JPH07121543A - Convensational sentence analyzing device - Google Patents

Convensational sentence analyzing device

Info

Publication number
JPH07121543A
JPH07121543A JP5266301A JP26630193A JPH07121543A JP H07121543 A JPH07121543 A JP H07121543A JP 5266301 A JP5266301 A JP 5266301A JP 26630193 A JP26630193 A JP 26630193A JP H07121543 A JPH07121543 A JP H07121543A
Authority
JP
Japan
Prior art keywords
dictionary
sentence
conversational sentence
syntactic
words
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP5266301A
Other languages
Japanese (ja)
Inventor
Mikio Nakano
幹生 中野
Akira Shimazu
明 島津
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nippon Telegraph and Telephone Corp
Original Assignee
Nippon Telegraph and Telephone Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nippon Telegraph and Telephone Corp filed Critical Nippon Telegraph and Telephone Corp
Priority to JP5266301A priority Critical patent/JPH07121543A/en
Publication of JPH07121543A publication Critical patent/JPH07121543A/en
Pending legal-status Critical Current

Links

Abstract

PURPOSE:To analyze a conversational sentence similarly to vocabularies for written words. CONSTITUTION:A conversational sentence analyzing device 100 consists of an analytical dictionary 110, a syntax analyzing rule 120 and a syntax semantic analyzing part 130. A written word dictionary 111 and a conversational sentence dictionary 112 are prepared as the dictionary 110 and infixes (e.g. 'ano', 'etto') and abbreviated words (e.g. 'ja') are previously registered in the dictionary 112. The analyzing part 130 judges parts of speech for respective words in an inputted conversational sentence by the use of both the dictionaries 111, 112 and repeatedly executes processing for applying the rule 120 to the judged words to find out tree structure for the conversational sentence. The meaning expression of the conversational sentence is determined by the tree structure and outputted.

Description

【発明の詳細な説明】Detailed Description of the Invention

【0001】[0001]

【産業上の利用分野】本発明は、会話文解析装置に関
し、たとえば人間が日常使っている言葉を、音声認識装
置、文字認識装置、計算機端末のキーボード、あるい
は、これらの組合せを介して文字列として計算機に入力
し、その文字列が意味する内容を計算機で処理する技術
に関するものである。
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a conversational sentence analysis device, for example, a character string that is used by a person in daily life through a voice recognition device, a character recognition device, a keyboard of a computer terminal, or a combination thereof. It is related to the technology of inputting into a computer as, and processing by the computer what the character string means.

【0002】[0002]

【従来の技術】従来の自然言語処理技法は、もっぱら書
き言葉を対象としたものであり、話し言葉で頻繁に現れ
るところの、「あのー」、「えっと」などの躊躇を表す
語や、「(〜は)ですね」といった文節の切れ目に挿入
される語句などの挿入語、および、「それでは」の代わ
りに用いられる「それじゃ」や、「行っている」の代わ
りに用いられる「行ってる」などの縮約語等を含む文を
解析するには向いていない。会語文を扱うためには、前
処理において、挿入語を取り除いたり、縮約語を書き言
語の語彙に変換したりした後、構文意味解析を行う必要
があった。
2. Description of the Related Art Conventional natural language processing techniques are intended mainly for written words. Words that frequently appear in spoken words, such as "Ah" and "Etto", and "(- Inserted words such as phrases inserted at breaks in phrases such as "ha"), and "sore" used in place of "there" and "going" used in place of "going" It is not suitable for analyzing sentences including contracted words such as. In order to handle synonym sentences, in the preprocessing, it was necessary to remove inserted words or convert contracted words into vocabulary of written language, and then perform syntactic and semantic analysis.

【0003】[0003]

【発明が解決しようとする課題】会話文特有の挿入語や
縮約語を、前処理において取り除いたり書き言葉の語彙
に変換したりした後、構文意味解析を行なう方法におい
ては、以下に述べるような欠点がある。
A method of performing syntactic and semantic analysis after removing an insertion word or a contraction word peculiar to a conversational sentence in a preprocessing or converting it into a vocabulary of a written word is as follows. There are drawbacks.

【0004】挿入語を前処理において発見し取り除いて
から構文意味解析を行うと、「ですね」などのように、
挿入語以外にも用いられる語も取り除かれてしまう。例
えば、「それについてはですね、今度までに考えてきま
す」という文なら「ですね」を取り除いてもよいが、
「原稿の締切は31日ですね」という文では、「です
ね」には確認を求めるという機能があるため、取り除く
ことはできない。従来の方法では、このような挿入語以
外に用いられる場合も取り除かれてしまう。
When the syntactic and semantic analysis is performed after the inserted word is found and removed in the preprocessing, it looks like "is".
Words used in addition to the inserted words are also removed. For example, in the sentence "I think about that, I will think about it", you can remove "
In the sentence "The manuscript deadline is 31st," it cannot be removed because "ne" has the function of requesting confirmation. In the conventional method, the case where it is used other than such an insertion word is also removed.

【0005】また、縮約語を書き言葉の語に変換する場
合も、問題がある。例えば、「じゃ」という語は「で
は」の縮約語として用いられるが、会話文中のすべての
「じゃ」が「では」におきかえられるわけではない。
There is also a problem in converting a contracted word into a written word. For example, the word "ja" is used as an abbreviation for "wa", but not all "ja" in conversational sentences are replaced with "wa".

【0006】本発明の目的は、会話文特有の挿入語や縮
約語を扱うための特別な手続きを用いず、書き言葉の語
彙と同じように扱うことによって、会話文の解析を可能
にすることにある。
An object of the present invention is to enable the analysis of conversational sentences by treating them in the same way as the vocabulary of written words without using a special procedure for treating insertion words and contracted words peculiar to conversational sentences. It is in.

【0007】[0007]

【課題を解決するための手段】上記目的を達成するため
に、本発明は、書き言葉用辞書と、構文解析規則と、こ
れらを用いて入力文の構文意味解析を行う構文意味解析
部とからなる文解析装置において、前記書き言葉用辞書
に会話文用辞書を付加して、該会話文用辞書に、会話文
特有の縮約表現や冗長表現などの構文意味情報を登録
し、前記構文意味解析部は、前記書き言葉用辞書に加え
て前記会話文用辞書を用い、前記構文解析規則をもと
に、入力された会話文の構文意味解析を行うことを特徴
とする。
In order to achieve the above object, the present invention comprises a dictionary for written words, a syntactic analysis rule, and a syntactic and semantic analysis unit for performing syntactic and semantic analysis of an input sentence using these. In the sentence analysis device, a conversational sentence dictionary is added to the written word dictionary, and syntactic and semantic information such as reduced expressions and redundant expressions peculiar to the conversational sentence is registered in the conversational sentence dictionary, and the syntactic and semantic analysis unit is described. Is characterized in that, in addition to the written word dictionary, the conversation sentence dictionary is used to perform syntactic and semantic analysis of the input conversation sentence based on the syntactic analysis rule.

【0008】[0008]

【作用】会話文用辞書には、挿入語や縮約語を登録す
る。例えば、「あのー」や、「えっと」などの挿入語
は、動詞句を修飾するが、意味に影響を及ぼさない様な
修飾語として登録する。また、「ですね」などの助詞の
あとに挿入される語は、助詞句に後続し、その意味に影
響を及ぼさない助詞として登録する。また、「では」の
かわりになる縮約語「じゃ」は、「では」と同じ機能の
語として登録する。このように、会話文特有の語を、書
き言葉用の辞書に付加する形で辞書登録することによっ
て、書き言葉と同じ規則と構文意味解析手法を用いて会
話文の構文意味解析を行うことができる。
[Operation] Inserted words and contracted words are registered in the conversation sentence dictionary. For example, an insertion word such as "an" or "umu" modifies a verb phrase, but is registered as a modifier that does not affect the meaning. In addition, a word inserted after a particle such as "is" is registered as a particle that follows the particle phrase and does not affect its meaning. Further, the contracted word “ja”, which replaces “”, is registered as a word having the same function as “”. In this way, by registering the words specific to the conversational sentence in the form of being added to the dictionary for the written language, the syntactic and semantic analysis of the conversational sentence can be performed using the same rules and the syntactic and semantic analysis method as the written language.

【0009】[0009]

【実施例】以下、本発明の一実施例について図面により
説明する。
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described below with reference to the drawings.

【0010】図1は本発明の一実施例のブロック図であ
る。図において、会話文解析装置100は、解析用辞書
110と構文解析規則120と構文意味解析部130よ
りなる。辞書110には、書き言葉用辞書111に加え
て会話文用辞書112を用意し、この会話文用辞書11
2に会話文特有の挿入語や縮約語を登録しておく。構文
意味解析部130は、会話文が入力されると、書き言葉
用辞書110に加えて会話文用辞書112も参照し、構
文規則120にしたがって入力会話文の構文意味解析を
行い、該会話文の意味表現を出力する。
FIG. 1 is a block diagram of an embodiment of the present invention. In the figure, the conversational sentence analysis device 100 includes an analysis dictionary 110, a syntactic analysis rule 120, and a syntactic and semantic analysis unit 130. In the dictionary 110, a conversation sentence dictionary 112 is prepared in addition to the written word dictionary 111.
The insertion words and contracted words peculiar to the conversational sentence are registered in 2. When a conversational sentence is input, the syntactic and semantic analysis unit 130 refers to the conversational sentence dictionary 112 in addition to the written word dictionary 110, analyzes the syntactical and semantics of the input conversational sentence according to the syntactic rule 120, and analyzes the input conversational sentence. Output a semantic expression.

【0011】図2に、本会話文解析装置の利用態様の一
例として、自然言語処理システムに用いた場合のブロッ
ク図を示す。図において、処理対象となる会話文の表現
(音声)を音声認識装置10に入力し、認識処理して文
字列に変換し、会話文解析装置20へ送る。会話文解析
装置20は図1に示す構成をとり、構文意味解析部13
0では、書き言葉用の辞書111に加えて会話文用の辞
書112も参照して、入力された会話文の文字列の構文
意味解析を書き言葉と同じ構文規則120を用いて行
い、該会話文の意味表現を出力し、応答生成装置30へ
送る。応答生成装置30は、会話文の意味表現から適当
な応答を計画し、音声合成装置40を用いて音声で応答
を出力する。
FIG. 2 shows a block diagram when used in a natural language processing system, as an example of a mode of use of the conversational sentence analysis apparatus. In the figure, the expression (speech) of the conversational sentence to be processed is input to the voice recognition device 10, subjected to recognition processing, converted into a character string, and sent to the conversational sentence analysis device 20. The conversational sentence analysis device 20 has the configuration shown in FIG.
In 0, by referring to the dictionary 112 for conversational sentences in addition to the dictionary 111 for written sentences, the syntactic and semantic analysis of the character string of the input conversational sentence is performed using the same syntax rule 120 as the written language, The meaning expression is output and sent to the response generation device 30. The response generation device 30 plans an appropriate response from the semantic representation of the conversational sentence, and outputs the response by voice using the voice synthesis device 40.

【0012】次に、一例として、「それについてはです
ね、今度まで考えてきます」(以下、例文1と称す)お
よび「原稿の締切は31日ですね」(以下、例文2と称
す)という会話文の構文意味解析処理を示す。
[0012] Next, as an example, "I will think about it, I will consider it until now" (hereinafter referred to as example sentence 1) and "The manuscript deadline is 31 days" (hereinafter referred to as example sentence 2). The syntactic and semantic analysis processing of a conversation sentence is shown.

【0013】図1の会話文解析装置100の書き言葉用
辞書111、会話文用辞書112及び構文規則120
は、具体的に、それぞれ図3、図4、図5のようになっ
ているとする。ここで、図5の1行ごとが1つの構文規
則に対応し、例えば、1番目の構文規則「動詞句←名詞
句動詞句」は、名詞句と動詞句の並びが動詞句を構成す
ることを意味する。
The written sentence dictionary 111, the conversational sentence dictionary 112, and the syntax rules 120 of the conversational sentence analysis device 100 of FIG.
Is assumed to be specifically as shown in FIGS. 3, 4, and 5, respectively. Here, each line in FIG. 5 corresponds to one syntax rule. For example, in the first syntax rule "verb phrase ← noun phrase verb phrase", a sequence of noun phrases and verb phrases constitutes a verb phrase. Means

【0014】会話文解析装置100の構文意味解析部1
30では、書き言葉用辞書111および会話文用辞書1
12を用いて、入力された会話文の各単語の品詞を判定
し、これに構文規則120を適用する処理を繰り返し実
行して、入力会話文の木構造を求め、この求まった木構
造より、該会話文の意味表現を決定する。
The syntactic and semantic analysis unit 1 of the conversational sentence analysis apparatus 100.
In 30, the written word dictionary 111 and the conversational sentence dictionary 1
12, the part of speech of each word of the input conversational sentence is determined, and the process of applying the syntax rule 120 to this is repeatedly executed to obtain the tree structure of the input conversational sentence. From the obtained tree structure, Determine the semantic expression of the conversation sentence.

【0015】例文1の場合、木構造は図6のように求ま
る。ここで、「それについてはですね」に着目した場
合、図3の書き言葉用辞書より、「それ」は名詞(代名
詞)、「については」は助詞で、図5の3番目の構文規
則「名詞句←名詞助詞」より、「それについては」の部
分は名詞句となる。次の「ですね」は「です」+「ね」
なのか、間投詞の「ですね」なのかが曖昧である。そこ
で、「ですね」+「ね」として、図3の書き言葉用辞書
により「です」を判定詞と仮定すると、図5の構文規則
には名詞句と判定詞の並びがない。一方、「ですね」と
すると、これは図4の会話文用辞書に間投助詞として登
録されており、図5には「名詞句←名詞句間投助詞」の
構文規則がある(6番目)。この結果、「それについて
はですね」の「ですね」は間投助詞であったことが分か
る。「今度までに考えてきます」の部分の解析は従来と
同様である。結局、図6の木構造より、例文1の意味表
現は図7のようになる。
In the case of example sentence 1, the tree structure is obtained as shown in FIG. Here, when focusing on "about that," from the dictionary for written words in FIG. 3, "that" is a noun (pronoun), "about" is a particle, and the third syntactic rule "noun" in FIG. From "phrase ← noun particle", the part "about it" becomes a noun phrase. Next "is" is "is" + "ne"
However, it is ambiguous whether it is an interjection "is". Therefore, assuming that "is" + "ne" is a decision word in the written word dictionary of FIG. 3, the syntactic rule of FIG. 5 does not have a sequence of noun phrases and decision words. On the other hand, if it is "is", this is registered as an interjection particle in the conversational sentence dictionary in FIG. 4, and in FIG. 5, there is a syntax rule of "noun phrase ← noun phrase interpositional particle" (6 ). As a result, it can be seen that “” about “that” was an interjection particle. The analysis of the “I will think about this time” part is the same as before. After all, the semantic representation of the example sentence 1 becomes as shown in FIG. 7 from the tree structure of FIG.

【0016】例文2の場合は、木構造は図8のようにな
り、意味表現は図9のようになる。この場合、図5の構
文規則「判定詞句←名詞判定詞」(10番目)および
「文←文終助詞」(12番目)より、「ですね」は「で
す」(判定詞)+「ね」(終助点)であることが分か
る。
In the case of example sentence 2, the tree structure is as shown in FIG. 8 and the semantic representation is as shown in FIG. In this case, according to the syntax rule “determinant phrase ← noun determiner” (10th) and “sentence ← sentence final particle” (12th) in FIG. 5, “is” is “is” (determinant) + “ne”. It turns out that it is the (ending point).

【0017】[0017]

【発明の効果】本発明の会話解析装置を用いることによ
り、従来の書き言葉用の自然言語処理技法では扱えなか
った会話文の構文意味解析が可能である。また、本発明
により、前処理を行なわなくても、会話文の解析が可能
である。前処理では、文の構文意味解析を行なう前に、
文を変換してしまうため、間違った変換をしてしまう可
能性があるが、本発明では、構文意味解析と同時に、会
話文特有の語を認識するため、より正確である。本発明
の会話解析装置を用いることにより、音声入力による自
然言語処理システムが、より多くの種類の文を扱えるよ
うになる。
By using the conversation analysis device of the present invention, it is possible to perform a syntactic and semantic analysis of a conversation sentence which cannot be handled by the conventional natural language processing technique for written words. Further, according to the present invention, it is possible to analyze a conversational sentence without performing preprocessing. In the preprocessing, before performing the syntactic and semantic analysis of the sentence,
Since the sentence is converted, there is a possibility of making an incorrect conversion, but the present invention is more accurate because it recognizes a word peculiar to a conversational sentence at the same time as the syntactic and semantic analysis. By using the conversation analysis device of the present invention, the natural language processing system by voice input can handle more kinds of sentences.

【図面の簡単な説明】[Brief description of drawings]

【図1】本発明の会話文解析装置の一実施例のブロック
図である。
FIG. 1 is a block diagram of an embodiment of a conversational sentence analysis device of the present invention.

【図2】本発明の会話文解析装置の利用態様の一例を示
す図である。
FIG. 2 is a diagram showing an example of a usage mode of the conversational sentence analysis device of the present invention.

【図3】書き言葉用辞書の一例を示す図である。FIG. 3 is a diagram showing an example of a written word dictionary.

【図4】会話文用辞書の一例を示す図である。FIG. 4 is a diagram showing an example of a conversational sentence dictionary.

【図5】構文規則の一例を示す図である。FIG. 5 is a diagram showing an example of syntax rules.

【図6】例文1の木構造である。6 is a tree structure of example sentence 1. FIG.

【図7】例文1の意味表現である。7 is a semantic representation of example sentence 1. FIG.

【図8】例文2の木構造である。8 is a tree structure of example sentence 2. FIG.

【図9】例文2の意味表現である。9 is a semantic representation of example sentence 2. FIG.

【符号の説明】[Explanation of symbols]

100 会話文解析装置 110 解析用辞書110 111 書き言葉用辞書 112 会話文用辞書 120 構文規則 130 構文意味解析部 100 Conversational sentence analysis device 110 Analysis dictionary 110 111 Written language dictionary 112 Conversational sentence dictionary 120 Syntax rules 130 Syntax meaning analysis unit

Claims (1)

【特許請求の範囲】[Claims] 【請求項1】 書き言葉用辞書と、構文解析規則と、こ
れらを用いて入力文の構文意味解析を行う構文意味解析
部とからなる文解析装置において、 前記書き言葉用辞書に会話文用辞書を付加して、該会話
文用辞書に、会話文特有の縮約表現や冗長表現などの構
文意味情報を登録し、 前記構文意味解析部は、前記書き言葉用辞書に加えて前
記会話文用辞書を用い、前記構文解析規則をもとに、入
力された会話文の構文意味解析を行うことを特徴とする
会話文解析装置。
1. A sentence analysis device comprising a written word dictionary, a syntactic analysis rule, and a syntactic and semantic analysis unit for performing syntactic and semantic analysis of an input sentence using these, and a conversational sentence dictionary is added to the written word dictionary. Then, the syntactic and semantic information such as a contracted expression and a redundant expression peculiar to the conversational sentence is registered in the conversational sentence dictionary, and the syntactic and semantic analysis unit uses the conversational sentence dictionary in addition to the written word dictionary. A conversational sentence analysis device for performing a syntactic and semantic analysis of an input conversational sentence based on the syntactic analysis rule.
JP5266301A 1993-10-25 1993-10-25 Convensational sentence analyzing device Pending JPH07121543A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP5266301A JPH07121543A (en) 1993-10-25 1993-10-25 Convensational sentence analyzing device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP5266301A JPH07121543A (en) 1993-10-25 1993-10-25 Convensational sentence analyzing device

Publications (1)

Publication Number Publication Date
JPH07121543A true JPH07121543A (en) 1995-05-12

Family

ID=17429041

Family Applications (1)

Application Number Title Priority Date Filing Date
JP5266301A Pending JPH07121543A (en) 1993-10-25 1993-10-25 Convensational sentence analyzing device

Country Status (1)

Country Link
JP (1) JPH07121543A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6132198A (en) * 1997-08-26 2000-10-17 Toshiba Kikai Kabushiki Kaisha Clamping apparatus for injection molding machine

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6132198A (en) * 1997-08-26 2000-10-17 Toshiba Kikai Kabushiki Kaisha Clamping apparatus for injection molding machine

Similar Documents

Publication Publication Date Title
US6374224B1 (en) Method and apparatus for style control in natural language generation
US6278968B1 (en) Method and apparatus for adaptive speech recognition hypothesis construction and selection in a spoken language translation system
US6442524B1 (en) Analyzing inflectional morphology in a spoken language translation system
US6223150B1 (en) Method and apparatus for parsing in a spoken language translation system
US6282507B1 (en) Method and apparatus for interactive source language expression recognition and alternative hypothesis presentation and selection
US6243669B1 (en) Method and apparatus for providing syntactic analysis and data structure for translation knowledge in example-based language translation
US6356865B1 (en) Method and apparatus for performing spoken language translation
US6266642B1 (en) Method and portable apparatus for performing spoken language translation
US8566076B2 (en) System and method for applying bridging models for robust and efficient speech to speech translation
JP2004355625A (en) Method and system for training machine translator
WO2010046782A2 (en) Hybrid machine translation
JPH0682377B2 (en) Emotion information extraction device
Szarvas et al. Finite-state transducer based modeling of morphosyntax with applications to Hungarian LVCSR
JP3441400B2 (en) Language conversion rule creation device and program recording medium
JPH07121543A (en) Convensational sentence analyzing device
Shukla et al. A Framework of Translator from English Speech to Sanskrit Text
JP3309896B2 (en) Utterance analysis device
Watanabe et al. An automatic interpretation system for travel conversation.
JPS6229796B2 (en)
JPH08190562A (en) Machine translation system
Jun Paraphrasing Chinese Idioms: Paraphrase Acquisition, Rewording and Scoring
Ying et al. A hybrid approach to Chinese-English machine translation
Růžička Normalization of numbers into spoken form for text-to-speech systems
JP2504449B2 (en) Japanese phrase interdependency analysis device
WO2000045289A1 (en) A method and apparatus for example-based spoken language translation with examples having grades of specificity