JPS62232076A - Learning system - Google Patents

Learning system

Info

Publication number
JPS62232076A
JPS62232076A JP61075818A JP7581886A JPS62232076A JP S62232076 A JPS62232076 A JP S62232076A JP 61075818 A JP61075818 A JP 61075818A JP 7581886 A JP7581886 A JP 7581886A JP S62232076 A JPS62232076 A JP S62232076A
Authority
JP
Japan
Prior art keywords
ambiguity
sentence
information
knowledge base
knowledge
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP61075818A
Other languages
Japanese (ja)
Inventor
Takenori Makino
牧野 武則
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Priority to JP61075818A priority Critical patent/JPS62232076A/en
Publication of JPS62232076A publication Critical patent/JPS62232076A/en
Pending legal-status Critical Current

Links

Abstract

PURPOSE:To register a new knowledge in a knowledge base by analyzing a sentence by using syntactical information and knowledge, and verifying the ambiguity of the obtained fact. CONSTITUTION:An ambiguity decision means 2 to verify the ambiguity of the fact obtained by a sentence analysis means 1, is provided. The means 1 retrieves the content of the syntactical information 3 and that of the knowledge base 4 respectively through transmission lines 31 and 41, and analyzes a sentence by using these contents. The result of the analysis is transferred to the means 2 through a transmission line 12, so that a fact which is not ambiguous and can be determined as unitary is found. And the result is transferred through a transmission line 24 and stored in the knowledge base 4.

Description

【発明の詳細な説明】 〔産業上の利用分野〕 本発明は、機械翻訳や自然言語処理に必要な言語情報を
自動的に獲得するための学習方式に関する。
DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to a learning method for automatically acquiring linguistic information necessary for machine translation and natural language processing.

〔従来技術とその問題点〕[Prior art and its problems]

従来、詳細な言語情報を集め、辞書等を構築する場合、
既存の辞書等を参照しながら、決めら6次仕様に従がい
、人手により電子化辞書に登録していた。自動学習につ
いては既にいろいろ外方式が提案されてはいるが、笑用
的な方式は存在せず。
Traditionally, when collecting detailed linguistic information and constructing dictionaries,
They were manually registered in the electronic dictionary by referring to existing dictionaries and following the established sixth-order specifications. Various methods have already been proposed for automatic learning, but no practical method exists.

きわめて限定さn、l’e分野でしか有用でない方式が
ほとんどである。
Most of the methods are useful only in very limited fields.

本発明では、すでにあるテキストを分析し、その分析結
果から学習すべき情報を取り出し、その情報を評価する
ことにより、自動的に言語情報を学習するシステムを構
築するための方式t−機供することが目的である。
The present invention provides a method for constructing a system that automatically learns linguistic information by analyzing existing text, extracting information to be learned from the analysis results, and evaluating that information. is the purpose.

この目的を連取するためには%まずテキストを読み、文
章解°析する手段と1文章解析された結果から新たに学
習すべき情報を取ジ出丁手段と、取り出された情報全知
識ベースに格納するため、その情報を評価する手段と、
評1曲し次結果にエフ知識ベースを書き換える手yiを
新たに提供しなけnはならない。
In order to achieve this purpose, first read the text, use a means to analyze the text, extract new information to be learned from the result of the text analysis, and put all the retrieved information into a knowledge base. means for storing and evaluating the information;
After one review, a new way to rewrite the F knowledge base based on the results must be provided.

文章解析についてtゴ、すでに多くの報告がめるが、こ
こでは、言語情報(知識)の獲得が目的であるため、依
存文法にもとずく係り受は解析を中心とし九解析手段を
導入する。ここで解決しなければならない問題は、現在
ある知識を使って、文章に含まわる事実(1;’act
)を固定する除に生ずる解析のあいまいさである。あい
まいさのある事実の学習は非常に危険であり、排除しな
けnばならない。すなわち、新しいB5災を含んでいる
文意を現在ある言語情報を使って、あいまいさのない事
実として同定できる機構が必斐である。
There have already been many reports regarding sentence analysis, but since the purpose here is to acquire linguistic information (knowledge), we will focus on analysis of dependencies based on dependent grammars and introduce nine analysis means. The problem we need to solve here is to use our current knowledge to determine the facts (1;'act) contained in the sentence.
) is an ambiguity in the analysis that arises when fixed. Learning ambiguous facts is extremely dangerous and must be avoided. In other words, it is essential to have a mechanism that can identify the meaning of a sentence that includes the new B5 disaster as an unambiguous fact using existing linguistic information.

〔問題点を)界決するだめの手段〕[Means to settle the issue]

発明の学習方式は、文章をπ〔析する情報として統語情
報と知識ベースをもち、前記統語情報と知識ベースを用
いて文章を解析手段と、前記文章解析手段に工って得ら
nた事実のあいまい性を検証するあいない性判定手段と
、前記あいまい性判定手段にエフ、あいまい性のないと
判定された事実を前記知識ベースに格納する格納手段と
を具備することを特徴とする。
The learning method of the invention has syntactic information and a knowledge base as information for analyzing a sentence, a means for analyzing the sentence using the syntactic information and the knowledge base, and a fact obtained by using the means for analyzing the sentence. The present invention is characterized by comprising: ambiguity determining means for verifying ambiguity; and storing means for storing facts determined to be unambiguous in the knowledge base.

〔作 用〕[For production]

言語情報として、構文といった言語学的情報と、意味に
関する知識情報とに分けらnる0ここでは言語学的情報
は充分に与えらnているものとし、知識情報に関し学習
するものとする0 いま、簡単な例を考える。次の文が与えらfL九とする
Linguistic information is divided into linguistic information such as syntax and knowledge information regarding meaning.Here, it is assumed that sufficient linguistic information has been provided, and learning is based on knowledge information. , consider a simple example. Suppose the following sentence is given fL9.

「赤いバラが咲いた」 この文は「バラの花の色は赤である」と「バラは咲くも
のである」という事実を示している0この2つの事実は
上の文を統語規則を使用して取り吊すことができるだろ
う。つま9簡単な文ならば。
``A red rose has bloomed.'' This sentence shows the facts that ``The color of a rose flower is red'' and ``A rose is something that blooms.'' 0These two facts can be explained using syntactic rules in the above sentence. You will be able to hang it. 9. If it's a simple sentence.

そこで示さnている事実を知識を使用しなくても「赤い
」が形容詞で体言に係り、「バラが」は連用型であり、
用言に係るという2つの規則でその係り受けが同定でき
、同定さn友関係から事実を取り吊すことか原則的には
可能である。こうし友事実を学習しておけは、例えば次
の文 透明な赤いバラの花ビン が与えらnたとき、「赤い」は「バラ」に係り。
Even if you do not use knowledge of the facts shown there, ``red'' is an adjective and relates to the noun, and ``baraga'' is a conjunctive form.
The dependency can be identified by the two rules of ``relating to predicates,'' and it is in principle possible to remove the fact from the identified n-friend relationship. For example, in the following sentence, when given a transparent vase of red roses, ``red'' refers to ``rose.''

「透明」ニ「バラ」に係らず「花ビン」に係ることが、
得らnている知識を用いて解析することができ、ニジ正
確な意味解析を行うことが可能となるO 〔実施例〕 第1図は本発明の一実施例を示す、機能的ブロックダイ
アグラムである。文章解析手段1は統語情報3の内容と
知識ベース4の内容をそれぞれ転送線31,41t−介
して検索し、それらの内容を用いて文章を解析する。解
析された結果は転送線12を介して、あいまい性判定手
段2に送られ、あいまい性がなく、−意に決定できる事
実を発見し、転送#1j124t−介して知識ベース4
にその事実を格納する。なお、格納手段については図示
していない。
Regardless of whether it is “transparent” or “rose”, it is related to “flower bottle”.
[Embodiment] Figure 1 is a functional block diagram showing an embodiment of the present invention. be. The text analysis means 1 retrieves the content of the syntactic information 3 and the content of the knowledge base 4 via transfer lines 31 and 41t, respectively, and analyzes the text using these contents. The analyzed results are sent to the ambiguity determination means 2 via the transfer line 12, which discovers facts that are unambiguous and can be determined at will, and sent to the knowledge base 4 via transfer #1j124t.
Store that fact in . Note that the storage means is not illustrated.

〔発明の効果〕〔Effect of the invention〕

本発明にLnは、統語情報と知識を用いて文を解析し、
得らnた事実のあいまい性を検証することで、新友な知
識を知識ベースに登録することが可能となる。
In the present invention, Ln analyzes sentences using syntactic information and knowledge,
By verifying the ambiguity of the obtained facts, new knowledge can be registered in the knowledge base.

本発明に工nば、さらに、テキストからの自動的な知識
の獲得が可能になる。
The invention further enables automatic knowledge acquisition from text.

【図面の簡単な説明】[Brief explanation of drawings]

第1図に本発明の一実施例を示す機能的ブロックダイア
グラムである。
FIG. 1 is a functional block diagram showing an embodiment of the present invention.

Claims (1)

【特許請求の範囲】[Claims] 文章を解析する情報として統語情報と知識ベースをもち
、前記統語情報と知識ベースを用いて文章を解析する文
章解析手段と、前記文章解析手段によって得られた事実
のあいまい性を検証するあいまい性判定手段と、前記あ
いまい性判定手段によりあいまい性がないと判定された
事実を前記知識ベースに格納する格納手段とを具備する
ことを特徴とする学習方式。
A text analysis means that has syntactic information and a knowledge base as information for analyzing a text, uses the syntactic information and knowledge base to analyze the text, and an ambiguity judgment that verifies the ambiguity of the facts obtained by the text analysis means. and storage means for storing, in the knowledge base, facts determined to have no ambiguity by the ambiguity determining means.
JP61075818A 1986-04-01 1986-04-01 Learning system Pending JPS62232076A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP61075818A JPS62232076A (en) 1986-04-01 1986-04-01 Learning system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP61075818A JPS62232076A (en) 1986-04-01 1986-04-01 Learning system

Publications (1)

Publication Number Publication Date
JPS62232076A true JPS62232076A (en) 1987-10-12

Family

ID=13587143

Family Applications (1)

Application Number Title Priority Date Filing Date
JP61075818A Pending JPS62232076A (en) 1986-04-01 1986-04-01 Learning system

Country Status (1)

Country Link
JP (1) JPS62232076A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH04503894A (en) * 1989-03-03 1992-07-09 ドイチェ トムソン―ブラント ゲゼルシャフト ミット ベシュレンクテル ハフツング Method for image reproduction of digital video signals

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH04503894A (en) * 1989-03-03 1992-07-09 ドイチェ トムソン―ブラント ゲゼルシャフト ミット ベシュレンクテル ハフツング Method for image reproduction of digital video signals

Similar Documents

Publication Publication Date Title
US20040128292A1 (en) Search data management
Bharadwaj et al. Language independent identification of parallel sentences using wikipedia
Saxena et al. Epie dataset: A corpus for possible idiomatic expressions
Al-Taani et al. A rule-based approach for tagging non-vocalized Arabic words.
Cholakov et al. Better statistical machine translation through linguistic treatment of phrasal verbs
CN109871548A (en) A kind of patent document interpretation method
JPS62232076A (en) Learning system
JPS62232077A (en) Learning system
Floyd Sentence complexity and clause subordination in children's Spanish
Sun Semantics-driven shallow parsing for Chinese semantic role labeling
KR20160050652A (en) Method for constructing treebank of new language and method thereof
Tovar et al. BUAP: A first approximation to relational similarity measuring
JPH0157826B2 (en)
JPH06295308A (en) Method and device for analyzing natural language
Wu Learning verb-noun relations to improve parsing
JPH0474259A (en) Document summarizing device
KR100371135B1 (en) Declinable-word morphology analyzing apparatus using a declinable-word derivative-dictionary and method therefor
Carl et al. English terminology in CLAT
JP2830097B2 (en) Sentence search method
JPH04211867A (en) System for analyzing japanese syntax
Rijkhoff Number’disagreement
JPH0262668A (en) Sentence information retrieving system using sentence information analyzing technique
Bowden Cooperative error handling and shallow processing
Li et al. Translation of Long English Sentences Based on Clause Complex Theory
KOGKITSIDOU et al. Normalisation of 16𝑡ℎ and 17𝑡ℎ century texts in French and geographical named entity recognition