JPH06290181A - Processor for retrieving derivative - Google Patents

Processor for retrieving derivative

Info

Publication number
JPH06290181A
JPH06290181A JP5101906A JP10190693A JPH06290181A JP H06290181 A JPH06290181 A JP H06290181A JP 5101906 A JP5101906 A JP 5101906A JP 10190693 A JP10190693 A JP 10190693A JP H06290181 A JPH06290181 A JP H06290181A
Authority
JP
Japan
Prior art keywords
word
derivative
search
retrieved
search processing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP5101906A
Other languages
Japanese (ja)
Inventor
Hideyasu Naka
英康 中
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Priority to JP5101906A priority Critical patent/JPH06290181A/en
Publication of JPH06290181A publication Critical patent/JPH06290181A/en
Pending legal-status Critical Current

Links

Abstract

PURPOSE:To enable the effective editing work by referring to a derivative development table prepared in advance by taking a conjugated form as a key, developing a word to be retrieved to a derivative word group, and retrieving the word to be retrieved and the derivative group. CONSTITUTION:An input section 11 inputs a word to be retrieved and its conjugated form from the input device such as keyboard. A derivative development section 13 accepts the word to be retrieved and its conjugated form from a control section 12 and obtains the derivative group of the word to be retrieved by taking the conjugated form as a key and referring to a conjugated form development table 16, passing the information to the control section 12. A retrieval processing section 14 accepts the word to be retrieved and the derivative group from the control section 12, retrieving the character string of the word to be retrieved and the derivative group from the object sentence and passing the retrieved information to the control section 12. An output section 15 accepts the retrieved information from the section 12 and displays it on an output device such as a display.

Description

【発明の詳細な説明】Detailed Description of the Invention

【0001】[0001]

【産業上の利用分野】本発明は派生語検索処理装置に関
し、特に自然言語で記述された文章から任意の語を検索
する処理を行うワードプロセッサや機械翻訳システム等
の文書編集システムにおける派生語検索処理装置に関す
る。
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a derivative word search processing device, and more particularly, a derivative word search process in a document editing system such as a word processor or a machine translation system for searching for an arbitrary word from a sentence written in natural language. Regarding the device.

【0002】[0002]

【従来の技術】従来、自然言語(日本語)の文章から、
例えば「書く」という語を検索する場合、その検索手法
は「書く」という文字列のマッチング処理によるもので
あったので、「書かない」や「書きます」といった「書
く」の派生語は検索されなかった。そこで、それらの派
生語を含めて検索したい場合には、その語幹部分「書」
を検索対象語として検索を行っていた。
2. Description of the Related Art Conventionally, from natural language (Japanese) sentences,
For example, when searching for the word "write", the search method was based on the matching process of the character string "write", so the derivative words of "write" such as "do not write" and "write" are searched. There wasn't. Therefore, when you want to search including those derivative words, the stem part "call"
The search was performed with "."

【0003】派生語を検索する必要がある例として、機
械翻訳システムを利用する場合と、ワードプロセッサを
利用する場合とを挙げてみる。
As an example in which it is necessary to search for a derivative word, a case of using a machine translation system and a case of using a word processor will be cited.

【0004】まず、機械翻訳の場面において、原文中の
ある語の訳語を変更したとき(「書く」の訳語を”wr
ite”から”draw”に変更したとき)に、訳語の
統一を図るためにその語(「書く」)の訳語をすべて変
更したい。その際に、「書く」で検索するのでは不十分
であるし、また「書」で検索するのでは「書籍」や「図
書」といった余計な語まで検索してしまう。
First, when the translation of a certain word in the original sentence is changed in the machine translation scene (the translation of "writing" is changed to "wr"
When changing from "ite" to "draw"), I want to change all translations of that word ("write") in order to unify the translations. At that time, searching by "writing" is not enough, and searching by "book" results in searching for extra words such as "book" and "book".

【0005】次に、ワードプロセッサ等で文章を編集す
る場面において、ある語の表記を変更した場合(例え
ば、「書く」を「描く」にした場合)、その語「書く」
に関してすべて検索をして必要ならば同様に変更した
い。ここでも、機械翻訳の場面と同じ問題が発生する。
Next, when the notation of a word is changed (for example, when "writing" is changed to "draw") in a scene where a sentence is edited by a word processor or the like, the word "writing" is performed.
I'd like to search all about and make the same changes if necessary. Here again, the same problem as in the machine translation scene occurs.

【0006】しかし、従来は、上記のような派生語まで
加えた編集作業においては、利用者は仕方なく語幹部分
で検索することによって編集を行っていたので、効率的
な編集作業を行うことができなかった。
However, in the past, in the editing work including the above-mentioned derivative words, the user had to do the search by searching the word stem portion, so that the efficient editing work could be performed. could not.

【0007】[0007]

【発明が解決しようとする課題】上述した従来の派生語
検索処理方法では、検索対象語を派生語まで含めて検索
したい場合には語幹部分を検索対象語として検索を行っ
ていたので、派生語だけでなく、語幹部分を含む熟語等
の余計な語まで検索してしまうことになり、利用者にと
って不便であるという問題点があった。
In the above-described conventional derivative word search processing method, when the search target word is to be searched including the derivative word, the word stem portion is searched as the search target word. Not only that, but also unnecessary words such as idioms including the stem portion are searched, which is a problem for the user.

【0008】なお、従来、特開昭63−204461号
公報に開示された文章解析装置のように、文章解析にお
いて派生語を分解して利用する技術はあったが、派生語
をあらかじめ発生させて検索する技術はなかった。
Conventionally, there has been a technique for decomposing and using derivative words in sentence analysis, such as the sentence analysis apparatus disclosed in Japanese Patent Laid-Open No. 63-204461. There was no search technology.

【0009】本発明の目的は、上述の点に鑑み、検索対
象語とその活用形とを入力として受け取り、その活用形
をキーにして、あらかじめ用意しておいた派生形展開テ
ーブルを参照して検索対象語を派生語群に展開し、その
検索対象語および派生語群を検索することにより、文章
の編集作業を効率的に行えるようにした派生語検索処理
装置を提供することにある。
In view of the above points, an object of the present invention is to receive a search target word and its inflectional form as input, and refer to a derived form expansion table prepared with the inflectional form as a key. It is an object of the present invention to provide a derivative word search processing device capable of efficiently editing a sentence by expanding a search object word into a derivative word group and searching the search object word and the derivative word group.

【0010】[0010]

【課題を解決するための手段】本発明の派生語検索処理
装置は、自然言語で記述された文章から任意の語を検索
する検索処理装置において、検索対象語とその活用形と
を入力する入力部と、この入力部により入力された検索
対象語を活用形をキーにして、あらかじめ用意した活用
形展開テーブルを参照して派生語群に展開する派生語展
開部と、検索対象語および前記派生語展開部により展開
された派生語群を文章中から検索する検索処理部と、こ
の検索処理部による検索処理の結果を出力する出力部と
を有する。
A derivative word search processing device of the present invention is a search processing device for searching an arbitrary word from a sentence written in natural language, and an input for inputting a search target word and its inflectional form. Part, a derivation word expansion part for expanding a derivation word group by referring to a derivation word expansion table prepared in advance using the retrieval target word input by this input part as a key, the retrieval target word and the derivation It has a search processing unit that searches a derivative word group expanded by the word expansion unit from a sentence, and an output unit that outputs the result of the search processing by this search processing unit.

【0011】[0011]

【実施例】次に、本発明について図面を参照して詳細に
説明する。
The present invention will be described in detail with reference to the drawings.

【0012】図1は、本発明の一実施例に係る派生語検
索処理装置の構成を示すブロック図である。本実施例の
派生語検索処理装置は、入力部11と、制御部12と、
派生語展開部13と、検索処理部14と、出力部15
と、活用形展開テーブル16とから構成されている。
FIG. 1 is a block diagram showing the arrangement of a derivative word search processing apparatus according to an embodiment of the present invention. The derivative word search processing device according to the present embodiment includes an input unit 11, a control unit 12, and
Derived word expansion unit 13, search processing unit 14, and output unit 15
And an inflection type expansion table 16.

【0013】入力部11は、例えばキーボード等の入力
装置から得られる検索対象語(原形(日本語の場合には
終止形)には限られない)およびその活用形を入力す
る。
The input unit 11 inputs a search target word (not limited to the original form (end form in the case of Japanese)) obtained from an input device such as a keyboard and its inflection form.

【0014】制御部12は、以下の3つの機能を持つ。 1.入力部11から検索対象語とその活用形とを取り込
み、派生語展開部13に渡す。 2.派生語展開部13から検索対象語および派生語群を
取り込み、検索処理部14に渡す。 3.検索処理部14から対象となる文章中の検索箇所の
情報を取り込み、出力部15に渡す。
The control unit 12 has the following three functions. 1. The search target word and its inflection form are fetched from the input unit 11 and passed to the derivative word expansion unit 13. 2. The search target word and the derivative word group are fetched from the derivative word expansion unit 13 and passed to the search processing unit 14. 3. Information on the search location in the target sentence is fetched from the search processing unit 14 and passed to the output unit 15.

【0015】派生語展開部13は、制御部12から検索
対象語および活用形を受け取り、活用形をキーとして活
用形展開テーブル16を参照して検索対象語の派生語群
を求め、その情報を制御部12に渡す。
The derivative word expansion unit 13 receives the search target word and the inflected form from the control unit 12, obtains a derivative word group of the search target word by referring to the inflected form expansion table 16 using the inflected form as a key, and outputs the information. It is passed to the control unit 12.

【0016】検索処理部14は、制御部12から検索対
象語および派生語群を受け取り、対象文章中から検索対
象語および派生語群の文字列を検索し、その検索箇所の
情報を制御部12に渡す。
The search processing unit 14 receives the search target word and the derivative word group from the control unit 12, searches the target sentence for a character string of the search target word and the derivative word group, and obtains the information of the search location from the control unit 12. Pass to.

【0017】出力部15は、制御部12から検索箇所の
情報を受け取り、その検索箇所を、例えばディスプレイ
等の出力装置に表示する。
The output unit 15 receives the information on the search location from the control unit 12, and displays the search location on an output device such as a display.

【0018】図2は、本実施例の派生語検索処理装置に
おいて受け渡しされる情報の具体例を示すブロック図で
ある。
FIG. 2 is a block diagram showing a specific example of information passed in the derivative word search processing device of this embodiment.

【0019】図3は、派生語展開部13において参照さ
れる活用形展開テーブル16の一例を示す図である。活
用形展開テーブル16には、五段活用,下一段活用等の
活用の種類と、活用語尾の行と、未然形,連用形,終止
形,連体形,仮定形および命令形の活用語尾とが対応さ
れて格納されている。
FIG. 3 is a diagram showing an example of the inflectional form expansion table 16 referred to by the derivative word expansion unit 13. The inflectional form expansion table 16 corresponds to the types of inflections such as the five-stage utilization, the lower one-stage utilization, and the lines of the inflectional inflections, and the inflectional forms of the preformed form, the combined form, the final form, the union form, the assumed form, and the imperative form Has been stored.

【0020】図4は、検索対象となる文章の一例を示す
図である。
FIG. 4 is a diagram showing an example of a sentence to be searched.

【0021】図5は、図4の文章中の検索される箇所を
例示する図である。
FIG. 5 is a diagram showing an example of a searched portion in the sentence shown in FIG.

【0022】次に、このように構成された本実施例の派
生語検索処理装置の動作について、図2ないし図5を参
照しながら説明する。ここでは、図4の文章中の「書
く」を派生語を含めて検索する場合について述べる。
Next, the operation of the derivative word search processing apparatus of the present embodiment thus constructed will be described with reference to FIGS. Here, a case will be described in which "writing" in the sentence of FIG. 4 is searched including derivative words.

【0023】まず、入力部11は、キーボード等から検
索対象語に「書く」、その活用形に「五段」を入力す
る。
First, the input unit 11 inputs "write" as the search target word and "fifth stage" as the inflection from a keyboard or the like.

【0024】制御部12は、入力部11から(検索対象
語,活用形)のペアとして(「書く」,「五段」)を取
り込み、派生語展開部13に渡す。このとき、名詞等の
活用しない品詞の場合や派生語を検索したくない場合に
は、活用形を指定しなければよい。例えば、「図書館」
という名詞を検索する場合には、(検索対象語,活用
形)のペアは(「図書館」,φ)と指定する。
The control unit 12 fetches (“write”, “five”) as a pair of (search target word, inflectional form) from the input unit 11 and passes it to the derivative word expansion unit 13. At this time, if there is no part of speech such as a noun or if it is not desired to search for a derivative word, the inflectional form need not be specified. For example, "library"
When searching for a noun called, the pair of (search target word, inflectional form) is specified as ("library", φ).

【0025】派生語展開部13は、制御部12から
(「書く」,「五段」)のペアを受け取り、活用形をキ
ーにして図3に示す活用形展開テーブル16を参照し
て、検索対象語の派生語群を求める。この場合、か行五
段活用であるので、図3の活用形展開テーブル16の*
の行が参照されて、「書く」の派生語群は、{「書
か」,「書こ」,「書き」,「書い」,「書け」}と求
められる。求めた派生語群と検索対象語「書く」とは、
制御部12に渡される。
The derivative word expansion unit 13 receives the pair ("write", "fifth stage") from the control unit 12, refers to the inflectional form expansion table 16 shown in FIG. Find a derivative word group of the target word. In this case, since it is a five-row utilization, the utilization type expansion table 16 of FIG.
Is referred to, the derivative word group of “writing” is obtained as {“writing”, “writing”, “writing”, “writing”, “writing”}. The derived terms and the search term "writing" are
It is passed to the control unit 12.

【0026】次に、制御部12は、検索対象語および派
生語群を検索処理部14に渡す。
Next, the control unit 12 passes the search target word and the derivative word group to the search processing unit 14.

【0027】検索処理部14は、文章中から検索対象語
および派生語群の文字列を検索し、その検索箇所の情報
を制御部12に渡す。その情報とは、例えば図4におい
て、{文の12〜13文字目,文の4〜5文字目,
文の3〜4文字目,文の7〜8文字目}となる。
The search processing unit 14 searches the text for a character string of a search target word and a derivative word group, and passes information on the search location to the control unit 12. The information is, for example, in FIG. 4, {the 12th to 13th characters of the sentence, the 4th to 5th characters of the sentence,
The 3rd to 4th characters of the sentence and the 7th to 8th characters of the sentence}.

【0028】続いて、制御部12は、検索箇所の情報を
出力部15に渡す。
Subsequently, the control unit 12 passes the information on the search location to the output unit 15.

【0029】出力部15は、その検索箇所をディスプレ
イ(図示せず)上に表示する。表示方法は様々あるが、
表示される検索箇所は図5の下線部分である。
The output unit 15 displays the search location on a display (not shown). There are various display methods,
The displayed search location is the underlined portion in FIG.

【0030】利用者は、表示された検索箇所をワードプ
ロセッサや機械翻訳システム等の文書編集システムの置
換機能を用いて必要に応じて「書く」から「描く」に置
換していけばよい。
The user may replace the displayed search location with "write" from "draw" as necessary using the replacement function of the document editing system such as a word processor or machine translation system.

【0031】参考までに、(検索対象語,活用形)を
(「図書館」,φ)とすれば、図4の{文の1〜3文
字目}が検索箇所となる。また、(検索対象語,活用
形)を(「書く」,φ)とすれば、図4の{文の4〜
5文字目,文の3〜4文字目}が検索箇所となる。
For reference, if the (search target word, inflectional form) is (“library”, φ), the {1st to 3rd letters of the sentence} in FIG. 4 becomes the search location. If (search target word, inflected form) is (“write”, φ), {sentence 4 to 4 in FIG.
The 5th character, the 3rd to 4th character of the sentence} is the search location.

【0032】なお、上記実施例では、文章が日本語で記
述されている場合を例にとって説明したが、文章を記述
する言語はかならずしも日本語に限られず、英語,仏語
等の他の言語であっても本発明が同様に適用できること
はいうまでもない。
In the above embodiment, the case where the text is written in Japanese has been described as an example, but the language in which the text is written is not necessarily limited to Japanese, but may be another language such as English or French. However, it goes without saying that the present invention can be similarly applied.

【0033】[0033]

【発明の効果】以上説明したように本発明は、検索対象
語とその活用形とを入力として受け取り、その活用形を
キーにしてあらかじめ用意しておいた派生形展開テーブ
ルを参照して、検索対象語を派生語群に展開し、その検
索対象語および派生語群を検索することにより、機械翻
訳での訳語の統一やワードプロセッサでの語の統一のた
めの編集作業を効率的に行うことを可能にするという効
果がある。
As described above, according to the present invention, a search target word and its inflectional form are received as an input, and the derivation form expansion table prepared in advance using the inflectional form as a key is referred to for retrieval. By expanding the target word into a derivative word group and searching for the search target word and the derivative word group, it is possible to efficiently perform editing work for unifying translation words in machine translation and unifying words in a word processor. It has the effect of enabling it.

【図面の簡単な説明】[Brief description of drawings]

【図1】本発明の派生語検索処理装置の構成を示すブロ
ック図である。
FIG. 1 is a block diagram showing a configuration of a derivative word search processing device of the present invention.

【図2】本発明の一実施例に係る派生語検索処理装置の
構成を示すブロック図である。
FIG. 2 is a block diagram showing a configuration of a derivative word search processing device according to an embodiment of the present invention.

【図3】図1中の派生語展開部において参照される活用
形展開テーブルの一例を示す図である。
FIG. 3 is a diagram showing an example of an inflection form expansion table referred to by a derivative word expansion unit in FIG.

【図4】本実施例の派生語検索処理装置において検索対
象となる文章の一例を示す図である。
FIG. 4 is a diagram showing an example of a sentence to be searched by the derivative word search processing device according to the embodiment.

【図5】本実施例の派生語検索処理装置において検索さ
れる箇所を例示する図である。
FIG. 5 is a diagram exemplifying locations searched for in the derivative word search processing device according to the embodiment.

【符号の説明】[Explanation of symbols]

11 入力部 12 制御部 13 派生語展開部 14 検索処理部 15 出力部 16 活用形展開テーブル 11 Input Section 12 Control Section 13 Derived Word Expansion Section 14 Search Processing Section 15 Output Section 16 Inflectional Expansion Table

Claims (3)

【特許請求の範囲】[Claims] 【請求項1】 自然言語で記述された文章から任意の語
を検索する検索処理装置において、 検索対象語とその活用形とを入力する入力部と、 この入力部により入力された検索対象語を活用形をキー
にして、あらかじめ用意した活用形展開テーブルを参照
して派生語群に展開する派生語展開部と、 検索対象語および前記派生語展開部により展開された派
生語群を文章から検索する検索処理部と、 この検索処理部による検索処理の結果を出力する出力部
とを有することを特徴とする派生語検索処理装置。
1. A search processing apparatus for searching an arbitrary word from a sentence written in natural language, and an input section for inputting a search target word and its inflection, and a search target word input by this input section. Using the inflectional form as a key, refer to the inflectional form expansion table that has been prepared in advance, and develop a derivative word group that expands to a derivative word group, and search for the search target word and the derivative word group expanded by the derivative word group And a search processing unit for outputting the result of the search processing by the search processing unit.
【請求項2】 前記入力部が活用形として活用しない旨
の情報を入力した場合に、前記派生語展開部が動作しな
い請求項1記載の派生語検索処理装置。
2. The derivative word search processing device according to claim 1, wherein the derivative word expansion unit does not operate when the input unit inputs information indicating that it is not used as a conjugation form.
【請求項3】 前記自然言語が日本語でなる請求項1記
載の派生語検索処理装置。
3. The derivative word search processing device according to claim 1, wherein the natural language is Japanese.
JP5101906A 1993-04-05 1993-04-05 Processor for retrieving derivative Pending JPH06290181A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP5101906A JPH06290181A (en) 1993-04-05 1993-04-05 Processor for retrieving derivative

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP5101906A JPH06290181A (en) 1993-04-05 1993-04-05 Processor for retrieving derivative

Publications (1)

Publication Number Publication Date
JPH06290181A true JPH06290181A (en) 1994-10-18

Family

ID=14312954

Family Applications (1)

Application Number Title Priority Date Filing Date
JP5101906A Pending JPH06290181A (en) 1993-04-05 1993-04-05 Processor for retrieving derivative

Country Status (1)

Country Link
JP (1) JPH06290181A (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS61208563A (en) * 1985-03-14 1986-09-16 Toshiba Corp Sentence editing device
JPS62282364A (en) * 1986-05-30 1987-12-08 Nec Corp Character string retrieval system
JPS6336367A (en) * 1986-07-30 1988-02-17 Seiko Epson Corp Sentence retrieving device
JPH01307865A (en) * 1988-06-06 1989-12-12 Nec Corp Character string retrieving system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS61208563A (en) * 1985-03-14 1986-09-16 Toshiba Corp Sentence editing device
JPS62282364A (en) * 1986-05-30 1987-12-08 Nec Corp Character string retrieval system
JPS6336367A (en) * 1986-07-30 1988-02-17 Seiko Epson Corp Sentence retrieving device
JPH01307865A (en) * 1988-06-06 1989-12-12 Nec Corp Character string retrieving system

Similar Documents

Publication Publication Date Title
EP0201324B1 (en) Language forming system
JPH06290181A (en) Processor for retrieving derivative
JP2838984B2 (en) General-purpose reference device
JP4643183B2 (en) Translation apparatus and translation program
JPH0944502A (en) Informatin receiving device and machine translating device
JPH09185629A (en) Machine translation method
JP4087829B2 (en) Valency dictionary expansion device, method, and program
JP2896152B2 (en) Japanese text generation processor
JPH06282568A (en) Document edition processor
JPH11282844A (en) Preparing method of document, information processor and recording medium
JPH0350668A (en) Character processor
JPH0778166A (en) Translating method and machine translation system
JPS62282364A (en) Character string retrieval system
JPH0773185A (en) Machine translation system and method therefor
JP2003108578A (en) Network retrieving system
JP2006065542A (en) Machine translation method
JP2002041516A (en) Device and method for machine translation, and computer-readable storage medium with machine translating program recorded thereon
JPH086950A (en) Machine translation apparatus with keyword translation function
JPH04241066A (en) Electronic dictionary retrieval system of document processor
JPH01181158A (en) Translation back-up device
JP2002032369A (en) Dictionary-preparing device
JPH03233671A (en) Sentence editing method and sentence generating device using this method
JPH0368074A (en) Context processing system dependent upon area designation
JPH05303589A (en) Translating device
JPH0981555A (en) Method and device for document processing