JPH04152458A

JPH04152458A - Word processor

Info

Publication number: JPH04152458A
Application number: JP2276494A
Authority: JP
Inventors: Shigemi Nakazato; 茂美中里; Hirofumi Goto; 浩文後藤
Original assignee: Toshiba Corp; Toshiba Software Engineering Corp
Current assignee: Toshiba Corp; Toshiba Software Engineering Corp
Priority date: 1990-10-17
Filing date: 1990-10-17
Publication date: 1992-05-26

Abstract

PURPOSE:To easily confirm the contents of a document by analyzing each word, each paragraph, and an entire sentence for each sentence, segmenting plural words showing the gists, and generating the auxiliary verbs, etc., for production of the sentences. CONSTITUTION:A syntax analyzing part 14 absorbs the auxiliary verbs, etc., into the precedent words and arranges them in the paragraphs in a word list analyzed by a morpheme analyzing part 13. Thus, the part 14 produces a reference notion structure. A semantic analyzing part 15 produces the Japanese word notion reliant structure data where the active-passive relation is adjusted against the reference notion structure. A gist segmenting part 16 segments the necessary words against the Japanese word notion reliant structure data based on a rule stored in a rule memory 19. A generating part 17 produces an auxiliary verb absorbed by the part 14 and produces a sentence. This sentence is outputted to a control part 10 and displayed at an output part 12. In such a constitution, the gist of a produced document is surely extracted and the contents of the document can be easily confirmed from the extracted gist.

Description

【発明の詳細な説明】［発明の目的］（産業上の利用分野）本発明は、文書の要点を作成する機能を備えた文書作成
装置に関する。DETAILED DESCRIPTION OF THE INVENTION [Object of the Invention] (Industrial Application Field) The present invention relates to a document creation device having a function of creating main points of a document.

（従来の技術）近年、文書作成装置（ワードプロセッサ）においては、
文章推敲機能を備えたものがある。この文章推敲機能に
より作成文章をチエツクする場合、文章中から１文を切
り出した後、解析辞書の検索により形態素解析を行なっ
て１文の系列に解析し、所定のルールに従って文中の不
適当な部分に対する置換候補を作成し、プロパティ等に
よりユーザーに示し、ユーザーの指示に従って校正処理
する。上記文中の不適当な部分に対する置換候補を作成
する場合、従来では文の意味解析を行ない、その解析結
果に基づいて文の並び変え等を行なって分かり易い文を
作成するようにしている。(Prior art) In recent years, in document creation devices (word processors),
There are some that have a text editing function. When checking a created sentence using this sentence refinement function, after cutting out one sentence from the sentence, morphological analysis is performed by searching an analysis dictionary to analyze it into a series of sentences, and inappropriate parts of the sentence are removed according to predetermined rules. A replacement candidate is created, shown to the user using properties, etc., and proofreaded according to the user's instructions. When creating replacement candidates for the inappropriate portions of a sentence, conventionally the sentences are semantically analyzed and the sentences are rearranged based on the analysis results to create easy-to-understand sentences.

（発明が解決しようとする課題）上記のように従来の文書作成装置では、文章推敲におい
て置換候補を示す場合、単に文の並び変え等を行なって
分かり易い文を作成するだけであり、要点だけを切り出
すことができなかった。(Problems to be Solved by the Invention) As mentioned above, in conventional document creation devices, when presenting replacement candidates in text editing, they simply rearrange the sentences to create an easy-to-understand sentence, and only focus on the main points. I couldn't cut it out.

本発明は上記のような点に鑑みてなされたもので、文の
要点だけを切り出すことかでき、文章推敲に際して置換
候補の選択が極めて容易な文書作成装置を提供すること
を目的とする。The present invention has been made in view of the above-mentioned points, and it is an object of the present invention to provide a document creation device that can extract only the main points of a sentence and that makes it extremely easy to select replacement candidates when revising a sentence.

［発明の構成］（課題を解決するための手段と作用）すなわち、本発明に係る文書作成装置は、日本語解析に
使用する知識情報を記憶してなる解析辞書と、この解析
辞書を参照して単語単位の解析を行なう形態素解析手段
と、この形態素解析手段の解析結果に基づいて文節単位
の解析を行ない助動詞等を前の単語に吸収させる構文解
析手段と、この構文解析手段の解析結果に基づき一文を
通しての解析を行なう意味解析手段と、この意味解析手
段の解析結果から必要な単語を切り出す要点切り出し手
段と、この要点切り田し手段により切り出された単語に
対し、上記構文解析手段で吸収された助動詞を生成して
１つの文を作成する文生成手段とを具備する。[Structure of the Invention] (Means and Effects for Solving the Problems) That is, the document creation device according to the present invention includes an analysis dictionary that stores knowledge information used for Japanese language analysis, and a document creation device that refers to this analysis dictionary. A morphological analysis means that performs word-by-word analysis based on the analysis result of this morphological analysis means, a syntactic analysis means that performs clause-by-clause analysis based on the analysis result of this morphological analysis means and absorbs auxiliary verbs etc. into the previous word, a semantic analysis means for analyzing a whole sentence based on the analysis results; a gist extraction means for cutting out necessary words from the analysis result of the semantic analysis means; and a syntactic analysis means for absorbing the words extracted by the gist cutting means. and sentence generation means for generating one sentence by generating the auxiliary verbs.

この結果、作成された文書は、各文毎に「単語単位」、
「文節単位」、「−文全体」を対象として順次解析が行
なわれ、その解析結果に基づいて要点を示す単語が抽出
される。そして、この抽出された単語に対して助動詞等
が生成され、要点を示す文が作成される。従って、この
抽出された要点から文書内容を容易に確認することが可
能になる。As a result, the created document is created for each sentence in "word units",
Analyzes are performed sequentially for "phrase units" and "- whole sentences," and words indicating key points are extracted based on the analysis results. Then, auxiliary verbs and the like are generated for the extracted words, and a sentence indicating the main point is created. Therefore, it becomes possible to easily confirm the document content from the extracted key points.

（実施例）以下、図面を参照して本発明の一実施例を説明する。(Example) Hereinafter, one embodiment of the present invention will be described with reference to the drawings.

第１図はこの実施例の文書作成装置の基本的な構成を示
すブロック図である。FIG. 1 is a block diagram showing the basic configuration of the document creation device of this embodiment.

第１図に於いて、１０はマイクロプロセッサ（ＣＰＵ）
等を用いて構成される制御部、１１は同制御部１０に接
続される入力部、１２は制御部１０に接続される表示部
である。上記入力部１１は、各種のキー及びマウスなど
による入力を受付けて上記制御部ｌＯに入力する。また
、表示部１２は、入力データあるいは処理データ等を表
示するもので、ＣＲＴあるいはドツトマトリクス方式の
液晶デイスプレィ装置等からなり、例えば６４０Ｘ４０
０ドツトの大きさに設定される。In Figure 1, 10 is a microprocessor (CPU)
11 is an input section connected to the control section 10, and 12 is a display section connected to the control section 10. The input section 11 accepts inputs using various keys, a mouse, etc., and inputs them to the control section IO. The display unit 12 displays input data or processed data, and is composed of a CRT or a dot matrix type liquid crystal display device, for example, a 640×40
The size is set to 0 dot.

上記制御部１０は、装置全体の制御を行なうもので、入
力指示に従うプログラムの起動で、文書作成処理、文書
編集処理、表示制御処理等を行なうと共に、形態素解析
部１３、構文解析部１４、意味解析部１５、要点切り出
し部１６、生成部１７を制御して、要点の切り出し制御
を実行する。The control unit 10 controls the entire device, and performs document creation processing, document editing processing, display control processing, etc. by starting a program according to input instructions, and also controls the morphological analysis unit 13, the syntax analysis unit 14, the semantic analysis unit 14, and the like. The analysis section 15, main point extraction section 16, and generation section 17 are controlled to execute main point extraction control.

形態素解析部１３は、制御部１０から送られてくる文に
対し、解析辞書１８を検索して単語単位に解析したリス
トを作成する。解析辞書１８には、この日本語解析に使
用する知識情報が収容されている。The morphological analysis unit 13 searches the analysis dictionary 18 for the sentences sent from the control unit 10 and creates a list of the sentences analyzed word by word. The analysis dictionary 18 stores knowledge information used for this Japanese language analysis.

構文解析部１４は、形態素解析部１３により解析された
単語リストの中で助動詞等を前の単語に吸収させて文節
にまとめ、基準概念構造を作成する。The syntactic analysis unit 14 absorbs auxiliary verbs and the like into previous words in the word list analyzed by the morphological analysis unit 13 and combines them into clauses to create a reference conceptual structure.

意味解析部１５は、−文を通しての解析を行なうもので
、構文解析部１４により作成された基準概念構造に対し
、その係り受けを調整した日本語概念依存構造データを
作成する。The semantic analysis unit 15 performs analysis through the - sentence, and creates Japanese concept dependent structure data in which the dependencies are adjusted for the reference concept structure created by the syntactic analysis unit 14.

要点切り出し部１６は、上記日本語概念依存構造データ
に対し、規則メモリ１つに記憶されている規則に基づい
て必要な単語を切り出し、生成部１７へ出力する。規則
メモリ１９には、文の要点を切り出すための規則が収容
されている。The gist extraction section 16 extracts necessary words from the Japanese concept dependent structure data based on the rules stored in one rule memory and outputs them to the generation section 17 . The rule memory 19 stores rules for cutting out the main points of sentences.

生成部１７は、構文解析部１４て吸収された助動詞第を
生成し、１つの文を作成して制御部１０に出力し、表示
部１２に表示する。The generation unit 17 generates the auxiliary verbs absorbed by the syntax analysis unit 14, creates one sentence, outputs it to the control unit 10, and displays it on the display unit 12.

次に、上記実施例における要点切り出し動作について第
２図のフローチャートを参照して説明する。Next, the gist extraction operation in the above embodiment will be explained with reference to the flowchart of FIG.

要点の切り出しを行なう場合には、画面上に対象となる
文章が表示されている状態で、入力部１１から文章を指
定して要点の切り出しを指示する。制御部１０は、要点
の切り出しが指示されると、第２図のフローチャートに
示す処理動作を開始する。すなわち、制御部１０は、要
点の切り出しが指示されると、その対象となる文を形態
素解析部１３に出力する。これにより形態素解析部１３
は、第２図のステップＡＩに示す形態素解析動作を行な
う。When cutting out the main points, the user specifies the text from the input section 11 and instructs to cut out the main points while the target text is displayed on the screen. When the control unit 10 is instructed to cut out the main points, it starts the processing operation shown in the flowchart of FIG. 2. That is, when the control unit 10 is instructed to cut out the main points, it outputs the target sentence to the morphological analysis unit 13. As a result, the morphological analysis unit 13
performs the morphological analysis operation shown in step AI in FIG.

今、例えば第３図のＢｌに示すように、私はかなり勉強
したので、今回の試験に合格することができた。Now, for example, as shown in Figure 3, Bl, I studied a lot, so I was able to pass this exam.

の文か制御部１０から形態素解析部１３に送られたとす
ると、形態素解析部１３は解析辞書１８を参照して形態
素解析を行なう。この形態素解析により、第３図のＢ２
に示すように単語単位に解析したリスト、私／はｌかなり７勉強した／ので／、ｌ今回／の７試験
／に７合格すること／が／でき／た。７を作成する。When a sentence is sent from the control unit 10 to the morphological analysis unit 13, the morphological analysis unit 13 refers to the analysis dictionary 18 and performs morphological analysis. Through this morphological analysis, B2 in Figure 3
As shown in the list, analyzed word by word, I studied quite a lot, so I was able to pass the exam this time. Create 7.

次に制御部１０は、上記形態素解析部１３により解析さ
れた単語リストを構文解析部１４に出力する。この構文
解析部１４は、上記単語リストの中で助動詞等を前の単
語に吸収させて文節にまとめ、第３図の８３に示すよう
に基準概念構造を作成する（ステップＡ２）。即ち、目
的語である「合格」を頂点として■「かなり」、「勉強
」。Next, the control unit 10 outputs the word list analyzed by the morphological analysis unit 13 to the syntax analysis unit 14. The syntactic analysis unit 14 absorbs auxiliary verbs and the like into the previous words in the word list and combines them into clauses, creating a reference conceptual structure as shown at 83 in FIG. 3 (step A2). In other words, with the object word ``pass'' at the top, ■ ``quite'' and ``study''.

「ので」、■「今回」、「試験」の２つの系統が関連付
けられる。また、「勉強」に対して「私」が関連付けら
れる。Two systems are associated: "So", ■ "This time", and "Test". Moreover, "I" is associated with "study".

そして、上記構文解析部１４で作成された基準概念構造
データは、意味解析部１５へ送られてその意味が解析さ
れ（ステップＡ３）、第３図のＢ４に示すように係り受
けを調整した日本語概念依存構造が作成される。即ち、
基準概念構造において、「勉強」に係っていた「私」が
「合格」に係るように意味解析部１５により調整され、
要点切り出し部１６へ送られる。The reference concept structure data created by the syntax analysis unit 14 is then sent to the semantic analysis unit 15, where its meaning is analyzed (step A3), and the dependencies are adjusted as shown in B4 of FIG. A word concept dependent structure is created. That is,
In the standard concept structure, "I" which was related to "studying" is adjusted by the semantic analysis unit 15 so that it is related to "passing",
The information is sent to the main point extraction section 16.

要点切り出し部１６は、意味解析部１５で作成された日
本語概念依存構造データから規則メモリ１９に記憶して
いる要点切り出し規則に基づいて必要な単語を切り出す
（ステップＡ４）。即ち、要点切り出し部１６は、第３
図の８５に示すように日本語概念依存構造データから要
点切り出し規則に基づいて主客と目的格のアークである
「私」。The gist cutting unit 16 cuts out necessary words from the Japanese concept dependent structure data created by the semantic analysis unit 15 based on gist cutting rules stored in the rule memory 19 (step A4). That is, the main point cutting section 16
As shown at 85 in the figure, ``Washi'' is the arc of the subject and objective case based on the gist extraction rules from the Japanese concept-dependent structure data.

「合格」、「試験」を切り出し、生成部１７へ出力する
。“Pass” and “Test” are cut out and output to the generation unit 17.

生成部１７は、要点切り出し部１６により切り出された
「私」、「合格」、「試験」から、上記構文解析部１４
で吸収された助動調節を生成し、第３図の８６に示すよ
うに１つの文「私は試験に合格した。」を作成する（ス
テップＡ５）。この生成部１７により生成された文は、
制御部１０へ送られ、表示部１２に表示される。The generation unit 17 generates the syntax analysis unit 14 from “I”, “pass”, and “examination” extracted by the gist extraction unit 16.
The system generates the auxiliary adjustment absorbed in the step A5, and creates one sentence "I passed the exam" as shown at 86 in FIG. 3 (step A5). The sentence generated by this generation unit 17 is
It is sent to the control section 10 and displayed on the display section 12.

上記のようにして作成文書に対する要点の抽出が行なわ
れる。The main points of the created document are extracted as described above.

［発明の効果］以上のように本発明に係る文書作成装置は、作成された
文書に対し、各文毎に「単語単位」、「文節単位」、「
−文全体」を対象として順次解析を行ない、その解析結
果に基づいて要点を示す複数の単語を切り出し、この単
語に対して助動詞等を生成して文を作成するようにした
ので、作成文書の要点を確実に抽出することができ、こ
の要点から文書内容を容易に確認することができる。[Effects of the Invention] As described above, the document creation device according to the present invention can process the created document for each sentence by "word unit", "bunse unit unit", or "bunsetsu unit".
- The entire sentence is sequentially analyzed, and based on the analysis results, multiple words that indicate the main points are extracted, and auxiliary verbs are generated for these words to create a sentence. The key points can be reliably extracted, and the content of the document can be easily confirmed from the key points.

[Brief explanation of drawings]

第１図は本発明の一実施例による文書作成装置を示すブ
ロック図、第２図は同実施例の動作を示すフローチャー
ト、第３図は同実施例における要点切り出しの具体的な
例を示す図である。１０・・・制御部、１１・・・入力部、１２・・表示部
、１３・・・形態素解析部、１４・構文解析部、１５・
・・意味解析部、１６・・・要点切り出し部、１７・・
・生成部、１８・・・解析辞書、１９・・・規則メモリ
。FIG. 1 is a block diagram showing a document creation device according to an embodiment of the present invention, FIG. 2 is a flowchart showing the operation of the embodiment, and FIG. 3 is a diagram showing a specific example of key point extraction in the embodiment. It is. DESCRIPTION OF SYMBOLS 10... Control part, 11... Input part, 12... Display part, 13... Morphological analysis part, 14. Syntax analysis part, 15.
...Semantic analysis part, 16...Gist extraction part, 17...
- Generation unit, 18... Analysis dictionary, 19... Rule memory.

Claims

[Claims]

In the document creation device, there is an analysis dictionary that stores knowledge information used for Japanese language analysis, a morphological analysis means that performs word-by-word analysis by referring to this analysis dictionary, and an analysis result of the morphological analysis means. There is a syntactic analysis means that analyzes each clause and absorbs auxiliary verbs into the previous word, a semantic analysis means that analyzes the whole sentence based on the analysis results of this syntactic analysis means, and a syntactic analysis means that analyzes the entire sentence based on the analysis results of this syntactic analysis means. The present invention further comprises: gist extraction means for cutting out words; and sentence generation means for generating one sentence by generating auxiliary verbs absorbed by the parsing means for the words extracted by the gist extraction means. Characteristic document creation device.