JPH0682367B2

JPH0682367B2 - Document creation / proofreading support device

Info

Publication number: JPH0682367B2
Application number: JP61314280A
Authority: JP
Inventors: 茂起空閑; 正寛和田; 敏幸田中; 太朗森下; 信夫中村
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 1986-12-26
Filing date: 1986-12-26
Publication date: 1994-10-19
Anticipated expiration: 2009-10-19
Also published as: JPS63163957A

Description

【発明の詳細な説明】＜産業上の利用分野＞本発明は、日本語文章の中から、形態素解析を行うこと
により、固有名詞に関連する部分を抽出し、固有名詞の
正誤の確認及び校正を行い易くする文書作成・校正支援
装置に関するものである。DETAILED DESCRIPTION OF THE INVENTION <Industrial application field> The present invention extracts a part related to a proper noun from a Japanese sentence by performing a morphological analysis, and confirms and corrects the correctness of the proper noun. The present invention relates to a document creation / proofreading support device that facilitates performing.

＜従来の技術＞現在、日本語ワードプロセッサが実用化されており、そ
れに関連した、日本語の入出力、編集、かな漢字変換ア
ルゴリズム、辞書の技術などの基本技術が確立してい
る。<Conventional Technology> Currently, Japanese word processors have been put into practical use, and related basic technologies such as Japanese input / output, editing, Kana-Kanji conversion algorithm, and dictionary technology have been established.

また、日本語処理技術では、形態素解析、構文解析、意
味解析などの基本的な技術が知られている。In addition, basic Japanese processing techniques such as morphological analysis, syntactic analysis, and semantic analysis are known.

欧米では、ワードプロセッサが早くから発展したため関
連技術が進んでおり、スペルのチェック、コレクトの機
能を持った装置が実用化されている。In Europe and the United States, word processors have developed from an early stage, so related technologies have advanced, and devices with spell check and collect functions have been put into practical use.

欧米の言葉が単語単位に句切られて記述されるのに比
べ、日本語は句切りのない漢字仮名交り文で記述される
のが通常であり、正書法が徹底していないため、解析が
難しく校正を自動化する装置は実用化されていない。Compared to Western words that are written in units of words, Japanese is usually written in Kanji Kana altering sentences without punctuation. A device that is difficult to automate the calibration has not been put to practical use.

従来、正確な日本語を扱うことが要求される場合、複数
の人が対になり読み合わせをして問題のある点を抽出し
たり、あるいは校正の専門的な知識を持った人が逐次照
合を加え、校正する方法などの方法が採用されている。In the past, when it was required to handle accurate Japanese, multiple people would be paired and read to identify problematic points, or people with specialized knowledge of proofreading would be able to perform sequential matching. In addition, methods such as calibration method are adopted.

最近、このような校正作業を支援するための装置が開発
されつつある。それらの装置は、大きく分けると２つの
方式に分類できる。Recently, a device for supporting such calibration work is being developed. These devices can be roughly classified into two types.

一つは、前者に対するものであり、漢字仮名交りの文章
を解析して音声に変換し、機械との間で、音声を介した
読み合わせを行うことのできる校正装置である。One is for the former, which is a proofreading device capable of analyzing a sentence containing kanji and kana and converting it into a voice, and performing reading through voice with a machine.

今一つは、後者に対するもので、校正の自動化あるいは
校正の専門化の支援を行うことを目的としたものであ
る。この装置は前者の装置に比べ、より高度の技術が要
求されるため、概念の提案があるのみで該装置を構成す
る手段、校正の具体的な手段の報告は少ない。The other is for the latter, with the purpose of supporting the automation of calibration or specialized calibration. Since this device requires higher technology than the former device, there are few reports of means for constructing the device and specific means of calibration only by the proposal of the concept.

また、その他の技術として、言語処理研究の中では、KW
IC（Key Word In Context）に関する技術は一般的で
ある。In addition, as another technique, in language processing research, KW
The technology related to IC (Key Word In Context) is general.

言語処理以外では、ワークステーションに関連した技術
が確立しており、作業の効率を上げるためのマルチウイ
ンンドウを用いた装置が実用化されている。Other than language processing, techniques related to workstations have been established, and devices using multi-windows have been put to practical use to improve work efficiency.

＜発明が解決しようとする問題点＞近年日本語のワードプロセッサが普及し、該装置で作成
した文書が多くなっている。ワードプロセッサでは、入
力の簡便なかな漢字変換方式（以下特に断りがない場
合、ローマ字漢字変換方式を含む）を採用した機種が多
くなっている。<Problems to be Solved by the Invention> In recent years, Japanese word processors have become widespread, and the number of documents created by the device has increased. In word processors, Kana-Kanji conversion method that is easy to input (including the Roman-Kanji conversion method unless otherwise specified) has been adopted in many models.

かな漢字変換のアルゴリズムは、かなを漢字に変換する
過程で単語辞書、文法などの言語的な裏付けのある情報
との照合が行われるため、確率的な基盤に基づいてはい
るが、ある程度の妥当性が確かめられている。The kana-kanji conversion algorithm is based on a probabilistic basis, because it matches with linguistically-supported information such as word dictionaries and grammars in the process of converting kana to kanji. Has been confirmed.

固有名詞はその一部が辞書に登録され単語辞書と同じよ
うに、検索、確認される場合が多い。Part of proper nouns is registered in a dictionary and often searched and confirmed in the same manner as a word dictionary.

しかし、固有名詞はその数が多いため、単語辞書にその
すべてを登録することは経済、処理時間、ハードウェア
の制約などの面で実現的でない。However, since there are many proper nouns, it is not feasible to register all of them in the word dictionary in terms of economy, processing time, hardware restrictions, and the like.

又、固有名詞は変化することが多く、その更新に注意が
必要である。Also, proper nouns often change, so be careful when updating them.

又、固有名詞は間違えた場合に影響が大きく、信頼関係
を損ねたり。人権問題、訴訟問題に発展する可能性があ
り、その正誤の確認には注意が必要である。Also, if you make a mistake with a proper noun, it will have a great effect, and you will lose trust. Human rights and litigation issues may develop, and it is necessary to be careful in checking their correctness.

上述のごとく、最近開発されつつある校正機能を持った
装置の場合も固有名詞の部分を重点的に確認、校正する
機能は報告されていない。As described above, even in the case of a device having a proofreading function which is being developed recently, the function of checking and proofreading the proper noun part has not been reported.

たとえば、音声読み上げの機能を持った装置では、文章
を逐一かな文字列に分解・変換し、音声合成装置で読み
上げるため、固有名詞の部分に対しても厳重な確認が可
能であるが、処理速度が遅いという欠点があった。For example, with a device that has a voice reading function, sentences are decomposed and converted into character strings one by one and read aloud by a voice synthesizer, so it is possible to perform strict confirmation even for proper noun parts, but processing speed Has the drawback of being slow.

又、同字異音の固有名詞、文字が多いため音声を用いた
読み合わせを行っても確実に校正できるという訳ではな
かった。In addition, since there are many proper nouns and letters with the same characters and different sounds, it was not always possible to proofread accurately even when reading with voice.

又、該装置では音声合成装置、スピーカーなどの付加装
置が必要であり、コスト高になるという欠点があった。In addition, this device requires an additional device such as a voice synthesizer and a speaker, which has a drawback of high cost.

又、該装置では、言語と音声に関する処理が必要であ
り、処理が複雑になるとともにプログラムの容量が増大
するという欠点があった。In addition, the apparatus requires processing related to language and voice, which makes the processing complicated and increases the capacity of the program.

又、音声を用いる騒音が問題になり、その解消のために
ヘンドフォンが必要になったり、それを装着する手間が
かかったりする欠点があった。In addition, there is a problem that a noise using voice becomes a problem, a hendphone is required to eliminate the noise, and it takes time and effort to put it on.

一方、形態素、構文、意味解析を用いる方式では、固有
名詞の間違いを校正する機能は報告されておらず、従来
通り、文中から固有名詞の部分を押出し、確認の後、校
正を行わねばならず、作業者の負担、時間が強要される
という欠点があった。On the other hand, in the method using morpheme, syntactic and semantic analysis, the function of proofreading the mistake of proper nouns has not been reported, and as before, the proper noun portion must be pushed out from the sentence and proofreading must be performed after confirmation. However, there is a drawback that the worker is burdened and time is required.

又、両方式に共通する問題として、文書の中で、固有名
詞が大きな意味をなす場合が、多いが、固有名詞の部分
を取り出して確認を行うという機能は報告されておら
ず、固有名詞の部分のみの確認を行う場合、作業の効率
を落とすという欠点があった。In addition, as a problem common to both formulas, proper nouns often have a large meaning in a document, but the function of extracting the proper noun part and performing confirmation has not been reported. When checking only the part, there is a drawback that the work efficiency is reduced.

本発明の方式は、文章の中から固有名詞に関連した部分
を形態訴解析によって抽出し、固有名詞の確認が行い易
い形で表示するとともに、装置の使用者が簡単に校正で
きるようにし、かかる問題を解決しようとするものであ
る。The method of the present invention extracts a part related to a proper noun from a sentence by morphological complaint analysis, displays it in a form that allows easy confirmation of the proper noun, and enables a user of the device to easily proofread. It tries to solve the problem.

＜問題点を解決するための手段＞本発明は、日本語を入力・編集する手段と、前記入力さ
れた日本語を記憶する手段と、辞書を記憶する辞書手段
と、文法を記憶する文法手段と、前記入力された日本語
の中から校正すべき文字・記号列を抽出する手段と、前
記辞書手段と前記文法手段により、前記入力された日本
語に対応する候補文字・記号列の形態素解析を行い、固
有名詞に関係する形態素の部分を抽出し、該固有名詞を
検出する検出手段と、前記検出手段に応答し、前記固有
名詞のみを他の文章の部分と区別が付くように反転表示
する手段と、前記表示手段にて表示された前記固有名詞
にて校正すべき候補文字・記号列がある場合に、該候補
文字・記号列を修正する手段とを備えることを特徴とす
る文書作成・校正支援装置である。<Means for Solving Problems> The present invention provides means for inputting / editing Japanese, means for storing the input Japanese, dictionary means for storing a dictionary, and grammar means for storing grammar. And means for extracting a character / symbol string to be proofread from the input Japanese, and morphological analysis of candidate characters / symbol strings corresponding to the input Japanese by the dictionary means and the grammar means. The morpheme portion related to the proper noun is extracted, and the detecting means for detecting the proper noun and the responding to the detecting means are displayed in reverse video so that only the proper noun can be distinguished from other sentence parts. And a means for correcting a candidate character / symbol string to be proofread by the proper noun displayed by the display means, if necessary. -It is a calibration support device.

＜作用＞入力手段により計算機などに入力された日本語の中か
ら、形態素解析を行い、固有名詞に関係する形態素の部
分を抽出するとともに、他の文章の部分と区別が付くよ
うに表示し、固有名詞に関連した部分の正誤の確認が行
い易くするように作用する。<Operation> From the Japanese input to the computer by the input means, morpheme analysis is performed to extract the morpheme part related to proper nouns and display it so as to distinguish it from other sentence parts. It works to make it easier to confirm the correctness of the part related to the proper noun.

押出された部分をカーソルなどのポインティングデバイ
スで指示したとき、固有名詞部分をキーとするKWICを作
成し、別の場所に表示することにより、固有名詞に関連
した部分の正誤の確認を更に、行き易くするように作用
する。When you point the extruded part with a pointing device such as a cursor, create a KWIC that uses the proper noun part as a key and display it in another place to further confirm the correctness of the part related to the proper noun. Acts to facilitate.

文章中に固有名詞の部分が複数個存在する場合は、文章
中の他の部分と固有名詞部分を区別した情報を利用し
て、抽出された部分のみの指示を可能とする機能を付加
し、更に固有名詞の部分の正誤の確認を行い易くするよ
うに作用する。If there is more than one proper noun part in the sentence, use the information that distinguishes the proper noun part from other parts of the sentence, and add a function that allows you to specify only the extracted part. Further, the function of facilitating confirmation of correctness of the proper noun portion is provided.

上記の確認作業において、もし、抽出された部分に間違
いのあることが確認された場合は、原文あるいはKWIC中
の間違いの部分を入力編集機能を用いて修正し、文章を
校正するように作用する。In the above confirmation work, if it is confirmed that there is an error in the extracted part, it corrects the error part in the original sentence or KWIC using the input editing function, and acts to proofread the sentence .

以上のような、確認、修正機能を用いて、固有名詞の部
分を効率良く、校正し、前述の問題点を改良するように
作用する。Using the checking and correcting functions as described above, the proper noun portion is efficiently proofread, and the above-mentioned problems are improved.

＜実施例＞以下図に基づいて本発明を詳細に説明する。第１図は本
発明に係わる日本語文章校正装置のブロック構成図であ
る。<Example> The present invention will be described in detail with reference to the drawings. FIG. 1 is a block diagram of a Japanese sentence proofreading device according to the present invention.

図において１は日本語の文字列を入力・編集する手段で
ある。In the figure, 1 is a means for inputting / editing a Japanese character string.

２は該入力手段により入力された日本語の文字列を記憶
する手段である。入力手段は通常キーボードが用いられ
るが逐次的に入力を行なわないで、たとえばフロッピー
ディスク、磁気テープなどのように入力した日本語の文
字列を記憶する外部記憶手段で代用することも可能であ
る。即ち、１が省略された構成も存在しうる。Reference numeral 2 is a means for storing the Japanese character string input by the input means. A keyboard is usually used as the input means, but it is also possible to substitute an external storage means for storing the input Japanese character string such as a floppy disk or a magnetic tape instead of sequentially inputting. That is, there may be a configuration in which 1 is omitted.

３は上記２に蓄積された日本語の文字・記号列を形態素
解析するための辞書を記憶する手段である。3 is a means for storing a dictionary for morphological analysis of the Japanese character / symbol strings accumulated in 2 above.

４は同様の目的のために使用するもので文法、その他の
辞書を記憶する手段である。4 is used for the same purpose and is a means for storing grammar and other dictionaries.

５は２に蓄えられた文字列の中から、固有名詞の部分を
抽出したり、途中結果を記憶したり、表示の司令などを
行ったりする制御手段である。該制御手段には制御によ
って得られる結果を記憶する手段を含む。Reference numeral 5 is a control means for extracting a proper noun portion from the character string stored in 2, storing an intermediate result, and controlling display. The control means includes means for storing the result obtained by the control.

６は入力された文字列、照合の途中結果、校正すべき文
字列、KWICなどを表示するCRTなどの表示の手段であ
る。Reference numeral 6 is a display means such as a CRT for displaying an input character string, a result of collation, a character string to be calibrated, KWIC and the like.

７は６によって表示されたKWICの中に誤りがあった場
合、その修正結果を原文中に正しく反映するための校正
手段である。Reference numeral 7 is a proofreading means for correctly reflecting the correction result in the original text when there is an error in the KWIC displayed by 6.

第２図は１によって入力れた文字列の例であり、５ほ制
御手段により、６に表示された状態を表している。この
文章を８とする。また、この文章は間違いであり、正し
くは「これは空閑氏らの発明である。」であるとする。
即ち、固有名詞の「空間」は誤りであり、「空閑」が正
しいものとする。正しく校正された後の文章は８と区別
して９とする。FIG. 2 shows an example of the character string input by 1, and shows the state displayed on 6 by the control means 5. This sentence is 8. In addition, this sentence is incorrect, and it is correct that it is "this is the invention of Kugara et al."
That is, the proper noun "space" is incorrect, and "void" is correct. The sentence after being proofread correctly is set to 9 in distinction from 8.

第３図は本発明に係わる表示の例を示している。本図は
文章８の中から固有名詞の部分10を抽出し、他と区別し
て表示した結果の例を示している。FIG. 3 shows an example of a display according to the present invention. This figure shows an example of a result in which the proper noun portion 10 is extracted from the sentence 8 and displayed separately from the others.

第４図はKWICを用いた本発明の別の表示の例であり、固
有名詞にカーソルのようなポインティングデバイスを合
わせたときの状態の例を示している。図中、KWICの中の
キーワードに該当する部分を11、KWICの全体を12とす
る。FIG. 4 is another example of the display of the present invention using KWIC, showing an example of a state in which a pointing device such as a cursor is placed on the proper noun. In the figure, the part corresponding to the keyword in KWIC is 11, and the whole KWIC is 12.

第５図は単語辞書の内容の例を示している。13は見出し
であり、14は自立語あるいは付属語の区別を表す情報で
あり、15は品詞または分類の情報である。FIG. 5 shows an example of the contents of the word dictionary. Reference numeral 13 is a headline, 14 is information indicating a distinction between an independent word or an adjunct word, and 15 is information about a part of speech or a classification.

第６図は文法の例を示している。16は文節の条件を表し
たものである。［］はその中の要素が文節が成立するた
めの必須の条件であることを示している。その他の要素
は省略されることがありうる。FIG. 6 shows an example of grammar. 16 is the condition of the clause. [] Indicates that the element in it is an indispensable condition for the clause to be established. Other elements may be omitted.

第７図は形態素を組み上げて文節を形成するための情報
を表したものである。17は先行する形態素、18は後続す
る形態素、19はこれらの形態素をつないだときの文字列
が文節の終了条件を満足するか否かの情報を示すもので
ある。FIG. 7 shows information for assembling morphemes to form a phrase. Reference numeral 17 is a preceding morpheme, 18 is a succeeding morpheme, and 19 is information indicating whether or not the character string connecting these morphemes satisfies the ending condition of the phrase.

第８図は固有名詞を抽出し他と区別して表示するための
情報を示している。20は文字であり、21は表示画面の中
のその文字の存在する行の位置を示し、22は列の位置を
示している。又、23は固有名詞か否かを区別するための
情報である。24は固有名詞の開始と終了点を記述し、KW
ICなどを作成するとき利用するための情報である。他と
区別がつけば何であっても構わないが、説明を分かりや
すくするため、固有名詞の開始点をＳ、終了点をＥとし
ておく。FIG. 8 shows information for extracting proper nouns and displaying them so as to distinguish them from others. 20 is a character, 21 is the position of the line where the character exists in the display screen, and 22 is the position of the column. Further, 23 is information for distinguishing whether or not it is a proper noun. 24 describes the start and end points of proper nouns, KW
This information is used when creating ICs. It may be anything as long as it can be distinguished from the others, but in order to make the explanation easy to understand, the start point of the proper noun is set to S and the end point is set to E.

第９図は表示装置の特定の位置25がM,Nの行、列の番号
で記述できることを示したものである。FIG. 9 shows that the specific position 25 of the display device can be described by the numbers of rows and columns of M and N.

第10図は本発明の該略フロー図である。第２図の例文、
８を用いて、以下に詳しく述べる。FIG. 10 is a schematic flow chart of the present invention. The example sentence in Figure 2,
8 will be described in detail below.

まず、入力・編集手段１により入力された「これは空間
氏らの発明である。」という文が、入力記憶装置２に蓄
積される。26はこの処理ブロックである。First, the sentence “This is the invention of Mr. Shira et al.” Input by the input / editing means 1 is accumulated in the input storage device 2. 26 is this processing block.

各入力文字列は単語辞書、文法と照合され形態素の要素
が決定される。27は単語辞書照合の処理ブロックであ
る。ここでは、単語辞書検索の結果まず、「これ」が検
索され、代名詞であることが分かる。次に「は」が検索
され、係助詞であり代名詞と接続が可能で、第７図から
文節の終了条件も満足することが分かる。文節成立のた
めの条件、形態素間の接続の条件、文節終了の条件など
を文法を照合する処理ブロックである。これを28とす
る。Each input character string is collated with a word dictionary and grammar to determine morpheme elements. 27 is a processing block for word dictionary matching. Here, as a result of the word dictionary search, “kore” is searched first, and it is found that it is a pronoun. Next, "ha" is searched, and it is understood that it is a particle and can be connected to a pronoun, and that the ending condition of the phrase is also satisfied from FIG. It is a processing block that matches the grammar with conditions for clause establishment, conditions for connection between morphemes, conditions for ending phrases, and the like. This is 28.

その結果、辞書、文法の双方の照合に成功する場合とそ
うでない場合が生ずる。後者の場合は、エラー処理35が
行われ、次文字の処理に移る。As a result, there are cases where both the dictionary and the grammar are successfully matched, and cases where they are not. In the latter case, error processing 35 is performed, and the processing for the next character is started.

処理28では、第７図を用い、先行する形態素の要素と後
続する形態素の要素間の接続の可否を調べる。たとえ
ば、代名詞「これ」の場合は、これ単体では、文節の終
了を満足せず、「これは」まで文字列がつながって文節
になることが分かる。次に、「空間」の処理に進む。空
間は単語辞書によれば自立語で名詞の場合と氏を表わす
固有名詞の場合が存在する。In process 28, using FIG. 7, it is checked whether or not the connection between the element of the preceding morpheme and the element of the subsequent morpheme. For example, in the case of the pronoun “kore”, it can be seen that this alone does not satisfy the end of the phrase, and the character string is connected to “kowa” to form the phrase. Next, the process of "space" is performed. According to the word dictionary, there are cases where the space is a noun with independent words and a proper noun representing Mr.

そのような場合、いずれかを選択する必要がある。形態
素解析上矛盾の無いものは候補としてスタックに蓄積し
ておき、次の形態素の解析を行い、形態素間の接続関係
で候補の中から妥当なものを選択できる場合がある。た
とえば、名詞の「空間」と接尾語「氏」は接続が不可で
あるが、固有名詞の「空間」と接尾語「氏」は接続が可
能である。このような場合、最長一致法が知られてお
り、文節数を少なくし、文字列が一番長い候補を選択す
る。これにより固有名に関連した「空間氏」のほうが選
択される。同様な処理を繰り返すことにより、「空間氏
らの」が一つの文節として切り出される。この文節を組
み上げる処理ブロックを29とする。In such a case, it is necessary to select either one. In some cases, there is no contradiction in the morphological analysis, which is accumulated in the stack as a candidate, the next morpheme is analyzed, and a proper one can be selected from the candidates in the connection relation between the morphemes. For example, the noun “space” and the suffix “Mr.” cannot be connected, but the proper noun “space” and the suffix “Mr.” can be connected. In such a case, the longest match method is known, the number of clauses is reduced, and the candidate with the longest character string is selected. As a result, "Space Mr." associated with the proper name is selected. By repeating the same processing, "Space Mr. et al." Is cut out as one clause. The processing block that composes this clause is 29.

入力文字列のコードの照合順序は、特に固定されたもの
はなく、いずれの文字から行っても本発明に影響はな
い。ここでは、通常文章を読むときの順序、「これは空
間・・・」の順で考えている。The collating order of the codes of the input character string is not fixed in particular, and any character does not affect the present invention. Here, we consider the order of reading ordinary texts in the order of "This is a space ...".

27、28、29により、「空間・・」の「空」の文字の処の
24にＳの記号が付けられ、「間」の文字の24にＥが付け
られ、「空間」が一つの固有名詞であることが分かる。By 27, 28, 29, "space ..."
The symbol S is attached to 24, and the letter 24 of the character "ma" is attached to E, and it can be seen that "space" is one proper noun.

30は形態素解析処理の終了の判断を行うための処理ブロ
ックである。後続する文字があれば、その文字を制御装
置に呼び込み、スタックにセットし、形態素解析のため
の準備を行う。この処理ブロックを31とする。この処理
は、後続文字が無くなるまで繰り返される。Reference numeral 30 is a processing block for determining the end of the morphological analysis processing. If there is a subsequent character, that character is called into the control device, set on the stack, and prepared for morphological analysis. This processing block is 31. This process is repeated until there are no subsequent characters.

32は文字判断処理の終了後、第８図に示した情報が保存
されたスタックの結果をもとに、原文中の固有名詞を他
と区別して表示する処理ブロックである。Reference numeral 32 is a processing block for displaying the proper noun in the original sentence separately from the others based on the result of the stack in which the information shown in FIG.

33は固有名詞を抽出して表示する別の方法である。すな
わち、固有名詞をキーとしたKWICを生成する処理ブロッ
クである。33 is another method for extracting and displaying proper nouns. That is, it is a processing block that generates a KWIC using a proper noun as a key.

34は固有名詞を確認し、間違いがあれば修正・編集を行
う処理ブロックである。この中で、固有名詞の「空間」
は「空閑」と修正される。Reference numeral 34 is a processing block that confirms proper nouns and corrects / edits any mistakes. Among them, the proper noun "space"
Is corrected as "Kuga".

確認の終了後、一連の作業を終了することができる。After the confirmation is completed, a series of operations can be completed.

次に、上記の説明以外の実現手段の例について述べる。Next, examples of implementation means other than the above description will be described.

第８図で固有名詞とその他の文字との区別を記述する情
報と固有名詞の開始と終了を表す情報を区別して23、24
に保存したがこれを共用した構成も考えられる。In FIG. 8, the information describing the distinction between proper nouns and other characters and the information indicating the start and end of proper nouns are distinguished 23, 24
Although it was saved in, it is possible to use a shared configuration.

又、上の説明では原文中の固有名詞を抽出した後で、KW
ICを作成するようにしたが、この順序にはとらわれない
で良い。すなわち、原文中の固有名詞部分のみを抽出し
て、KWICの生成、表示は行わない方法、あるいは、KWIC
を生成してから原文中の固有名詞抽出する方法などもあ
りうる。抽出された、固有名詞の表示は、全部一様に表
示する方法、カーソルなどで指示された部分のみを表示
する方法などが考えられ、いずれの方法を採用しても本
発明に影響はない。Also, in the above explanation, after extracting proper nouns in the original sentence, KW
I made IC, but don't worry about this order. That is, only the proper noun part in the original sentence is extracted and KWIC is not generated or displayed, or KWIC is not displayed.
There may be a method of generating proper names and then extracting proper nouns from the original sentence. The extracted proper nouns may be displayed in a uniform manner, or may be displayed only in a portion designated by a cursor or the like. Whichever method is adopted, the present invention is not affected.

又、固有名詞の校正は、原文中の抽出された固有名詞部
分に対して直接に行う方法、KWICの中のキーワードに対
し行い、その結果を原文中に取り込む方法などが考えら
れ、いずれの方法を採用しても本発明に影響はない。In addition, proper nouns can be calibrated directly by using the extracted proper nouns in the original text, or by using keywords in KWIC and incorporating the results in the original text. Even if the above is adopted, the present invention is not affected.

＜発明の効果＞本発明の効果は、固有名詞が重要な意味を持つ、文書の
中から、固有名詞部分のみを抽出し、確認・校正を行い
やすくできる点にある。<Effects of the Invention> An effect of the present invention is that proper nouns have an important meaning, and only proper nouns can be extracted from a document to facilitate confirmation and proofreading.

又、本発明によれば、固有名詞部分が確実に抽出される
ため、校正の精度を高めるという点で効果がある。Further, according to the present invention, the proper noun portion is reliably extracted, which is effective in improving the accuracy of calibration.

又、原文中から固有名詞の部分を抽出する手間が省ける
ため、校正の処理時間の短縮、校正を行う人の精神的負
担や疲労を軽減という点で効果がある。Further, since it is possible to omit the work of extracting the proper noun portion from the original sentence, it is effective in reducing the processing time of the proofreading and reducing the mental burden and fatigue of the person who performs the proofreading.

又、音声読み上げ装置、スピーカー、音声合成装置など
の特別な装置を必要としないとういう利点がある。Further, there is an advantage that no special device such as a voice reading device, a speaker, a voice synthesizing device is required.

又、ヘッオフォンなどの特別な装置の装着が不必要であ
り、装着の手間及び装置購入が省略できるという利点が
ある。In addition, there is an advantage that a special device such as a phonophone is not required to be attached, and the labor and the purchase of the device can be omitted.

又、言語処理、音声処理などの複雑な処理が不必要であ
り、処理のプログラム作成、メンテナンス、記憶容量の
点で効果がある。In addition, complicated processing such as language processing and voice processing is unnecessary, and it is effective in terms of processing program creation, maintenance, and storage capacity.

又、簡単な処理で実行することができるため、処理速度
が速いという点で効果がある。Further, since it can be executed by a simple process, it is effective in that the processing speed is high.

又、音声のような騒音の原因になる要素を含んでいない
ため周囲への影響を考えないで構成の設置や作業ができ
るという点で効果がある。In addition, since it does not include a noise-causing element such as voice, it is effective in that the configuration and the work can be performed without considering the influence on the surroundings.

[Brief description of drawings]

第１図は本発明装置の構成ブロック図、第２図は入力手
段により入力された文字列の例を示す図、第３図は表示
の例を示す図、第４図はKWICを用いた表示の例を示す
図、第５図は単語辞書の内容の例を示す図、第６図は文
法の例を示す図、第７図は文節成立条件の例を示す図、
第８図は固有名詞抽出スタック情報の例を示す図、第９
図は表示装置の特定点の指定方法の例を示す図、第10図
は本発明の概略フロー図である。１……入力・編集手段２……入力文字記憶手段３……辞書記憶手段４……文法記憶手段５……制御手段６……表示手段７……校正手段８……誤りを含んだ例文９……正しい例文 10……文章中の固有名詞の部分 11……KWIC中のキーワード 12……KWIC全体 13……単語辞書中の見出し 14……単語辞書中の自立語、付属語区別情報 15……単語辞書中の品詞、区別情報 16……文節成立条件 17……先行形態素 18……後続形態素 19……文節終了条件 20……画面表示用の文字スタック 21……画面表示用の行位置スタック 22……画面表示用の列位置スタック 23……画面表示用の固有名詞判定スタック 24……固有名詞判定情報スタック 25……表示装置上の特定な位置 26……入力文字列蓄積処理ブロック 27……単語辞書照合処理ブロック 28……文法照合処理ブロック 29……文節決定照合処理ブロック 30……形態素解析終了処理ブロック 31……次文字処理ブロック 32……固有名詞抽出、表示処理ブロック 33……固有名詞KWIC表示処理ブロック 34……校正処理ブロック 35……エラー処理ブロックFIG. 1 is a block diagram of the device of the present invention, FIG. 2 is a diagram showing an example of a character string input by input means, FIG. 3 is a diagram showing an example of display, and FIG. 4 is a display using a KWIC. , FIG. 5 is a diagram showing an example of the contents of a word dictionary, FIG. 6 is a diagram showing an example of a grammar, FIG. 7 is a diagram showing an example of a clause establishment condition,
FIG. 8 is a diagram showing an example of proper noun extraction stack information, and FIG.
FIG. 10 is a diagram showing an example of a method of designating a specific point of a display device, and FIG. 10 is a schematic flow chart of the present invention. 1 ... input / editing means 2 ... input character storage means 3 ... dictionary storage means 4 ... grammar storage means 5 ... control means 6 ... display means 7 ... proofreading means 8 ... example sentences including errors 9 …… Correct example sentence 10 …… Part of proper noun in the sentence 11 …… Keyword in KWIC 12 …… Entire KWIC 13 …… Heading in word dictionary 14 …… Independent word in word dictionary, auxiliary word distinction information 15… … Part of speech in word dictionary, distinction information 16 …… Branch formation condition 17 …… Preceding morpheme 18 …… Succeeding morpheme 19 …… Phrase ending condition 20 …… Character stack for screen display 21 …… Line position stack for screen display 22 …… Column position stack for screen display 23 …… Proper noun decision stack for screen display 24 …… Proper noun decision information stack 25 …… Specific position on display 26 …… Input character string accumulation processing block 27… … Word dictionary matching processing block 28 …… Grammar matching processing block C 29 ... Phrase decision matching processing block 30 ... Morphological analysis end processing block 31 ... Next character processing block 32 ... Proper noun extraction and display processing block 33 ... Proper noun KWIC display processing block 34 ... Proofreading processing block 35 ...... Error processing block

───────────────────────────────────────────────────── フロントページの続き (72)発明者森下太朗大阪府大阪市阿倍野区長池町22番22号シャープ株式会社内 (72)発明者中村信夫大阪府大阪市阿倍野区長池町22番22号シャープ株式会社内 (56)参考文献特開昭55−43627（ＪＰ，Ａ) 特開昭60−123960（ＪＰ，Ａ) 特開昭63−94365（ＪＰ，Ａ) ─────────────────────────────────────────────────── ─── Continuation of the front page (72) Inventor Taro Morishita 22-22 Nagaike-cho, Abeno-ku, Osaka City, Osaka Prefecture Sharp Corporation (72) Nobuo Nakamura 22-22, Nagaike-cho, Abeno-ku, Osaka City, Osaka Prefecture Incorporated (56) Reference JP 55-43627 (JP, A) JP 60-123960 (JP, A) JP 63-94365 (JP, A)

Claims

[Claims]

1. A means for inputting / editing Japanese, a means for storing the input Japanese, a dictionary for storing a dictionary, a grammar for storing a grammar, and a dictionary for storing the input Japanese. A means for extracting a character / symbol string to be proofed from the inside, a morphological analysis of a candidate character / symbol string corresponding to the input Japanese by the dictionary means and the grammar means,
Detecting means for extracting a part of a morpheme related to a proper noun and detecting the proper noun, and means for responding to the detecting means and displaying only the proper noun in reverse video so as to be distinguished from other sentence parts. A document creation / proofreading support, characterized in that, when there is a candidate character / symbol string to be proofread by the proper noun displayed by the display means, the candidate character / symbol string is corrected. apparatus.