JP2008102802A

JP2008102802A - Document support system and document support program

Info

Publication number: JP2008102802A
Application number: JP2006285678A
Authority: JP
Inventors: Kazunari Tsuhara; 一成津原; Keiko Fujii; 桂子藤井; Sachiko Yoshimura; 幸子吉村
Original assignee: Kyocera Mita Corp
Current assignee: Kyocera Document Solutions Inc
Priority date: 2006-10-20
Filing date: 2006-10-20
Publication date: 2008-05-01

Abstract

<P>PROBLEM TO BE SOLVED: To provide a document support system which supports users other than a document creator to understand words etc. without misunderstanding and to view annotations therefor on a monitor screen or on a printout, and adds annotations without conversion of adding the annotation to original sentences, while keeping original expressions as they are. <P>SOLUTION: The document support system includes: an analysis part 6 for analyzing words included in sentences of an acquired document file; an extraction part 7 for extracting words to be annotated when they are included in words analyzed by the analysis part 6; a retrieval part 9 for calling annotations corresponding to the words to be annotated, which are extracted by the extraction part 7, from a dictionary database 8 prestored with the annotations; and a writing part 10 for generating an annotated document file by adding the annotation called by the retrieval part 9 to the document by using a document format of the document file analyzed by the analysis part 6. <P>COPYRIGHT: (C)2008,JPO&INPIT

Description

本発明は、作成された文書ファイルの文章中に含まれる単語（語彙）等に対する注釈を付与する文書支援システム及び文書支援プログラムに関する。 The present invention relates to a document support system and a document support program for giving annotations to words (vocabulary) included in a sentence of a created document file.

従来から、パーソナルコンピュータにインストールされた日本語辞書に基づいてワードプロセッサ等で文書を作成している作業中に単語変換（漢字変換）等を行う場合に、モニタ画面上に同音異義語の意味を注釈的に表示することによって誤った漢字変換を行わないようにする文書支援システムが知られている。 Conventionally, when performing word conversion (kanji conversion) while creating a document with a word processor based on a Japanese dictionary installed on a personal computer, the meaning of the homonyms on the monitor screen is annotated. Document support systems are known that prevent erroneous Kanji conversion by displaying them dynamically.

また、作成した文章中の単語等を解析し、同一文章中の複数個所で用いている単語（語彙）の使用形態が異なっている場合（例えば、「コンピュータ」と「コンピューター」）にモニタ画面上の該当単語中にアンダーライン等を付与することで同一の使用形態に変換するワードチェックツールや、文章中に使用されている単語（語彙）が複数個所で用いられているような場合に強調語として表示フォントサイズ等を変換する強調変換ツールといった文書支援システムも知られている（例えば、特許文献１参照。）。 In addition, when words etc. in the created sentence are analyzed and the usage forms of words (vocabulary) used in multiple places in the same sentence are different (for example, “Computer” and “Computer”) on the monitor screen Word check tool to convert to the same usage pattern by adding underline etc. to the corresponding word of, or emphasized word when the word (vocabulary) used in the sentence is used in multiple places Document support systems such as emphasis conversion tools that convert display font sizes and the like are also known (for example, see Patent Document 1).

さらに、上述した同音異義語の意味を注釈的に表示する文書支援システムとして、単語変換時のみならず、変換決定後の文章中に含まれる単語（語彙）を検索し、利用者の理解補助や誤解防止の観点から必要があると判断した単語に対してポップアップメニューやサイクリック表示などによって表現の変換を可能とした文書支援システムも知られている（例えば、特許文献２参照。）。 Furthermore, as a document support system that displays the meanings of the above-mentioned homonyms as annotations, the word (vocabulary) included in the sentence after the conversion decision is searched for not only at the time of word conversion, There is also known a document support system that enables expression conversion of a word that is determined to be necessary from the viewpoint of preventing misunderstanding by using a pop-up menu, cyclic display, or the like (see, for example, Patent Document 2).

これにより、例えば、「ＡＴＭ」という略語に関し、金融業界で用いた場合の「ＡＴＭ」（＝現金自動預入支払機［Ａｕｔｏｍａｔｅｄ
ｔｅｌｌｅｒｍａｃｈｉｎｅ］）と、軍事業界で用いた場合の「ＡＴＭ」（＝対戦車ミサイル［Ａｎｔｉ−ＴａｎｋＭｉｓｓｉｌｅ］）の何れの意味を意図して文書作成者が用いているかを作成者が指定することにより、他の利用者がこの文書を閲覧した場合の理解補助や誤解防止を実現することができる。
特開平１１−２３８０５８号公報特開２００４−１５１８８２号公報 Thus, for example, regarding the abbreviation “ATM”, when used in the financial industry, “ATM” (= automatic teller machine [Automated Cash Machine]
teller machine]) and “ATM” (= Anti-Tank Missile) when used in the military industry, the author must specify what the document creator is using. Thus, it is possible to realize understanding assistance and prevention of misunderstanding when another user views this document.
Japanese Patent Laid-Open No. 11-238058 JP 2004-151882 A

ところが、上記の如く構成された文書支援システムにあっては、漢字変換等を行う場合に同音異義語の意味を注釈的に表示するものにあっては、主として文書作成中に文書作成者の誤変換を防止するために用いられ、他の利用者がモニタ画面上或いはプリント出力した状態での理解補助や誤解防止に貢献することはできなかった。 However, in the document support system configured as described above, when performing kanji conversion, etc., the meaning of the homonym is annotated, which is mainly an error of the document creator during document creation. It was used to prevent conversion, and could not contribute to assisting understanding or preventing misunderstandings on the monitor screen or printed out by other users.

また、作成した文章中の単語等を解析するワードチェックツールや強調変換ツールにあっては、単なるワードチャックや文字強調といった機能のみに用いられ、検索した単語（語彙）等に対する理解補助や誤解防止に貢献するものではなかった。 In addition, word check tools and emphasis conversion tools that analyze words in created sentences are used only for functions such as simple word chucking and character emphasis, and assist in understanding and preventing misunderstanding of searched words (vocabulary). Did not contribute to.

さらに、変換決定後の文章中に含まれる単語（語彙）を検索し、利用者の理解補助や誤解防止の観点から必要があると判断した単語に対してポップアップメニューやサイクリック表示などによって表現の変換を可能とした文書支援システムにあっては、利用者の観点からのみ単語（語彙）等の意味を注釈的に添付又は変換することができるものであり、しかも、その注釈表示はモニタ画面上でのみ可能であるという問題があった。 In addition, words (vocabulary) included in the text after the conversion decision is searched, and words that are judged to be necessary from the viewpoint of user assistance and prevention of misunderstandings are expressed by a pop-up menu or cyclic display. In a document support system that enables conversion, the meaning of a word (vocabulary) or the like can be annotated or converted only from the user's point of view, and the annotation display is on the monitor screen. There was a problem that was possible only with.

そこで、本発明は、上記事情を考慮し、文書作成者以外の利用者に対する単語等の理解補助や誤解防止を実現することができる文書支援システムを提供することを目的とする。 In view of the above circumstances, an object of the present invention is to provide a document support system that can realize word assistance and prevention of misunderstandings for users other than the document creator.

また、理解補助や誤解防止のための注釈を、モニタ画面上のみならずプリント出力した場合であっても目視可能とすることができ、しかも、元の文章に注釈を添付するといった変換を行うことなく元の表現を残したまま注釈を追加することができる文書支援システム及び文書支援プログラムを提供することを目的とする。 In addition, annotations for assisting understanding and preventing misunderstandings can be viewed not only on the monitor screen but also when printed out, and conversion is performed such that the annotation is attached to the original text. It is an object of the present invention to provide a document support system and a document support program that can add annotations without changing the original expression.

本発明の文書支援システムは、文書ファイルを取得すると共にその取得した文書ファイルの文章中に含まれる単語を解析する解析部と、該解析部で解析した単語中に注釈が必要な注釈対象単語が含まれている場合にそれを抽出する抽出部と、該抽出部で抽出した注釈対象単語に対応する注釈を予め格納された辞書データベースから呼び出す検索部と、該検索部で呼び出した注釈を前記解析部で解析した文書ファイルの文書フォーマットを利用して注釈付文書ファイルを作成する書込部とを備えていることを特徴とする。 The document support system of the present invention acquires a document file and analyzes a word included in the sentence of the acquired document file, and an annotation target word that requires an annotation in the word analyzed by the analysis unit. An extraction unit for extracting the annotation when it is included, a search unit for calling an annotation corresponding to the annotation target word extracted by the extraction unit from a dictionary database stored in advance, and the analysis for the annotation called by the search unit And a writing unit that creates an annotated document file using the document format of the document file analyzed by the unit.

この際、前記書込部は、文書フォーマットで指定された余白部分又は最終頁に注釈を添付した注釈付文書ファイルを作成するのが好ましい。 At this time, it is preferable that the writing unit creates an annotated document file in which an annotation is attached to a blank part specified in the document format or the last page.

また、前記辞書データベースには、複数の辞書に対応するデータベースが優先順序立てして格納されていることが好ましい。 The dictionary database preferably stores databases corresponding to a plurality of dictionaries in a priority order.

さらに、前記抽出部は、予め規定された選択基準に応じた名詞を注釈対象単語として抽出するのが好ましい。この際、その選択基準としては、外来語等の可能性の高いカタカナ単語や略語等の可能性の高い大文字のみから構成された英数字を優先して注釈対象単語とすることが考えられる。 Furthermore, it is preferable that the extraction unit extracts a noun according to a predetermined selection criterion as an annotation target word. In this case, as a selection criterion, it is conceivable to preferentially use alphanumeric characters composed only of capital letters having high possibility such as foreign words and abbreviated words and abbreviations as annotation target words.

また、前記抽出部は、文章中に複数回使用されている同一単語は最初の１回のみ注釈対象単語として抽出するのが好ましい。 In addition, it is preferable that the extraction unit extracts the same word that is used a plurality of times in the sentence as an annotation target word only once.

さらに、前記書込部は、文書フォーマットと注釈対象単語数とを考慮して前記検索部によって呼び出した前記辞書データベースに格納された注釈文のうち最初の指定文字数分のみを注釈として使用しても良い。 Further, the writing unit may use only the first designated number of characters as an annotation among the annotation sentences stored in the dictionary database called by the search unit in consideration of the document format and the number of annotation target words. good.

本発明の文書支援システムによれば、文書ファイルの文章中に注釈が必要であると想定される単語を注釈対象単語として解析し、その注釈対象単語に対応する注釈を含む注釈付文書ファイルを作成することにより、文書作成者による文書作成字や文書作成者が任意に指定した単語のみならず、幅広い注釈付きの文章とすることができる。 According to the document support system of the present invention, an annotated document file including an annotation corresponding to the annotation target word is generated by analyzing a word assumed to require an annotation in a document file as an annotation target word. By doing so, not only the document creation characters by the document creator and the words arbitrarily designated by the document creator but also a wide annotated sentence can be obtained.

さらに、本発明の文書支援プログラムは、文書ファイルの文章中に含まれる単語を解析し、その解析した単語中に注釈が必要であると想定される注釈対象単語を抽出した後、予め格納された辞書データベースを検索して注釈対象単語に対応する注釈が存在している場合に文書ファイルの文書フォーマットを利用して注釈付文書ファイルを作成する。 Furthermore, the document support program of the present invention analyzes words included in the text of the document file, extracts the annotation target words that are assumed to require annotation in the analyzed words, and stores them in advance. When an annotation corresponding to the annotation target word exists in the dictionary database, an annotated document file is created using the document format of the document file.

本発明の文書支援システムは、文書作成者以外の利用者に対する単語等の理解補助や誤解防止を実現することができるばかりでなく、理解補助や誤解防止のための注釈をモニタ画面上のみならずプリント出力した場合であっても目視可能とすることができ、しかも、元の文章に注釈を添付するといった変換を行うことなく元の表現を残したまま注釈を追加することができる。 The document support system of the present invention can not only realize understanding of words and the like and prevent misunderstandings for users other than the document creator, but also provide annotations for understanding assistance and prevention of misunderstandings not only on the monitor screen. Even if it is printed out, it can be made visible, and an annotation can be added while leaving the original expression without conversion such as attaching an annotation to the original sentence.

次に、本発明の一実施形態に係る文書支援システム及び文書支援プログラムについて、図面を参照して説明する。 Next, a document support system and a document support program according to an embodiment of the present invention will be described with reference to the drawings.

図１は本発明の一実施形態に係る文書支援システムのシステム構成例の説明図、図２は本発明の一実施形態に係る文書支援システムのシステム構成例のブロック説明図、図３は本発明の一実施形態に係る文書支援システムで作成された文書ファイル例の説明図、図４は本発明の一実施形態に係る文書支援システムルーチンのフロー図である。 1 is an explanatory diagram of a system configuration example of a document support system according to an embodiment of the present invention, FIG. 2 is a block diagram of a system configuration example of a document support system according to an embodiment of the present invention, and FIG. FIG. 4 is an explanatory diagram of an example of a document file created by the document support system according to the embodiment, and FIG. 4 is a flowchart of the document support system routine according to the embodiment of the present invention.

図１に示すように、本発明の一実施形態に係る文書支援システムは、複数のパーソナルコンピュータ１及び複合機２がネットワーク３を介してサーバー４に接続されている。尚、このようなネットワークシステムは、図示したものに限定されるものではない。また、サーバー４は、インターネット５等と接続することも可能である。 As shown in FIG. 1, in a document support system according to an embodiment of the present invention, a plurality of personal computers 1 and a multifunction device 2 are connected to a server 4 via a network 3. Such a network system is not limited to the one shown in the figure. The server 4 can also be connected to the Internet 5 or the like.

サーバー４は、図２に示すように、パーソナルコンピュータ１にインストールされたワードプロセッサソフトを利用して作成・保存された文書ファイルを取得すると共にその取得した文書ファイルの文章中に含まれる単語を解析する解析部６と、解析部６で解析した単語中に注釈が必要な注釈対象単語が含まれている場合にそれを抽出する抽出部７と、抽出部７で抽出した注釈対象単語に対応する注釈を予め格納された辞書データベース８から呼び出す検索部９と、検索部９で呼び出した注釈を解析部６で解析した文書ファイルの文書フォーマットを利用して注釈付文書ファイルを作成する書込部１０とを備えている。 As shown in FIG. 2, the server 4 acquires a document file created and stored using word processor software installed in the personal computer 1 and analyzes words included in the sentence of the acquired document file. An analysis unit 6, an extraction unit 7 that extracts an annotation target word that needs an annotation in the word analyzed by the analysis unit 6, and an annotation corresponding to the annotation target word extracted by the extraction unit 7 A search unit 9 that calls from a dictionary database 8 stored in advance, and a writing unit 10 that creates an annotated document file using a document format of a document file obtained by analyzing an annotation called by the search unit 9 using an analysis unit 6; It has.

解析部６は、取得した文書ファイルの文章を句読点や記号（例えば、コンマ、ピリオド、スラッシュ等）毎に区切って形態素解析し、単語（語彙）に分割する。 The analysis unit 6 divides the sentence of the acquired document file into punctuation marks and symbols (for example, commas, periods, slashes, etc.) and morphologically analyzes them, and divides them into words (vocabulary).

抽出部７は、予め規定された選択基準に応じた名詞を注釈対象単語として抽出する。この際、予め規定された選択基準としては、例えば、人名・地名・国名・団体名（会社名・学校名など）・年号（元号と西暦等との対応）・商号・作品名（書名・映画の題名等）等の固有名詞や、外来語等の可能性の高いカタカナ単語や何らかの略語としての可能性の高い大文字のみから構成された英数字（例えば、「ＡＴＭ」「ＭＰ３」等）を優先して注釈対象単語として抽出するといったように、業界等に応じて設定することができる。また、抽出部７は、文章中に複数回使用されている同一単語は最初の１回のみ注釈対象単語として抽出することで重複を避けることができる。 The extraction unit 7 extracts a noun according to a predetermined selection criterion as an annotation target word. In this case, the selection criteria specified in advance include, for example, the name of a person, place, country, organization, name (company name, school name, etc.), year (correspondence between the era name and the year), trade name, title of work (book title) · Alphabet composed of proper nouns such as movie titles), katakana words that are likely to be foreign words, or capital letters that are likely to be abbreviations (eg, “ATM” “MP3”, etc.) Can be set according to the industry or the like. Further, the extraction unit 7 can avoid duplication by extracting the same word used multiple times in the sentence as the annotation target word only once for the first time.

辞書データベース８には、複数の辞書に対応するデータベースが優先順序立てして格納されている。これにより、業界等で異なった意味として用いられる重複単語が存在していても、文書作成者が意図する使用形態と一致する注釈を添付することができる。 The dictionary database 8 stores databases corresponding to a plurality of dictionaries in priority order. Thereby, even if there is a duplicate word used as a different meaning in the industry or the like, it is possible to attach an annotation that matches the usage pattern intended by the document creator.

この際、複数辞書の優先順序設定は、業界等の利用者によって様々であるため、利用者側で設定することができる。 At this time, the priority order setting of a plurality of dictionaries varies depending on users in the industry and the like, and can be set on the user side.

尚、辞書データベース８に格納された注釈は、通常の辞書等に記載されている内容よりも簡素な文字数の少ない注釈文で格納するのが好ましい。 Note that the annotation stored in the dictionary database 8 is preferably stored as a simple annotation with a smaller number of characters than the contents described in a normal dictionary or the like.

検索部９は、抽出部７で抽出された単語（語彙）が辞書データベース８に格納されているか否かを検索し、辞書データベース８に該当する単語が含まれていた場合に辞書データベース８からその該当した単語に対する注釈を取得する。また、辞書データベース８に該当する単語が含まれていなかった場合にはスキップして次の単語を検索すると同時に少なくとも同一文書ファイルに対する次回以降（次頁以降）の検索対象から除外する。尚、検索部９は、辞書データベース８以外として（又は併用して）、インターネット５を経由して注釈対象単語を検索して説明文を取得しても良い。この際、辞書データベース８の場合と同様に複数のサイト（専門用語の解説サイトなど）を予め登録しておくと同時に、そのサイトの検索順序等を指定しておくことにより精度を向上することができる。 The search unit 9 searches whether or not the word (vocabulary) extracted by the extraction unit 7 is stored in the dictionary database 8. If the dictionary database 8 includes a corresponding word, the search unit 9 searches the dictionary database 8 for the word (vocabulary). Get annotations for the corresponding word. If the corresponding word is not included in the dictionary database 8, the next word is skipped and at the same time, at the same time, it is excluded from at least the next search (next page and later) for the same document file. Note that the search unit 9 may acquire an explanatory text by searching for an annotation target word via the Internet 5 other than (or in combination with) the dictionary database 8. At this time, as in the case of the dictionary database 8, a plurality of sites (such as technical term commentary sites) are registered in advance, and at the same time, by specifying the search order of the sites, the accuracy can be improved. it can.

書込部１０は文書フォーマットで指定された余白部分又は最終頁に注釈を添付した注釈付文書ファイルを作成する。また、書込部１０は、例えば、余白スペース等には文字数の制限が発生することから、文書フォーマットと注釈対象単語数とを考慮して検索部９によって呼び出した辞書データベース８に格納された注釈文のうち最初の指定文字数分のみを注釈として使用しても良い。さらに、書込部１０は作成した注釈付文書ファイルを元の文書ファイルのファイル名とは異なった注釈付文書ファイル名でパーソナルコンピュータ１に出力することにより、注釈文無しの原本と注釈文有りの新たなファイルとを別々に保存することができる。 The writing unit 10 creates an annotated document file in which an annotation is attached to a blank portion specified in the document format or the last page. Further, the writing unit 10 has an annotation stored in the dictionary database 8 called by the search unit 9 in consideration of the document format and the number of annotation target words, for example, because the number of characters is limited in the blank space. Only the first specified number of characters in the sentence may be used as an annotation. Furthermore, the writing unit 10 outputs the created annotated document file to the personal computer 1 with an annotated document file name that is different from the original document file name, so that the original with no annotated sentence and a new one with an annotated sentence are present. Files can be saved separately.

尚、書込部１０は、パーソナルコンピュータ１から複合機２へと文書ファイルを含む印刷指令データが出力された場合においても、その印刷指令データを解析部６で取得し、注釈付文書ファイルの新たな印刷指令データとして複合機２へと出力しても良い。また、文書作成中においても、例えば、所定時間毎や所定漢字変換操作数をトリガーとしてワードプロセッサソフト上で設定したバックアップデータ保存時等に、上述した解析部６から書込部１０を経由して注釈分を余白等に作成し、モニタ画面上で確認しつつ文書作成をすることも可能である。この際、文書作成者が注釈対象単語として指定することも可能であり、その注釈を選択（指定）することも可能である。さらに、利用者側においても注釈付文書ファイルを呼び出してモニタ画面を確認しつつ、不要な注釈文を削除することも可能である。 Even when print command data including a document file is output from the personal computer 1 to the multi-function device 2, the writing unit 10 acquires the print command data with the analysis unit 6 and creates a new annotated document file. The print command data may be output to the multifunction device 2. Further, during document creation, for example, when storing backup data set on the word processor software with a predetermined number of times or a predetermined number of Kanji conversion operations as a trigger, an annotation is made from the analysis unit 6 via the writing unit 10 described above. It is also possible to create a document while creating the minutes in margins and confirming them on the monitor screen. At this time, the document creator can also designate it as an annotation target word, and can select (specify) the annotation. Furthermore, it is possible for the user side to call out the annotated document file and check the monitor screen while deleting unnecessary annotation sentences.

これにより、図３に示すように、プリントアウト後の用紙１１（又は、印刷表示モードでのモニタ画面上）には、書込部１０で余白部（例えばＡ４縦向きのレイアウトならば下側）１１ａに注釈が追記されると共に、本文中の注釈対象単語に注１）、注２）・・・等の注釈記号が添記される。 As a result, as shown in FIG. 3, in the paper 11 after printing out (or on the monitor screen in the print display mode), the writing unit 10 has a margin (for example, the lower side in the case of an A4 portrait layout). An annotation is added to 11a, and annotation symbols such as Note 1), Note 2), etc. are added to the annotation target word in the text.

尚、ワードプロセッサソフト上で設定された余白部分が少ない場合には、注釈対象単語数を考慮して注釈文を短くしたり、注釈文のフォントサイズを小さくしたり、余白面積を広げる（行間を短くしたり行数を少なくして次頁以降に繰り越す等）といったレイアウト変更を可能としても良い。また、辞書データベース８に、同一頁内での注釈対象単語数が１つの場合、２つの場合、３つの場合等に応じた専用の注釈文を複数格納し、注釈単語数に応じて検索部９で選択的に用いることも可能である。 If there are few margins set on the word processor software, consider the number of annotation target words, shorten the annotation sentence, reduce the font size of the annotation sentence, and increase the margin area (shorten the line spacing). It is also possible to change the layout, such as reducing the number of lines and moving forward to the next page. The dictionary database 8 stores a plurality of dedicated annotation sentences corresponding to the number of annotation target words in the same page, two cases, three cases, etc., and the search unit 9 according to the number of annotation words. It can also be used selectively.

次に、図４のフロー図を参照しつつ、本発明の文書支援システムの注釈文作成手順を説明する。 Next, the comment sentence creation procedure of the document support system of the present invention will be described with reference to the flowchart of FIG.

（ステップＳ１）
ステップＳ１では、解析部６により、文書作成者が作成した文書ファイルをその作成者又は他の利用者のパーソナルコンピュータ１から取得してステップＳ２へと移行する。 (Step S1)
In step S1, the analysis unit 6 acquires the document file created by the document creator from the personal computer 1 of the creator or another user, and proceeds to step S2.

（ステップＳ２）
ステップＳ２では、解析部６により、取得した文書ファイルから図柄等が含まれている場合を想定し、その部分を除いた文章を構成する文章データ（テキストデータ）のみを対象とし、その文章データの句読点や記号（例えば、コンマ、ピリオド、スラッシュ等）毎に区切って形態素解析しつつ単語（語彙）に分割してステップＳ３へと移行する。 (Step S2)
In step S2, assuming that a pattern or the like is included from the acquired document file by the analysis unit 6, only sentence data (text data) constituting the sentence excluding the part is targeted, and the sentence data The punctuation marks and symbols (for example, commas, periods, slashes, etc.) are divided into words (vocabulary) while being divided into morphemes and the process proceeds to step S3.

尚、この単語（語彙）分割は、ＪＩＳの区点コード等に基づいて文字を参酌すると同時に、その句点コードの組み合わせから、予め単語登録された単語を抽出するなど、公知の単語検索機能を利用して分割する。 This word (vocabulary) segmentation uses a well-known word search function such as extracting a word registered in advance from a combination of punctuation codes at the same time as taking into account characters based on JIS mark codes. And split.

（ステップＳ３）
ステップＳ３では、抽出部７により、解析部６で分割された単語のうち、文頭から１つ目の単語を参酌してステップＳ４へと移行する。尚、このステップＳ３で、抽出部７は、参酌した単語は参酌済み単語として記憶処理しておく。 (Step S3)
In step S3, the extraction unit 7 refers to the first word from the beginning of the words divided by the analysis unit 6, and proceeds to step S4. In step S3, the extraction unit 7 stores the visited word as a visited word.

（ステップＳ４）
ステップＳ４では、抽出部７により、参酌した単語が難解な単語等、注釈の必要がある注釈対象単語であるか否かが判断され、注釈対象単語である場合にはステップＳ５へと移行し、注釈対象単語で無い場合にはステップＳ７へとスキップする。 (Step S4)
In step S4, the extraction unit 7 determines whether or not the considered word is an annotation target word that needs to be annotated, such as an esoteric word. If it is an annotation target word, the process proceeds to step S5. If it is not an annotation target word, the process skips to step S7.

（ステップＳ５）
ステップＳ５では、検索部９により、ステップＳ４で注釈対象単語であると判断した単語に対応する注釈が辞書データベース８に登録されているか否が確認され、注釈有りの場合にはステップＳ６へと移行し、注釈無しの場合にはステップＳ７へとスキップする。 (Step S5)
In step S5, the search unit 9 checks whether or not an annotation corresponding to the word determined to be an annotation target word in step S4 is registered in the dictionary database 8. If there is an annotation, the process proceeds to step S6. If there is no comment, the process skips to step S7.

（ステップＳ６）
ステップＳ６では、検索部９により、注釈対象単語に対応する注釈を辞書データベース８から取得してステップＳ７へと移行する。 (Step S6)
In step S6, the search unit 9 acquires an annotation corresponding to the annotation target word from the dictionary database 8, and the process proceeds to step S7.

（ステップＳ７）
ステップＳ７では、解析部６により、注釈対象単語の有無処理を行っていない次の単語があるか否かが判断され、以下、全文章データに対する注釈対象単語の検索が終了するまでしてステップＳ２〜ステップＳ６のルーチンを繰り返し、全文章データに対する注釈対象単語の検索が終了したらばステップＳ８へと移行する。 (Step S7)
In step S7, the analysis unit 6 determines whether or not there is a next word that has not been subjected to the annotation target word presence / absence processing. When the routine of Step S6 is repeated and the search for the annotation target word for all the text data is completed, the routine proceeds to Step S8.

（ステップＳ８）
ステップＳ８では、書込み部１０により、注釈対象単語に対する注釈を、文書フォーマットで指定された余白部分の該当頁に（又は最終頁にまとめて）、その注釈数（注釈対象単語数）や注釈文字数などを考慮して注釈を追加したうえで、新たな文書データ（注釈付文書ファイル）として保存してパーソナルコンピュータ１に出力する。 (Step S8)
In step S8, the writing unit 10 adds annotations to the annotation target word to the corresponding page of the blank portion specified in the document format (or to the last page), the number of annotations (number of annotation target words), the number of annotation characters, and the like. Is added to the personal computer 1 and stored as new document data (annotated document file).

このように、本発明の文書支援システムにあっては、文書作成者以外の利用者がその文書を閲覧（モニタ画面・プリント出力を問わない）した場合に、理解しづらい専門用語等の理解補助や誤解の虞がある単語が含まれている可能性があることから、文書作成者又は利用者が文書ファイルをサーバー４へ送信することにより、そのサーバー４でその文書ファイルを解析し、余白部分に説明が必要と思われる語句への注釈を付加する新たな注釈付文書ファイルを作成することによって、利用者は注釈付の分かりやすい文章を手に入れることができる。 As described above, in the document support system of the present invention, when a user other than the document creator browses the document (regardless of whether it is a monitor screen or print output), it assists understanding of technical terms that are difficult to understand. Or a word that may be misunderstood, the document creator or the user sends the document file to the server 4 to analyze the document file, and the margin part By creating a new annotated document file that adds annotations to words that may need explanation, users can get annotated and easy-to-understand text.

ところで、上記実施の形態では、本発明の文書支援システムをサーバー４で行う旨を開示したが、各パーソナルコンピュータ１で行うようにしても良い。この際、本発明の文書支援システムは、辞書データベース７を含めたソフトウエアとしてパーソナルコンピュータ１にインストールすることで実現可能となる。 By the way, in the above embodiment, it has been disclosed that the document support system of the present invention is performed by the server 4, but may be performed by each personal computer 1. At this time, the document support system of the present invention can be realized by installing it in the personal computer 1 as software including the dictionary database 7.

本発明の一実施形態に係る文書支援システムのシステム構成例の説明図である。It is explanatory drawing of the system configuration example of the document assistance system which concerns on one Embodiment of this invention. 本発明の一実施形態に係る文書支援システムのシステム構成例のブロック説明図である。It is a block explanatory diagram of a system configuration example of a document support system according to an embodiment of the present invention. 本発明の一実施形態に係る文書支援システムで作成された文書ファイル例の説明図である。It is explanatory drawing of the example of a document file produced with the document assistance system which concerns on one Embodiment of this invention. 本発明の一実施形態に係る文書支援システムルーチンのフロー図である。It is a flowchart of the document assistance system routine which concerns on one Embodiment of this invention.

Explanation of symbols

６…解析部
７…抽出部
８…辞書データベース
９…検索部
１０…書込部 6 ... Analysis unit 7 ... Extraction unit 8 ... Dictionary database 9 ... Search unit 10 ... Writing unit

Claims

An analysis unit that acquires a document file and analyzes a word included in the sentence of the acquired document file, and if an analysis target word that requires an annotation is included in the analyzed word, An extraction unit to extract, a search unit for calling an annotation corresponding to the annotation target word extracted by the extraction unit from a dictionary database stored in advance, and a document file document obtained by analyzing the annotation called by the search unit by the analysis unit A document support system comprising a writing unit that creates an annotated document file using a format.

2. The document support system according to claim 1, wherein the writing unit creates an annotated document file in which an annotation is attached to a margin part or a last page specified in a document format.

3. The document support system according to claim 1, wherein databases corresponding to a plurality of dictionaries are stored in the dictionary database in a priority order.

4. The document support system according to claim 1, wherein the extraction unit extracts a noun according to a predetermined selection criterion as an annotation target word. 5.

5. The document support system according to claim 4, wherein the extraction unit preferentially extracts alphanumeric characters including only katakana words or capital letters as annotation target words. 6.

6. The document support system according to claim 1, wherein the extraction unit extracts the same word used a plurality of times in a sentence as an annotation target word only for the first time.

The writing unit uses, as an annotation, only the first designated number of characters in the annotation sentence stored in the dictionary database called by the search unit in consideration of the document format and the number of annotation target words. The document support system according to any one of claims 1 to 6.

After analyzing the words included in the text of the document file and extracting the annotation target words that are assumed to require annotations in the analyzed words, the dictionary database stored in advance is searched to make the annotation target words A document support program that creates an annotated document file using a document format of a document file when a corresponding annotation exists.