JP2012221285A

JP2012221285A - Document generation support method, document generation support device and document generation support program

Info

Publication number: JP2012221285A
Application number: JP2011087157A
Authority: JP
Inventors: Takehiko Ono; 健彦大野; Ai Nakane; 愛中根; Momoko Nakatani; 桃子中谷
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2011-04-11
Filing date: 2011-04-11
Publication date: 2012-11-12

Abstract

PROBLEM TO BE SOLVED: To support creation of converted document data obtained through conversion from document data according to appropriate difficulty.SOLUTION: A document generation support method includes: a target word extraction step for extracting a target word in target word data 12a from document data 11a, and storing the extracted target word in to-be-used word data 13a; and a conversion step for extracting complexity of a to-be-used word included in the to-be-used word data 13a, extracting a to-be-converted word, which is a synonym of the to-be-used word and whose complexity is in a range of reference complexity, from synonym data 15a when the extracted complexity of the to-be-used word is outside the range of reference complexity, and outputting converted document data 16a obtained by converting the target word in the document data 11a into the extracted to-be-converted word.

Description

本発明は、文書データから、適切な難易度に変換した変換文書データの生成を支援する文書生成支援方法、文書生成支援装置および文書生成支援プログラムに関する。 The present invention relates to a document generation support method, a document generation support apparatus, and a document generation support program that support generation of converted document data converted from document data to an appropriate difficulty level.

一般に、文書を生成する際、読み手の理解度に応じた単語を用いる必要がある。そのため、作成者は、文書中で用いる単語が適切でない場合、類義語辞典などを用いて、適切な単語に置換し、または、他の表現に置換する必要がある。 In general, when generating a document, it is necessary to use words according to the reader's level of understanding. Therefore, when the word used in the document is not appropriate, the creator needs to replace it with an appropriate word or other expression using a synonym dictionary or the like.

そこで、文書中に用いる単語の親密度に基づいて、読み手が理解しやすいか否かの文書解析結果を出力する方法がある（例えば、特許文献１参照。）。この特許文献１は、文書を単語単位で分割し、単語単位に分割された分割候補のそれぞれについて、複数の異なる解析構造を有する解析候補から、それぞれの解析候補に含まれる単語の親密度を算出する。さらに、特許文献１に記載の技術は、各単語の親密度の平均が最も低くなるような分割候補を出力する。 Therefore, there is a method of outputting a document analysis result indicating whether or not the reader can easily understand based on the familiarity of words used in the document (see, for example, Patent Document 1). This patent document 1 divides a document in units of words, and calculates the closeness of words included in each analysis candidate from analysis candidates having a plurality of different analysis structures for each of the division candidates divided in units of words. To do. Furthermore, the technique described in Patent Document 1 outputs a division candidate that has the lowest average familiarity of each word.

ここで、単語の親密度は、単語が読み手にとってどの程度難しいかを示した指標である（例えば、非特許文献１参照。）。非特許文献１に記載の方法は、予め各単語に対して単語親密度を算出することで、なじみのある単語であれば単語親密度が低く、なじみのない単語については単語親密度が高いことから、各単語がなじみのある単語か否かを判定することができる。 Here, the word familiarity is an index indicating how difficult the word is for the reader (for example, see Non-Patent Document 1). The method described in Non-Patent Document 1 calculates the word familiarity for each word in advance, so that the word familiarity is low for familiar words, and the word familiarity is high for unfamiliar words. Thus, it can be determined whether or not each word is a familiar word.

特開２００７−１１７７４号公報JP 2007-11774 A

佐藤、笠原、金杉、天野、”単語親密度に基づく基本語彙の選定”、人工知能学会論文誌、Ｖｏｌ．１９，Ｎｏ．６、ｐｐ．５０２−５１０、２００４Sato, Kasahara, Kanesugi, Amano, “Selection of basic vocabulary based on word familiarity”, Journal of the Japanese Society for Artificial Intelligence, Vol. 19, no. 6, pp. 502-510, 2004

しかしながら、特許文献１に記載の方法は、読み手が理解しやすいか否かの文書解析結果を出力するのみで、読み手にとって適切な文書を出力するものではない。 However, the method described in Patent Document 1 only outputs a document analysis result indicating whether or not the reader can easily understand, and does not output a document appropriate for the reader.

特に、取扱説明書、保守用マニュアル等の技術文書において、技術文書を利用する読み手にとって理解できない概念を含む専門用語を用いると、読み手が、技術文書を理解しづらい問題があった。一般に、技術文書を作成する際に、どのような専門用語を用いるかは、作成者に任されている場合が多い。しかしながら、様々な専門用語がどの程度理解が困難であるかを判定することは難しく、適切でない難易度の単語を用いてしまう場合が多かった。 In particular, in technical documents such as instruction manuals and maintenance manuals, when technical terms including concepts that cannot be understood by readers using the technical documents are used, there is a problem that it is difficult for the readers to understand the technical documents. In general, it is often up to the creator to determine what technical terms are used when creating a technical document. However, it is difficult to determine how difficult various technical terms are to understand, and words of inappropriate difficulty are often used.

例えば一般ユーザが利用することを想定したマニュアルで、一般ユーザが理解しにくい専門用語を用いると、理解が非常に難しくなる。一方、専門用語を使わずに、一般的な用語で同等の内容を説明すると、説明が冗長になったり、正確な説明が難しくなるという問題がある。 For example, in a manual that is assumed to be used by a general user, using technical terms that are difficult for a general user to understand makes it very difficult to understand. On the other hand, if the same contents are explained in general terms without using technical terms, there are problems that the explanation becomes redundant and that accurate explanation becomes difficult.

従って本発明の目的は、文書データから、適切な難易度に変換した変換文書データを生成する文書生成支援方法、文書生成支援装置および文書生成支援プログラムを提供することである。 Accordingly, an object of the present invention is to provide a document generation support method, a document generation support apparatus, and a document generation support program for generating converted document data converted from document data to an appropriate difficulty level.

上記課題を解決するために、本発明の第１の特徴は、文書データから、適切な難易度に変換した変換文書データの生成を支援する文書生成支援方法に関する。本発明の第１の特徴は、文書データから、変換の対象となる対象単語と、当該対象単語の複雑度と、を対応づけた対象単語データに含まれる対象単語を抽出して、使用対象単語データに記憶する対象単語抽出ステップと、使用対象単語データに含まれる使用対象単語について、当該使用対象単語の複雑度を抽出し、抽出した当該使用対象単語の複雑度が、変換文書データに用いる対象単語の複雑度の範囲の基準となる基準複雑度の範囲外の場合、対象単語の類義語を対応づけた類義語データから、当該使用対象単語の類義語であって基準複雑度の範囲内の変換対象単語を抽出し、文書データ中の当該対象単語を、抽出した変換対象単語に変換した変換文書データを出力する変換ステップを備える。 In order to solve the above-mentioned problem, a first feature of the present invention relates to a document generation support method for supporting generation of converted document data converted from document data to an appropriate difficulty level. The first feature of the present invention is that a target word included in target word data in which a target word to be converted is associated with complexity of the target word is extracted from the document data, and used word A target word extraction step to be stored in the data, and for the use target word included in the use target word data, the complexity of the use target word is extracted, and the extracted complexity of the use target word is used for the conversion document data If the word complexity is outside the range of the reference complexity, the synonym data that associates the synonym of the target word, and the conversion target word within the reference complexity range that is a synonym of the target word And a conversion step of outputting converted document data obtained by converting the target word in the document data into the extracted conversion target word.

ここで変換ステップは、抽出した当該使用対象単語の複雑度が基準複雑度の範囲外で、類義語データに、当該使用対象単語の類義語であって基準複雑度の範囲内の変換対象単語がない場合、文書データ中の当該対象単語を変換することなく、変換できない旨を前記変換文書データに出力しても良い。 Here, when the complexity of the extracted use target word is outside the range of the reference complexity, and the synonym data has no conversion target word within the reference complexity that is a synonym of the use target word The fact that conversion is not possible may be output to the converted document data without converting the target word in the document data.

変換文書データをプレビュー表示するとともに、変換文書データに基準複雑度の範囲外の使用対象単語が含まれている場合、当該使用対象単語を、意味のない文字列に置換して、表示装置に表示する出力ステップをさらに備えても良い。 In addition to previewing the converted document data, if the converted document data contains words to be used that are outside the standard complexity range, the words to be used are replaced with meaningless character strings and displayed on the display device. An output step may be further included.

使用対象単語データに含まれる使用対象単語と、当該使用対象単語の複雑度と、を対応づけて表示装置に表示し、操作者によって指定された基準複雑度の範囲を、基準複雑度データに設定する基準複雑度設定ステップをさらに備えても良い。 The target word included in the target word data and the complexity of the target word are displayed in association on the display device, and the reference complexity range specified by the operator is set in the reference complexity data. A reference complexity setting step may be further included.

本発明の第２の特徴は、文書データから、適切な難易度に変換した変換文書データの生成を支援する文書生成支援装置に関する。本発明の第２の特徴に係る文書生成支援装置は、文書データが記憶された文書データ記憶部と、変換の対象となる対象単語と、当該対象単語の複雑度と、を対応づけた対象単語データが記憶された対象単語データ記憶部と、変換文書データに用いる対象単語の複雑度の範囲の基準となる基準複雑度の範囲を含む基準複雑度データが記憶された基準複雑度データ記憶部と、対象単語の類義語を対応づけた類義語データが記憶された類義語データ記憶部と、文書データから、対象単語データに含まれる対象単語を抽出して、使用対象単語データに記憶する対象単語抽出手段と、使用対象単語データに含まれる使用対象単語について、当該使用対象単語の複雑度を抽出し、抽出した当該使用対象単語の複雑度が基準複雑度の範囲外の場合、類義語データから、当該使用対象単語の類義語であって基準複雑度の範囲内の変換対象単語を抽出し、文書データ中の当該対象単語を、抽出した変換対象単語に変換した変換文書データを出力する変換手段を備える。 The second feature of the present invention relates to a document generation support apparatus that supports generation of converted document data converted from document data to an appropriate difficulty level. The document generation support apparatus according to the second feature of the present invention is a target word in which a document data storage unit storing document data, a target word to be converted, and a complexity of the target word are associated with each other. A target word data storage unit in which data is stored; a reference complexity data storage unit in which reference complexity data including a reference complexity range that serves as a reference for a range of complexity of target words used for converted document data is stored; A synonym data storage unit storing synonym data in which synonyms of the target word are associated, and target word extracting means for extracting the target word included in the target word data from the document data and storing it in the target word data When the complexity of the target word is extracted from the target word included in the target word data, and the complexity of the extracted target word is out of the reference complexity range, A conversion target word that is a synonym of the target word and that is within the range of the reference complexity is extracted from the data, and converted document data obtained by converting the target word in the document data into the extracted conversion target word is output. Means.

ここで変換手段は、抽出した当該使用対象単語の複雑度が基準複雑度の範囲外で、類義語データに、当該使用対象単語の類義語であって基準複雑度の範囲内の変換対象単語がない場合、文書データ中の当該対象単語を変換することなく、変換できない旨を前記変換文書データに出力しても良い。 Here, when the complexity of the extracted use target word is outside the range of the reference complexity, and the synonym data has no conversion target word within the reference complexity that is a synonym of the use target word The fact that conversion is not possible may be output to the converted document data without converting the target word in the document data.

変換文書データをプレビュー表示するとともに、変換文書データに基準複雑度の範囲外の使用対象単語が含まれている場合、当該使用対象単語を、意味のない文字列に置換して、表示装置に表示する出力手段をさらに備えても良い。 In addition to previewing the converted document data, if the converted document data contains words to be used that are outside the standard complexity range, the words to be used are replaced with meaningless character strings and displayed on the display device. Output means may be further provided.

使用対象単語データに含まれる使用対象単語と、当該使用対象単語の複雑度と、を対応づけて表示装置に表示し、操作者によって指定された基準複雑度の範囲を、基準複雑度データに設定する基準複雑度設定手段をさらに備えても良い。 The target word included in the target word data and the complexity of the target word are displayed in association on the display device, and the reference complexity range specified by the operator is set in the reference complexity data. Reference complexity setting means may be further provided.

本発明の第３の特徴は、コンピュータに、本発明の第１の特徴に記載のステップを実行させるための文書生成支援プログラムに関する。 A third feature of the present invention relates to a document generation support program for causing a computer to execute the steps described in the first feature of the present invention.

本発明によれば、文書データから、適切な難易度に変換した変換文書データを生成する文書生成支援方法、文書生成支援装置および文書生成支援プログラムを提供することができる。 According to the present invention, it is possible to provide a document generation support method, a document generation support apparatus, and a document generation support program for generating converted document data converted to an appropriate difficulty level from document data.

本発明の実施の形態に係る文書生成支援方法の処理の概要を説明するフローチャートである。It is a flowchart explaining the outline | summary of a process of the document generation assistance method which concerns on embodiment of this invention. 本発明の実施の形態に係る文書生成支援装置の機能ブロック図である。It is a functional block diagram of the document generation assistance apparatus which concerns on embodiment of this invention. 本発明の実施の形態に係る文書データのデータの一例を説明する図である。It is a figure explaining an example of the data of the document data which concerns on embodiment of this invention. 本発明の実施の形態に係る対象単語データのデータ構造とデータの一例を説明する図である。It is a figure explaining an example of data structure and data of object word data concerning an embodiment of the invention. 本発明の実施の形態に係る使用対象単語データの一例を説明する図である。It is a figure explaining an example of use object word data concerning an embodiment of the invention. 本発明の実施の形態に係る基準複雑度データの一例を説明する図である。It is a figure explaining an example of the reference | standard complexity data which concern on embodiment of this invention. 本発明の実施の形態に係る類義語辞書データのデータ構造とデータの一例を説明する図である。It is a figure explaining an example of data structure and data of synonym dictionary data concerning an embodiment of the invention. 本発明の実施の形態に係る文書生成支援装置が表示する基準複雑度設定画面の一例を説明する図である。It is a figure explaining an example of the reference | standard complexity setting screen which the document generation assistance apparatus concerning embodiment of this invention displays. 本発明の実施の形態に係る変換文書データのデータの一例を説明する図である。It is a figure explaining an example of the data of the conversion document data which concerns on embodiment of this invention. 本発明の実施の形態に係る変換文書データを表示する表示画面の一例を説明する図である。It is a figure explaining an example of the display screen which displays the conversion document data which concerns on embodiment of this invention.

次に、図面を参照して、本発明の実施の形態を説明する。以下の図面の記載において、同一または類似の部分には同一または類似の符号を付している。 Next, embodiments of the present invention will be described with reference to the drawings. In the following description of the drawings, the same or similar parts are denoted by the same or similar reference numerals.

（文書生成支援方法）
本発明の実施の形態に係る文書生成支援方法は、文書データから、適切な難易度に変換した変換文書データを生成する。これにより本発明の実施の形態に係る文書生成支援方法は、読み手に理解しやすい文書データを出力することができる。この文書データは、例えば、専門家が一般ユーザに向けて頒布する文書のデータである。具体的には文書データは、機器の取扱説明書や、保守用マニュアルなどである。 (Document generation support method)
The document generation support method according to the embodiment of the present invention generates converted document data converted from document data to an appropriate difficulty level. As a result, the document generation support method according to the embodiment of the present invention can output document data that can be easily understood by the reader. This document data is, for example, data of a document distributed to a general user by an expert. Specifically, the document data is an instruction manual of the device or a maintenance manual.

図１を参照して、本発明の実施の形態に係る文書生成支援方法の概略を説明する。図１に示す文書生成支援方法は、図２に示す文章生成支援装置１によって実現される。 With reference to FIG. 1, an outline of a document generation support method according to an embodiment of the present invention will be described. The document generation support method shown in FIG. 1 is realized by the text generation support apparatus 1 shown in FIG.

まず文書生成支援装置１は、ステップＳ１において、変換対象となる文書データ１１ａを取得し、ステップＳ２において、この文書データ１１ａから、変換の対象となる対象単語である使用対象単語を取得する。ステップＳ３において、ステップＳ２において取得された対象単語を含む使用対象単語データ１３ａが、記憶装置１０に記憶される。また、文書生成支援装置１はステップＳ４において、基準複雑度データ１４ａを取得する。この基準複雑度データ１４ａは、文書データに用いる対象単語の複雑度であって、作成者が想定する文書データの読み手に適切な対象単語の複雑度の範囲が設定されたデータである。 First, in step S1, the document generation support device 1 acquires document data 11a to be converted, and in step S2, acquires a use target word that is a target word to be converted from the document data 11a. In step S3, use target word data 13a including the target word acquired in step S2 is stored in the storage device 10. Further, the document generation support apparatus 1 acquires the reference complexity data 14a in step S4. The reference complexity data 14a is the complexity of the target word used for the document data, and is a data in which a range of complexity of the target word appropriate for the reader of the document data assumed by the creator is set.

次に、使用対象単語データ１３ａに含まれる全ての対象単語、具体的には、文書データ１１ａに含まれる全ての対象単語について、ステップＳ５ないしステップＳ８の処理を繰り返す。まず、ステップＳ５において文書生成支援装置１は、当該使用対象単語が、基準複雑度設定データ１４ａで設定された範囲内であるか否かを判定する。範囲内である場合、次の使用対象単語について、ステップＳ５ないしステップＳ８の処理を繰り返す。 Next, the process of step S5 thru | or step S8 is repeated about all the target words contained in use object word data 13a, specifically, all the target words contained in the document data 11a. First, in step S5, the document generation support apparatus 1 determines whether the use target word is within the range set in the reference complexity setting data 14a. If it is within the range, the process from step S5 to step S8 is repeated for the next word to be used.

一方、範囲外である場合、ステップＳ６において文書生成支援装置１は、基準複雑度の範囲内であって、当該使用対象単語の類義語を検索する。類義語が検索された場合、ステップＳ７において、当該使用対象単語を、検索された類義語に置換する。一方、検索されなかった場合、ステップＳ８において、置換できない旨を出力する。 On the other hand, when it is out of the range, in step S6, the document generation support apparatus 1 searches for a synonym of the use target word that is within the reference complexity range. When a synonym is searched, in step S7, the use target word is replaced with the searched synonym. On the other hand, if no search is made, a message indicating that the replacement cannot be performed is output in step S8.

全ての使用対象単語についてステップＳ５ないしステップＳ８の処理が終了すると、ステップＳ９において、変換文書データ１６ａを出力する。変換文書データ１６ａにおいて使用される対象単語は、原則、予め設定された基準複雑度の範囲内である。基準複雑度の範囲外の使用対象単語であって、基準複雑度の範囲内の類義語がない場合、例外的に、この使用対象単語が含まれる。また、変換文書データ１６ａの基準複雑度の範囲外の使用対象単語には、変換できなかった旨のコメント等が含まれても良い。 When the processing from step S5 to step S8 is completed for all words to be used, converted document data 16a is output in step S9. In principle, the target word used in the converted document data 16a is within the preset reference complexity range. If the word is a target word outside the range of the reference complexity and there is no synonym within the range of the reference complexity, the target word is exceptionally included. In addition, the use target word outside the range of the reference complexity of the converted document data 16a may include a comment or the like indicating that the conversion could not be performed.

ステップＳ１０において文書生成支援装置１は、ステップＳ９で出力した変換文書データ１６ａを、表示装置４０に表示する。ここで文書生成支援装置１は、変換文書データ１６ａの変換後のデータを表示するのみならず、文書データ１１ａから変換文書データ１６ａの差分がわかるように表示しても良い。また、ステップＳ８において、置換できない旨が設定された対象単語について、意味のない文字列で置換して表示しても良い。これにより、読み手の見え方を作成者にシミュレーションさせることができる。 In step S10, the document generation support device 1 displays the converted document data 16a output in step S9 on the display device 40. Here, the document generation support apparatus 1 may display not only the converted data of the converted document data 16a but also the difference between the converted document data 16a from the document data 11a. Further, in step S8, the target word that is set as not replaceable may be replaced with a meaningless character string and displayed. Thereby, a creator can be made to simulate how a reader looks.

（文書生成支援装置）
図２を参照して、本発明の実施の形態に係る文書生成支援装置１を説明する。本発明の実施の形態に係る文書生成支援装置１は、記憶装置１０、中央処理制御装置２０、入力装置３０および表示装置４０を備える一般的なコンピュータである。文書生成支援装置１は、所定の処理を実行するための文書生成支援プログラムが、一般的なコンピュータにインストールされることにより実装される。入力装置３０は、マウス、キーボードなどで、表示装置４０は、液晶ディスプレイなどである。 (Document generation support device)
With reference to FIG. 2, the document generation assistance apparatus 1 which concerns on embodiment of this invention is demonstrated. The document generation support apparatus 1 according to the embodiment of the present invention is a general computer including a storage device 10, a central processing control device 20, an input device 30, and a display device 40. The document generation support apparatus 1 is implemented by installing a document generation support program for executing a predetermined process in a general computer. The input device 30 is a mouse, a keyboard or the like, and the display device 40 is a liquid crystal display or the like.

記憶装置１０は、文書データ記憶部１１、対象単語データ記憶部１２、使用対象単語データ記憶部１３、基準複雑度データ記憶部１４、類義語データ記憶部１５および変換文書データ記憶部１６を備える。また記憶装置１０には、文書生成支援プログラムも記憶される。 The storage device 10 includes a document data storage unit 11, a target word data storage unit 12, a use target word data storage unit 13, a reference complexity data storage unit 14, a synonym data storage unit 15, and a converted document data storage unit 16. The storage device 10 also stores a document generation support program.

文書データ記憶部１１は、記憶装置１０のうち、文書データ１１ａが記憶された記憶領域である。文書データ１１ａは、本発明の実施の形態に係る文書生成支援装置１が、生成を支援する変換文書データ１６ａの元となる文書のデータである。文書データ１１ａは、作成者によって作成されたデータである。文書データ１１ａは、文書生成支援装置１が、文書データ１１ａに含まれる文字を識別し、また編集可能な形式であれば良い。文書データ１１ａは、例えば、テキスト形式、ＨＴＭＬ形式、文書生成アプリケーションプログラム用の形式、ＰＤＦ形式などである。 The document data storage unit 11 is a storage area in the storage device 10 in which the document data 11a is stored. The document data 11a is document data that is the basis of the converted document data 16a that is supported by the document generation support apparatus 1 according to the embodiment of the present invention. The document data 11a is data created by the creator. The document data 11a may be in any format that allows the document generation support apparatus 1 to identify and edit the characters included in the document data 11a. The document data 11a is, for example, a text format, an HTML format, a format for a document generation application program, a PDF format, or the like.

文書データ１１ａは、例えば図３に示すデータである。図３に示す文書データ１１ａは、ＨＴＭＬ形式である。この文書データ１１ａは、作成者が作成した文書データそのものである。文書データ１１ａには、基準複雑度の範囲外の対象単語も含まれる。 The document data 11a is data shown in FIG. 3, for example. The document data 11a shown in FIG. 3 is in the HTML format. The document data 11a is document data itself created by the creator. The document data 11a also includes target words outside the range of the reference complexity.

対象単語データ記憶部１２は、記憶装置１０のうち、対象単語データ１２ａが記憶された記憶領域である。対象単語データ１２ａは、本発明の実施の形態に係る文書生成支援装置１の変換の対象となる対象単語と、当該対象単語の複雑度と、を対応づけたデータである。 The target word data storage unit 12 is a storage area in the storage device 10 in which target word data 12a is stored. The target word data 12a is data in which the target word to be converted by the document generation support apparatus 1 according to the embodiment of the present invention is associated with the complexity of the target word.

対象単語データ１２ａは、例えば、図４に示すデータである。図４に示す対象単語データ１２ａは、対象単語と、その複雑度が対応づけられている。この対象単語は、例えば、専門用語などであって、読み手の知識などによって理解度が大きく異なる単語である。この複雑度は、例えば、単語親密度に基づいて算出される。図４に示す例においてこの複雑度は、０に近いほど低く、１に近いほど高い。 The target word data 12a is, for example, data shown in FIG. In the target word data 12a shown in FIG. 4, the target word is associated with its complexity. The target word is, for example, a technical term or the like, and is a word whose degree of understanding varies greatly depending on the knowledge of the reader. This complexity is calculated based on the word familiarity, for example. In the example shown in FIG. 4, this complexity is lower as it is closer to 0 and higher as it is closer to 1.

使用対象単語データ記憶部１３は、記憶装置１０のうち、使用対象単語データ１３ａが記憶された記憶領域である。使用対象単語データ１３ａは、対象単語データ１２ａに含まれる対象単語のうち、文書データ１１ａで使用される対象単語である。 The use target word data storage unit 13 is a storage area in the storage device 10 in which the use target word data 13a is stored. The use target word data 13a is a target word used in the document data 11a among the target words included in the target word data 12a.

使用対象単語データ１３ａは、例えば図５に示すデータである。図５に示す使用対象単語データ１３ａは、図３に示す文書データ１１ａで使用される対象単語を含む。図５に示す例において、使用対象単語は、”ＰＣ”、”インターネット”（登録商標）、”有線ＬＡＮ”などである。図５において、これらの使用対象単語の文字区切りとして、”／（スラッシュ）”を用いているが、これに限られない。例えば、改行、スペース、カンマなどが、使用対象単語の文字区切りとして使用されても良い。 The use target word data 13a is, for example, data shown in FIG. The use target word data 13a shown in FIG. 5 includes the target words used in the document data 11a shown in FIG. In the example shown in FIG. 5, the words to be used are “PC”, “Internet” (registered trademark), “wired LAN”, and the like. In FIG. 5, “/ (slash)” is used as a character delimiter for these words to be used, but this is not restrictive. For example, a line break, a space, a comma, or the like may be used as a character delimiter for the use target word.

基準複雑度データ記憶部１４は、記憶装置１０のうち、基準複雑度データ１４ａが記憶された記憶領域である。基準複雑度データ１４ａは、変換文書データ１６ａに用いる対象単語の複雑度の範囲の基準となる基準複雑度の範囲を含むデータである。基準複雑度データ１４ａは、予め記憶装置１０に記憶されても良いし、後述する基準複雑度設定手段２３において、作成者によって入力装置３０から設定されても良い。 The reference complexity data storage unit 14 is a storage area in the storage device 10 in which the reference complexity data 14a is stored. The reference complexity data 14a is data including a reference complexity range that serves as a reference for the complexity range of the target word used for the converted document data 16a. The reference complexity data 14a may be stored in the storage device 10 in advance, or may be set from the input device 30 by the creator in the reference complexity setting unit 23 described later.

基準複雑度データ１４ａは、例えば、図６に示すデータである。図６に示す基準複雑度データ１４ａは、変換文書データ１６ａに含まれる使用対象単語の複雑度が”０．０〜０．４”となるように、変換文書データ１６ａが生成されることを示している。ただし、図１に示すステップＳ８の場合のように、基準複雑度の範囲内に含まれる類義語がない場合、変換文書データ１６ａに、基準複雑度の範囲外の対象単語が含まれる場合がある。 The reference complexity data 14a is, for example, data shown in FIG. The reference complexity data 14a shown in FIG. 6 indicates that the converted document data 16a is generated such that the complexity of the word to be used included in the converted document data 16a is “0.0 to 0.4”. ing. However, if there is no synonym included in the range of the reference complexity as in step S8 shown in FIG. 1, the converted document data 16a may include a target word outside the range of the reference complexity.

類義語データ記憶部１５は、記憶装置１０のうち、類義語データ１５ａが記憶された記憶領域である。類義語データ１５ａは、対象単語の類義語を対応づけたデータである。類義語データ１５ａにおいては、対象単語データ１２ａに含まれる対象単語のうち、同じ概念を持つ対象単語を対応づけたデータである。 The synonym data storage unit 15 is a storage area in the storage device 10 in which synonym data 15a is stored. The synonym data 15a is data in which synonyms of the target word are associated with each other. The synonym data 15a is data that associates target words having the same concept among the target words included in the target word data 12a.

類義語データ１５ａは、例えば、図７に示すデータである。図７に示す類義語データ１５ａは、複数の対象単語が対応づけられた複数のグループが含まれる。図７に示す例において、グループａは、対象単語”パソコン”および”ＰＣ”、”コンピュータ”を対応づけている。これらの対象単語”パソコン”および”ＰＣ”、”コンピュータ”は、同じ概念を持つ類義語である。同様に、グループｂ、グループｃ…においてそれぞれ、互いに類義語となる複数の対象単語が対応づけられる。 The synonym data 15a is, for example, data shown in FIG. The synonym data 15a shown in FIG. 7 includes a plurality of groups in which a plurality of target words are associated. In the example shown in FIG. 7, the group a associates the target words “PC”, “PC”, and “Computer”. These target words “PC”, “PC”, and “computer” are synonyms having the same concept. Similarly, in group b, group c,..., A plurality of target words that are synonyms are associated with each other.

変換文書データ記憶部１６は、記憶装置１０のうち、変換文書データ１６ａが記憶された記憶領域である。変換文書データ１６ａは、本発明の実施の形態に係る文書生成支援装置１が、文書データ１１ａから、文書データ１１ａに含まれる使用対象単語を基準複雑度の範囲内になるように変換したデータである。変換文書データ１６ａについては、後に詳述する。 The converted document data storage unit 16 is a storage area in the storage device 10 in which the converted document data 16a is stored. The converted document data 16a is data obtained by converting the word to be used included in the document data 11a from the document data 11a so as to be within the range of the reference complexity by the document generation support apparatus 1 according to the embodiment of the present invention. is there. The converted document data 16a will be described in detail later.

中央処理制御装置２０は、文書取得手段２１、対象単語抽出手段２２、基準複雑度設定手段２３、変換手段２４および出力手段２５を備える。 The central processing control device 20 includes a document acquisition unit 21, a target word extraction unit 22, a reference complexity setting unit 23, a conversion unit 24, and an output unit 25.

文書取得手段２１は、文書データ１１ａを取得して、文書データ記憶部１１に記憶する。文書取得手段２１は、例えば、作成者の操作により、本発明の実施の形態に係る文書生成支援装置１の処理の対象となる文書データ１１ａを特定する手段であっても良い。 The document acquisition unit 21 acquires the document data 11 a and stores it in the document data storage unit 11. For example, the document acquisition unit 21 may be a unit that specifies the document data 11a to be processed by the document generation support apparatus 1 according to the embodiment of the present invention by the operation of the creator.

対象単語抽出手段２２は、文書データ１１ａから、対象単語データ１２ａに含まれる対象単語を抽出して、使用対象単語データ１３ａに記憶する。 The target word extraction means 22 extracts the target word contained in the target word data 12a from the document data 11a and stores it in the use target word data 13a.

対象単語抽出手段２２は、例えば、文書データ１１ａを読み出して、形態素解析を実行し、文書データ１１ａを各形態素に分割するとともに、各形態素の品詞を特定する。この形態素解析の手法は、既存の手法で良い。対象単語抽出手段２２は、形態素解析の結果に基づいて、文書データ１１ａから名詞のみを抽出する。対象単語抽出手段２２は、文書データ１１ａから抽出した名詞と対象単語データ１２ａに含まれる対象単語とを比較し、文書データ１１ａから抽出した名詞と一致し、対象単語データ１２ａに含まれる対象単語を抽出する。対象単語抽出手段２２は、文書データ１１ａで使用される対象単語を抽出すると、この抽出した対象単語を、使用対象単語データ１３ａとして出力する。 For example, the target word extracting unit 22 reads the document data 11a, performs morpheme analysis, divides the document data 11a into each morpheme, and specifies the part of speech of each morpheme. This morphological analysis method may be an existing method. The target word extraction unit 22 extracts only nouns from the document data 11a based on the result of morphological analysis. The target word extraction means 22 compares the noun extracted from the document data 11a with the target word included in the target word data 12a, matches the noun extracted from the document data 11a, and selects the target word included in the target word data 12a. Extract. When the target word extraction unit 22 extracts the target word used in the document data 11a, the target word extraction unit 22 outputs the extracted target word as the use target word data 13a.

基準複雑度設定手段２３は、文書データ１１ａに対して基準複雑度を設定する際に用いられる。基準複雑度設定手段２３は、使用対象単語データ１３ａに含まれる対象単語と、当該使用対象単語の複雑度と、を対応づけて表示装置４０に表示し、作成者によって指定された基準複雑度の範囲を、基準複雑度データ１４ａに設定する。 The reference complexity setting means 23 is used when setting the reference complexity for the document data 11a. The reference complexity setting unit 23 displays the target word included in the use target word data 13a and the complexity of the use target word on the display device 40 in association with each other, and has the reference complexity specified by the creator. The range is set in the reference complexity data 14a.

図８を参照して、基準複雑度設定手段２３が表示装置４０に表示する基準複雑度設定画面Ｐ１０１を説明する。基準複雑度設定画面Ｐ１０１においては、使用対象単語データ１３ａに含まれる使用対象単語が表示される。この使用対象単語は、対象単語データ１２ａで設定された複雑度と関連づけて、表示される。図８に示す例では、画面左部に複雑度を記した軸が表示される。画面上部に表示される使用対象単語の複雑度が高く、画面下部になるほど使用対象単語の複雑度が低くなるように表示されている。また、基準複雑度設定画面Ｐ１０１では、基準複雑度設定ボタンＢ１０１ａおよびＢ１０１ｂが表示されている。作成者は、マウスなどの入力装置３０を使って、この基準複雑度設定ボタンＢ１０１ａを操作し、基準複雑度の設定値の上限値を決定するとともに、基準複雑度設定ボタンＢ１０１ｂを操作し、基準複雑度の設定値の下限値を決定する。基準複雑度設定手段２３は、基準複雑度設定ボタンＢ１０１ａおよびＢ１０１ｂが示す複雑度の範囲を基準複雑度の範囲として、基準複雑度データ１４ａに記録する。 The reference complexity setting screen P101 displayed on the display device 40 by the reference complexity setting unit 23 will be described with reference to FIG. In the reference complexity setting screen P101, the usage target words included in the usage target word data 13a are displayed. This use target word is displayed in association with the complexity set in the target word data 12a. In the example shown in FIG. 8, an axis describing complexity is displayed on the left side of the screen. The complexity of the word to be used displayed at the top of the screen is high, and the complexity of the word to be used is low as it goes to the bottom of the screen. In the reference complexity setting screen P101, reference complexity setting buttons B101a and B101b are displayed. The creator operates the reference complexity setting button B101a using the input device 30 such as a mouse to determine the upper limit value of the reference complexity setting value, and operates the reference complexity setting button B101b to Determine the lower limit of the complexity setting. The reference complexity setting means 23 records the complexity range indicated by the reference complexity setting buttons B101a and B101b as the reference complexity range in the reference complexity data 14a.

変換手段２４は、使用対象単語データ１３ａに含まれる使用対象単語について、当該使用対象単語の複雑度を抽出する。変換手段２４は、抽出した当該使用対象単語の複雑度が基準複雑度の範囲外の場合、類義語データ１５ａから、当該使用対象単語の類義語であって基準複雑度の範囲内の変換対象単語を抽出する。変換手段２４は、文書データ１１ａ中の当該対象単語を、抽出した変換対象単語に変換した変換文書データ１６ａを出力する。 The conversion means 24 extracts the complexity of the use target word for the use target word included in the use target word data 13a. When the complexity of the extracted use target word is outside the range of the reference complexity, the conversion unit 24 extracts a conversion target word that is a synonym of the use target word and within the reference complexity range from the synonym data 15a. To do. The conversion unit 24 outputs converted document data 16a obtained by converting the target word in the document data 11a into the extracted conversion target word.

変換手段２４はさらに、抽出した当該使用対象単語の複雑度が基準複雑度の範囲外で、類義語データ１５ａに、当該使用対象単語の類義語であって基準複雑度の範囲内の変換対象単語がない場合、文書データ１１ａ中の当該対象単語を変換することなく、変換できない旨を変換文書データ１６ａに変換文書データ１６ａを出力する。ここで、変換できない旨とは、例えば、変換できなかった文書データ１１ａ中の対象単語が、基準複雑度の範囲の下限値より低いのか、あるいは上限値より高いのかを示す文字列、記号などのデータである。 Further, the conversion means 24 has a complexity of the extracted use target word outside the range of the reference complexity, and the synonym data 15a has no conversion target word that is a synonym of the use target word and within the reference complexity range. In this case, the converted document data 16a is output to the converted document data 16a to the effect that conversion is not possible without converting the target word in the document data 11a. Here, the fact that conversion is not possible means, for example, a character string or symbol indicating whether the target word in the document data 11a that could not be converted is lower than the lower limit value or higher than the upper limit value of the reference complexity range. It is data.

例えば、図３に示す文書データ１１ａについて説明する。この文書データ１１ａで使用される対象単語の一つは、図５に示すように”ＰＣ”である。変換手段２４は、記憶装置１０から対象単語データ１２ａを読み出し、使用対象単語”ＰＣ”の複雑度”０．５”を取得する。一方、基準複雑度データ１４ａは、図６に示すように、”０．０〜０．４”に設定されている。ここで、使用対象単語”ＰＣ”の複雑度は”０．５”なので、この基準複雑度の範囲外となり、変換対象となる。 For example, the document data 11a shown in FIG. 3 will be described. One of the target words used in the document data 11a is “PC” as shown in FIG. The conversion unit 24 reads the target word data 12a from the storage device 10, and acquires the complexity “0.5” of the use target word “PC”. On the other hand, the reference complexity data 14a is set to “0.0 to 0.4” as shown in FIG. Here, since the complexity of the use target word “PC” is “0.5”, it is out of the range of the reference complexity and becomes a conversion target.

そこで、変換手段２４は、記憶装置１０から類義語データ１５ａおよび対象単語データ１２ａを読み出して、使用対象単語”ＰＣ”の類義語であって、複雑度が”０．０〜０．４”の対象単語を抽出する。図７に示す類義語データ１５ａにおいて、対象単語”ＰＣ”の類義語として、”パソコン”および”コンピュータ”が対応づけられている。ここで、図４に示す対象単語データ１２ａにおいて、対象単語”パソコン”の複雑度は０．２で、基準複雑度の範囲内である。一方対象単語”コンピュータ”の複雑度は０．６で、基準複雑度の範囲外である。そこで、変換手段２４は、文書データ１１ａで使用されている対象単語”ＰＣ”を、”パソコン”に変換して、変換文書データ１６ａを生成する。 Therefore, the conversion unit 24 reads the synonym data 15a and the target word data 12a from the storage device 10, and is a synonym of the use target word “PC” and has a complexity of “0.0 to 0.4”. To extract. In the synonym data 15a shown in FIG. 7, “PC” and “computer” are associated as synonyms for the target word “PC”. Here, in the target word data 12a shown in FIG. 4, the complexity of the target word “PC” is 0.2, which is within the range of the reference complexity. On the other hand, the complexity of the target word “computer” is 0.6, which is outside the range of the standard complexity. Therefore, the conversion means 24 converts the target word “PC” used in the document data 11a into “personal computer” to generate converted document data 16a.

文書データ１１ａに含まれる全ての使用対象単語について、複雑度の比較および置換の処理を繰り返す。 The complexity comparison and replacement process is repeated for all words to be used included in the document data 11a.

この例では、基準複雑度の範囲内に含まれる類義語が、”パソコン”のみであるが、基準複雑度の範囲内に含まれる類義語が複数の場合もある。その場合、変換手段２４は、基準複雑度の範囲内に含まれる複数の類義語から、変換文書データ１６ａで使用する一つの使用対象単語を選択する必要がある。例えば変換手段２４は、基準複雑度の範囲内に含まれる複数の類義語のうち、最も複雑度の低い類義語を、変換文書データ１６ａに用いても良いし、基準複雑度の範囲の中央値に最も近い類義語を、変換文書データ１６ａに用いても良い。また、基準複雑度の範囲内に含まれる類義語が複数ある場合の変換手段２４の処理を、予め作成者が指定しても良い。 In this example, “PC” is the only synonym included in the standard complexity range, but there may be a plurality of synonyms included in the standard complexity range. In that case, the conversion means 24 needs to select one use target word to be used in the converted document data 16a from a plurality of synonyms included within the range of the reference complexity. For example, the converting unit 24 may use the synonym having the lowest complexity among the plurality of synonyms included in the range of the reference complexity in the converted document data 16a, or the highest in the median of the range of the reference complexity. Close synonyms may be used for the converted document data 16a. Further, the creator may specify in advance the processing of the conversion means 24 when there are a plurality of synonyms included within the range of the reference complexity.

基準複雑度の範囲内に含まれる類義語が存在しない場合、変換手段２４は、変換すべき対象単語を見つけられず変換することができない。そこで変換手段２４は、類義語が存在しなかった旨を、この使用対象単語と関連づけて出力する。例えば変換手段２４は、変換できなかった使用対象単語の一覧データを出力しても良いし、変換文書データ１６ａに、その情報を付加しても良い。 If there is no synonym included within the range of the reference complexity, the conversion unit 24 cannot find the target word to be converted and cannot convert it. Therefore, the conversion means 24 outputs that the synonym does not exist in association with this use target word. For example, the conversion means 24 may output list data of words to be used that could not be converted, or may add the information to the converted document data 16a.

ここで、図９を参照して、変換データ１６ａの例を説明する。図９に示す例は、変換の履歴が付加された変換文書データ１６ａである。図９に示す変換データ１６ａは、図３に示す文書データ１１ａからの使用対象単語の変換履歴を含む。具体的には、文書データ１１ａの使用対象単語”ＰＣ”が、”パソコン”に変換されている。図９に示す変換文書データ１６ａの変換後の”パソコン”に、変換されたことを示す下線が施されている。 Here, an example of the conversion data 16a will be described with reference to FIG. The example shown in FIG. 9 is converted document data 16a to which a conversion history is added. The conversion data 16a shown in FIG. 9 includes the conversion history of the words to be used from the document data 11a shown in FIG. Specifically, the word “PC” to be used in the document data 11a is converted to “PC”. The converted “computer” of the converted document data 16a shown in FIG. 9 is underlined to indicate that it has been converted.

さらに、変換された”パソコン”に続いて、”［←ＰＣ］”のデータが続く。この”［”および”］”は、この”［”から”］”の文字列が、変換手段２４によって変換履歴等が挿入されたことを示す区切り文字である。”←ＰＣ”は、文書データ１１ａの”ＰＣ”から”パソコン”に変換されたことを示す。 Further, the data “[← PC]” follows the converted “personal computer”. These “[” and “]” are delimiters indicating that the conversion history or the like has been inserted into the character string “[” to “]” by the conversion means 24. “← PC” indicates that “PC” of the document data 11a is converted to “PC”.

ここで、図９に示す例において、”有線ＬＡＮ”、”無線ＬＡＮ”などに、”［難］”の文字列が付加されている。この”［難］”は、この文字列の直前の使用対象単語”有線ＬＡＮ”、”無線ＬＡＮ”などが、基準複雑度の範囲外であって、類義語データ１５ａから、基準複雑度の範囲内の類義語を取得できなかった使用対象単語であることを示す。”［難］”の文字列は、具体的には、図１のステップＳ８において説明した”置換できない旨”の一例である。このように変換できなかった使用対象単語についても、下線が施されている。 Here, in the example shown in FIG. 9, a character string “[difficult]” is added to “wired LAN”, “wireless LAN”, and the like. This “[difficult]” means that the words “wired LAN”, “wireless LAN”, etc. used immediately before this character string are out of the standard complexity range, and within the standard complexity range from the synonym data 15a. This indicates that it is a use target word that could not be obtained. The character string “[difficult]” is an example of “cannot be replaced” described in step S8 in FIG. The use target words that could not be converted in this way are also underlined.

図９に示す例において”有線ＬＡＮ”、”無線ＬＡＮ”などは、基準複雑度の上限値を超える複雑度を持つ対象単語であるので、変換手段２４は、これらの単語に”［難］”の文字列を付加して出力する。一方、基準複雑度の範囲の下限値を下回る複雑度を持ち、類義語を取得できなかった使用対象単語については、”［易］”の文字列を付加しても良い。このように、基準複雑度の範囲を上回るのか、下回るのかによって、変換手段２４が付加する文字列を変更する。これにより文書生成支援装置１は、作成者が基準複雑度に合致する文書データを作成できるように、支援することができる。具体的には、作成者は、基準複雑度の範囲を上回る使用対象単語については、注釈を付加し、基準複雑度の範囲を下回る使用対象単語について、変換対象の単語を検討することができる。 In the example shown in FIG. 9, “wired LAN”, “wireless LAN”, and the like are target words having a complexity exceeding the upper limit of the reference complexity, so the conversion unit 24 adds “[difficult]” to these words. Output with the character string added. On the other hand, a character string of “[Easy]” may be added to words to be used that have complexity below the lower limit of the range of the reference complexity and for which synonyms could not be acquired. In this way, the character string added by the conversion means 24 is changed depending on whether it exceeds or falls below the reference complexity range. As a result, the document generation support apparatus 1 can support the creator so as to create document data that matches the reference complexity. Specifically, the creator can add an annotation to a use target word that exceeds the reference complexity range, and examine the conversion target word for the use target word that falls below the reference complexity range.

出力手段２５は、表示装置４０に、変換文書データ１６ａをプレビュー表示する。図１０（ａ）に、変換文書データ１６ａをプレビュー表示した例を説明する。図１０（ａ）においては、変換履歴や下線などがはずされた状態で表示されている。このようなプレビュー表示により、作成者は、変換後の文書を確認することができる。 The output unit 25 displays the converted document data 16a on the display device 40 as a preview. FIG. 10A illustrates an example in which the converted document data 16a is displayed as a preview. In FIG. 10A, the conversion history and underline are removed. By such a preview display, the creator can check the converted document.

また出力手段２５は、図９に示すように変換履歴を含めて表示装置４０に表示しても良い。このような表示により、作成者は、文書生成支援装置１の変換手段２４によって変換された履歴を確認することができる。 Further, the output means 25 may display the display device 40 including the conversion history as shown in FIG. With such a display, the creator can check the history converted by the conversion unit 24 of the document generation support apparatus 1.

また出力手段２５は、変換文書データ１６ａに基準複雑度の範囲外の使用対象単語が含まれている場合、当該使用対象単語を、意味のない文字列に置換して、表示装置４０に表示しても良い。ここで、変換文書データ１６ａに含まれる基準複雑度の範囲外の使用対象単語は、文書データ１１ａの基準複雑度の範囲外の使用対象単語であって、類義語データ１５ａから、基準複雑度の範囲内の類義語を取得できなかった使用対象単語である。図９に示す例では、”［難］”の文字列が付された、”無線ＬＡＮ”、”有線ＬＡＮ”などの単語である。これらの文字列は、基準複雑度の範囲外であるので、作成者が想定する読み手が読むと、意味のない文字になってしまうと考えられる。そこで出力手段２５は、このような文字列を意味のない文字列で表記することにより、読み手がこの変換文書データ１６ａを読んだ印象をシミュレーションすることができる。 Further, when the conversion document data 16a includes a use target word outside the range of the reference complexity, the output unit 25 replaces the use target word with a meaningless character string and displays the replacement target word on the display device 40. May be. Here, the use target word outside the range of the reference complexity included in the converted document data 16a is a use target word outside the range of the reference complexity of the document data 11a, and the range of the reference complexity from the synonym data 15a. This is a target word for which a synonym within the word could not be acquired. In the example shown in FIG. 9, words such as “wireless LAN”, “wired LAN”, etc., with the character string “[difficult]” attached thereto. Since these character strings are outside the range of the standard complexity, it is considered that the characters become meaningless when read by the reader assumed by the creator. Therefore, the output unit 25 can simulate the impression that the reader has read the converted document data 16a by expressing such a character string as a meaningless character string.

図１０（ｂ）を参照して、シミュレーション表示画面Ｐ１０３を説明する。図１０（ｂ）に示すシミュレーション表示画面Ｐ１０３は、図９を参照して説明した変換文書データ１６ａをシミュレーション表示したものである。図９に示す変換文書データ１６ａにおいて、”［難］”の文字列が付された”無線ＬＡＮ”、”有線ＬＡＮ”などの使用対象文字列について、これらの使用対象文字列の代わりに、”●×△◎”や、”☆△○■”など、意味のない文字列が表示されている。このように、基準複雑度の範囲外の対象単語であって、類義語に変換できなかったものについて、意味のない文字列に置換して表示することにより、読み手の見え方をシミュレーションすることができる。これにより作成者は、変換文書データ１６ａの仕上がりを確認し、また必要によっては、文書データ１１ａそのものの修正の要否を判断することができる。 With reference to FIG. 10B, the simulation display screen P103 will be described. A simulation display screen P103 shown in FIG. 10B is a simulation display of the converted document data 16a described with reference to FIG. In the converted document data 16a shown in FIG. 9, for character strings to be used such as “wireless LAN” and “wired LAN” to which the character string “[difficult]” is attached, instead of these character strings to be used, “ ● A meaningless character string such as “X △ ◎” or “☆ △ ○ ■” is displayed. In this manner, the appearance of the reader can be simulated by replacing a target word outside the range of the reference complexity, which cannot be converted into a synonym, with a meaningless character string and displaying it. . Thus, the creator can confirm the finish of the converted document data 16a, and can determine whether or not the document data 11a itself needs to be corrected, if necessary.

このように本発明の実施の形態に係る文書生成支援装置１によれば、作成者が作成した文書データから、専門用語などの変換対象の候補となる単語を抽出し、対象単語に設定された複雑度を用いて、作成者が想定している読み手に適切な複雑度の単語が用いられているか否かを判別することができる。 As described above, according to the document generation support device 1 according to the embodiment of the present invention, words that are candidates for conversion, such as technical terms, are extracted from the document data created by the creator and set as target words. By using the complexity, it is possible to determine whether or not a word having an appropriate complexity is used for the reader assumed by the creator.

文書生成支援装置１は、文書データ１１ａから読み手に不適切な複雑度の単語を抽出し、適切な類義語がある場合、その類義語に変換して文書変換データ１６ａを出力することができる。これにより、作成者の作業を省略して、自動的に基準複雑度内の単語を用いた変換文書データを出力することができる。 The document generation support apparatus 1 can extract a word having an inappropriate complexity for the reader from the document data 11a, and if there is an appropriate synonym, the document generation support apparatus 1 can convert the synonym and output the document conversion data 16a. As a result, it is possible to automatically output converted document data using words within the reference complexity without the creator's work.

また、適切な類義語に変換できない場合でも、例えば、基準複雑度の上限値を上回る複雑度を持つ使用対象単語に、”［難］”の文字列を付加し、基準複雑度の下限値を下回る複雑度を持つ使用対象単語に、”［易］”の文字列を付加する。これにより文書生成支援装置１は、適切な類義語がない場合でも、作成者に文書データの適切な修正を促すことができる。 Even if it cannot be converted into an appropriate synonym, for example, a character string “[difficult]” is added to the word to be used that has a complexity that exceeds the upper limit of the reference complexity, and falls below the lower limit of the reference complexity. A character string “[Easy]” is added to the target word having complexity. As a result, the document generation support apparatus 1 can prompt the creator to appropriately correct the document data even when there is no appropriate synonym.

さらに、基準複雑度の範囲外の使用対象単語が適切な類義語に変換できない場合でも、文書生成支援装置１は、この使用対象単語を意味のない文字列に変換することにより、読み手の見え方をシミュレーションし、作成者に、必要に応じて、文書データの適切な修正を促すことができる。これにより、読み手に理解しづらい文書データの作成を阻止することができる。 Furthermore, even when the use target word outside the range of the reference complexity cannot be converted into an appropriate synonym, the document generation support device 1 converts the use target word into a meaningless character string, thereby Simulation can be performed to prompt the creator to correct the document data as necessary. This can prevent the creation of document data that is difficult for the reader to understand.

このように本発明の実施の形態に係る文書生成支援装置１によれば、文書データ１１ａから、適切な難易度に変換した変換文書データ１６ａを生成することができる。 As described above, according to the document generation support apparatus 1 according to the embodiment of the present invention, the converted document data 16a converted to an appropriate difficulty level can be generated from the document data 11a.

（その他の実施の形態）
上記のように、本発明の実施の形態によって記載したが、この開示の一部をなす論述および図面はこの発明を限定するものであると理解すべきではない。この開示から当業者には様々な代替実施の形態、実施例および運用技術が明らかとなる。 (Other embodiments)
As described above, the embodiments of the present invention have been described. However, it should not be understood that the descriptions and drawings constituting a part of this disclosure limit the present invention. From this disclosure, various alternative embodiments, examples, and operational techniques will be apparent to those skilled in the art.

例えば、本発明の最良の実施の形態に記載した文書生成支援装置は、図２に示すように一つのハードウェア上に構成されても良いし、その機能や処理数に応じて複数のハードウェア上に構成されても良い。また、既存の情報処理システム上に実現されても良い。 For example, the document generation support apparatus described in the best embodiment of the present invention may be configured on a single piece of hardware as shown in FIG. 2, or a plurality of pieces of hardware depending on the functions and the number of processes. It may be configured above. Moreover, you may implement | achieve on the existing information processing system.

本発明はここでは記載していない様々な実施の形態等を含むことは勿論である。従って、本発明の技術的範囲は上記の説明から妥当な特許請求の範囲に係る発明特定事項によってのみ定められるものである。 It goes without saying that the present invention includes various embodiments not described herein. Therefore, the technical scope of the present invention is defined only by the invention specifying matters according to the scope of claims reasonable from the above description.

１文書生成支援装置
１０記憶装置
１１文書データ記憶部
１２対象単語データ記憶部
１３使用対象単語データ記憶部
１４基準複雑度データ記憶部
１５類義語データ記憶部
１６変換文書データ記憶部
２０中央処理制御装置
２１文書取得手段
２２対象単語抽出手段
２３基準複雑度設定手段
２４変換手段
２５出力手段
３０入力装置
４０表示装置 DESCRIPTION OF SYMBOLS 1 Document production | generation assistance apparatus 10 Storage apparatus 11 Document data storage part 12 Target word data storage part 13 Use object word data storage part 14 Reference | standard complexity data storage part 15 Synonym data storage part 16 Conversion document data storage part 20 Central processing control apparatus 21 Document acquisition means 22 Target word extraction means 23 Reference complexity setting means 24 Conversion means 25 Output means 30 Input device 40 Display device

Claims

A document generation support method for supporting generation of converted document data converted from document data to an appropriate difficulty level,
A target word extraction step of extracting a target word included in target word data in which a target word to be converted and a complexity of the target word are associated from the document data, and storing the target word in the target word data When,
For the use target word included in the use target word data, the complexity of the use target word is extracted, and the extracted complexity of the use target word is a criterion for the range of complexity of the target word used for the converted document data If it is outside the range of the reference complexity, the synonym data corresponding to the synonym of the target word is extracted from the synonym data of the target word and the conversion target word within the reference complexity range, and the document data A document generation support method comprising: a conversion step of outputting converted document data obtained by converting the target word in the extracted conversion target word.

In the conversion step, when the complexity of the extracted use target word is outside the range of the reference complexity, the synonym data includes a synonym of the use target word and a conversion target word within the reference complexity range. 2. The document generation support method according to claim 1, wherein if there is not, the fact that conversion is not possible is output to the converted document data without converting the target word in the document data.

A preview display of the converted document data, and when the converted document data includes a target word outside the range of the reference complexity, the target word is replaced with a meaningless character string and displayed. The document generation support method according to claim 2, further comprising an output step of displaying on the apparatus.

The use target word included in the use target word data and the complexity of the use target word are displayed in association with each other on the display device, and the reference complexity range specified by the operator is displayed on the reference complexity data. The document generation support method according to claim 1, further comprising: a reference complexity setting step for setting to

A document generation support device that supports generation of converted document data converted from document data to an appropriate difficulty level,
A document data storage unit storing the document data;
A target word data storage unit that stores target word data in which target words to be converted are associated with the complexity of the target words;
A reference complexity data storage unit in which reference complexity data including a reference complexity range that is a reference of a complexity range of a target word used for converted document data is stored;
A synonym data storage unit storing synonym data in which synonyms of the target word are associated;
A target word extracting means for extracting a target word contained in the target word data from the document data and storing the target word in the target word data;
For the use target word included in the use target word data, the complexity of the use target word is extracted, and when the extracted complexity of the use target word is outside the range of the reference complexity, from the synonym data, A conversion unit that extracts a conversion target word that is a synonym of the use target word and is within the range of the reference complexity, and outputs converted document data obtained by converting the target word in the document data into the extracted conversion target word A document generation support apparatus comprising:

The conversion means has the extracted complexity of the use target word outside the range of the reference complexity, and the synonym data includes a synonym of the use target word and a conversion target word within the range of the reference complexity. 6. The document generation support apparatus according to claim 5, wherein if there is no conversion, the fact that the target word in the document data cannot be converted is output to the converted document data without conversion.

A preview display of the converted document data, and when the converted document data includes a target word outside the range of the reference complexity, the target word is replaced with a meaningless character string and displayed. The document generation support apparatus according to claim 6, further comprising output means for displaying on the apparatus.

The use target word included in the use target word data and the complexity of the use target word are displayed in association with each other on the display device, and the reference complexity range specified by the operator is displayed on the reference complexity data. The document generation support apparatus according to claim 5, further comprising reference complexity setting means for setting to

A document generation support program for causing a computer to execute the steps according to any one of claims 1 to 4.