JP2014232494A

JP2014232494A - Document creation assist device and operation method thereof

Info

Publication number: JP2014232494A
Application number: JP2013114093A
Authority: JP
Inventors: 遼山下; Ryo Yamashita; 千尋高山; Chihiro Takayama; 大野　健彦; Takehiko Ono; 健彦大野; 陽子浅野; Yoko Asano
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2013-05-30
Filing date: 2013-05-30
Publication date: 2014-12-11

Abstract

PROBLEM TO BE SOLVED: To obtain a prediction value of a response to a document.SOLUTION: A response prediction acquisition unit 4 searches a response history DB for a record fulfilling (condition 1) that a difference between a time of day 202 and the current time of day is within a prescribed range, (condition 2) that a situation of whether or not the day in question is a holiday matches a holiday flag 203, (condition 3) that the number of characters in a document concerned matches the number of characters 206, and (condition 4) that a category of the document concerned and category information 204 match (S2071). The response prediction acquisition unit 4 reads out a first period response count 211, a second period response count 212, a third period response count 213, a fourth period response count 214, and a fifth period response count 215, respectively as a first period response prediction value, a second period response prediction value, a third period response prediction value, a fourth period response prediction value, and a fifth period response prediction value (S2077), and then terminates the process. A response prediction output unit 5 outputs the first period response prediction value, the second period response prediction value, the third period response prediction value, the fourth period response prediction value, and the fifth period response prediction value.

Description

本発明は、文書作成支援装置およびその動作方法に関するものである。 The present invention relates to a document creation support apparatus and an operation method thereof.

従来においては、文書は作成者自身が考えて作成される、または、所定のルールを基に作成者の手間を省く補助を得て作成される。 Conventionally, a document is created by the creator himself / herself, or with the assistance of saving the creator's effort based on a predetermined rule.

特開平０２−２９７１２４号公報Japanese Patent Laid-Open No. 02-297124 特開平０２−１１６９５６号公報JP 02-116956 A

作成者が自身で文書を作成する場合、その文書を投稿して人に見てもらい、反応を得るなどし、こうして、反応を得やすい文書の作成に熟達する必要がある。 When a creator creates a document by himself / herself, he / she needs to be proficient in creating a document that easily obtains a reaction by posting the document and having a person see it and getting a response.

一方、補助を得て文書を作成する場合、作成時間は短縮されるが、反応を得やすい文書であるかは、実際に反応を得るまでわからない。 On the other hand, when a document is created with assistance, the creation time is shortened, but it is not known whether the document is easy to get a response until the response is actually obtained.

いずれにしても、作成者が文書作成に慣れていない場合、反応を得やすい文書を作成できなかったり、反応を予測できないことから、投稿時の心理的負担が大きい。 In any case, if the creator is not accustomed to document creation, a document that easily obtains a response cannot be created or a response cannot be predicted.

本発明は、上記の課題に鑑みてなされたものであり、その目的とするところは、文書への反応の予測値を得ることが可能な文書作成支援装置およびその動作方法を提供することにある。 The present invention has been made in view of the above problems, and an object of the present invention is to provide a document creation support apparatus capable of obtaining a predicted value of reaction to a document and an operation method thereof. .

上記の課題を解決するために、第１の本発明に係る文書作成支援装置は、文書作成の支援対象である対象文書の特徴量を求め、前記対象文書への反応の予測値である反応予測値を前記特徴量に基づいて求める反応予測取得部を備えることを特徴とする。 In order to solve the above-described problem, a document creation support apparatus according to a first aspect of the present invention obtains a feature quantity of a target document that is a support target of document creation, and a reaction prediction that is a predicted value of a response to the target document. A reaction prediction acquisition unit for obtaining a value based on the feature amount is provided.

例えば、文書作成支援装置は、前記対象文書でない文書について複数のレコードを有し、且つ、当該各レコードは該当の文書の特徴量と当該文書への反応数を含むデータベースを備え、前記反応予測取得部は、前記データベースから、前記対象文書の特徴量に対応するレコードを検索し、当該レコードに含まれる反応数に基づいて前記反応予測値を求める。 For example, the document creation support apparatus includes a plurality of records for a document that is not the target document, and each record includes a database that includes a feature amount of the document and the number of responses to the document, and the response prediction acquisition The unit searches the database for a record corresponding to the feature quantity of the target document, and obtains the predicted response value based on the number of reactions included in the record.

例えば、前記特徴量は、前記対象文書から不要な文字を削除した後の文字数を含む。 For example, the feature amount includes the number of characters after unnecessary characters are deleted from the target document.

例えば、前記特徴量は、前記対象文書から不要な文字を削除した後の文字数、前記対象文書のカテゴリおよび前記対象文書のサブカテゴリを含む。 For example, the feature amount includes the number of characters after unnecessary characters are deleted from the target document, the category of the target document, and the subcategory of the target document.

例えば、文書作成支援装置は、カテゴリとサブカテゴリが共通する複数の文書で構成されるグループ毎のレコードを有し、且つ、当該レコードは当該グループの特徴を示す要素を備える文書分類データベースを備え、前記反応予測取得部は、前記対象文書の特徴を示す特徴ベクトルを生成し、前記文書分類データベースから前記対象文書カテゴリに対応するレコードを検索し、検索された各レコードにつき、該レコードに含まれる要素からなる代表点ベクトルを生成し、前記特徴ベクトルと前記各代表点ベクトルとの間の距離を求め、最小の距離に対応するサブカテゴリを前記対象文書のサブカテゴリとする。 For example, the document creation support apparatus includes a record for each group including a plurality of documents having a common category and subcategory, and the record includes a document classification database including elements indicating characteristics of the group, The reaction prediction acquisition unit generates a feature vector indicating the feature of the target document, searches a record corresponding to the target document category from the document classification database, and for each searched record, from an element included in the record The representative point vector is generated, the distance between the feature vector and each representative point vector is obtained, and the subcategory corresponding to the minimum distance is set as the subcategory of the target document.

第２の本発明に係る文書作成支援装置は、対象文書を入力する文書入力部と、前記対象文書の特徴量を求め、前記対象文書への反応の予測値である反応予測値を前記特徴量に基づいて求める反応予測取得部と、前記反応予測値を出力する反応予測出力部とを備えることを特徴とする。 A document creation support apparatus according to a second aspect of the present invention provides a document input unit for inputting a target document, a feature amount of the target document, and a predicted response value that is a predicted value of a response to the target document. And a reaction prediction output unit for outputting the predicted reaction value.

第３の本発明に係る文書作成支援装置の動作方法は、文書作成支援装置の反応予測取得部が、対象文書の特徴量を求めるステップと、前記反応予測取得部が、前記対象文書への反応の予測値である反応予測値を前記特徴量に基づいて求めるステップとを備えることを特徴とする。 According to a third aspect of the present invention, there is provided a method for operating a document creation support apparatus, in which a response prediction acquisition unit of a document creation support apparatus obtains a feature amount of a target document, and the reaction prediction acquisition unit responds to the target document. And a step of obtaining a predicted reaction value, which is a predicted value of the above, based on the feature amount.

第４の本発明に係る文書作成支援装置の動作方法は、文書作成支援装置の文書入力部が、文書作成の支援対象である対象文書を入力するステップと、前記文書作成支援装置の反応予測取得部が、前記対象文書の特徴量を求めるステップと、前記反応予測取得部が、前記対象文書への反応の予測値である反応予測値を前記特徴量に基づいて求めるステップと、前記文書作成支援装置の反応予測出力部が、前記反応予測値を出力するステップとを備えることを特徴とする。 According to a fourth aspect of the present invention, there is provided a document creation support apparatus operating method in which a document input unit of a document creation support apparatus inputs a target document that is a document creation support target, and a response prediction acquisition of the document creation support apparatus. A step of obtaining a feature amount of the target document; a step of obtaining a response prediction value that is a predicted value of a response to the target document based on the feature amount; and the document creation support. The reaction prediction output unit of the apparatus includes a step of outputting the reaction prediction value.

本発明によれば、文書への反応の予測値である反応予測値を得ることが可能となる。 According to the present invention, a predicted response value that is a predicted value of a response to a document can be obtained.

第１の実施の形態に係る文書作成支援装置の構成を示す図である。It is a figure which shows the structure of the document preparation assistance apparatus which concerns on 1st Embodiment. 反応履歴ＤＢ２のデータ構成の一例を示す図である。It is a figure which shows an example of a data structure of reaction log | history DB2. 文書作成支援装置に接続された表示装置の画面例を示す図である。It is a figure which shows the example of a screen of the display apparatus connected to the document preparation assistance apparatus. 文書入力部１の処理の流れを示すフローチャートである。4 is a flowchart showing a process flow of a document input unit 1. 反応予測取得部４と反応予測出力部５の処理の流れを示す全体フローチャートである。4 is an overall flowchart showing a flow of processing of a reaction prediction acquisition unit 4 and a reaction prediction output unit 5. 反応予測取得部４による反応予測値の取得（Ｓ２０７）のフローチャートである。It is a flowchart of acquisition of a reaction predicted value by the reaction prediction acquisition unit 4 (S207). 反応履歴ＤＢ２から反応予測取得部４により検索されたレコードの一例を示す図である。It is a figure which shows an example of the record searched by the reaction prediction acquisition part 4 from reaction history DB2. 文書作成支援装置に接続された表示装置の別な画面例を示す図である。It is a figure which shows another example of a screen of the display apparatus connected to the document preparation assistance apparatus. 第２の実施の形態に係る文書作成支援装置の構成を示す図である。It is a figure which shows the structure of the document preparation assistance apparatus which concerns on 2nd Embodiment. 、反応予測ＤＢ６のデータ構成の一例を示す図である。It is a figure which shows an example of a data structure of reaction prediction DB6. 文書分類ＤＢ７のデータ構成の一例を示す図である。It is a figure which shows an example of a data structure of document classification DB7. トピックＤＢ８のデータ構成の一例を示す図である。It is a figure which shows an example of a data structure of topic DB8. 反応予測取得部４による文書分類ＤＢ７の生成処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the production | generation process of document classification DB7 by the reaction prediction acquisition part. 過去文書ベクトルの例を示す図である。It is a figure which shows the example of the past document vector. 反応予測取得部４と反応予測出力部５の処理の流れを示す全体フローチャートである。4 is an overall flowchart showing a flow of processing of a reaction prediction acquisition unit 4 and a reaction prediction output unit 5. 反応予測取得部４による対象文書サブカテゴリ検出（Ｓ２０５）のフローチャートである。It is a flowchart of target document subcategory detection (S205) by the reaction prediction acquisition unit 4. 反応予測取得部４による反応予測値の取得（Ｓ２０８）のフローチャートである。It is a flowchart of acquisition (S208) of the reaction predicted value by the reaction prediction acquisition part 4. FIG.

以下、本発明の実施の形態について図面を参照して説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

［第１の実施の形態］
図１は、第１の実施の形態に係る文書作成支援装置の構成を示す図である。 [First Embodiment]
FIG. 1 is a diagram illustrating a configuration of a document creation support apparatus according to the first embodiment.

文書作成支援装置は、文書作成の支援対象である文書（対象文書）を入力する文書入力部１と、過去に作成された文書（過去文書）への反応の履歴が記憶される反応履歴データベース（以下、データベースをＤＢと略記する）ＤＢ２と、文書への反応を取得する反応取得部３と、対象文書の特徴量を求め、対象文書への反応の予測値である反応予測値を特徴量に基づいて求める反応予測取得部４と、反応予測値を出力する反応予測出力部５とを備える。 The document creation support apparatus includes a document input unit 1 that inputs a document (target document) that is a document creation support target, and a reaction history database (a history of reactions to documents created in the past (past documents)). (Hereinafter, the database is abbreviated as DB) DB2, the reaction acquisition unit 3 that acquires the response to the document, the feature quantity of the target document is obtained, and the response predicted value that is the predicted value of the response to the target document is used as the feature quantity. The reaction prediction acquisition part 4 calculated | required based on and the reaction prediction output part 5 which outputs a reaction prediction value are provided.

図２は、反応履歴ＤＢ２のデータ構成の一例を示す図であり、カテゴリ「邦画」の部分を示す。 FIG. 2 is a diagram showing an example of the data structure of the reaction history DB 2 and shows a portion of the category “Japanese movie”.

反応履歴ＤＢ２は、過去に生成された文書（以下、過去文書という）毎のレコードを有する。 The reaction history DB 2 has a record for each document generated in the past (hereinafter referred to as a past document).

各レコードは、過去文書の文書番号２０１、過去文書が生成された時刻２０２、過去文書が生成された日が休日か否かを示す休日フラグ２０３、過去文書のカテゴリ情報２０４、過去文書のタイトル２０５、過去文書の文字数２０６、過去文書自体である過去文書２０７、過去文書作成後において順次に到来する第１期間、第２期間、第３期間、第４期間、第５期間のうちの第１期間での反応数である第１期間反応数２１１、第２期間での反応数である第２期間反応数２１２、第３期間での反応数である第３期間反応数２１３、第４期間での反応数である第４期間反応数２１４、第５期間での反応数である第５期間反応数２１５を備える。 Each record includes a document number 201 of the past document, a time 202 when the past document was generated, a holiday flag 203 indicating whether or not the date when the past document was generated, a category information 204 of the past document, and a title 205 of the past document. , The number of characters 206 of the past document, the past document 207 that is the past document itself, the first period of the first period, the second period, the third period, the fourth period, and the fifth period that sequentially arrive after the past document is created The number of reactions in the first period 211, the number of reactions in the second period 212, the number of reactions in the second period 212, the number of reactions in the third period 213, the number of reactions in the third period 213, the number of reactions in the fourth period A fourth period reaction number 214, which is the number of reactions, and a fifth period reaction number 215, which is the number of reactions in the fifth period, are provided.

第１期間は、例えば作成時から１０分後まで、第２期間は１０分後から１時間後まで、第３期間は１時間後から２時間後まで、第４期間は２時間後から５時間後まで、第５期間は５時間後から１０時間後までのように定められる。期間はさらに設けてもよい。 The first period is, for example, 10 minutes after the creation, the second period is 10 minutes to 1 hour later, the third period is 1 hour to 2 hours later, the fourth period is 2 hours to 5 hours later Until later, the fifth period is defined as from 5 hours to 10 hours later. A period may be further provided.

なお、反応履歴ＤＢ２のレコードに記憶させる項目としては、これに限らず、文書における漢字やひらがなの割合、予め作成した単語の専門度の辞書を元に文書の専門度を計算し数値化したもの、予め作成した語尾の辞書を元に文書の丁寧度を数値化したもの、予め作成した単語の注目度辞書を元に計算した文書の注目度を数値化したもの、などを用いてもよい。 Note that the items to be stored in the record of the reaction history DB 2 are not limited to this, and the degree of specialization of the document is calculated and digitized based on the kanji and hiragana ratio in the document and the dictionary of word specialities created in advance. Alternatively, the document politeness based on a pre-created ending dictionary or the document attention calculated on the basis of a pre-created word attention dictionary may be used.

反応取得部３は、実際の文書（過去文書）のそれぞれにつき、文書作成後において順次に到来する第１期間、第２期間、第３期間、第４期間、第５期間のそれぞれにおける反応の数を取得し、反応履歴ＤＢ２における該当レコードの第１期間反応数２１１、第２期間反応数２１２、第３期間反応数２１３、第４期間反応数２１４、第５期間反応数２１５にそれぞれ記憶させる。反応取得部３は、例えば、新たな反応を得るごとに反応履歴ＤＢ２を更新する。 For each actual document (past document), the response acquisition unit 3 counts the number of responses in each of the first, second, third, fourth, and fifth periods that arrive sequentially after the document is created. Are stored in the first period reaction number 211, the second period reaction number 212, the third period reaction number 213, the fourth period reaction number 214, and the fifth period reaction number 215 of the corresponding record in the reaction history DB2. For example, the reaction acquisition unit 3 updates the reaction history DB 2 every time a new reaction is obtained.

文書への反応としては、数値で表現される以下のような指標を用いることができる。すなわち、反応の数は、（１）文書が質問掲示板の質問文なら、例えば取得回答数であり、（２）文書がメールなら、例えば返信数であり、（３）文書がソーシャル・ネットワーキング・サービス（social networking service、SNS）の投稿記事なら、例えば評価ボタン押下数、コメント数であり、（４）文書がウェブページの投稿記事なら、例えば閲覧数、評価ボタン押印数、コメント数である。 As a response to a document, the following indices expressed numerically can be used. That is, the number of responses is (1) if the document is a question message on a question board, for example, the number of obtained answers, (2) if the document is an email, for example, the number of replies, and (3) the document is a social networking service. If it is a posted article of (social networking service, SNS), for example, it is the number of evaluation button presses and the number of comments. (4) If the document is a posted article on a web page, for example, it is the number of browsing, the number of evaluation button stamps, and the number of comments.

例えば、文書が作者の友人が使用するコンピュータ画面に表示され、友人が画面の評価ボタンを押すと、反応取得部３は、押した回数を反応の数として取得し、反応取得部３を更新する。 For example, when a document is displayed on a computer screen used by a friend of the author and the friend presses the evaluation button on the screen, the reaction acquisition unit 3 acquires the number of times of pressing as the number of reactions, and updates the reaction acquisition unit 3 .

図３は、文書作成支援装置に接続された表示装置の画面例を示す図である。 FIG. 3 is a diagram illustrating a screen example of the display device connected to the document creation support device.

文書入力部１は、表示装置に文書入力画面１００を表示し、反応予測出力部５は、反応予測画面２００を表示する。文書入力画面１００に文書が入力されると、反応予測出力部５は反応予測画面２００に反応予測値を表示する。投稿ボタン３００が押されると文書が外部に送信（投稿）される。 The document input unit 1 displays the document input screen 100 on the display device, and the reaction prediction output unit 5 displays the reaction prediction screen 200. When a document is input to the document input screen 100, the reaction prediction output unit 5 displays a reaction prediction value on the reaction prediction screen 200. When the posting button 300 is pressed, the document is transmitted (posted) to the outside.

このように作成終了した文書、すなわち過去文書については、反応履歴ＤＢ２に当該文書に関するレコードが生成される。 For a document that has been created in this way, that is, a past document, a record relating to the document is generated in the reaction history DB 2.

なお、反応予測画面２００の表示内容については後述する。 The display contents of the reaction prediction screen 200 will be described later.

図４は、文書入力部１の処理の流れを示すフローチャートである。 FIG. 4 is a flowchart showing a process flow of the document input unit 1.

文書入力部１は、文書入力画面１００への文書が入力されたか、または、文書の内容が変化したかを判定し（Ｓ１０１）、文書が入力された、または、文書の内容が変化したなら（Ｓ１０１：ＹＥＳ）、反応予測取得部４と反応予測出力部５を呼び出し（Ｓ１０３）、ステップＳ１０１に戻る。なお、文書の作成者は、例えば、複数のカテゴリから、文書のカテゴリを選択し、文書入力部１には、当該カテゴリが与えられ、文書入力部１は、反応予測取得部４を呼び出す時点における文書入力画面１００内の文書（以下、対象文書という）ならびに当該カテゴリ（以下、対象文書カテゴリという）を反応予測取得部４に与える。 The document input unit 1 determines whether a document has been input to the document input screen 100 or whether the content of the document has changed (S101). If the document has been input or the content of the document has changed (S101) (S101: YES), the reaction prediction acquisition unit 4 and the reaction prediction output unit 5 are called (S103), and the process returns to step S101. The document creator selects, for example, a document category from a plurality of categories, the category is given to the document input unit 1, and the document input unit 1 calls the reaction prediction acquisition unit 4. A document in the document input screen 100 (hereinafter referred to as a target document) and the category (hereinafter referred to as a target document category) are given to the reaction prediction acquisition unit 4.

図５は、反応予測取得部４と反応予測出力部５の処理の流れを示す全体フローチャートである。 FIG. 5 is an overall flowchart showing a processing flow of the reaction prediction acquisition unit 4 and the reaction prediction output unit 5.

反応予測取得部４は、対象文書を単語に分解し（Ｓ２０１）、不要な単語を削除し（Ｓ２０３）、残った全ての単語（以下、単語群という）の文字数を検出し（Ｓ２０４）、反応予測値を取得する（Ｓ２０７）。 The reaction prediction acquisition unit 4 decomposes the target document into words (S201), deletes unnecessary words (S203), detects the number of characters of all remaining words (hereinafter referred to as word groups) (S204), and reacts. A predicted value is acquired (S207).

次に、反応予測出力部５は、反応予測値を反応予測画面２００に表示し（Ｓ２０９）、制御は、Ｓ１０１に戻る。 Next, the reaction prediction output unit 5 displays the reaction prediction value on the reaction prediction screen 200 (S209), and the control returns to S101.

なお、反応予測値は表示に限らず、印刷（プリント）してもよい。 The predicted response value is not limited to display, and may be printed.

反応予測取得部４は、例えば、不要な単語を網羅した不要語リストを予め保持し、Ｓ２０３では、対象文書から、不要語リストの単語に一致する単語や句読点を検索し、削除する。 For example, the reaction prediction acquisition unit 4 holds in advance an unnecessary word list that covers unnecessary words, and in S203, searches the target document for words and punctuation marks that match the words in the unnecessary word list and deletes them.

反応予測取得部４は、例えば、Ｓ２０４では、文字数の最後の桁を四捨五入し、これを対象文書の文字数とする。これは、後述する反応履歴ＤＢ２の検索において、該当レコード数が少なくなるのを防ぐためである。 For example, in S204, the reaction prediction acquisition unit 4 rounds off the last digit of the number of characters, and sets this as the number of characters of the target document. This is to prevent the number of corresponding records from decreasing in the search of the reaction history DB 2 described later.

図６は、反応予測取得部４による反応予測値の取得（Ｓ２０７）のフローチャートである。 FIG. 6 is a flowchart of obtaining a predicted reaction value (S207) by the reaction prediction obtaining unit 4.

反応予測取得部４は、反応履歴ＤＢ２から、（条件１）時刻２０２と現在時刻の差が所定範囲内にある、（条件２）当日が休日であるか否かの状況が休日フラグ２０３に一致する、（条件３）対象文書の文字数が文字数２０６に一致する、（条件４）対象文書カテゴリとカテゴリ情報２０４が一致する、を充足するレコードを検索する（Ｓ２０７１）。 The reaction prediction acquisition unit 4 matches the holiday flag 203 with whether or not the difference between the (time 1) time 202 and the current time is within a predetermined range from the reaction history DB 2 (condition 2) whether the current day is a holiday. A record satisfying (condition 3) that the number of characters of the target document matches the number of characters 206 and (condition 4) that the target document category matches the category information 204 is searched (S2071).

図７は、反応履歴ＤＢ２から反応予測取得部４により検索されたレコードの一例を示す図である。 FIG. 7 is a diagram illustrating an example of a record retrieved from the reaction history DB 2 by the reaction prediction acquisition unit 4.

例えば、現在時刻が「１０：０１」、当日が休日でない、対象文書の文字数が「２００」、対象文書のカテゴリが「邦画」である場合、例えば、時刻２０２「１０：００」、休日フラグ２０３「０」、文字数２０６「２００」、カテゴリ情報２０４「邦画」を含む３レコードが検索される。 For example, when the current time is “10:01”, the current day is not a holiday, the number of characters of the target document is “200”, and the category of the target document is “Japanese movie”, for example, time 202 “10:00”, holiday flag 203 Three records including “0”, the number of characters 206 “200”, and the category information 204 “Japanese movie” are searched.

図６に戻り、反応予測取得部４は、レコード数が２以上なら（Ｓ２０７３：ＹＥＳ）、第１期間反応数２１１の平均、第２期間反応数２１２の平均、第３期間反応数２１３の平均、第４期間反応数２１４の平均、第５期間反応数２１５の平均を計算し、第１期間反応数２１１の平均を第１期間における反応予測値（以下、第１期間反応予測値）とし、第２期間反応数２１２の平均を第２期間における反応予測値（以下、第２期間反応予測値）とし、第３期間反応数２１３の平均を第３期間における反応予測値（以下、第３期間反応予測値）とし、第４期間反応数２１４の平均を第４期間における反応予測値（以下、第４期間反応予測値）とし、第５期間反応数２１５の平均を第５期間における反応予測値（以下、第５期間反応予測値）とし（Ｓ２０７５）、それぞれを読み出して、処理を終える。 Returning to FIG. 6, if the number of records is 2 or more (S2073: YES), the reaction prediction acquisition unit 4 averages the first period reaction number 211, the second period reaction number 212, and the third period reaction number 213. The average of the fourth period reaction number 214 and the average of the fifth period reaction number 215 is calculated, and the average of the first period reaction number 211 is defined as a predicted reaction value in the first period (hereinafter referred to as the first period response prediction value). The average of the second period response number 212 is defined as a predicted response value in the second period (hereinafter referred to as second period response predicted value), and the average of the third period response number 213 is defined as a predicted response value in the third period (hereinafter referred to as the third period). Reaction predicted value), the average of the fourth period response number 214 is the response predicted value in the fourth period (hereinafter referred to as fourth period response predicted value), and the average of the fifth period response number 215 is the predicted response value in the fifth period. (Hereinafter, the fifth period response prediction value) ( 2075), reads out respectively, the process ends.

図７を例とすれば、第１期間反応予測値は「２」、第２期間反応予測値は「１」、第３期間反応予測値は「１」、第４期間反応予測値は「１」、第５期間反応予測値は「０．６６」となる。 Taking FIG. 7 as an example, the first period response predicted value is “2”, the second period response predicted value is “1”, the third period response predicted value is “1”, and the fourth period response predicted value is “1”. ”, The fifth period response prediction value is“ 0.66 ”.

反応予測取得部４は、レコード数が１なら（Ｓ２０７３：ＮＯ）、第１期間反応数２１１を第１期間反応予測値、第２期間反応数２１２を第２期間反応予測値、第３期間反応数２１３を第３期間反応予測値、第４期間反応数２１４を第４期間反応予測値、第５期間反応数２１５を第５期間反応予測値とし（Ｓ２０７７）、それぞれを読み出して、処理を終える。 If the number of records is 1 (S2073: NO), the reaction prediction acquisition unit 4 sets the first period reaction number 211 as the first period reaction prediction value, the second period reaction number 212 as the second period reaction prediction value, and the third period reaction. The number 213 is the third period response predicted value, the fourth period response number 214 is the fourth period response predicted value, and the fifth period response number 215 is the fifth period response predicted value (S2077). .

反応予測出力部５は、図３の反応予測画面２００において、横軸に時刻、縦軸に反応予測値をとり、第１期間反応予測値と第１期間終了時刻とに対応する点Ｐ１、第１期間反応予測値および第２期間反応予測値の和と第２期間終了時刻とに対応する点Ｐ２、第１期間反応予測値、第２期間反応予測値および第３期間反応予測値の和と第３期間終了時刻とに対応する点Ｐ３、第１期間反応予測値、第２期間反応予測値、第３期間反応予測値および第４期間反応予測値の和と第４期間終了時刻とに対応する点Ｐ４、第１期間反応予測値、第２期間反応予測値、第３期間反応予測値、第４期間反応予測値および第５期間反応予測値の和と第５期間終了時刻とに対応する点Ｐ５を結んだ折れ線グラフを表示する。 The reaction prediction output unit 5 takes the time P on the horizontal axis and the reaction prediction value on the vertical axis on the reaction prediction screen 200 in FIG. 3, and points P1 and P1 corresponding to the first period reaction prediction value and the first period end time. The point P2, corresponding to the sum of the 1-period response prediction value and the second-period response prediction value and the second period end time, the sum of the first-period response prediction value, the second-period response prediction value, and the third-period response prediction value Corresponds to point P3 corresponding to the third period end time, first period response predicted value, second period response predicted value, third period response predicted value, and sum of fourth period response predicted value and fourth period end time Corresponding to the sum of the point P4, the first period response predicted value, the second period response predicted value, the third period response predicted value, the fourth period response predicted value and the fifth period response predicted value, and the fifth period end time. A line graph connecting the points P5 is displayed.

これにより、対象文書の作者は、現在の文書が投稿された場合における累積反応数の推移を知ることができる。なお、反応予測値は数字などで表示してもよい。また、反応予測値は印刷してもよい。 Thereby, the author of the target document can know the transition of the cumulative reaction number when the current document is posted. In addition, you may display a reaction predicted value with a number etc. The predicted reaction value may be printed.

（変形例）
または、本実施の形態では、以下のような態様を採用してもよい。 (Modification)
Or in this Embodiment, you may employ | adopt the following aspects.

反応予測取得部４は、ステップＳ２０７１では、対象文書の文字数を仮に５０、１００、１５０、２００、２５０とする。 In step S2071, the reaction prediction acquisition unit 4 temporarily sets the number of characters of the target document to 50, 100, 150, 200, 250.

反応予測出力部５は、図８に示すように、横軸に文字数、縦軸に反応予測値（例えば、第１期間反応予測値）をとり、文字数を５０とした場合の反応予測値と文字数「５０」とに対応する点Ｐ１１、文字数を１００とした場合の反応予測値と文字数「１００」とに対応する点Ｐ１２、文字数を１５０とした場合の反応予測値と文字数「１５０」とに対応する点Ｐ１３、文字数を２００とした場合の反応予測値と文字数「２００」とに対応する点Ｐ１４、文字数を２５０とした場合の反応予測値と文字数「２５０」とに対応する点Ｐ１５、を結んだ折れ線グラフを表示し、対象文書の文字数、例えば、文字数「１００」に対応する点Ｐ１２を強調表示する。 As shown in FIG. 8, the reaction prediction output unit 5 takes the number of characters on the horizontal axis, the reaction prediction value (for example, the first period reaction prediction value) on the vertical axis, and the reaction prediction value and the number of characters when the number of characters is 50. Corresponds to the point P11 corresponding to “50”, the response predicted value when the number of characters is 100 and the point P12 corresponding to the number of characters “100”, the predicted response value and the number of characters “150” when the number of characters is 150 A point P13 corresponding to the response predicted value when the number of characters is 200 and the number of characters “200” and a point P15 corresponding to the response predicted value and the number of characters “250” when the number of characters is 250 are connected. A broken line graph is displayed, and a point P12 corresponding to the number of characters of the target document, for example, the number of characters “100” is highlighted.

これにより、対象文書の作者は、現在の文書の文字数をどの程度増やせば、または減らせば反応数を上げられるとの知見を得ることができる。 As a result, the author of the target document can obtain knowledge that how much the number of characters in the current document can be increased or decreased to increase the number of responses.

なお、反応予測値が最も高くなる文字数（最適文字数）を求め、現在の文字数と最適文字数を数字などで表示してもよい。反応予測値は印刷してもよい。 In addition, the number of characters (optimum number of characters) having the highest response predicted value may be obtained, and the current number of characters and the optimum number of characters may be displayed as numbers. The predicted reaction value may be printed.

［第２の実施の形態］
次に、本発明の第２の実施の形態について説明する。第２の実施の形態では、第１の実施の形態に同一または類似の装置および装置構成を用い、同一または類似のものについては第１の実施の形態で使用した符号を使用して重複説明を略し、第１の実施の形態とは異なる事項を中心に説明を行う。 [Second Embodiment]
Next, a second embodiment of the present invention will be described. In the second embodiment, the same or similar apparatus and apparatus configuration as those in the first embodiment are used, and the same or similar elements are redundantly described by using the reference numerals used in the first embodiment. For brevity, the description will focus on matters different from the first embodiment.

図９は、第２の実施の形態に係る文書作成支援装置の構成を示す図である。 FIG. 9 is a diagram illustrating a configuration of a document creation support apparatus according to the second embodiment.

文書作成支援装置は、文書入力部１と、反応履歴ＤＢ２と、反応取得部３と、反応予測取得部４と、反応予測出力部５と、反応予測のためのデータが記憶される反応予測ＤＢ６と、文書を分類したデータが記憶される文書分類ＤＢ７と、文書のトピックに関するデータが記憶されるトピックＤＢ８とを備える。 The document creation support apparatus includes a document input unit 1, a reaction history DB 2, a reaction acquisition unit 3, a reaction prediction acquisition unit 4, a reaction prediction output unit 5, and a reaction prediction DB 6 in which data for reaction prediction is stored. And a document classification DB 7 for storing data obtained by classifying documents, and a topic DB 8 for storing data related to the topic of the document.

図１０は、反応予測ＤＢ６のデータ構成の一例を示す図である。 FIG. 10 is a diagram illustrating an example of a data configuration of the reaction prediction DB 6.

反応予測ＤＢ６は、反応履歴ＤＢ２の時刻２０２、休日フラグ２０３、カテゴリ情報２０４、文字数２０６のそれぞれについて共通な値を有し、且つ、カテゴリ情報２０４を更に細分化したサブカテゴリについても共通な１以上の文書（グループという）が複数ある場合に、グループ毎のレコードを有する。 The reaction prediction DB 6 has a common value for each of the time 202, holiday flag 203, category information 204, and number of characters 206 in the reaction history DB 2, and one or more common sub-categories of the category information 204 are further subdivided. When there are a plurality of documents (referred to as groups), each group has a record.

各レコードは、該当の時刻２０２と同じ時刻６０１、該当の休日フラグ２０３と同じ休日フラグ６０２、該当のカテゴリ情報２０４と同じカテゴリ情報６０３、該当のサブカテゴリを示すサブカテゴリ情報６０４、該当の文字数２０６と同じ文字数６０５、該当の第１期間反応数２１１から計算される第１期間反応数６１１、該当の第２期間反応数２１２から計算される第２期間反応数６１２、該当の第３期間反応数２１３から計算される第３期間反応数６１３、該当の第４期間反応数２１４から計算される第４期間反応数６１４、該当の第５期間反応数２１５から計算される第５期間反応数６１５を備える。 Each record has the same time 601 as the corresponding time 202, the same holiday flag 602 as the corresponding holiday flag 203, the same category information 603 as the corresponding category information 204, the subcategory information 604 indicating the corresponding subcategory, and the same number of characters 206. From the number of characters 605, the first period reaction number 611 calculated from the corresponding first period reaction number 211, the second period reaction number 612 calculated from the corresponding second period reaction number 212, and the corresponding third period reaction number 213 A third period reaction number 613 calculated, a fourth period reaction number 614 calculated from the corresponding fourth period reaction number 214, and a fifth period reaction number 615 calculated from the corresponding fifth period reaction number 215 are provided.

図１１は、文書分類ＤＢ７のデータ構成の一例を示す図であり、カテゴリ「邦画」且つサブカテゴリ「１」の部分を示す。 FIG. 11 is a diagram showing an example of the data structure of the document classification DB 7 and shows a portion of the category “Japanese movie” and the subcategory “1”.

文書分類ＤＢ７は、反応履歴ＤＢ２のカテゴリ情報２０４について共通な値を有し、且つ、サブカテゴリについても共通な１以上の文書（グループという）が複数ある場合に、グループ毎のレコードを有する。 The document classification DB 7 has a common value for the category information 204 of the reaction history DB 2 and a record for each group when there are a plurality of one or more documents (referred to as groups) that are also common for the subcategories.

レコードは、該当のカテゴリ情報２０４と同じカテゴリ情報７０１、該当のサブカテゴリを示すサブカテゴリ情報７０２、該当の文書において所定のトピック（第１トピック）に関する単語が占める割合であるトピック割合７１１、該当の文書において別なトピック（第２トピック）に関する単語が占める割合であるトピック割合７１２、該当の文書において別なトピック（第３トピック）に関する単語が占める割合であるトピック割合７１３、該当の文書において別なトピック（第４トピック）に関する単語が占める割合であるトピック割合７１４を備える。 The record includes the same category information 701 as the corresponding category information 204, the subcategory information 702 indicating the corresponding subcategory, the topic ratio 711 that is the ratio of words related to a predetermined topic (first topic) in the corresponding document, Topic ratio 712, which is the ratio of words related to another topic (second topic), topic ratio 713, which is the ratio of words related to another topic (third topic) in the document, and another topic ( A topic ratio 714 that is a ratio of words related to the fourth topic).

図１２は、トピックＤＢ８のデータ構成の一例を示す図であり、カテゴリ「邦画」の部分を示す。 FIG. 12 is a diagram showing an example of the data configuration of the topic DB 8 and shows a portion of the category “Japanese movie”.

トピックＤＢ８は、カテゴリとトピックの組み合わせ（組という）が複数ある場合に、組毎のレコードを有する。 The topic DB 8 has a record for each group when there are a plurality of combinations (referred to as groups) of categories and topics.

レコードは、該当のカテゴリを示すカテゴリ情報８０１、該当のトピックを示すトピック番号８０２、該当の文書における該当のトピックに関する第１、第２、第３の単語をそれぞれ示す単語情報８１１、単語情報８１２、単語情報８１３を備える。 The record includes category information 801 indicating the corresponding category, topic number 802 indicating the corresponding topic, word information 811 indicating the first, second, and third words related to the topic in the corresponding document, word information 812, Word information 813 is provided.

なお、単語情報（単語情報８１１等）の数は、３以下であってもよく、最大で３とし、トピック数は、４とした。 The number of word information (word information 811 and the like) may be 3 or less, 3 at the maximum, and 4 topics.

このようなトピックＤＢ８は、トピック数を４として、図２に示す反応履歴ＤＢ２の部分にLDA(Latend Dirichlet Allocation), LSA(Latent Semantic Analysis), pLSA(Probabilistic Latent Semantic Analysis)などを適用することで生成できる。 Such a topic DB 8 has four topics, and applies LDA (Latend Dirichlet Allocation), LSA (Latent Semantic Analysis), pLSA (Probabilistic Latent Semantic Analysis), etc. to the part of the reaction history DB 2 shown in FIG. Can be generated.

図１３は、反応予測取得部４による文書分類ＤＢ７の生成処理の流れを示すフローチャートである。 FIG. 13 is a flowchart showing a flow of processing for generating the document classification DB 7 by the reaction prediction acquisition unit 4.

ここでは、１つのカテゴリ（以下、対象カテゴリという）につき、対象カテゴリに一致するカテゴリ情報７０１を含むレコードの生成方法を説明する。他の対象カテゴリに一致するカテゴリ情報７０１を含むレコードも同様に生成される。 Here, a method for generating a record including category information 701 matching the target category for one category (hereinafter referred to as a target category) will be described. A record including category information 701 that matches another target category is generated in the same manner.

反応予測取得部４は、反応履歴ＤＢ２から対象カテゴリに一致するカテゴリ情報２０４を含むレコードを検索し、過去文書２０７（以下、過去文書という）を取り出す（Ｓ６０１）。 The reaction prediction acquisition unit 4 searches the reaction history DB 2 for a record including the category information 204 that matches the target category, and extracts a past document 207 (hereinafter referred to as a past document) (S601).

反応予測取得部４は、トピックＤＢ８から対象カテゴリに一致するカテゴリ情報８０１を含むレコードを検索し、トピック番号８０２を取り出す（Ｓ６０３）。 The reaction prediction acquisition unit 4 searches the topic DB 8 for a record including the category information 801 that matches the target category, and extracts the topic number 802 (S603).

ここで、４つのトピック番号（以下、トピック番号８０２１、８０２２、８０２３、８０２４という）が取り出されたこととする。 Here, it is assumed that four topic numbers (hereinafter referred to as topic numbers 8021, 8022, 8023, and 8024) are extracted.

反応予測取得部４は、過去文書とトピック番号の組ごとに、当該過去文書にあり且つ当該トピック番号を含むトピックＤＢ８のレコード内の単語情報８１１、単語情報８１２、単語情報８１３のいずれかに一致する単語の数を求める（Ｓ６０５）。 For each pair of past document and topic number, the reaction prediction acquisition unit 4 matches one of the word information 811, word information 812, and word information 813 in the record of the topic DB 8 that is in the past document and includes the topic number. The number of words to be calculated is obtained (S605).

反応予測取得部４は、過去文書に関係なくトピック番号８０２１について求めた単語数の総和（以下、総和８０２１０）、過去文書に関係なくトピック番号８０２２について求めた単語数の総和（以下、総和８０２２０）、過去文書に関係なくトピック番号８０２３について求めた単語数の総和（以下、総和８０２３０）を求める（Ｓ６０７）。 The reaction prediction acquisition unit 4 sums the number of words obtained for the topic number 8021 regardless of the past document (hereinafter, sum 80210), and sums of the number of words obtained for the topic number 8022 regardless of the past document (hereinafter, sum 80220). Then, the sum of the number of words obtained for the topic number 8023 regardless of the past document (hereinafter, sum 80230) is obtained (S607).

反応予測取得部４は、過去文書ごとに、当該過去文書とトピック番号８０２１について求めた単語数の総和８０２１０に占める割合（以下、トピック割合９０１という）、トピック番号８０２２について求めた単語数の総和８０２２０に占める割合（以下、トピック割合９０２という）、トピック番号８０２３について求めた単語数の総和８０２３０に占める割合（以下、トピック割合９０３という）、トピック番号８０２４について求めた単語数の総和に占める割合（以下、トピック割合９０４という）、を要素とする過去文書ベクトルを生成する（Ｓ６０９）。 For each past document, the response prediction acquisition unit 4 occupies a ratio (hereinafter referred to as a topic ratio 901) of the total number of words 80210 calculated for the past document and the topic number 8021, and a total number of words 80220 calculated for the topic number 8022. Occupying ratio (hereinafter referred to as topic ratio 902), ratio occupying the total number of words calculated for topic number 8023 (hereinafter referred to as topic ratio 903), and ratio occupying the total number of words calculated regarding topic number 8024 (hereinafter referred to as topic ratio 902). , A topic ratio 904) is generated (S609).

図１４は、図２に示す反応履歴ＤＢ２の各レコードに対応する文書の過去文書ベクトルの例を示す図である。 FIG. 14 is a diagram illustrating an example of a past document vector of a document corresponding to each record of the reaction history DB 2 illustrated in FIG.

文書番号「１」の過去文書については、過去文書ベクトルのトピック割合９０１は０．７であり、トピック割合９０２は０であり、トピック割合９０３は０であり、トピック割合９０４は０．３である。 For the past document with the document number “1”, the topic ratio 901 of the past document vector is 0.7, the topic ratio 902 is 0, the topic ratio 903 is 0, and the topic ratio 904 is 0.3. .

文書番号「２」の過去文書については、過去文書ベクトルのトピック割合９０１は０．１であり、トピック割合９０２は０．５であり、トピック割合９０３は０であり、トピック割合９０４は０．４である。 For the past document with the document number “2”, the topic ratio 901 of the past document vector is 0.1, the topic ratio 902 is 0.5, the topic ratio 903 is 0, and the topic ratio 904 is 0.4. It is.

文書番号「３」の過去文書については、過去文書ベクトルのトピック割合９０１は０．１であり、トピック割合９０２は０であり、トピック割合９０３は０．５であり、トピック割合９０４は０．４である。 For the past document with the document number “3”, the topic ratio 901 of the past document vector is 0.1, the topic ratio 902 is 0, the topic ratio 903 is 0.5, and the topic ratio 904 is 0.4. It is.

文書番号「４」の過去文書については、過去文書ベクトルのトピック割合９０１は０．６であり、トピック割合９０２は０であり、トピック割合９０３は０．１であり、トピック割合９０４は０．３である。 For the past document with the document number “4”, the topic ratio 901 of the past document vector is 0.6, the topic ratio 902 is 0, the topic ratio 903 is 0.1, and the topic ratio 904 is 0.3. It is.

図１３に戻り、反応予測取得部４は、２つの過去文書ベクトルｉ，ｊからなる組ごとに、以下の式により、当該過去文書ベクトル間の距離ｄを計算する（Ｓ６１１）。

Returning to FIG. 13, the reaction prediction acquisition unit 4 calculates the distance d between the past document vectors for each set of the two past document vectors i and j by the following formula (S611).

上記例では、要素数は４なので、ｚｍは４で計算される。 In the above example, since the number of elements is 4, zm is calculated as 4.

反応予測取得部４は、距離ｄに基づき、所定の分類技術（例えば、k-meansクラスタリング）を用い、各過去文書にサブカテゴリ情報を付与し、且つ、同一のサブカテゴリ情報を付与された２つの過去文書についての距離ｄは、異なるサブカテゴリ情報を付与された２つの過去文書についての距離ｄより短くなるようにする（Ｓ６１３）。 Based on the distance d, the reaction prediction acquisition unit 4 uses a predetermined classification technique (for example, k-means clustering) to give subcategory information to each past document, and two past data to which the same subcategory information is given. The distance d for the document is made shorter than the distance d for the two past documents to which different subcategory information is assigned (S613).

例えば、図１４に示す文書番号「１」、「４」の過去文書には、サブカテゴリ（サブカテゴリ情報「１」）が付与され、文書番号「２」の過去文書には、サブカテゴリ（サブカテゴリ情報「２」）が付与され、文書番号「３」の過去文書には、サブカテゴリ（サブカテゴリ情報「３」）が付与される。 For example, the past documents with document numbers “1” and “4” shown in FIG. 14 are assigned a subcategory (subcategory information “1”), and the past document with document number “2” is assigned a subcategory (subcategory information “2”). )) And a past category with document number “3” is given a subcategory (subcategory information “3”).

つまり、４つの過去文書が３つにクラスタリングされる。 That is, four past documents are clustered into three.

反応予測取得部４は、各サブカテゴリ情報につき、当該サブカテゴリ情報を付与された過去文書についてのトピック割合９０１の平均値、トピック割合９０２の平均値、トピック割合９０３の平均値、トピック割合９０４の平均値を計算する（Ｓ６１５）。 For each subcategory information, the reaction prediction acquisition unit 4 obtains the average value of the topic ratio 901, the average value of the topic ratio 902, the average value of the topic ratio 903, and the average value of the topic ratio 904 for the past document to which the subcategory information is assigned. Is calculated (S615).

反応予測取得部４は、各サブカテゴリ情報につき、文書分類ＤＢ７から、当該サブカテゴリ情報に一致するサブカテゴリ情報７０２を含むレコードを検索し、トピック割合９０１の平均値をトピック割合７１１、トピック割合９０２の平均値をトピック割合７１２、トピック割合９０３の平均値をトピック割合７１３、トピック割合９０４の平均値をトピック割合７１４として記憶させ（Ｓ６１７）、処理を終える。 The reaction prediction acquisition unit 4 searches the document classification DB 7 for each subcategory information for a record including the subcategory information 702 that matches the subcategory information, and sets the average value of the topic ratio 901 as the average value of the topic ratio 711 and the topic ratio 902. Are stored as the topic ratio 713, the average value of the topic ratio 903 as the topic ratio 713, and the average value of the topic ratio 904 as the topic ratio 714 (S617), and the process ends.

文書入力部１の処理については、第１の実施の形態と同様なので説明を省略する。 Since the processing of the document input unit 1 is the same as that of the first embodiment, description thereof is omitted.

図１５は、反応予測取得部４と反応予測出力部５の処理の流れを示す全体フローチャートである。 FIG. 15 is an overall flowchart showing a processing flow of the reaction prediction acquisition unit 4 and the reaction prediction output unit 5.

反応予測取得部４は、対象文書を単語に分解し（Ｓ２０１）、不要な単語を削除し（Ｓ２０３）、残った単語群の文字数を検出し（Ｓ２０４）、単語群と対象文書カテゴリを基に対象文書のサブカテゴリ（以下、対象文書サブカテゴリという）を検出し（Ｓ２０５）、反応予測値を取得する（Ｓ２０８）。 The reaction prediction acquisition unit 4 decomposes the target document into words (S201), deletes unnecessary words (S203), detects the number of characters in the remaining word group (S204), and based on the word group and the target document category A subcategory of the target document (hereinafter referred to as a target document subcategory) is detected (S205), and a predicted reaction value is acquired (S208).

図１６は、反応予測取得部４による対象文書サブカテゴリ検出（Ｓ２０５）のフローチャートである。 FIG. 16 is a flowchart of target document subcategory detection (S205) by the reaction prediction acquisition unit 4.

反応予測取得部４は、トピックＤＢ８から対象文書カテゴリに一致するカテゴリ情報８０１を含むレコードを検索する（Ｓ２０５１）。 The reaction prediction acquisition unit 4 searches the topic DB 8 for a record including category information 801 that matches the target document category (S2051).

反応予測取得部４は、検索された各レコードにつき、例えば、図１２に示す各レコードにつき、単語群にあり且つ単語情報８１１、８１２、８１３のいずれかに一致する単語の数を求める（Ｓ２０５３）。 The reaction prediction acquisition unit 4 obtains the number of words that are in the word group and match any one of the word information 811, 812, and 813 for each searched record, for example, for each record shown in FIG. 12 (S 2053). .

反応予測取得部４は、各単語数の総和（各単語数の総和）に占める割合を要素とする特徴ベクトルを生成する（Ｓ２０５５）。例えば、反応予測取得部４は、例えば、図１２のサブカテゴリ情報「１」、「２」、「３」、「４」のそれぞれに対応する割合を要素とする特徴ベクトルを生成する。 The reaction prediction acquisition unit 4 generates a feature vector whose element is a ratio of the total number of words (total number of words) (S2055). For example, the reaction prediction acquisition unit 4 generates a feature vector whose elements are ratios corresponding to the subcategory information “1”, “2”, “3”, and “4” in FIG. 12, for example.

反応予測取得部４は、文書分類ＤＢ７から対象文書カテゴリに一致するカテゴリ情報７０１を含むレコードを検索する（Ｓ２０５７）。 The reaction prediction acquisition unit 4 searches the document classification DB 7 for records including the category information 701 that matches the target document category (S2057).

反応予測取得部４は、検索された各レコードにつき、トピック割合７１１、トピック割合７１２、トピック割合７１３、トピック割合７１４を要素とする代表点ベクトルを生成する（Ｓ２０５９）。 The reaction prediction acquisition unit 4 generates a representative point vector having the topic ratio 711, the topic ratio 712, the topic ratio 713, and the topic ratio 714 as elements for each retrieved record (S2059).

反応予測取得部４は、特徴ベクトルｍと代表点ベクトルｎの組ごとに、以下の式により、特徴ベクトルｍと代表点ベクトルｎの間の距離Ｄを計算する（Ｓ２０６１）。

The reaction prediction acquisition unit 4 calculates the distance D between the feature vector m and the representative point vector n by the following formula for each set of the feature vector m and the representative point vector n (S2061).

反応予測取得部４は、最小の距離Ｄに対応する文書分類ＤＢ７のレコードからサブカテゴリ情報７０２、すなわち、対象文書サブカテゴリを取り出し（Ｓ２０６３）、処理を終える。 The reaction prediction acquisition unit 4 extracts the subcategory information 702, that is, the target document subcategory from the record of the document classification DB 7 corresponding to the minimum distance D (S2063), and ends the process.

図１７は、反応予測取得部４による反応予測値の取得（Ｓ２０８）のフローチャートである。 FIG. 17 is a flowchart of the reaction prediction value acquisition (S208) by the reaction prediction acquisition unit 4.

反応予測取得部４は、反応予測ＤＢ６から、（条件１）時刻６０１と現在時刻の差が所定範囲内にある、（条件２）当日が休日であるか否かの状況が休日フラグ６０２に一致する、（条件３）対象文書の文字数が文字数６０５に一致する、（条件４）対象文書カテゴリとカテゴリ情報６０３が一致し且つ対象文書サブカテゴリとサブカテゴリ情報６０４が一致する、を充足するレコードを検索する（Ｓ２０８１）。 The reaction prediction acquisition unit 4 determines from the reaction prediction DB 6 that (condition 1) the difference between the time 601 and the current time is within a predetermined range, (condition 2) whether the current day is a holiday or not matches the holiday flag 602. Search for records satisfying (Condition 3) The number of characters of the target document matches the number of characters 605, (Condition 4) The target document category and category information 603 match, and the target document subcategory and subcategory information 604 match. (S2081).

反応予測取得部４は、第１期間反応数６１１を第１期間反応予測値、第２期間反応数６１２を第２期間反応予測値、第３期間反応数６１３を第３期間反応予測値、第４期間反応数６１４を第４期間反応予測値、第５期間反応数６１５を第５期間反応予測値とし（Ｓ２０８７）、それぞれを読み出して、処理を終える。 The reaction prediction acquisition unit 4 sets the first period reaction number 611 as the first period reaction prediction value, the second period reaction number 612 as the second period reaction prediction value, the third period reaction number 613 as the third period reaction prediction value, The 4-period response number 614 is set as the fourth-period response predicted value, and the fifth-period response number 615 is set as the fifth-period response predicted value (S2087).

このように、予め反応予測ＤＢ６を設け、反応予測ＤＢ６から対象文書に応じた反応予測値を取り出すことで、反応予測値を迅速に得ることができる。 Thus, by providing reaction prediction DB6 beforehand and taking out the reaction prediction value according to the object document from reaction prediction DB6, a reaction prediction value can be obtained rapidly.

反応予測出力部５の処理は第１の実施の形態と同様である。 The process of the reaction prediction output unit 5 is the same as that in the first embodiment.

例えば、対象文書が、「あなたが過去に見た作品の中で一番おすすめする、面白いと思った邦画のタイトルを教えてください。私はアクションとホラーがとても好きなので、出来ればアクションかホラーの映画で教えていただけると嬉しいです。」とする。 For example, the target document is “Please tell me the title of the most interesting Japanese movie you have seen in the past. I really like action and horror. I would be happy if you could tell me in the movie. "

対象文書は単語に分解され（図１５：Ｓ２０１）、不要な単語を削除され（図１５：Ｓ２０３）、残った単語群には、「おすすめ」「面白い」「邦画」「タイトル」「映画」が含まれることとなる。 The target document is decomposed into words (FIG. 15: S201), unnecessary words are deleted (FIG. 15: S203), and “recommended”, “interesting”, “Japanese film”, “title”, “movie” are included in the remaining word group. Will be included.

例えば、上記５単語のうち、図１２のカテゴリ情報７０１「邦画」とサブカテゴリ情報「１」の組に対応する単語は「おすすめ」「面白い」である。 For example, among the above five words, the words corresponding to the set of category information 701 “Japanese movie” and subcategory information “1” in FIG. 12 are “recommended” and “interesting”.

カテゴリ情報７０１「邦画」とサブカテゴリ情報「２」の組に対応する単語は「名前」である。カテゴリ情報７０１「邦画」とサブカテゴリ情報「３」の組に対応する単語は無い。カテゴリ情報７０１「邦画」とサブカテゴリ情報「４」の組に対応する単語は「映画」「邦画」である。 The word corresponding to the set of category information 701 “Japanese film” and subcategory information “2” is “name”. There is no word corresponding to the set of the category information 701 “Japanese movie” and the subcategory information “3”. The words corresponding to the set of category information 701 “Japanese movie” and subcategory information “4” are “movie” and “Japanese movie”.

よって、特徴ベクトルは、｛2/5, 1/5, 0, 2/5｝＝｛0.40, 0.20, 0, 0.40｝となる。 Therefore, the feature vector is {2/5, 1/5, 0, 2/5} = {0.40, 0.20, 0, 0.40}.

ここでは、有効数字２桁で計算した。 Here, the calculation was made with two significant figures.

例えば、図１１に示すカテゴリ情報７０１「１」に対応する代表点ベクトルと上記特徴ベクトル｛0.40, 0.20, 0, 0.40｝の距離Ｄは、0.339である。カテゴリ情報７０１「２」に対応する代表点ベクトルと上記特徴ベクトルの距離Ｄは、0.424である。カテゴリ情報７０１「３」に対応する代表点ベクトルと上記特徴ベクトルの距離Ｄは、0.616である。 For example, the distance D between the representative point vector corresponding to the category information 701 “1” shown in FIG. 11 and the feature vector {0.40, 0.20, 0, 0.40} is 0.339. The distance D between the representative point vector corresponding to the category information 701 “2” and the feature vector is 0.424. The distance D between the representative point vector corresponding to the category information 701 “3” and the feature vector is 0.616.

よって、カテゴリ情報７０１「１」に対応する代表点ベクトルが選択され、すなわち、対象文書サブカテゴリは、「１」となる。 Therefore, the representative point vector corresponding to the category information 701 “1” is selected, that is, the target document subcategory is “1”.

また、対象文書の文字数は９９文字であるため、最後の桁を四捨五入すると、文字数は１００となる。 Further, since the number of characters of the target document is 99, the number of characters becomes 100 when the last digit is rounded off.

図１０においては、時刻６０１と現在時刻の差が所定範囲内にあり、当日が休日であるか否かの状況が休日フラグ６０２に一致すると仮定すると、最も上にあるレコードについては、対象文書の文字数「１００」が文字数６０５に一致し、対象文書カテゴリとカテゴリ情報６０３が一致し且つ対象文書サブカテゴリとサブカテゴリ情報６０４が一致するので、第１期間反応数６１１「６」（第１期間反応予測値）、第２期間反応数６１２「４」（第２期間反応予測値）、第３期間反応数６１３「１」（第３期間反応予測値）、第４期間反応数６１４「１」（第４期間反応予測値）、第５期間反応数６１５「０」（第５期間反応予測値）が得られる。 In FIG. 10, assuming that the difference between the time 601 and the current time is within a predetermined range, and the status of whether or not the current day is a holiday is the same as the holiday flag 602, the uppermost record is Since the number of characters “100” matches the number of characters 605, the target document category and category information 603 match, and the target document subcategory and subcategory information 604 match, the first period response number 611 “6” (first period response prediction value) ), Second period response number 612 “4” (second period response predicted value), third period response number 613 “1” (third period response predicted value), fourth period response number 614 “1” (fourth) (Period response predicted value), 5th period response number 615 “0” (5th period response predicted value) is obtained.

これにより、例えば、図３に示すような反応予測値の表示がなされる。 Thereby, for example, a reaction predicted value as shown in FIG. 3 is displayed.

第２の実施の形態によれば、対象文書サブカテゴリ、つまり、対象文書に内在するカテゴリを求め、これを基に反応予測値を取得するので、反応予測の精度向上を図ることができる。 According to the second embodiment, the target document subcategory, that is, the category inherent in the target document is obtained, and the reaction prediction value is acquired based on this, so that the accuracy of the reaction prediction can be improved.

ところで、ソーシャル・ネットワーキング・サービス（SNS）ではユーザ毎に異なるコミュニティが築かれており、挙動が異なる可能性がある。この場合、新規ユーザの文書について適切な反応予測値が得られないことが想定される。 By the way, in the social networking service (SNS), different communities are established for each user, and the behavior may be different. In this case, it is assumed that an appropriate response prediction value cannot be obtained for the new user's document.

そこで、文書を閲覧するユーザ毎に反応予測値を分類し、閲覧するユーザが文書の作成者に対して過去に見せた反応に基づいて、新規ユーザの文書に対する反応予測値を計算し、全ての閲覧するユーザに対して計算が終わった後に、反応予測値の総和を計算し、画面に表示する方法も考えられる。 Therefore, the predicted response value is classified for each user who views the document, and the predicted response value for the new user's document is calculated based on the response that the browsing user has shown to the creator of the document in the past. A method is also conceivable in which, after the calculation is completed for the user who browses, the sum of the predicted reaction values is calculated and displayed on the screen.

以上のように、本実施の形態に係る文書作成支援装置によれば、対象文書の特徴量を求め、対象文書への反応の予測値である反応予測値を特徴量に基づいて求める反応予測取得部を備えるので、対象文書の作成者は、反応予測値に基づいて対象文書への反応を予め知ることができ、すなわち、より良い対象文書の作成を支援することができる。 As described above, according to the document creation support apparatus according to the present embodiment, the response prediction acquisition for obtaining the feature amount of the target document and obtaining the reaction predicted value that is the predicted value of the response to the target document based on the feature amount is obtained. Therefore, the creator of the target document can know in advance the reaction to the target document based on the predicted response value, that is, can assist in creating a better target document.

また、過去の履歴を利用することで、例えば、文書の対象であるコミュニティの性質を考慮した反応予測値を得ることができる。例えば、主婦の多いコミュニティへ文書を投稿した場合には、平日の昼に最も多く反応が得られ、休日には反応が少ないなどという知見を得ることが期待できる。また、上記トピックを用いた方法（潜在トピック推定法）を用いて文書分類を行うことで反応予測値の推定精度が向上する。また、文書を投稿する前に反応者の反応の推定値（反応予測値）が表示されることで、文書投稿時の心理的負荷が低減される。 Further, by using the past history, for example, it is possible to obtain a predicted response value in consideration of the nature of the community that is the object of the document. For example, when a document is posted to a community with many housewives, it can be expected to obtain the knowledge that the most reaction is obtained at noon on weekdays and the reaction is less on holidays. In addition, the accuracy of estimation of reaction prediction values is improved by performing document classification using the above-described method using topics (latent topic estimation method). In addition, since the estimated value of the reaction of the responder (reaction predicted value) is displayed before the document is posted, the psychological load at the time of document posting is reduced.

また、文書作成支援装置は、対象文書でない文書について複数のレコードを有し、且つ、当該各レコードは該当の文書の特徴量と当該文書への反応数を含むデータベース（反応履歴ＤＢ２や反応予測ＤＢ６）を備え、反応予測取得部は、データベースから、対象文書の特徴量に対応するレコードを検索し、当該レコードに含まれる反応数に基づいて反応予測値を求めることで、対象文書の作成を支援することができる。 The document creation support apparatus has a plurality of records for a document that is not a target document, and each record includes a database (reaction history DB2 or reaction prediction DB6) that includes the feature amount of the corresponding document and the number of reactions to the document. ), And the reaction prediction acquisition unit searches the record corresponding to the feature quantity of the target document from the database and obtains the predicted response value based on the number of reactions included in the record, thereby supporting the creation of the target document. can do.

また、特徴量は、対象文書の文字数を含むので、文字数に基づいて求めた反応予測値により対象文書の作成を支援することができる。 In addition, since the feature amount includes the number of characters of the target document, creation of the target document can be supported by a predicted response value obtained based on the number of characters.

また、特徴量は、対象文書の文字数、対象文書のカテゴリおよび対象文書のサブカテゴリを含むので、反応予測値の精度向上が期待できる。 Further, since the feature amount includes the number of characters of the target document, the category of the target document, and the subcategory of the target document, it can be expected to improve the accuracy of the predicted response value.

また、カテゴリとサブカテゴリが共通する複数の文書で構成されるグループ毎のレコードを有し、且つ、当該レコードは当該グループの特徴を示す要素を備える文書分類ＤＢを備え、反応予測取得部は、対象文書の特徴を示す特徴ベクトルを生成し、文書分類データベースから対象文書カテゴリに対応するレコードを検索し、検索された各レコードにつき、該レコードに含まれる要素からなる代表点ベクトルを生成し、特徴ベクトルと各代表点ベクトルとの間の距離を求め、最小の距離に対応するサブカテゴリを対象文書のサブカテゴリとするので、反応予測値の精度向上が期待できる。 In addition, a record for each group including a plurality of documents having a common category and subcategory is included, and the record includes a document classification DB including elements indicating characteristics of the group. Generate a feature vector indicating the feature of the document, search a record corresponding to the target document category from the document classification database, generate a representative point vector composed of elements included in the record for each searched record, Since the subcategory corresponding to the minimum distance is determined as the subcategory of the target document, the accuracy of the reaction predicted value can be expected to be improved.

なお、文書作成支援装置としてコンピュータを機能させるためのコンピュータプログラムは、半導体メモリ、磁気ディスク、光ディスク、光磁気ディスク、磁気テープなどのコンピュータ読み取り可能な記録媒体に記録でき、また、インターネットなどの通信網を介して伝送させて、広く流通させることができる。 A computer program for causing a computer to function as a document creation support apparatus can be recorded on a computer-readable recording medium such as a semiconductor memory, a magnetic disk, an optical disk, a magneto-optical disk, a magnetic tape, or a communication network such as the Internet And can be widely distributed.

１…文書入力部
２…反応履歴ＤＢ
３…反応取得部
４…反応予測取得部
５…反応予測出力部
６…反応予測ＤＢ
７…文書分類ＤＢ
８…トピックＤＢ
１００…文書入力画面
２００…反応予測画面
２０１…文書番号
２０２、６０１…時刻
２０３、６０２…休日フラグ
２０４、６０３、７０１、８０１…カテゴリ情報
２０５…タイトル
２０６、６０５…文字数
２０７…過去文書
２１１、６１１…第１期間反応数
２１２、６１２…第２期間反応数
２１３、６１３…第３期間反応数
２１４、６１４…第４期間反応数
２１５、６１５…第５期間反応数
３００…投稿ボタン
６０４、７０２…サブカテゴリ情報
７１１、７１２、７１３、７１４、９１１、９１２、９１３、９１４…トピック割合
８０２、８０２１、８０２２、８０２３、８０２４…トピック番号
８１１、８１２、８１３…単語情報
８０２１０、８０２２０、８０２３０…総和
ｄ、Ｄ…距離
ｉ，ｊ…過去文書ベクトル
ｍ…特徴ベクトル
ｎ…代表点ベクトル 1 ... Document input part 2 ... Reaction history DB
3 ... Reaction acquisition unit 4 ... Reaction prediction acquisition unit 5 ... Reaction prediction output unit 6 ... Reaction prediction DB
7 ... Document classification DB
8 ... Topic DB
DESCRIPTION OF SYMBOLS 100 ... Document input screen 200 ... Reaction prediction screen 201 ... Document number 202, 601 ... Time 203, 602 ... Holiday flag 204, 603, 701, 801 ... Category information 205 ... Title 206, 605 ... Number of characters 207 ... Past document 211, 611 ... Number of first period reactions 212, 612 ... Number of second period reactions 213, 613 ... Number of third period reactions 214, 614 ... Number of fourth period reactions 215, 615 ... Number of fifth period reactions 300 ... Post button 604, 702 ... Subcategory information 711, 712, 713, 714, 911, 912, 913, 914 ... Topic ratio 802, 8021, 8022, 8023, 8024 ... Topic number 811, 812, 813 ... Word information 80210, 80220, 80230 ... Sum d, D ... distance i, j ... past document vector m ... features Vector n ... representative point vector

Claims

A document creation comprising: a reaction prediction acquisition unit that obtains a feature amount of a target document that is a document creation support target and obtains a reaction prediction value that is a predicted value of a response to the target document based on the feature amount Support device.

The document that is not the target document has a plurality of records, and each record includes a database that includes the feature amount of the corresponding document and the number of reactions to the document,
The reaction prediction acquisition unit
The document creation support apparatus according to claim 1, wherein a record corresponding to the feature amount of the target document is searched from the database, and the predicted response value is obtained based on the number of reactions included in the record.

The document creation support apparatus according to claim 1, wherein the feature amount includes a number of characters after unnecessary characters are deleted from the target document.

The document creation support apparatus according to claim 1, wherein the feature amount includes a number of characters after unnecessary characters are deleted from the target document, a category of the target document, and a subcategory of the target document.

A record for each group including a plurality of documents having a common category and subcategory, and the record includes a document classification database including elements indicating characteristics of the group;
The reaction prediction acquisition unit
Generating a feature vector indicating features of the target document;
A record corresponding to the target document category is searched from the document classification database, and a representative point vector including elements included in the record is generated for each searched record.
Obtaining a distance between the feature vector and each representative point vector;
The document creation support apparatus according to claim 4, wherein a subcategory corresponding to a minimum distance is a subcategory of the target document.

A document input part for inputting the target document;
A reaction prediction acquisition unit that obtains a feature amount of the target document and obtains a reaction prediction value that is a predicted value of a response to the target document based on the feature amount;
A document creation support apparatus comprising: a reaction prediction output unit that outputs the reaction prediction value.

A step of obtaining a feature amount of the target document by the reaction prediction acquisition unit of the document creation support apparatus;
The operation method of the document creation support apparatus, characterized in that the reaction prediction acquisition unit includes a step of obtaining a reaction predicted value that is a predicted value of a response to the target document based on the feature amount.

A document input unit of the document creation support apparatus inputting a target document that is a document creation support target;
A step of obtaining a feature amount of the target document by a reaction prediction acquisition unit of the document creation support apparatus;
The reaction prediction acquisition unit obtains a reaction predicted value that is a predicted value of a response to the target document based on the feature amount;
A response prediction output unit of the document creation support apparatus includes a step of outputting the reaction predicted value.

A computer program for causing a computer to function as the document creation support apparatus according to claim 1.

A computer program for causing a computer to function as the document creation support apparatus according to claim 6.