JP2017078978A

JP2017078978A - Natural language processor

Info

Publication number: JP2017078978A
Application number: JP2015207013A
Authority: JP
Inventors: 浩野武; Hiroshi Notake
Original assignee: Jetrun; Jetrun Tech Corp; JETRUN TECHNOLOGY CORP
Current assignee: Jetrun; Jetrun Tech Corp; JETRUN TECHNOLOGY CORP
Priority date: 2015-10-21
Filing date: 2015-10-21
Publication date: 2017-04-27
Anticipated expiration: 2035-10-21
Also published as: JP6722852B2

Abstract

PROBLEM TO BE SOLVED: To provide a natural language processor capable of converting document data, such as news articles, into expressions that are subjective and easily convey true intention as if a human type robot, a personified character, and the like had feelings in the human type robot, the personified character, and the like.SOLUTION: In a natural language processing part 20, 5W1H elements are determined by analyzing document data. In a scenario creation processing part 30, an adverbial usage calculation part 31 calculates adverbial usages corresponding to a prescribed adverbial usage calculation reference by referring to an adverbial usage DB 50. An adverbial usage replacement part 32 replaces prescribed 5W1H elements of the document data by adverbial usages calculated by an adverbial usage calculation part 31. A document output part 40 outputs new document data with the 5W1H elements replaced by the adverbial usage.SELECTED DRAWING: Figure 1

Description

本発明は、ヒト型のロボットや擬人化されたキャラクター等において、ニュース記事等の文書データの自然言語処理を実行する自然言語処理装置に関するものである。 The present invention relates to a natural language processing apparatus that executes natural language processing of document data such as a news article in a humanoid robot or anthropomorphic character.

近年、文書データに対する形態素解析や構文解析などの自然原語処理技術の向上に伴って、ヒト型のロボットや擬人化されたキャラクター等にニュース記事等の文書データの読み上げを代行させることが行われている。例えば、ソフトバンク社のＰｅｐｐｅｒ（登録商標）に代表されるように、人の要求に応じて、インターネット上からニュース記事等の文書データを収集して、それを読み上げることが注目されている。 In recent years, with the improvement of natural language processing technology such as morphological analysis and syntactic analysis for document data, human-type robots and anthropomorphic characters are used to read out document data such as news articles. Yes. For example, as represented by Pepper (registered trademark) of SOFTBANK CORP., It has been noticed that document data such as news articles is collected from the Internet and read out in response to a human request.

ところが、従来のヒト型のロボットや擬人化されたキャラクター等では、ニュース記事等の文書データを「何が」「どこで」「どうなった」などの定型フォーマットである「報道事の慣行」と呼ばれる５Ｗ１Ｈによって、単調かつ主観的でない読み上げが繰り返され、いかにも感情のない機械的な印象を受けるものであった。 However, in conventional humanoid robots and anthropomorphic characters, document data such as news articles are called “reporting practices”, which are standard formats such as “what”, “where” and “what” With 5W1H, reading was repeated monotonously and not subjectively, and received a mechanical impression with no emotion.

この点、過去の自然言語処理技術において、文書データをそのまま読み上げるのではなく、各種ルールに従って言語処理を行うことが開示されている（特許文献１〜４参照）。 In this regard, in past natural language processing techniques, it has been disclosed to perform language processing in accordance with various rules instead of reading out document data as it is (see Patent Documents 1 to 4).

特開２００８−１５２６３７号公報JP 2008-152637 A 特開２００７−３１０８２９号公報JP 2007-310829 A 特開２００２−１８３１１７号公報JP 2002-183117 A 特開平７−１７５８０８号公報JP-A-7-175808

しかしながら、いずれの自然言語処理技術においても、未だ感情のない機械的な印象を受けるものであった。 However, any natural language processing technology still has a mechanical impression without feelings.

本発明は、上述の技術的背景に鑑みてなされたものであって、ヒト型のロボットや擬人化されたキャラクター等において、ニュース記事等の文書データをあたかも感情を持っているかのような主観的で真意の伝わりやすい表現に変換することができる自然言語処理装置を提供することを目的とする。 The present invention has been made in view of the above-described technical background, and it is subjective in human-like robots, anthropomorphic characters, etc. as if the document data such as news articles have emotions. It is an object of the present invention to provide a natural language processing device that can be converted into an expression that is easy to convey with real intention.

本発明は、上記目的を達成するために、文書データが入力される文書入力部と、該文書入力部により入力された文書データを解析することにより、少なくとも何（Ｗｈａｔ）、誰（Ｗｈｏ）、何時（Ｗｈｅｎ）、何処（Ｗｈｅｒｅ）、何故（Ｗｈｙ）、どうした（Ｈｏｗ）の一ないし複数の５Ｗ１Ｈの要素の判定を行う自然言語処理部と、副詞的用法を算定するための副詞的用法算定基準と、該副詞的用法算定基準に対応する副詞的用法とが互いに関連付けられながら記憶された副詞的用法データベースと、前記自然言語処理部により５Ｗ１Ｈの要素の判定が行われた文書データについて、前記副詞的用法データベースを参照することにより、所定の副詞的用法算定基準に対応する副詞的用法を算定する副詞的用法算定部と、文書データの所定の５Ｗ１Ｈの要素を前記副詞的用法算定部により算定された副詞的用法に置換する副詞的用法置換部と、前記副詞的用法置換部により所定の５Ｗ１Ｈの要素が副詞的用法に置換された新たな文書データを出力する文書出力部とを備えることを特徴とする。
なお、本発明では、日本語の文法における副詞（主に用言（動詞・形容詞・形容動詞）を説明し、修飾語となる語）に加え、英語の文法における副詞のように、名詞、代名詞、句、節、あるいは文全体を説明し、修飾語となる品詞・句・節に係る用法を「副詞的用法」と定義することとする。 In order to achieve the above object, the present invention analyzes a document input unit to which document data is input and the document data input by the document input unit, so that at least what (What), who (Who), A natural language processing unit for determining one or more 5W1H elements of when, where, why, and why, and adverbial usage calculation for calculating adverbial usage The adverbial usage database stored while the criteria and the adverbial usage corresponding to the adverbial usage calculation criterion are associated with each other, and the document data on which the 5W1H element determination is performed by the natural language processing unit, By referring to the adverbial usage database, an adverbial usage calculation unit that calculates adverbial usage corresponding to a predetermined adverbial usage calculation standard, and the document data An adverbial usage replacement unit that replaces an element of 5W1H with an adverbial usage calculated by the adverbial usage calculation unit, and a new 5W1H element replaced with an adverbic usage by the adverbial usage replacement unit And a document output unit for outputting document data.
In addition, in the present invention, in addition to adverbs in Japanese grammar (mainly a predicate (a verb / adjective / adjective verb), a modifier), as well as adverbs in English grammar, nouns, pronouns The phrase, clause, or entire sentence is explained, and the usage related to the part-of-speech, phrase, or clause as a modifier is defined as “adverbial usage”.

これによれば、文書データの内容に基づいて副詞的用法を算定して、文書データの５Ｗ１Ｈの要素を該副詞的用法に置換するため、ヒト型のロボットや擬人化されたキャラクター等において、ニュース記事等の文書データをあたかも感情を持っているかのような主観的で真意の伝わりやすい表現に変換することができる。 According to this, in order to calculate the adverbial usage based on the contents of the document data and to replace the 5W1H element of the document data with the adverbial usage, in a humanoid robot, anthropomorphized character, etc. It is possible to convert document data such as articles into an expression that is easy to convey subjective and sincere feeling as if it has emotion.

また、前記副詞的用法データベースは、副詞的用法の置換対象となる５Ｗ１Ｈの要素が副詞的用法算定基準と関連付けられながら記憶され、前記副詞的用法算定部は、前記自然言語処理部により５Ｗ１Ｈの要素の判定が行われた文書データについて、前記副詞的用法データベースを参照することにより、所定の副詞的用法算定基準に対応する副詞的用法と、該副詞的用法の置換対象となる５Ｗ１Ｈの要素とを算定し、前記副詞的用法置換部は、前記副詞的用法算定部により算定された副詞的用法と５Ｗ１Ｈの要素の算定結果に基づいて、文書データの該５Ｗ１Ｈの要素を該副詞的用法に置換するのが好ましい。 The adverbial usage database stores 5W1H elements to be substituted for adverbial usage while being associated with adverbial usage calculation criteria, and the adverbial usage calculation unit uses the natural language processing unit to store 5W1H elements. By referring to the adverbial usage database, the adverbial usage corresponding to a predetermined adverbial usage calculation standard and the 5W1H element to be replaced with the adverbial usage are obtained. And the adverbial usage replacement unit replaces the 5W1H element of the document data with the adverbial usage on the basis of the adverbial usage calculated by the adverbial usage calculation unit and the 5W1H element calculation result. Is preferred.

また、前記副詞的用法データベースは、文書データにおける副詞的用法の挿入位置が副詞的用法算定基準と関連付けられながら記憶され、前記副詞的用法算定部は、前記自然言語処理部により５Ｗ１Ｈの要素の判定が行われた文書データについて、前記副詞的用法データベースを参照することにより、所定の副詞的用法算定基準に対応する副詞的用法と、該副詞的用法の置換対象となる５Ｗ１Ｈの要素と、文書データにおける副詞的用法の挿入位置とを算定し、前記副詞的用法置換部は、前記副詞的用法算定部により算定された副詞的用法と、５Ｗ１Ｈの要素と、副詞的用法の挿入位置の算定結果に基づいて、文書データの該５Ｗ１Ｈの要素を該副詞的用法に置換した上で、文書データにおける副詞的用法の挿入位置に挿入するのが好ましい。 Further, the adverbial usage database stores the adverbial usage insertion position in the document data while being associated with the adverbial usage calculation criteria, and the adverbial usage calculation unit determines the 5W1H element by the natural language processing unit. By referring to the adverbial usage database, the adverbial usage corresponding to the predetermined adverbial usage calculation standard, the 5W1H element to be replaced with the adverbial usage, and the document data The adverbial usage replacement unit calculates the adverbial usage calculated by the adverbial usage calculation unit, the element of 5W1H, and the calculation result of the insertion position of the adverbial usage. Based on this, it is preferable that the 5W1H element of the document data is replaced with the adverbial usage and then inserted into the insertion position of the adverbial usage in the document data.

また、前記副詞的用法データベースは、ユーザのパーソナリティに関する基準が副詞的用法算定基準と関連付けられながら記憶され、前記副詞的用法算定部は、前記自然言語処理部により５Ｗ１Ｈの要素の判定が行われた文書データについて、前記副詞的用法データベースを参照することにより、ユーザのパーソナリティに関する基準に応じて、所定の副詞的用法算定基準に対応する副詞的用法を算定してもよい。 Further, the adverbial usage database is stored with the criteria related to the user's personality being associated with the adverbial usage calculation criteria, and the adverbial usage calculation unit determines the 5W1H element by the natural language processing unit. By referring to the adverbial usage database for the document data, an adverbial usage corresponding to a predetermined adverbial usage calculation criterion may be calculated according to a criterion related to the personality of the user.

また、前記副詞的用法算定基準は対語に関するものであり、前記副詞的用法算定部は、前記自然言語処理部により５Ｗ１Ｈの要素の判定が行われた文書データについて、前記副詞的用法データベースを参照することにより、所定の対語に関する副詞的用法算定基準に対応する副詞的用法を算定してもよい。 In addition, the adverbial usage calculation standard relates to a counter word, and the adverbial usage calculation unit refers to the adverbial usage database for the document data for which the natural language processing unit has determined the 5W1H element. Accordingly, an adverbial usage corresponding to the adverbial usage calculation standard for a predetermined counter-word may be calculated.

また、前記副詞的用法算定基準は接続パターンに関するものであり、前記副詞的用法算定部は、前記自然言語処理部により５Ｗ１Ｈの要素の判定が行われた文書データについて、前記副詞的用法データベースを参照することにより、所定の接続パターンに関する副詞的用法算定基準に対応する副詞的用法を算定してもよい。 The adverbial usage calculation standard relates to a connection pattern, and the adverbial usage calculation unit refers to the adverbial usage database for the document data for which the natural language processing unit has determined the 5W1H element. By doing so, the adverbial usage corresponding to the adverbial usage calculation standard for the predetermined connection pattern may be calculated.

また、前記副詞的用法算定基準は誇張表現に関するものであり、前記副詞的用法算定部は、前記自然言語処理部により５Ｗ１Ｈの要素の判定が行われた文書データについて、前記副詞的用法データベースを参照することにより、所定の誇張表現に関する副詞的用法算定基準に対応する副詞的用法を算定してもよい。 The adverbial usage calculation standard relates to exaggerated expressions, and the adverbial usage calculation unit refers to the adverbial usage database for the document data for which the natural language processing unit has determined the 5W1H element. By doing so, an adverbial usage corresponding to the adverbial usage calculation standard for a predetermined exaggerated expression may be calculated.

本発明によれば、文書データの内容に基づいて副詞的用法を算定して、文書データの５Ｗ１Ｈの要素を該副詞的用法に置換するため、ヒト型のロボットや擬人化されたキャラクター等において、ニュース記事等の文書データをあたかも感情を持っているかのような主観的で真意の伝わりやすい表現に変換することができる。 According to the present invention, in order to calculate the adverbial usage based on the content of the document data and replace the 5W1H element of the document data with the adverbial usage, in a humanoid robot, anthropomorphic character, etc. It is possible to convert document data such as news articles into a subjective and true expression that feels as if it has emotion.

本装置の概略構成を示す図である。It is a figure which shows schematic structure of this apparatus. 自然言語処理部による文書データの解析を説明する図である。It is a figure explaining analysis of document data by a natural language processing part. 実施例１に係る文書データの副詞的用法の置換を説明する図である。It is a figure explaining substitution of adverbial usage of document data concerning Example 1. FIG. 実施例１に係る副詞的用法算定基準の対応関係を示す図である。It is a figure which shows the correspondence of the adverbial usage calculation standard which concerns on Example 1. FIG. 実施例２に係る文書データの副詞的用法の置換を説明する図である。It is a figure explaining substitution of adverbial usage of document data concerning Example 2. FIG. 実施例２に係る副詞的用法算定基準の対応関係を示す図である。It is a figure which shows the correspondence of the adverbial usage calculation standard which concerns on Example 2. FIG. 実施例２に係るパーソナリティ基準の対応関係を示す図である。It is a figure which shows the correspondence of the personality reference | standard which concerns on Example 2. FIG. 実施例３に係る文書データの副詞的用法の置換を説明する図である。It is a figure explaining substitution of adverbial usage of document data concerning Example 3. 実施例３に係る副詞的用法算定基準の対応関係を示す図である。It is a figure which shows the correspondence of the adverbial usage calculation standard which concerns on Example 3. FIG. 実施例３に係る単語と単語グループの対応関係を示す図である。It is a figure which shows the correspondence of the word which concerns on Example 3, and a word group. 実施例４に係る文書データの副詞的用法の置換を説明する図である。It is a figure explaining substitution of the adverbial usage of the document data concerning Example 4. 実施例４に係る副詞的用法算定基準の対応関係を示す図である。It is a figure which shows the correspondence of the adverbial usage calculation standard which concerns on Example 4. FIG. 実施例５に係る文書データの副詞的用法の置換を説明する図である。It is a figure explaining substitution of adverbial usage of document data concerning Example 5. 実施例５に係る副詞的用法算定基準の対応関係を示す図である。It is a figure which shows the correspondence of the adverbial usage calculation standard which concerns on Example 5. FIG.

次に、本発明に係る自然言語処理装置（以下、本装置１という）の実施形態について図１を参照しつつ説明する。 Next, an embodiment of a natural language processing apparatus (hereinafter referred to as the present apparatus 1) according to the present invention will be described with reference to FIG.

本装置１は、図１に示すように、文書入力部１０と、文書入力部１０の出力側に設けられた自然言語処理部２０と、自然言語処理部２０の出力側に設けられたシナリオ生成処理部３０と、シナリオ生成処理部３０の出力側に設けられた文書出力部４０とを備える。 As shown in FIG. 1, the apparatus 1 includes a document input unit 10, a natural language processing unit 20 provided on the output side of the document input unit 10, and a scenario generation provided on the output side of the natural language processing unit 20. A processing unit 30 and a document output unit 40 provided on the output side of the scenario generation processing unit 30 are provided.

前記文書入力部１０は、インターネットから配信されたり、ユーザにより読み込まれたニュース記事等の文書データが入力される。 The document input unit 10 receives document data such as news articles distributed from the Internet or read by a user.

前記自然言語処理部２０は、文書入力部１０の出力側に設けられた形態素解析部２１と、形態素解析部２１の出力側に設けられた構文解析部２２と、構文解析部２２の出力側に設けられた５Ｗ１Ｈ判定部２３とを備える。 The natural language processing unit 20 includes a morphological analysis unit 21 provided on the output side of the document input unit 10, a syntax analysis unit 22 provided on the output side of the morpheme analysis unit 21, and an output side of the syntax analysis unit 22. And a 5W1H determination unit 23 provided.

前記形態素解析部２１は、文書入力部１０により入力された文書データの各文の形態素解析を行う。この形態素解析とは、対象言語の文法の知識（文法のルールの集まり）や辞書（品詞等の情報付きの単語リスト）を情報源として用い、自然言語で書かれた文を形態素と呼ばれる言語学上で意味を持つ最小の品詞単位に分割し、それぞれの品詞を判別する作業を示す。例えば、図２に示すように、「１９７７年２月１７日の昼過ぎに久米島で観測史上初となる雪が記録された。」の文に対して形態素解析を行うと、「１９７７」「年」「２」「月」「１７」「日」・・・のように最小の品詞単位に分割する。 The morpheme analysis unit 21 performs morpheme analysis of each sentence of the document data input by the document input unit 10. This morphological analysis uses grammatical knowledge (collection of grammar rules) and a dictionary (a word list with information such as parts of speech) as information sources, and linguistics called morphemes are sentences written in natural language. The work of dividing each part of speech into the smallest parts of speech that have the meaning above is shown. For example, as shown in FIG. 2, when morphological analysis is performed on the sentence “The first snow in Kumejima was recorded in the afternoon of February 17, 1977.”, “1977” “Year” “2”, “Month”, “17”, “Day”...

前記構文解析部２２は、形態素解析部２１により形態素解析された文書データの各文の構文解析を行う。この構文解析とは、最小単位に分割した品詞を用いて、句と呼ばれる意味や目的を持った言葉のかたまりにする作業を示す。例えば、図２に示すように、上述のように形態素解析された文に対して構文解析を行うと、「『１９７７年２月１７日の』『昼過ぎに』」「久米島で」「『観測史上初と』『なる』『雪が』」「記録された」と意味や目的を持った言葉のかたまりにする。 The syntax analysis unit 22 performs a syntax analysis of each sentence of the document data subjected to the morphological analysis by the morpheme analysis unit 21. This parsing shows the work of making words that have meaning and purpose called phrases by using parts of speech divided into minimum units. For example, as shown in FIG. 2, when syntax analysis is performed on a sentence that has been morphologically analyzed as described above, ““ February 17, 1977 ”“ Afternoon ”” “In Kumejima” “ It is a group of words with meaning and purpose, such as “First in history”, “Naru”, “Yukiga”, and “Recorded”.

前記５Ｗ１Ｈ判定部２３は、構文解析部２２により解析された文書データの各文の５Ｗ１Ｈの判定を行う。この５Ｗ１Ｈの判定とは、各文において意味や目的をもった言葉のかたまりである各句に対して、何（Ｗｈａｔ）、誰（Ｗｈｏ）、何時（Ｗｈｅｎ）、何処（Ｗｈｅｒｅ）、何故（Ｗｈｙ）、どうした（Ｈｏｗ）の要素の判定を行う。本実施形態では、５Ｗ１Ｈとして一般的な５つの要素の判定を行うが、さらに「誰に（Ｗｈｏｍ）や「いくら（Ｈｏｗｍｕｃｈ）」「どれだけ（Ｈｏｗｍａｎｙ）」などの要素の判定も行ってもよい。例えば、図２に示すように、上述の構文解析された文に対して５Ｗ１Ｈ判定を行うと、ＷＨＥＮ「１９７７年２月１７日の昼過ぎに」、ＷＨＥＲＥ「久米島で」、ＷＨＡＴ「観測史上初となる雪が」、ＨＯＷ「記録された」と判定する。 The 5W1H determination unit 23 performs 5W1H determination on each sentence of the document data analyzed by the syntax analysis unit 22. This determination of 5W1H means what (What), who (Who), what (When), where (Where), why (Why) for each phrase that is a group of words having meaning and purpose in each sentence. ), What (how) element is determined. In the present embodiment, five general elements are determined as 5W1H, but elements such as “who”, “how much”, and “how many” are also determined. Also good. For example, as shown in FIG. 2, when a 5W1H determination is performed on the above-mentioned parsed sentence, WHEN “after noon on February 17, 1977”, WHERE “in Kumejima”, WHAT “first in observation history” It is determined that “the snow that becomes” is “recorded”.

前記シナリオ生成処理部３０は、５Ｗ１Ｈ判定部２３の出力側に設けられた副詞的用法算定部３１と、副詞的用法算定部３１の出力側に設けられた副詞的用法置換部３２とを備え、副詞的用法データベース（以下、副詞的用法ＤＢ５０という）が通信可能な状態で接続されている。 The scenario generation processing unit 30 includes an adverbial usage calculation unit 31 provided on the output side of the 5W1H determination unit 23, and an adverbial usage replacement unit 32 provided on the output side of the adverbial usage calculation unit 31. An adverbial usage database (hereinafter referred to as adverbial usage DB 50) is connected in a communicable state.

この副詞的用法ＤＢ５０は、副詞的用法を算定するための副詞的用法算定基準と、該副詞的用法算定基準に対応する副詞的用法と、該副詞的用法の置換対象となる５Ｗ１Ｈの要素と、文書データにおける該副詞的用法の挿入位置とが互いに関連付けられながら記憶されている。 This adverbial usage DB 50 includes adverbial usage calculation criteria for calculating adverbial usage, adverbial usage corresponding to the adverbial usage calculation criteria, and elements of 5W1H to be replaced with the adverbial usage, The insertion position of the adverbial usage in the document data is stored while being associated with each other.

前記副詞的用法算定部３１は、前記５Ｗ１Ｈ判定部２３により５Ｗ１Ｈの要素の判定が行われた文書データについて、前記副詞的用法ＤＢ５０を参照することにより、所定の副詞的用法算定基準に対応する副詞的用法と、該副詞的用法の置換対象となる５Ｗ１Ｈの要素と、文書データにおける該副詞的用法の挿入位置とを算定する。具体的な副詞的用法の算定方法については、実施例１〜実施例５にて具体的に説明する。 The adverbial usage calculation unit 31 refers to the adverbial usage DB 50 with respect to the document data for which the 5W1H determination unit 23 has determined the 5W1H element, and thereby corresponds to a predetermined adverbial usage calculation standard. The usage, the 5W1H element to be replaced with the adverb usage, and the insertion position of the adverb usage in the document data are calculated. Specific methods for calculating adverbial usage will be specifically described in Examples 1 to 5.

なお、本発明では、日本語の文法における副詞（主に用言（動詞・形容詞・形容動詞）を説明し、修飾語となる語）に加え、英語の文法における副詞のように、名詞、代名詞、句、節、あるいは文全体を説明し、修飾語となる品詞・句・節に係る用法を「副詞的用法」と定義することとする。 In addition, in the present invention, in addition to adverbs in Japanese grammar (mainly a predicate (a verb / adjective / adjective verb), a modifier), as well as adverbs in English grammar, nouns, pronouns The phrase, clause, or entire sentence is explained, and the usage related to the part-of-speech, phrase, or clause as a modifier is defined as “adverbial usage”.

前記副詞的用法置換部３２は、前記副詞的用法算定部３１により算定された副詞的用法と、５Ｗ１Ｈの要素と、副詞的用法の挿入位置の算定結果に基づいて、文書データの該５Ｗ１Ｈの要素を該副詞的用法に置換した上で、文書データにおける副詞的用法の挿入位置に挿入する。副詞的用法の置換方法については、実施例１〜実施例５にて具体的に説明する。 The adverbial usage replacement unit 32 calculates the 5W1H element of the document data based on the adverbial usage calculated by the adverbial usage calculation unit 31, the 5W1H element, and the calculation result of the insertion position of the adverbial usage. Is replaced with the adverbial usage and inserted at the insertion position of the adverbial usage in the document data. The replacement method of the adverbial usage will be specifically described in Examples 1 to 5.

前記文書出力部４０は、前記副詞的用法置換部３２により５Ｗ１Ｈの要素が副詞的用法に置換された新たな文書データを所定の形式で出力する。この新たな文書データの出力に際しては、例えばヒト型のロボットや仮想のキャラクターにより音声形式で出力することが挙げられる。 The document output unit 40 outputs, in a predetermined format, new document data in which the 5W1H element is replaced with the adverbial usage by the adverbial usage replacement unit 32. When outputting this new document data, for example, it is possible to output it in a voice format by a humanoid robot or a virtual character.

本実施例では、図３および図４を参照しながら、文書データの何時（Ｗｈｅｎ）の要素を副詞的用法に置換する場合について説明する。 In the present embodiment, a case where an element of “What” in document data is replaced with an adverbial usage will be described with reference to FIGS. 3 and 4.

前記副詞的用法ＤＢ５０は、図４（ａ）に示すように、副詞的用法を算定するための副詞的用法算定基準と、該副詞的用法算定基準に対応する副詞的用法と、該副詞的用法の置換対象となる５Ｗ１Ｈの要素と、文書データにおける該副詞的用法の挿入位置とが互いに関連付けられながら記憶されている。 As shown in FIG. 4 (a), the adverbial usage DB 50 includes an adverbial usage calculation standard for calculating an adverbial usage, an adverbial usage corresponding to the adverbial usage calculation standard, and the adverbial usage. The 5W1H element to be replaced and the insertion position of the adverbial usage in the document data are stored while being associated with each other.

この副詞的用法算定基準は、例えば「ＨＯＷに謝罪が含まれる」「ＨＯＷがポジティブな場合」「ＨＯＷがネガティブな場合」というように５Ｗ１Ｈの「ＨＯＷ」の要素に関する算定基準である。 This adverbial usage calculation standard is a calculation standard for the element of “HOW” of 5W1H, such as “when HOW contains an apology”, “when HOW is positive”, and “when HOW is negative”.

なお、これらの副詞的用法算定基準に対応する副詞的用法は、例えば謝罪、感謝、悲嘆などに関するものであり、図４（ｂ）に示すように、謝罪の場合には「『ご迷惑がかからないようにする。』と、」、感謝の場合は「有り難いことに、」、悲嘆の場合は「残念ですが、」の具体的な表現がそれぞれ対応している。なお、このような副詞的用法の具体的な表現は実施例２〜実施例５でも同様に対応するものである。 The adverbial usage corresponding to these adverbial usage calculation standards relates to, for example, apology, gratitude, grief, etc. As shown in FIG. ”,” “Thank you” for appreciation, “Unfortunately” for grief, and “I ’m sorry,” respectively. In addition, the concrete expression of such an adverbic usage respond | corresponds similarly in Example 2-5.

また、これらの副詞的用法の置換対象となるなる５Ｗ１Ｈの要素は、「ＷＨＥＮ」に設定されている。 Further, the element of 5W1H to be replaced with these adverbial usages is set to “WHEN”.

また、文書データにおける前記副詞的用法の挿入位置は、「文頭」に設定されている。 Also, the insertion position of the adverbial usage in the document data is set to “sentence”.

而して、文書入力部１０に図３に示す文書データＡ１が入力されると、自然言語処理部２０において、形態素解析部２１が、図３の文書データＡ２に示すように形態素解析を行い、構文解析部２２が図３の文書データＡ３に示すように構文解析を行ったあと、５Ｗ１Ｈ判定部２３が図３の文書データＡ４に示すように５Ｗ１Ｈの判定を行う。 Thus, when the document data A1 shown in FIG. 3 is input to the document input unit 10, in the natural language processing unit 20, the morpheme analysis unit 21 performs morphological analysis as shown in the document data A2 in FIG. After the syntax analysis unit 22 performs the syntax analysis as shown in the document data A3 in FIG. 3, the 5W1H determination unit 23 performs the determination of 5W1H as shown in the document data A4 in FIG.

次に、シナリオ生成処理部３０において、前記副詞的用法算定部３１が、図３の文書データＡ４について、前記副詞的用法ＤＢ５０を参照することにより、所定の副詞的用法算定基準に対応する副詞的用法を算定する。本実施例の文書データＡ４では、ＨＯＷに謝罪が含まれるため、その副詞的用法算定基準「ＨＯＷに謝罪が含まれる」に対応する副詞的用法（謝罪）を算定する。また、この副詞的用法（謝罪）の置換対象となる５Ｗ１Ｈの要素「ＷＨＥＮ」と挿入位置「文頭」も算定する。 Next, in the scenario generation processing unit 30, the adverbial usage calculation unit 31 refers to the adverbial usage DB 50 for the document data A4 in FIG. Calculate usage. In the document data A4 of the present embodiment, since an apology is included in the HOW, the adverbial usage (apology) corresponding to the adverbial usage calculation criterion “HOW includes an apology” is calculated. Also, the 5W1H element “WHEN” and the insertion position “sentence” to be replaced with this adverbial usage (apology) are calculated.

そして、前記副詞的用法置換部３２は、前記副詞的用法算定部３１により算定された副詞的用法と５Ｗ１Ｈの要素と副詞的用法の挿入位置の算定結果に基づいて、文書データの該５Ｗ１Ｈの要素を該副詞的用法に置換した上で、文書データにおける該副詞的用法の挿入位置に挿入する。本実施例の文書データＡ４では、図３の文書データＡ５に示すように、５Ｗ１Ｈの要素「ＷＨＥＮ」の「６月１日に」を副詞的用法（謝罪）の「「ご迷惑がかからないようにする。」と、」に置換した上で文頭に挿入する。 Then, the adverbial usage replacement unit 32 calculates the 5W1H element of the document data based on the adverbial usage calculated by the adverbial usage calculation unit 31, the 5W1H element, and the calculation result of the insertion position of the adverbial usage. Is replaced with the adverbial usage and then inserted at the insertion position of the adverbial usage in the document data. In the document data A4 of this embodiment, as shown in the document data A5 of FIG. 3, “June 1” of the element “WHEN” of 5W1H is used as an adverbial usage (apology) ““ Do not cause trouble. Replace with "and" and insert it at the beginning of the sentence.

あとは、前記文書出力部４０が、５Ｗ１Ｈの要素「ＷＨＥＮ」の「６月１日に」が副詞的用法（謝罪）の「「ご迷惑がかからないようにする。」と、」に置換された、図３に示す文書データＡ５を所定の形式で出力する。 After that, in the document output unit 40, the element “WHEN” of 5W1H “June 1” has been replaced with the adverbial usage (apology) “Make no inconvenience”. The document data A5 shown in FIG. 3 is output in a predetermined format.

本実施例では、図５〜図７を参照しながら、ユーザのパーソナリティ基準に応じて、文書データの５Ｗ１Ｈの要素を副詞的用法に置換する場合について説明する。 In the present embodiment, a case where the 5W1H element of the document data is replaced with an adverbial usage according to the personality criterion of the user will be described with reference to FIGS.

前記副詞的用法ＤＢ５０は、図６に示すように、副詞的用法を算定するための副詞的用法算定基準と、ユーザのパーソナリティに関する基準と、該副詞的用法算定基準に対応する副詞的用法と、該副詞的用法の置換対象となる５Ｗ１Ｈの要素と、文書データにおける該副詞的用法の挿入位置とが互いに関連付けられながら記憶されている。 As shown in FIG. 6, the adverbial usage DB 50 includes adverbial usage calculation criteria for calculating adverbial usage, criteria related to user personality, adverbial usage corresponding to the adverbial usage calculation criteria, The 5W1H element to be replaced by the adverbial usage and the insertion position of the adverbial usage in the document data are stored while being associated with each other.

このユーザのパーソナリティに関する基準とは、ユーザの思考傾向を示すものであり、例えば政治に関する思考が右寄りまたは左寄り、海外派兵に関する思考がタカ派またはハト派などの基準が定められている。また、ユーザのパーソナリティに関する基準は、ユーザの選択等によりあらかじめ定めてもよいし、文書データの内容に基づいて定めてもよい。特に文書データの内容に基づいてパーソナリティに関する基準を定める場合、図７に示すように、ユーザの思考傾向ごとに判別を設定しておいて、一方の思考傾向の場合にはプラス、他方の政治傾向の場合にはマイナスとして、思考傾向を数値化した上でユーザのパーソナリティに関する基準を定める方法が考えられる。 The user personality standards indicate the user's thinking tendency. For example, the political thoughts are determined to be right or left, and the overseas soldiers are considered hawks or pigeons. In addition, the criteria regarding the personality of the user may be determined in advance by the user's selection or the like, or may be determined based on the contents of the document data. In particular, when determining personality standards based on the contents of document data, as shown in FIG. 7, a discrimination is set for each user's thinking tendency, and in the case of one thinking tendency, plus, the other political tendency In this case, as a minus, it is possible to define a standard for the user's personality after quantifying the thinking tendency.

而して、文書入力部１０に図５に示す文書データＢ１が入力されると、自然言語処理部２０において、形態素解析部２１が図５の文書データＢ２に示すように形態素解析を行い、構文解析部２２が図５の文書データＢ３に示すように構文解析を行ったあと、５Ｗ１Ｈ判定部２３が図５の文書データＢ４に示すように５Ｗ１Ｈの判定を行う。 Thus, when the document data B1 shown in FIG. 5 is input to the document input unit 10, in the natural language processing unit 20, the morpheme analysis unit 21 performs morpheme analysis as shown in the document data B2 in FIG. After the analysis unit 22 performs syntax analysis as shown in the document data B3 in FIG. 5, the 5W1H determination unit 23 performs determination of 5W1H as shown in the document data B4 in FIG.

次に、シナリオ生成処理部３０において、前記副詞的用法算定部３１が、図５の文書データＢ４について、前記副詞的用法ＤＢ５０を参照することにより、ユーザのパーソナリティに関する基準に応じて、所定の副詞的用法算定基準に対応する副詞的用法を算定する。本実施例の文書データＢ４では、改憲や憲法改正等の記載が「ＷＨＡＴ」にあるため、ユーザのパーソナリティの基準が「政治が右寄り」の場合、そのユーザのパーソナリティに関する基準「政治が右寄り」に応じて副詞的用法算定基準「改憲や憲法改正等の記載がＷＨＡＴにある」に対応する副詞的用法（悲願）を算定する。一方、そのユーザのパーソナリティが「政治が左寄り」の場合、そのユーザのパーソナリティに関する基準「政治が左寄り」に応じて副詞的用法算定基準「改憲や憲法改正等の記載がＷＨＡＴにある」に対応する副詞的用法（残念）を算定する。また、該副詞的用法（謝罪）の置換対象となる５Ｗ１Ｈの要素「ＷＨＥＮ」と挿入位置「文頭」も算定する。 Next, in the scenario generation processing unit 30, the adverbial usage calculation unit 31 refers to the adverbial usage DB 50 for the document data B4 in FIG. The adverbial usage corresponding to the standard usage calculation standard is calculated. In the document data B4 of this embodiment, since the description of the constitutional amendment or constitutional amendment is in “WHAT”, when the user's personality standard is “politics is right”, the user's personality standard is “politics is right” Accordingly, the adverbial usage (sorry) corresponding to the adverbial usage calculation standard “WHAT has descriptions of constitutional amendments and constitutional amendments” is calculated. On the other hand, if the user's personality is “politics is left”, it corresponds to the adverbial usage calculation standard “WHAT has a description of constitutional or constitutional amendments in WHAT” according to the standard “politics is left” regarding the user ’s personality Calculate adverbial usage (sorry). Also, the 5W1H element “WHEN” and the insertion position “sentence” to be replaced with the adverbial usage (apology) are calculated.

そして、前記副詞的用法置換部３２は、前記副詞的用法算定部３１による副詞的用法と５Ｗ１Ｈの要素と副詞的用法の挿入位置の算定結果に基づいて、文書データの該５Ｗ１Ｈの要素を該副詞的用法に置換した上で、文書データにおける該副詞的用法の挿入位置に挿入する。本実施例の文書データＢ４では、図５の文書データＢ５に示すように５Ｗ１Ｈの要素「ＷＨＥＮ」の「去る１５日に」を副詞的用法（悲願）の「待ちわびたことに、」または副詞的用法（残念）の「遺憾だが、」に置換した上で文頭に挿入する。 The adverbial usage replacement unit 32 then converts the 5W1H element of the document data into the adverb based on the adverbial usage by the adverbial usage calculation unit 31 and the calculation result of the 5W1H element and the insertion position of the adverbial usage. After replacing with the usage, it is inserted at the insertion position of the adverb usage in the document data. In the document data B4 of the present embodiment, as shown in the document data B5 in FIG. 5, the “Waiting for 15 days” of the element “WHEN” of the 5W1H is used as an adverbial usage (a long-awaited) or “adverbial” Replaced with usage (sorry) “regret but” and insert it at the beginning of the sentence.

あとは、前記文書出力部４０が、５Ｗ１Ｈの要素「ＷＨＥＮ」の「去る１５日に」が副詞的用法（悲願）の「待ちわびたことに、」または副詞的用法（残念）の「遺憾だが、」に置換された、図５に示す文書データＡ５を所定の形式で出力する。 After that, the document output unit 40 is that the 5W1H element “WHEN” “Last 15 days” is an adverbic usage (satisfied) “waiting” or an adverbic usage (sorry) “regret, The document data A5 shown in FIG. 5 is output in a predetermined format.

本実施例では、図８〜図１０を参照しながら、対語に関する副詞的用法算定基準に基づいて、文書データの５Ｗ１Ｈの要素を副詞的用法に置換する場合について説明する。 In this embodiment, the case where the 5W1H element of the document data is replaced with the adverbial usage based on the adverbial usage calculation standard regarding the counter word will be described with reference to FIGS.

前記副詞的用法ＤＢ５０は、図９に示すように、副詞的用法を算定するための対語に関する副詞的用法算定基準と、該副詞的用法算定基準に対応する副詞的用法と、該副詞的用法の置換対象となる５Ｗ１Ｈの要素と、文書データにおける該副詞的用法の挿入位置とが互いに関連付けられながら記憶されている。 As shown in FIG. 9, the adverbial usage DB 50 includes adverbial usage calculation standards for adversaries for calculating adverbial usage, adverbial usages corresponding to the adverbial usage calculation criteria, and adverbial usages. The 5W1H element to be replaced and the insertion position of the adverbial usage in the document data are stored while being associated with each other.

この対語に関する副詞的用法算定基準は、「暖かい」と「寒い」、「高い」と「低い」、「早い」と「遅い」などのように、意味が反対の対語に関する単語グループが互いに対応するともに、各対語に関する単語グループに対応して「寒暖」「高低」「遅早」等の識別が対応するように関連付けられている。また、この対語に関する単語グループについては、図１０に示すように、各単語グループに一ないし複数の単語が属するように設定されている。なお、本実施例では、対語に関する単語グループが互いに対応した副詞的用法算定基準としたが、対語に関する単語そのものが対応した副詞的用法算定基準であってもよい。 The adverbial usage criteria for this adversary correspond to groups of opposite words that have opposite meanings, such as “warm” and “cold”, “high” and “low”, “early” and “slow”. Both are associated so that identifications such as “warmth / warm”, “high / low”, “late”, etc. correspond to the word groups relating to the respective words. Further, as shown in FIG. 10, the word group related to this counter word is set so that one or a plurality of words belong to each word group. In this embodiment, the adverbial usage calculation standard corresponding to the word group related to the word is used, but the adverbial usage calculation standard corresponding to the word related to the word may be used.

而して、文書入力部１０に図８に示す文書データＣ１が入力されると、自然言語処理部２０において、形態素解析部２１が所定の形態素解析を行い、構文解析部２２が所定の構文解析を行ったあと、５Ｗ１Ｈ判定部２３が図８の文書データＣ２に示すように５Ｗ１Ｈの判定を行う。 Thus, when the document data C1 shown in FIG. 8 is input to the document input unit 10, in the natural language processing unit 20, the morpheme analysis unit 21 performs a predetermined morpheme analysis, and the syntax analysis unit 22 performs a predetermined syntax analysis. Then, the 5W1H determination unit 23 determines 5W1H as shown in the document data C2 in FIG.

次に、シナリオ生成処理部３０において、前記副詞的用法算定部３１が、図８の文書データＣ２について、前記副詞的用法ＤＢ５０を参照することにより、所定の対語に関する副詞的用法算定基準に対応する副詞的用法を算定する。本実施例の文書データＣ２では、単語グループ「暖かい」の属する単語「沖縄県」と、単語グループ「寒い」に属する単語「積雪」が５Ｗ１Ｈに含まれるため、その対語に関する副詞的用法算定基準（寒暖）に対応する副詞的用法（驚愕）を算定する。また、該副詞的用法（驚愕）の置換対象となる５Ｗ１Ｈの要素「ＷＨＥＮ」と挿入位置「文頭」も算定する。 Next, in the scenario generation processing unit 30, the adverbial usage calculation unit 31 refers to the adverbial usage DB 50 with respect to the document data C2 in FIG. Calculate adverbial usage. In the document data C2 of this embodiment, the word “Okinawa” to which the word group “warm” belongs and the word “snow snow” to the word group “cold” are included in 5W1H. Adverbial usage (startle) corresponding to (cold) is calculated. Also, the 5W1H element “WHEN” and the insertion position “sentence” to be replaced with the adverbial usage (amazing) are calculated.

そして、前記副詞的用法置換部３２は、前記副詞的用法算定部３１により副詞的用法と５Ｗ１Ｈの要素と副詞的用法の挿入位置の算定結果に基づいて、文書データの該５Ｗ１Ｈの要素を該副詞的用法に置換した上で、文書データにおける該副詞的用法の挿入位置に挿入する。本実施例の文書データＣ２では、図８の文書データＣ３に示すように、５Ｗ１Ｈの要素「ＷＨＥＮ」の「昨年に」を副詞的用法（驚愕）の「信じられないことに、」に置換した上で文頭に挿入する。 Then, the adverbial usage replacement unit 32 uses the adverbial usage calculation unit 31 to calculate the adverbial usage, the element of 5W1H, and the calculation result of the insertion position of the adverbial usage as the adverb. After replacing with the usage, it is inserted at the insertion position of the adverb usage in the document data. In the document data C2 of this embodiment, as shown in the document data C3 of FIG. 8, the “last year” of the 5W1H element “WHEN” is replaced with “unbelievable” of the adverbial usage (amazing). Insert at the beginning of the sentence above.

あとは、前記文書出力部４０が、５Ｗ１Ｈの要素「ＷＨＥＮ」の「昨年に」が副詞的用法（驚愕）の「信じられないことに、」に置換された、図８に示す文書データＣ３を所定の形式で出力する。 After that, the document output unit 40 replaces the document data C3 shown in FIG. 8 in which “last year” of the element “WHEN” of 5W1H is replaced with “unbelievable” of the adverbial usage (amazing). Output in a predetermined format.

なお、本実施例では、所定の単語について各単語グループがあらかじめ設定されるに際して、単語「苺」には単語グループ「赤い」といった連想する単語グループを設定してもよい。 In this embodiment, when each word group is set in advance for a predetermined word, an associated word group such as the word group “red” may be set for the word “苺”.

本実施例では、図１１および図１２を参照しながら、接続パターンに関する副詞的用法算定基準に基づいて、文書データの５Ｗ１Ｈの要素を副詞的用法に置換する場合について説明する。 In this embodiment, a case where the 5W1H element of the document data is replaced with the adverbial usage based on the adverbial usage calculation standard regarding the connection pattern will be described with reference to FIG. 11 and FIG.

前記副詞的用法ＤＢ５０は、図１２に示すように、副詞的用法を算定するための接続パターンに関する副詞的用法算定基準と、該副詞的用法算定基準に対応する副詞的用法と、該副詞的用法の置換対象となる５Ｗ１Ｈの要素と、文書データにおける該副詞的用法の挿入位置とが互いに関連付けられながら記憶されている。 As shown in FIG. 12, the adverbial usage DB 50 includes adverbial usage calculation standards related to connection patterns for calculating adverbial usage, adverbial usages corresponding to the adverbial usage calculation criteria, and adverbial usages. The 5W1H element to be replaced and the insertion position of the adverbial usage in the document data are stored while being associated with each other.

この接続パターンに関する副詞的用法算定基準は、仮定、逆接、順接等の接続種類と、「もし、Ａたら」「Ａだが、Ｂだ」「ＡだからＢ」（Ａ、Ｂには節・句・単語が入る）等の接続パターンとが互いに対応するように関連付けられている。 The adverbial usage calculation criteria for this connection pattern are as follows: connection type such as assumption, reverse connection, forward connection, etc., “If A, A”, “A, but B”, “A because B” (A and B are clauses / phrases) And a connection pattern such as a word is associated with each other so as to correspond to each other.

而して、文書入力部１０に図１１に示す文書データＤ１が入力されると、自然言語処理部２０において、形態素解析部２１が図１１の文書データＤ２に示すように形態素解析を行い、構文解析部２２が図１１の文書データＤ３に示すように構文解析を行ったあと、５Ｗ１Ｈ判定部２３が図１１の文書データＤ４に示すように５Ｗ１Ｈの判定を行う。 Thus, when the document data D1 shown in FIG. 11 is input to the document input unit 10, the morphological analysis unit 21 performs morpheme analysis in the natural language processing unit 20 as shown in the document data D2 in FIG. After the analysis unit 22 performs syntax analysis as shown in the document data D3 in FIG. 11, the 5W1H determination unit 23 performs determination of 5W1H as shown in the document data D4 in FIG.

次に、シナリオ生成処理部３０において、前記副詞的用法算定部３１が、図１１の文書データＤ４について、前記副詞的用法ＤＢ５０を参照することにより、所定の接続パターンに関する副詞的用法算定基準に対応する副詞的用法を算定する。本実施例の文書データＤ４では、文書データに接続パターン（仮定）の「もし、Ａたら」が含まれているため、その接続パターン（仮定）に関する副詞的用法算定基準に対応する副詞的用法（仮定）を算定する。また、該副詞的用法（驚愕）の置換対象となる５Ｗ１Ｈの要素「ＷＨＥＮ」と挿入位置「文頭」も算定する。 Next, in the scenario generation processing unit 30, the adverbial usage calculation unit 31 refers to the adverbial usage DB 50 with respect to the document data D4 in FIG. Calculate adverbial usage. In the document data D4 of this embodiment, since the connection data (assumed) “if, if A” is included in the document data, the adverbial usage corresponding to the adverbial usage calculation standard for the connection pattern (assumed) ( Assumption). Also, the 5W1H element “WHEN” and the insertion position “sentence” to be replaced with the adverbial usage (amazing) are calculated.

そして、前記副詞的用法置換部３２は、前記副詞的用法算定部３１により副詞的用法と５Ｗ１Ｈの要素と副詞的用法の挿入位置の算定結果に基づいて、文書データの該５Ｗ１Ｈの要素を該副詞的用法に置換した上で、文書データにおける該副詞的用法の挿入位置に挿入する。本実施例の文書データＤ４では、図１１の文書データＤ５に示すように、５Ｗ１Ｈの要素「ＷＨＥＮ」の「１９４１年１２月８日の」を副詞的用法（仮定）の「信じられないかもしれないが、」に置換した上で文頭に挿入する。このとき接続パターンに係る単語「もし」を省略してもよい。 Then, the adverbial usage replacement unit 32 uses the adverbial usage calculation unit 31 to calculate the adverbial usage, the element of 5W1H, and the calculation result of the insertion position of the adverbial usage as the adverb. After replacing with the usage, it is inserted at the insertion position of the adverb usage in the document data. In the document data D4 of the present embodiment, as shown in the document data D5 of FIG. 11, “December 8, 1941” of the element “WHEN” of 5W1H may be “unbelievable”. Insert it at the beginning of the sentence after replacing "". At this time, the word “if” related to the connection pattern may be omitted.

あとは、前記文書出力部４０が、５Ｗ１Ｈの要素「ＷＨＥＮ」の「１９４１年１２月８日の」が副詞的用法（仮定）の「信じられないかもしれないが、」に置換された、図１１に示す文書データＤ５を所定の形式で出力する。 After that, in the document output unit 40, the element “WHEN” of 5W1H “December 8, 1941” is replaced with the adverbial usage (assuming may be incredible). 11 is output in a predetermined format.

本実施例では、図１３および図１４を参照しながら、誇張表現に関する副詞的用法算定基準に基づいて、文書データの５Ｗ１Ｈの要素を副詞的用法に置換する場合について説明する。 In the present embodiment, a case where the 5W1H element of the document data is replaced with an adverbial usage based on the adverbial usage calculation standard regarding the exaggerated expression will be described with reference to FIGS. 13 and 14.

前記副詞的用法ＤＢ５０は、図１４に示すように、副詞的用法を算定するための誇張表現に関する副詞的用法算定基準と、該副詞的用法算定基準に対応する副詞的用法と、該副詞的用法の置換対象となる５Ｗ１Ｈの要素と、文書データにおける該副詞的用法の挿入位置とが互いに関連付けられながら記憶されている。 As shown in FIG. 14, the adverbial usage DB 50 includes adverbial usage calculation criteria for exaggerated expressions for calculating adverbial usage, adverbial usage corresponding to the adverbial usage calculation criteria, and adverbial usage. The 5W1H element to be replaced and the insertion position of the adverbial usage in the document data are stored while being associated with each other.

この誇張表現に関する副詞的用法算定基準は、簡易、唯一、絶対等の誇張表現と、「ＨＯＷに簡単」「ＷＨＡＴの前に世界で一つ」「ＨＯＷに絶対」等の誇張パターンとが互いに対応するように関連付けられている。 The adverbial usage calculation standard regarding this exaggeration corresponds to the exaggerated expressions such as simple, unique, absolute etc. and exaggerated patterns such as “easy to HOW”, “one in the world before WHAT”, “absolutely HOW” Associated to be.

而して、文書入力部１０に図１３に示す文書データＥ１が入力されると、自然言語処理部２０において、形態素解析部２１が図１３の文書データＥ２に示すように形態素解析を行い、構文解析部２２が図１３の文書データＥ３に示すように構文解析を行ったあと、５Ｗ１Ｈ判定部２３が図１３の文書データＥ４に示すように５Ｗ１Ｈの判定を行う。 Thus, when the document data E1 shown in FIG. 13 is input to the document input unit 10, in the natural language processing unit 20, the morpheme analysis unit 21 performs morpheme analysis as shown in the document data E2 in FIG. After the analysis unit 22 performs syntax analysis as shown in the document data E3 in FIG. 13, the 5W1H determination unit 23 performs determination of 5W1H as shown in the document data E4 in FIG.

次に、シナリオ生成処理部３０において、前記副詞的用法算定部３１が、図１３の文書データＥ４について、前記副詞的用法ＤＢ５０を参照することにより、誇張表現に関する副詞的用法算定基準に対応する副詞的用法を算定する。本実施例の文書データＥ４では、ＨＯＷに「簡単」の誇張表現が含まれているため、その誇張表現に関する副詞的用法算定基準（簡易）「ＨＯＷに簡単」に対応する副詞的用法（驚愕）「信じられないことに」を算定する。また、該副詞的用法（驚愕）の置換対象となる５Ｗ１Ｈの要素「ＷＨＥＮ」と挿入位置「文頭」も算定する。 Next, in the scenario generation processing unit 30, the adverbial usage calculation unit 31 refers to the adverbial usage DB 50 for the document data E4 in FIG. Calculate the usage. In the document data E4 of the present embodiment, the exaggeration of “easy” is included in the HOW, so the adverbial usage calculation standard (simple) relating to the exaggerated expression (simple) adverbial usage corresponding to “easy to HOW” (amazing) "Unbelievable" is calculated. Also, the 5W1H element “WHEN” and the insertion position “sentence” to be replaced with the adverbial usage (amazing) are calculated.

そして、前記副詞的用法置換部３２は、前記副詞的用法算定部３１により副詞的用法と５Ｗ１Ｈの要素と副詞的用法の挿入位置の算定結果に基づいて、文書データの該５Ｗ１Ｈの要素を該副詞的用法に置換した上で、文書データにおける該副詞的用法の挿入位置に挿入する。本実施例の文書データＥ４では、図１３の文書データＥ５に示すように、５Ｗ１Ｈの要素「ＷＨＥＮ」の「毎日」を副詞的用法（驚愕）の「信じられないことに、」に置換した上で文頭に挿入する。このとき５Ｗ１Ｈの要素「ＷＨＯ」の「リバウンダー」の語末に助詞「も」を追加するとよい。 Then, the adverbial usage replacement unit 32 uses the adverbial usage calculation unit 31 to calculate the adverbial usage, the element of 5W1H, and the calculation result of the insertion position of the adverbial usage as the adverb. After replacing with the usage, it is inserted at the insertion position of the adverb usage in the document data. In the document data E4 of the present embodiment, as shown in the document data E5 of FIG. 13, “everyday” of the element “WHEN” of 5W1H is replaced with “unbelievable” of the adverbial usage (amazing). Insert at the beginning of the sentence. At this time, the particle “mo” may be added to the end of “rebounder” of the element “WHO” of 5W1H.

あとは、前記文書出力部４０が、５Ｗ１Ｈの要素「ＷＨＥＮ」の「毎日」が副詞的用法（驚愕）の「信じられないことに、」に置換された、図１３に示す文書データＥ５を所定の形式で出力する。 After that, the document output unit 40 predetermines the document data E5 shown in FIG. 13 in which “everyday” of the element “WHEN” of 5W1H is replaced with “unbelievable” of the adverbial usage (amazing). Output in the form

なお、本実施形態では、副詞的用法算定部３１が所定の副詞的用法算定基準に対応する副詞的用法を算定するに際して、該副詞的用法の置換対象や挿入位置も併せて算定するものとしたが、それらを併せて算定しないものとしてもよい。この場合、副詞用法の置換対象の５Ｗ１Ｈの要素は例えば「ＷＨＥＮ」のみと予め設定したり、挿入位置は元の５Ｗ１Ｈの要素の位置等と予め設定することが考えられる。 In the present embodiment, when the adverbial usage calculation unit 31 calculates an adverbial usage corresponding to a predetermined adverbial usage calculation standard, the replacement target and insertion position of the adverbial usage are also calculated. However, they may not be calculated together. In this case, it is conceivable that the 5W1H element to be replaced by the adverb usage is preset as, for example, “WHEN” only, or the insertion position is preset as the position of the original 5W1H element.

また、文書データのうちの一の５Ｗ１Ｈの要素のみを副詞的用法に置換する場合について説明したが、複数の５Ｗ１Ｈの要素をそれぞれに対応した副詞的用法に置換してもよい。 Further, the case where only one 5W1H element of the document data is replaced with the adverbial usage has been described. However, a plurality of 5W1H elements may be replaced with the adverbial usage corresponding to each.

また、副詞的用法算定基準と副詞的用法は実施例１〜実施例５のものに限定されるものではなく、その他の副詞的用法算定基準と副詞的用法であってもよい。 Further, the adverbial usage calculation standard and the adverbial usage are not limited to those in the first to fifth embodiments, but may be other adverbial usage calculation standards and adverbial usage.

以上、図面を参照して本発明の実施形態を説明したが、本発明は、図示した実施形態のものに限定されない。図示された実施形態に対して、本発明と同一の範囲内において、あるいは均等の範囲内において、種々の修正や変形を加えることが可能である。 As mentioned above, although embodiment of this invention was described with reference to drawings, this invention is not limited to the thing of embodiment shown in figure. Various modifications and variations can be made to the illustrated embodiment within the same range or equivalent range as the present invention.

１０・・・文書入力部
２０・・・自然言語処理部
２１・・・形態素解析部
２２・・・構文解析部
２３・・・５Ｗ１Ｈ判定部
３０・・・シナリオ生成部
３１・・・副詞的用法算定部
３２・・・副詞的用法置換部
４０・・・文書出力部
５０・・・副詞的用法ＤＢ DESCRIPTION OF SYMBOLS 10 ... Document input part 20 ... Natural language processing part 21 ... Morphological analysis part 22 ... Syntax analysis part 23 ... 5W1H determination part 30 ... Scenario generation part 31 ... Adverbial usage Calculation unit 32 ... adverb usage replacement unit 40 ... document output unit 50 ... adverb usage DB

Claims

A document input unit for inputting document data;
By analyzing the document data input by the document input unit, at least one of what, who, what, where, why, and how A natural language processing unit for determining a plurality of 5W1H elements;
An adverbial usage database for storing adverbial usage calculation criteria for calculating adverbial usage and adverbial usages corresponding to the adverbial usage calculation criteria in association with each other;
Adverbial usage calculation for calculating adverbial usage corresponding to a predetermined adverbial usage calculation standard by referring to the adverbial usage database for the document data for which the 5W1H element has been determined by the natural language processing unit And
An adverbial usage replacement unit that replaces a predetermined 5W1H element of the document data with an adverbial usage calculated by the adverbial usage calculation unit;
A natural language processing apparatus comprising: a document output unit that outputs new document data in which predetermined 5W1H elements are replaced by adverbial usage by the adverbial usage replacement unit.

The adverbial usage database stores 5W1H elements to be substituted for adverbial usage while being associated with adverbial usage calculation criteria,
The adverbial usage calculation unit refers to the adverbial usage database for the document data for which the 5W1H element has been determined by the natural language processing unit, so that the adverbial usage calculation unit corresponds to a predetermined adverbial usage calculation standard. Calculate the usage and the 5W1H element to be replaced by the adverbial usage,
The adverbial usage replacement unit replaces the 5W1H element of the document data with the adverbial usage based on the adverbial usage calculated by the adverbial usage calculation unit and the calculation result of the 5W1H element. The natural language processing device described in 1.

In the adverbial usage database, the insertion position of the adverbial usage in the document data is stored while being associated with the adverbial usage calculation criterion,
The adverbial usage calculation unit refers to the adverbial usage database for the document data for which the 5W1H element has been determined by the natural language processing unit, so that the adverbial usage calculation unit corresponds to a predetermined adverbial usage calculation standard. Calculating the usage, the 5W1H element to be replaced by the adverbial usage, and the insertion position of the adverbial usage in the document data;
The adverbial usage replacement unit replaces the 5W1H element of the document data with the adverbial usage calculated by the adverbial usage calculation unit, the 5W1H element, and the calculation result of the insertion position of the adverbial usage. The natural language processing apparatus according to claim 1, wherein the natural language processing device is inserted into the insertion position of the adverbial usage in the document data after being replaced with the adverbial usage.

The adverbial usage database is stored with criteria relating to the user's personality being associated with adverbial usage criteria,
The adverbial usage calculation unit refers to the adverbial usage database with respect to the document data for which the 5W1H element has been determined by the natural language processing unit, so that a predetermined adverb The natural language processing apparatus according to any one of claims 1 to 3, wherein an adverbial usage corresponding to a standard usage calculation standard is calculated.

The adverbial usage calculation standard relates to the opposite word,
The adverbial usage calculation unit corresponds to the adverbial usage calculation standard for a predetermined opposite word by referring to the adverbial usage database for the document data for which the 5W1H element has been determined by the natural language processing unit. The natural language processing apparatus according to any one of claims 1 to 4, which calculates adverbial usage.

The adverbial usage calculation criteria relate to connection patterns;
The adverbial usage calculation unit corresponds to the adverbial usage calculation standard for a predetermined connection pattern by referring to the adverbial usage database for the document data for which the natural language processing unit has determined the 5W1H element. The natural language processing apparatus according to claim 1, wherein the adverbial usage is calculated.

The adverbial usage calculation standard relates to exaggerated expressions,
The adverbial usage calculation unit corresponds to the adverbial usage calculation standard for a predetermined exaggerated expression by referring to the adverbial usage database for the document data for which the natural language processing unit has determined the 5W1H element. The natural language processing apparatus according to claim 1, wherein the adverbial usage is calculated.