JPWO2015118683A1

JPWO2015118683A1 - Opinion collection device and system, and opinion collection method

Info

Publication number: JPWO2015118683A1
Application number: JP2015561135A
Authority: JP
Inventors: 芳樹丹羽; 直之神田
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2014-02-10
Filing date: 2014-02-10
Publication date: 2017-03-23
Also published as: WO2015118683A1

Abstract

論題に関する意見を収集する際に、論題に関係する複数の価値観を設定し、意見の発言主体が価値観に関する考え方のバックグラウンドを把握できるようにすること。論題と論題に関わる価値観を入力し、論題に関する意見を検索し、意見の発言主体の過去の意見を検索し、過去発言の価値観に関する重視度を計算し、検索された意見を表示する。複数の価値観の重視度に基づいて発言主体を表示部にマップし、論題に関する個々の意見を発言主体に結びつけて表示することにより、意見の発言主体が価値観をどの程度重視しているか把握しながら各意見を参照できる。価値観の重視度は、価値観と意味相似性を有する表現が発言主体の過去発言に出現する箇所を求め、各出現箇所における表現の意味相似度と文脈ファクターとの相乗値を過去発言に渡って累積することで計算する。When collecting opinions on a topic, set multiple values related to the topic so that the commenter of the opinion can understand the background of the way of thinking about values. Input a topic and values related to the topic, search for an opinion on the topic, search for past opinions of the utterances of the opinion, calculate importance on the values of past utterances, and display the searched opinions. Based on the degree of importance of multiple values, the subject is mapped to the display unit, and individual opinions on the topic are displayed in association with the subject. While referring to each opinion. The importance of values is determined by finding the places where expressions with values and semantic similarity appear in the past utterances of the utterance subject, and the synergistic value of the semantic similarity and the context factor of each expression at each occurrence is passed to the past utterances. To calculate.

Description

本発明は、意見収集装置及びシステム及び意見収集方法に関する。 The present invention relates to an opinion collection device and system, and an opinion collection method.

ある論題について分析したり、判断を下したりする場合、その論題に関する賛否それぞれの立場からのさまざまな意見を収集し、それぞれの長短を比較検討することによって最適の判断を下そうとする努力が一般に行われる。その努力を支援するための技術がこれまでにも開発されてきた。
特許文献１には賛否意見の特徴的な語句を賛否特異度と論題固有度に応じてマップ化する技術が記載されている。When analyzing or making a decision on a topic, there is an effort to collect the various opinions from the respective pros and cons of the topic, and to make an optimal decision by comparing the lengths of each. Generally done. Technologies have been developed to support these efforts.
Patent Document 1 describes a technique for mapping characteristic words of pros and cons according to pros and cons specificity and topic specificity.

特開２００７−２４１９０１号公報JP 2007-241901 A

ある論題について分析したり、判断を下したりする場合、その論題に関する賛否それぞれの立場からのさまざまな意見を収集し、それぞれの長短を比較検討することによって最適の判断を下そうとする努力が一般に行われる。その際に各意見の発言主体が、その論題に関わる一般に複数の価値観についてどの程度重視しているか、というバックグラウンド知ることができれば、より的確な分析をすることができると考えられる。
例えば、新しい医薬品の開発のための動物実験の是非を問う論題の場合、医学の進歩を重視する立場からは、動物実験は必要という考えになることが多く、また動物愛護や動物の生命倫理を重視する立場からは、禁止すべきという考えになることが多い。従ってこの場合には、医学の進歩という価値観と動物の生命倫理という二つの価値観が両立できない所に論題の発生源があると考えられる。
しかしながら動物実験の是非という論題を離れれば、通常は医学の進歩も、動物の生命を守ることもどちらも大事、というのが常識的な考え方である。このように、そのことを他の価値観とは切り離して独立に善悪を問われた時に、常識的に大事であると多くの人が考えるものを、ここでは価値観（もしくは価値観点）と呼ぶ。When analyzing or making a decision on a topic, there is an effort to collect the various opinions from the respective pros and cons of the topic, and to make an optimal decision by comparing the lengths of each. Generally done. At that time, if it is possible to know the background of how much importance is given to a plurality of values related to the topic in general by the subject of each opinion, a more accurate analysis can be performed.
For example, in the case of a topic that asks the right or wrong of animal experiments for the development of new medicines, from the standpoint of emphasizing medical progress, animal experiments are often considered to be necessary, and animal welfare and animal bioethics are also considered. From the point of emphasis, it is often thought that it should be prohibited. Therefore, in this case, it is considered that there is a source of the topic where the two values of medical progress and animal bioethics are not compatible.
However, if we leave the topic of animal experimentation, it is common sense that both the advancement of medicine and the protection of animal life are usually important. In this way, what many people consider to be important in common sense when they are questioned about good and evil independently from other values is called values (or values). .

今仮に、ＡさんとＢさんＣさんＤさんが動物実験は必要という意見を述べていたとする。この場合にもしＡさんＢさんＣさんＤさんが、動物実験の論題とは独立に、「医学の進歩」という価値観と「動物の生命倫理」という価値観をそれぞれ独立の価値観としてどの程度重視しているかということを過去の発言などから定量的に推定することができれば、同じ「動物実験は必要」という意見でも見え方が違ってくるという効果がある。
例えばＡさんは医学の進歩の重視度が高いが、動物の生命倫理への関心は薄いということであれば、「バックグラウンド通り」の意見という見方になるし、Ｂさんは逆に動物の生命倫理に関心の高い人であったとすると、今回の意見は「バックグラウンドとは違う意外性のある」意見という見方ができる。またＣさんは医学の進歩についても動物倫理についてもどちらも過去の発言からは重視度が低いということであれば、「ちょっと思いつきで」発言しただけかもしれない、という見方も可能である。またＤさんは逆にどちらの価値観についても重視度が高いということであれば、今回の意見は「彼我の軽重を測った熟慮の上の意見」かもしれないと考えることができる。
このように価値観に関する重視度を定量的に推定できることは意見分析の上で大きな価値を持つのであるが、これまで実現されてこなかった。本発明の解決すべき課題の一つはこれを実現する手段を与えることである。
ここではＡ〜Ｄは個人であるとしたが、実際にはウェブサイトの場合や、雑誌などの媒体であるなど、組織である場合もあるので、本発明ではそれらを一括して発言主体と呼ぶ。Suppose now that Mr. A, Mr. B, Mr. C, and Mr. D stated that animal experiments are necessary. In this case, if Mr. A, Mr. B, Mr. C, and Mr. D are independent of the subject of animal experiments, the values of "medical progress" and the values of "animal bioethics" are considered as independent values. If it can be quantitatively estimated from past remarks etc. whether there is importance or not, there is an effect that even the same opinion that “animal experiment is necessary” will be seen differently.
For example, if Mr. A has a high degree of emphasis on medical progress, but is less interested in animal bioethics, then he would be construed as a “Background” opinion. If you are a person who has a strong interest in ethics, this opinion can be viewed as an “unexpectedly different opinion”. It is also possible that Mr. C may have just said “slightly” if the importance of both medical progress and animal ethics is low from past remarks. On the contrary, if Mr. D has a high degree of emphasis on both values, this opinion may be considered as “a thoughtful opinion based on his own weight”.
The ability to quantitatively estimate the degree of importance regarding values in this way has great value in opinion analysis, but has not been realized so far. One of the problems to be solved by the present invention is to provide means for realizing this.
Here, A to D are individuals, but in reality, there are cases where the organization is an organization such as a website or a medium such as a magazine. .

本発明は、以上の点に鑑み、所与の論題に関する複数の意見を、論題に関わる複数の価値観に関する各発言主体の考え方の重視度（バックグラウンド等）を計算し、表示又は出力することを目的とする。
In view of the above points, the present invention calculates and displays or outputs a plurality of opinions relating to a given topic by calculating the degree of importance (background, etc.) of each speaker's way of thinking regarding a plurality of values related to the topic. With the goal.

本発明の第１の解決手段によると、
意見収集装置であって、
文書内容を含む文書データを予め複数保持し、文書内容及び発言主体を含む意見データを複数保持し、発言主体別及び価値観別重視度データを保持する記憶部と、
演算部と、
を備え、

前記演算部は、
端末により入力された、何に関する文書を収集するかを定める論題と、前記論題の是非を判断する際に影響を及ぼすと考えられるひとつ又は複数の価値観と、を受信し、
受信した前記論題に関する文書内容を前記文書データから検索し、
前記検索された文書内容の発言主体の集合を求め、文書内容及び発言主体を含む複数の意見データを前記記憶部に記憶し、
前記意見データに含まれる発言主体毎に、各前記価値観に対する重視度を計算し、
計算された重視度から発言主体別及び価値観別重視度データを作成し、前記記憶部に記憶し、
前記発言主体別及び価値観別重視度データを、表示部に表示又は出力部に出力させる
ことを特徴とする意見収集装置が提供される。According to the first solution of the present invention,
An opinion collecting device,
A plurality of document data including the document content in advance, a plurality of opinion data including the document content and the speaking subject, a storage unit holding importance data for each speaking subject and each value,
An arithmetic unit;
With

The computing unit is
Receiving a topic entered by the terminal that defines what documents to collect and one or more values that are considered to have an impact when determining the pros and cons of the topic;
Search the document data for the document content related to the received topic,
Finding a set of speaking subjects of the retrieved document content, storing a plurality of opinion data including the document content and the speaking subject in the storage unit,
For each speaking subject included in the opinion data, calculate the importance for each of the values,
Create importance level data for each subject and value from the calculated importance level, store it in the storage unit,
An opinion collecting apparatus is provided, characterized in that the importance data classified by each speaking subject and each value is displayed on a display unit or output on an output unit.

本発明の第２の解決手段によると、
意見収集システムであって、
端末と、
前記端末と通信ネットワークを介して接続された意見収集装置と
を備え、

前記意見収集装置は、
文書内容を含む文書データを予め複数保持し、文書内容及び発言主体を含む意見データを複数保持し、発言主体別及び価値観別重視度データを保持する記憶部と、
演算部と、
を有し、

前記演算部は、
端末により入力された、何に関する文書を収集するかを定める論題と、前記論題の是非を判断する際に影響を及ぼすと考えられるひとつ又は複数の価値観と、を受信し、
受信した前記論題に関する文書内容を前記文書データから検索し、
前記検索された文書内容の発言主体の集合を求め、文書内容及び発言主体を含む複数の意見データを前記記憶部に記憶し、
前記意見データに含まれる発言主体毎に、各前記価値観に対する重視度を計算し、
計算された重視度から発言主体別及び価値観別重視度データを作成し、前記記憶部に記憶し、
前記発言主体別及び価値観別重視度データを、表示部に表示又は出力部に出力させる
ことを特徴とする意見収集システムが提供される。According to the second solution of the present invention,
An opinion collection system,
A terminal,
An opinion collection device connected to the terminal via a communication network;

The opinion collection device includes:
A plurality of document data including the document content in advance, a plurality of opinion data including the document content and the speaking subject, a storage unit holding importance data for each speaking subject and each value,
An arithmetic unit;
Have

The computing unit is
Receiving a topic entered by the terminal that defines what documents to collect and one or more values that are considered to have an impact when determining the pros and cons of the topic;
Search the document data for the document content related to the received topic,
Finding a set of speaking subjects of the retrieved document content, storing a plurality of opinion data including the document content and the speaking subject in the storage unit,
For each speaking subject included in the opinion data, calculate the importance for each of the values,
Create importance level data for each subject and value from the calculated importance level, store it in the storage unit,
There is provided an opinion collection system characterized in that the importance data for each subject and each value is displayed on a display unit or output on an output unit.

本発明の第３の解決手段によると、
意見収集装置における意見収集方法であって、
前記意見収集装置は、
文書内容を含む文書データを予め複数保持し、文書内容及び発言主体を含む意見データを複数保持し、発言主体別及び価値観別重視度データを保持する記憶部と、
演算部と、
を備え、

前記演算部は、
端末により入力された、何に関する文書を収集するかを定める論題と、前記論題の是非を判断する際に影響を及ぼすと考えられるひとつ又は複数の価値観と、を受信し、
受信した前記論題に関する文書内容を前記文書データから検索し、
前記検索された文書内容の発言主体の集合を求め、文書内容及び発言主体を含む複数の意見データを前記記憶部に記憶し、
前記意見データに含まれる発言主体毎に、各前記価値観に対する重視度を計算し、
計算された重視度から発言主体別及び価値観別重視度データを作成し、前記記憶部に記憶し、
前記発言主体別及び価値観別重視度データを、表示部に表示又は出力部に出力させる
ことを特徴とする意見収集方法が提供される。
According to the third solution of the present invention,
An opinion collection method in an opinion collection device,
The opinion collection device includes:
A plurality of document data including the document content in advance, a plurality of opinion data including the document content and the speaking subject, a storage unit holding importance data for each speaking subject and each value,
An arithmetic unit;
With

The computing unit is
Receiving a topic entered by the terminal that defines what documents to collect and one or more values that are considered to have an impact when determining the pros and cons of the topic;
Search the document data for the document content related to the received topic,
Finding a set of speaking subjects of the retrieved document content, storing a plurality of opinion data including the document content and the speaking subject in the storage unit,
For each speaking subject included in the opinion data, calculate the importance for each of the values,
Create importance level data for each subject and value from the calculated importance level, store it in the storage unit,
An opinion collecting method is provided, characterized in that the importance data classified by the subject and the value is displayed on a display unit or output on an output unit.

本発明によると、所与の論題に関する複数の意見を、論題に関わる複数の価値観に関する各発言主体の考え方の重視度（バックグラウンド等）を計算し、表示又は出力することができる。
According to the present invention, it is possible to calculate and display or output a plurality of opinions relating to a given topic by calculating the degree of importance (background etc.) of each speaker's way of thinking regarding a plurality of values related to the topic.

本発明の一実施例に係る意見収集システムの構成図である。It is a block diagram of the opinion collection system which concerns on one Example of this invention. 図１の表示部上に表示される意見閲覧分析支援画面２５１の画面例の詳細と、意見閲覧分析支援ワークエリア２２１２の詳細を示す図である。It is a figure which shows the detail of the example of an opinion browsing analysis support screen 251 displayed on the display part of FIG. 1, and the detail of the opinion browsing analysis support work area 2212. 図２の価値観格納エリアに格納されている価値観データの構成を示す図である。It is a figure which shows the structure of the value data stored in the value storage area of FIG. 図２の意見データ格納エリアに格納されている意見データの構成を示す図である。It is a figure which shows the structure of the opinion data stored in the opinion data storage area of FIG. 図２の発言主体データ格納エリアに格納されている発言主体データの構成を示す図である。It is a figure which shows the structure of the speech subject data stored in the speech subject data storage area of FIG. 図２の発言主体×価値観別重視度格納エリアに格納されている発言主体×価値観別重視度データの構成を示す図である。It is a figure which shows the structure of the speech subject x importance by value storage stored in the message subject x sense of importance storage area of FIG. 意見収集装置の意見収集ワークエリア１２１５の構成図である。It is a block diagram of the opinion collection work area 1215 of an opinion collection device. 意見収集装置の意見収集用規則・データ集１２１６の構成図である。It is a block diagram of the rule and data collection 1216 for opinion collection of an opinion collection device. 図１の意見閲覧分析支援管理部と意見収集管理部とが、通信ネットワークを介して動作するシーケンスを示す図である。It is a figure which shows the sequence in which the opinion browsing analysis support management part and opinion collection management part of FIG. 1 operate | move via a communication network. 図５のステップＦ４にて、意見収集手段１２１１が所与の論題から論題に関する意見を収集する手順について説明する図である。It is a figure explaining the procedure in which the opinion collection means 1211 collects the opinion regarding a topic from a given topic in step F4 of FIG. 意見収集の際に用いる、賛成・反対表現データの一例を示す図である。It is a figure which shows an example of approval / disagreement expression data used in the case of opinion collection. 意見収集の際に用いる、理由・証拠表現データの一例を示す図である。It is a figure which shows an example of reason and evidence expression data used in the case of opinion collection. 意見収集の際に用いる、否定表現データの一例を示す図である。It is a figure which shows an example of the negative expression data used in the case of opinion collection. 意見収集の際に用いる、発言表現データの一例を示す図である。It is a figure which shows an example of the utterance expression data used in the case of opinion collection. 価値観の重視度算出手段１２１３が、所与の発言主体と所与の価値観について、発言主体の過去の発言を収集し、収集された発言から価値観に関する重視度を計算する手順について説明する図である。A procedure in which the value importance degree calculation unit 1213 collects the past remarks of a given subject and the given values and calculates the importance on the values from the collected remarks will be described. FIG. 重視度計算時に用いる意味相似語句対データの一例を示す図である。It is a figure which shows an example of the semantic similarity phrase pair data used at the time of importance calculation. 重視度計算時に用いる、促進・抑制語句データの一例を示す図である。It is a figure which shows an example of promotion / suppression word / phrase data used at the time of importance calculation. 重視度計算時における、文格納エリアに格納された例文と、その構文構造を解析した結果として得られる、文構造格納エリアの内容を示した図である。It is the figure which showed the content of the sentence structure storage area obtained as a result of analyzing the example sentence stored in the sentence storage area at the time of importance calculation, and its syntax structure. 図１０と同じ例文に関して重視度計算をしている時の文脈ファクター計算ワークエリアの内容を示した図である。It is the figure which showed the content of the context factor calculation work area at the time of calculating importance regarding the same example sentence as FIG. 文脈ファクター計算時に用いる、主部ファクターデータの一例を示す図である。It is a figure which shows an example of the main part factor data used at the time of context factor calculation. 文脈ファクター計算時に用いる、補助部ファクターデータの一例を示す図である。It is a figure which shows an example of auxiliary | assistant part factor data used at the time of context factor calculation. 文脈ファクター計算時に用いる、修飾部ファクターデータの一例を示す図である。It is a figure which shows an example of modification part factor data used at the time of context factor calculation. 年代区分設定部を備えた意見閲覧分析支援画面２５１の画面例である。It is a screen example of the opinion browsing analysis support screen 251 provided with the age division setting part. 製品やサービスに関する意見を価格と性能を価値観として収集した場合の画面例である。This is a screen example when opinions about products and services are collected based on price and performance as values. 外交上対立のある主張に関して、当事国以外の意見を当事国（Ａ・Ｂ）との関係を価値観として収集した場合の画面例である。It is an example of a screen when opinions other than the concerned country are collected as values based on the relations with the concerned countries (A and B) regarding diplomatic disputes. 図１の文書データ１３１の一例を示す図である。It is a figure which shows an example of the document data 131 of FIG. 図１の検索用索引データ１３２の一例を示す図である。It is a figure which shows an example of the search index data 132 of FIG. 図１の発言主体データ１３３の一例を示す図である。It is a figure which shows an example of the speech subject data 133 of FIG. 図１の価値観データ１３４の一例を示す図である。It is a figure which shows an example of the value data 134 of FIG. 価値観の重視度算出手段１２１３の動作（図８）に伴って、価値観の文構造格納エリア（図４Ａ）に作成されるデータの一例を示す図である。It is a figure which shows an example of the data produced in the sentence structure storage area (FIG. 4A) of a value accompanying operation | movement (FIG. 8) of the importance degree calculation means 1213 of a value. 図１の文構造解析用辞書１３５の一例を示す図である。It is a figure which shows an example of the dictionary 135 for sentence structure analysis of FIG.

Ａ．概要

以下では、本実施例に係る意見収集システム、および意見収集方法の具体的な構成例を説明する。

ある発言主体の過去の発言は文書検索手段により得ることができる。従ってある与えられたテキスト集合から、ある与えられた価値観の重視度を計算する手段があれば、前記発言主体の前記価値観への重視度が計算できることになる。本実施例ではテキスト集合と価値観を引数として、テキスト集合の価値観に対する重視度を計算する手段を「重視度計算手段」と呼ぶ。
テキスト集合の価値観に対する重視度を推定する場合、価値観として与えられた文言がそのまま出現する場合もあるが、別表現でも意味的な相似性が高い表現も考慮する必要がある。例えば医学の進歩の場合、「医療技術の革新」や「新薬の開発」なども意味的な相似性が高い表現である。価値観と別表現との意味的な相似性を意味相似度と呼ぶ。
さらには同じ表現でもそれが出現する文脈によって、それが価値観を支持する度合いも変化する。「○○は重要だ」という文脈で○○に「医学の進歩」が現れれば、医学の進歩への支持度は高いが「○○は必ずしも良いことばかりではない」のような文脈であれば、支持度は低いと考えられる。このように文脈が支持度に与える影響を「文脈ファクター」と呼ぶ。
本実施例の代表的なものの一例を示すと、次のとおりである。意見収集システムは、論題と論題に関わる価値観を入力する入力部と、論題に関する意見を検索する意見収集手段と、意見の発言主体に関する過去の意見を検索する過去発言検索手段と、過去発言の価値観に関する重視度を計算する計算手段と検索された意見を表示する表示部を有することを特徴とする。
また、本実施例では、複数の価値観に基づいて発言主体を表示部にマップし、論題に関する個々の発言を発言主体に結びつけて、論題に関する賛否が分かる形で表示することを特徴とすることができる。
また、前記重視度計算手段は、過去発言から価値観と意味相似性を有する表現を検出し、検出された表現と価値観との意味相似性を検索し、また出現文脈の文脈ファクターを計算し、両者を融合した値（積など）を加算することによって重視度を計算することを特徴とすることができる。A. Overview

Hereinafter, specific configuration examples of the opinion collection system and the opinion collection method according to the present embodiment will be described.

Past utterances of a certain utterance subject can be obtained by document retrieval means. Therefore, if there is a means for calculating the degree of importance of a given value from a given text set, the degree of importance of the speaking subject on the value can be calculated. In this embodiment, the means for calculating the importance level for the values of the text set using the text set and the values as arguments is referred to as “importance level calculating means”.
When estimating the importance of the values of the text set, the words given as the values may appear as they are, but it is necessary to consider expressions that are different or highly semantically similar. For example, in the case of medical progress, “innovation of medical technology” and “development of new drugs” are also highly similar expressions. Semantic similarity between values and other expressions is called semantic similarity.
Furthermore, depending on the context in which the same expression appears, the degree to which it supports values also changes. If “medical progress” appears in XX in the context of “XX is important,” support for medical progress is high, but if it is a context such as “XX is not always good.” The support is considered low. The effect of context on support is called “context factor”.
An example of a representative example of this embodiment is as follows. The opinion collection system includes an input unit that inputs a topic and values related to the topic, an opinion collection unit that retrieves an opinion about the topic, a past speech retrieval unit that retrieves a past opinion about the subject of the opinion, It has a calculation means for calculating the degree of importance regarding values and a display unit for displaying retrieved opinions.
In addition, the present embodiment is characterized in that a subject is mapped to a display unit based on a plurality of values, and individual remarks related to a topic are linked to a replay subject and displayed in a form in which the pros and cons regarding the topic are understood. Can do.
The importance calculation means detects expressions having values and semantic similarity from past statements, searches for semantic similarities between the detected expressions and values, and calculates context factors of appearance contexts. The importance level can be calculated by adding a value obtained by merging the two (product or the like).

Ｂ．実施の形態
B. Embodiment

１．システム及び装置

本発明の第一の実施例に係る、意見収集システムを説明する。まず図１により、本実施例の基本的な構成を説明する。

図１は、本発明の一実施例に係る意見収集システム１０００の構成図である。意見収集システム１０００は、意見収集装置１００と、意見閲覧分析支援端末２００を有し、これらは通信ネットワーク３００によって接続されている。意見収集装置１００と意見閲覧分析支援端末２００を一体化することもできる。通信ネットワーク３００上または意見閲覧分析支援端末２００上には、プリンタなどの印刷手段４００が接続される。なお、意見収集システム１０００は、通信ネットワーク３００を介して、関係部署の他の端末やサーバ等、あるいは外部機関の関係部署の端末やサーバ等にも適宜接続される。
意見収集装置１００は、演算部（ＣＰＵ）１１０、主記憶部１２０、補助記憶部（データベース類）１３０、入力部１４０、表示部１５０、通信部１６０を備えたコンピュータによって構成することができ、演算部１１０が主記憶部１２０に記憶された各種プログラムを実行することによって以下に述べる各手段が実現される。すなわち、演算部１１０は、主記憶部１２０が格納しているプログラムを実行することにより、意見収集装置１００の動作を制御する。1. System and apparatus

An opinion collection system according to the first embodiment of the present invention will be described. First, the basic configuration of the present embodiment will be described with reference to FIG.

FIG. 1 is a configuration diagram of an opinion collection system 1000 according to an embodiment of the present invention. The opinion collection system 1000 includes an opinion collection device 100 and an opinion browsing analysis support terminal 200, which are connected by a communication network 300. The opinion collection device 100 and the opinion browsing analysis support terminal 200 can be integrated. Printing means 400 such as a printer is connected to the communication network 300 or the opinion browsing analysis support terminal 200. The opinion collection system 1000 is also appropriately connected to other terminals and servers of related departments or terminals and servers of related departments of external organizations via the communication network 300.
The opinion collection device 100 can be configured by a computer including a calculation unit (CPU) 110, a main storage unit 120, an auxiliary storage unit (databases) 130, an input unit 140, a display unit 150, and a communication unit 160. Each unit described below is realized by the unit 110 executing various programs stored in the main storage unit 120. That is, the calculation unit 110 controls the operation of the opinion collection device 100 by executing a program stored in the main storage unit 120.

主記憶部１２０は、意見収集装置１００が提供する意見収集機能を実装したプログラムである意見収集管理部１２１を格納している。意見収集管理部１２１は、構成要素として、論題に関する意見収集手段１２１１、発言主体の過去発言収集手段１２１２、価値観の重視度算出手段１２１３、文脈ファクター計算手段１２１４を含む。また、主記憶部１２０は、これらの手段を実行する際のデータを一時的に保持する意見収集ワークエリア１２１５を持ち、また実行時に参照する各種規則やデータの集合体である意見収集用規則・データ集１２１６も有する。意見収集管理部の動作の詳細については後に詳細に示す。
主記憶部１２０には、意見収集管理部１２１の実行に際して呼び出される、以下の手段も格納されている。すなわち文書検索手段１２２、文書実体取得手段１２３、文分割手段１２４、文構造解析手段１２５、単語分割・品詞付与手段１２６、名寄せ手段１２７、固有表現抽出手段１２８などである。
これら各手段１２１〜１２８は既知の技術であるため、詳述は省略する。文書検索手段１２２、単語分割・品詞付与手段１２６、については公知の手法を用いればよい。名寄せ手段１２７は同一の対象（人物、地名、書籍名など）の名称が表記の微妙な違いなどで複数存在するものを、単一の名称に帰着させる技術である。固有表現抽出手段１２８についても公知の手法を用いることができる。例えば、文分割手段１２４については、前記単語分割・品詞付与手段をテキストに適用後、文末と認定される箇所（句点など）を切れ目として認定して分割するなどの方法がある。非特許文献１にも方法が記載されている。文構造解析手段１２５についても公知の手法を用いることができる。The main storage unit 120 stores an opinion collection management unit 121 that is a program in which the opinion collection function provided by the opinion collection device 100 is implemented. The opinion collection management unit 121 includes, as constituent elements, an opinion collection unit 1211 relating to a topic, a past statement collection unit 1212 of a utterance subject, a sense of importance calculation unit 1213, and a context factor calculation unit 1214. Further, the main storage unit 120 has an opinion collection work area 1215 for temporarily storing data when executing these means, and various rules to be referred to at the time of execution and opinions collection rules / data collections. A data collection 1216 is also included. Details of the operation of the opinion collection management unit will be described in detail later.
The main storage unit 120 also stores the following means that are called when the opinion collection management unit 121 is executed. That is, the document search means 122, the document entity acquisition means 123, the sentence division means 124, the sentence structure analysis means 125, the word division / part of speech assignment means 126, the name identification means 127, the specific expression extraction means 128, and the like.
Since each of these means 121 to 128 is a known technique, detailed description is omitted. A known method may be used for the document search means 122 and the word segmentation / part of speech assignment means 126. The name identification means 127 is a technique for reducing a plurality of names of the same target (person, place name, book name, etc.) due to subtle differences in notation to a single name. A known technique can also be used for the specific expression extracting unit 128. For example, the sentence dividing means 124 includes a method in which the word division / part-of-speech giving means is applied to a text, and a portion (such as a punctuation mark) recognized as a sentence end is recognized as a break and divided. Non-Patent Document 1 also describes a method. A well-known method can be used for the sentence structure analyzing unit 125.

補助記憶部（データベース）１３０は、ハードディスク等によって構成され、意見収集管理部１２１の各手段を実行するのに必要な、データ、辞書等の知識データベース等が格納されている。すなわち文書データ１３１と検索用索引データ１３２、発言主体データ１３３、価値観データ１３４、および文構造解析用辞書１３５などである。文書データ１３１は、過去の意見を記載した文書群を発言者や日時・時刻・場所などの書誌的データと共に電子化したデータである。図１５Ａに文書データ１３１の一例が示されている。文書検索用索引データ１３２は、文書データ１３１を高速に検索するために用いられるインデックスデータである。なお、これらのデータの一部は、意見収集装置１００に通信ネットワーク３００を介して接続される外部の情報処理装置のデータベースに格納されていても良い。検索用索引データ１３２の一例は図１５Ｂに示されている。また発言主体データ１３３の一例が図１５Ｃに、価値観データ１３４の一例が１５Ｄに示されている。文構造解析用辞書１３５の一例は図１７に示されている。これらの図の説明は後述する。
入力部１４０は、マウスやキーボードなどのような、ユーザから操作入力を受け取るデバイスである。表示部１５０は、システム管理者やユーザが意見収集装置１００を操作する際に用いる画面を表示する。通信部１６０は、通信ネットワーク３００を介して意見閲覧分析支援端末２００と通信し、後述する意見データ、発言主体データ及び発言主体×価値観別重視度データを意見閲覧分析支援端末２００へ送信する。The auxiliary storage unit (database) 130 is configured by a hard disk or the like, and stores data, a knowledge database such as a dictionary, and the like necessary for executing each means of the opinion collection management unit 121. That is, the document data 131, the search index data 132, the speech subject data 133, the value data 134, the sentence structure analysis dictionary 135, and the like. The document data 131 is data obtained by digitizing a document group describing past opinions together with bibliographic data such as a speaker, date, time, and location. An example of the document data 131 is shown in FIG. 15A. The document search index data 132 is index data used to search the document data 131 at high speed. A part of these data may be stored in a database of an external information processing apparatus connected to the opinion collection apparatus 100 via the communication network 300. An example of the search index data 132 is shown in FIG. 15B. An example of the speech subject data 133 is shown in FIG. 15C, and an example of the value data 134 is shown in 15D. An example of the sentence structure analysis dictionary 135 is shown in FIG. The description of these figures will be described later.
The input unit 140 is a device that receives an operation input from a user, such as a mouse or a keyboard. The display unit 150 displays a screen used when the system administrator or the user operates the opinion collection apparatus 100. The communication unit 160 communicates with the opinion browsing analysis support terminal 200 via the communication network 300, and transmits opinion data, a speech subject data, and a speech subject × value-oriented importance degree data described later to the opinion browsing analysis support terminal 200.

意見閲覧分析支援端末２００は、演算部２１０、主記憶部２２０、補助記憶部２３０、入力部２４０、表示部２５０、通信部２６０を備えている。演算部２１０は、主記憶部２２０が格納しているプログラムを実行することにより、意見閲覧分析支援端末２００の動作を制御する。主記憶部２２０は、意見閲覧分析支援端末２００が提供する意見閲覧分析支援機能を実行する意見閲覧分析支援管理部２２１を格納している。同管理部は、意見閲覧分析支援機能を実装するプログラムである意見閲覧分析支援手段２２１１と実行時に発生するデータ類を格納するワークエリア２２１２を含む。
意見閲覧分析支援管理部２２１は、意見収集装置１００から受け取る後述の意見データ、発言主体データ及び発言主体×価値観別重視度データを用いて、表示部２５０上に意見閲覧分析支援画面２５１を表示させる。ユーザは意見閲覧分析支援画面２５１を用いて、意見の閲覧・分析等の作業を実施する。
入力部２４０は、マウスやキーボードなどのようなユーザから操作入力を受け取るデバイスである。通信部２６０は、通信ネットワーク３００を介して意見収集装置１００と通信する。The opinion browsing analysis support terminal 200 includes a calculation unit 210, a main storage unit 220, an auxiliary storage unit 230, an input unit 240, a display unit 250, and a communication unit 260. The calculation unit 210 controls the operation of the opinion browsing analysis support terminal 200 by executing a program stored in the main storage unit 220. The main storage unit 220 stores an opinion browsing analysis support management unit 221 that executes an opinion browsing analysis support function provided by the opinion browsing analysis support terminal 200. The management unit includes an opinion browsing analysis support means 2211 which is a program for implementing an opinion browsing analysis support function and a work area 2212 for storing data generated at the time of execution.
The opinion browsing analysis support management unit 221 displays an opinion browsing analysis support screen 251 on the display unit 250 by using the below-described opinion data, the speech subject data, and the speech subject × value-oriented importance data received from the opinion collection device 100. Let The user uses the opinion browsing analysis support screen 251 to perform operations such as browsing and analyzing opinions.
The input unit 240 is a device that receives an operation input from a user such as a mouse or a keyboard. The communication unit 260 communicates with the opinion collection device 100 via the communication network 300.

図２は、図１の意見閲覧分析支援端末２００の表示部２５０上に表示される意見閲覧分析支援画面２５１の画面例を示す図である。同端末の演算部２１０は、意見閲覧分析支援手段２２１１を実行することにより、表示部２５０上で意見閲覧分析支援画面２５１を提供する。
意見閲覧分析支援画面２５１は、論題設定部２５１１、価値観設定部２５１２、オプション設定部２５１３、意見収集指示部２５１４、意見一覧表示部２５１５、および個々の意見にマウスのカーソルを当てるなどした時に表示される個々の意見の詳細表示部２５１６を含む。論題設定部の右にある肯定と否定の選択欄は、否定が選択された場合には、意見の賛否を逆転させるスイッチをオンにするという設定をするためのものである。例えば「動物実験を禁止する」という論題にしたい場合には否定を選択する。FIG. 2 is a diagram illustrating a screen example of the opinion browsing analysis support screen 251 displayed on the display unit 250 of the opinion browsing analysis support terminal 200 of FIG. The computing unit 210 of the terminal provides the opinion browsing analysis support screen 251 on the display unit 250 by executing the opinion browsing analysis support means 2211.
The opinion browsing analysis support screen 251 is displayed when an agenda setting unit 2511, a values setting unit 2512, an option setting unit 2513, an opinion collection instruction unit 2514, an opinion list display unit 2515, and a mouse cursor is placed on each opinion. The individual opinion detail display section 2516 is included. The selection column for affirmation and denial on the right side of the topic setting section is for making a setting to turn on a switch that reverses the approval or disapproval of opinion when negative is selected. For example, if the subject is “prohibit animal experiments”, select “No”.

意見閲覧分析支援手段２２１１は、論題設定部２５１１に入力された論題を主記憶部２２０内の論題格納エリア２２１２００１に格納し、また価値観設定部２５１２で設定された価値観は価値観格納エリア２２１２００２に格納される。
意見収集指示部２５１４から意見収集実行の指示が与えられると、ワークエリア２２１２に格納された論題と価値観の両者は意見収集装置１００へ送信される。意見収集装置１００からは、最初に意見収集結果（後に図３Ｂを用いて詳述）と意見の発言主体の一覧に関するデータを受け取り、それをワークエリア２２１２の意見データ格納エリア２２１２００３と発言主体データ格納エリア２２１２００４に格納する。さらに、意見収集装置１００から発言主体×価値観別重視度データ（後に図３Ｄを用いて詳述）を受け取り、それを発言主体×価値観別重視度データ格納エリア２２１２００５に格納する。意見閲覧分析支援手段２２１１は、ワークエリア２２１２の意見データ、発言主体データおよび発言主体×価値観別重視度データの内容に基づいて、意見一覧表示部２５１５に表示する。The opinion browsing analysis support unit 2211 stores the topic input to the topic setting unit 2511 in the topic storage area 221001 in the main storage unit 220, and the values set in the value setting unit 2512 are stored in the value storage area 221002. Stored in
When an opinion collection execution instruction is given from the opinion collection instruction unit 2514, both the topic and values stored in the work area 2212 are transmitted to the opinion collection device 100. From the opinion collection device 100, first, the opinion collection result (detailed later with reference to FIG. 3B) and the data related to the list of comment actors are received, and the data are stored in the opinion data storage area 221003 of the work area 2212 and the comment subject data. Stored in area 221004. Furthermore, the importance data according to the speech subject × value (which will be described in detail later with reference to FIG. 3D) is received from the opinion collection device 100 and stored in the importance data storage area 221005 of the speech subject × value. The opinion browsing analysis support unit 2211 displays the opinion list display unit 2515 based on the contents of the opinion data, the speech subject data, and the speech subject × value-oriented importance data in the work area 2212.

本図では論題として「動物実験の是非」、関連する価値観として「医学の進歩」と「動物の命」が選ばれ、意見一覧表示部２５１５は横軸に医学の進歩の重視度、縦軸に動物の命の重視度を取り、発言者（ここではＡ，Ｂ，Ｃ，Ｄ）を各重視度に従って縦座標、横座標を決めて配置し、各発言者別に賛否（本例では賛成意見が○、反対意見が×）と共に論題に関する意見が表示されている。
本図では省略しているが、価値観を３つ以上設定した場合には、その内のどれとどれを縦軸、横軸にするかを指示する設定部も表示する。価値観選択の指示が無い場合には、できるだけ独立性の高い２つの価値観を選ぶことが好ましいと考えられるので、発言主体別重視度の分布間のχ２乗検定など、独立性指標を用いて最大となるペアを選択することは好ましい方法の一つである。In this figure, “the right or wrong of animal experiments” is selected as the topic, “medical progress” and “life of animals” are selected as related values, and the opinion list display unit 2515 has a horizontal axis indicating the importance of medical progress, and the vertical axis The importance of the life of animals is taken into account, and the speakers (here, A, B, C, D) are arranged with the ordinate and abscissa determined according to each importance, and the pros and cons of each speaker (in this example, the pros and cons) Is displayed with an opinion on the topic.
Although not shown in the figure, when three or more values are set, a setting unit for instructing which and which are set as the vertical axis and the horizontal axis is also displayed. If there is no instruction to select values, it is preferable to select two values that are as independent as possible. Therefore, using an independence index such as a chi-square test between the distributions of the importance levels of individual speakers. Choosing the largest pair is one of the preferred methods.

価値観設定部２５１２の右に候補提示ボタンが描かれているので、この価値観推薦機能について説明する。本実施例では価値観は意見収集者の見識に基づいて設定されることを想定しているが、場合によってはシステム側からの推薦が欲しい場合もあると考えられ、そのような場合に必要となる機能である。実現方法の一例を以下に示す。
論題に関する文書を検索し、価値観データ１３４に登録されているすべての価値観について、それと意味的相似性を有すると判断されるすべて語（後述する図９Ａの説明部分参照）の前記検索された文書中に現れる頻度をカウントして合計する。この頻度合計カウントが大きい順に価値観をソートし、上位から予め設定された個数を選択する（例えば５個）。それら選択された価値観が設定されているものとして、発言主体×価値観別重視度データを作成する。前記価値観が３個以上設定された場合の価値観対自動選択方法に準じて、独立性が高い順に価値観のペアをリストする。この上位を候補として推薦する、という方法が考えられる。Since a candidate presentation button is drawn on the right side of the value setting unit 2512, this value recommendation function will be described. In this example, it is assumed that the values are set based on the insight of the opinion collector, but in some cases it may be desirable to recommend from the system side. It is a function. An example of an implementation method is shown below.
Documents relating to the topic are searched, and all the values registered in the value data 134 are searched for all words (see the explanation part of FIG. 9A described later) that are judged to have semantic similarities thereto. Count and sum the frequency of occurrences in the document. Values are sorted in descending order of the frequency total count, and a preset number is selected from the top (for example, 5). Assuming that the selected values are set, the data of importance of the utterance subject × value is created. A pair of values is listed in descending order of independence in accordance with the method for automatically selecting values versus when three or more values are set. A method of recommending this higher rank as a candidate can be considered.

図３Ａは、図２の価値観格納エリア２２１２００２に格納されている価値観データの構成を示す図である。価値観データは、異なる価値観を区別するためのローカルな番号（図２の例では１と２のみ）と価値観の識別子、およびその内容を含む。価値観の識別子は、価値観データ１３４に既登録のものについて、その識別子を記入したものである。
図１５Ｄは価値観データ１３４のデータ構成を示す図である。価値観データは価値観識別子と対応する価値観の内容を含む。本例では、「医療の進歩」は０８６番の価値観として既登録である。FIG. 3A is a diagram showing a configuration of value data stored in the value storage area 221002 in FIG. The value data includes a local number for distinguishing different values (only 1 and 2 in the example of FIG. 2), a value identifier, and its contents. The identifier of values is a value in which identifiers are entered for values already registered in the values data 134.
FIG. 15D is a diagram illustrating a data configuration of the value data 134. The value data includes the value identifier corresponding to the value identifier. In this example, “medical progress” is already registered as the 086 value.

図３Ｂは、図２の意見データ格納エリア２２１２００３に格納されている意見データの構成を示す図である。同データは、意見の内容の他、内容が論題に対して賛成・支持（＋１）であるか反対・否定的（−１）であるかを示す項目、論題との関連性の強さを示す関連性スコア、意見の理由や証拠が示されているかどうかを示す理由証拠スコア、その意見等の内容、意見等が表明されている文書のＩＤ、またそのタイトル、発言主体識別子を含む。 FIG. 3B is a diagram showing a configuration of opinion data stored in the opinion data storage area 221003 in FIG. In addition to the content of the opinion, the data indicates whether the content is in favor / support (+1) or opposite / negative (-1) for the topic, and the strength of the relevance to the topic The relevance score, the reason evidence score indicating whether or not the reason for the opinion and the evidence are shown, the content of the opinion, the ID of the document in which the opinion is expressed, the title, and the speaking subject identifier are included.

図３Ｃは、図２の発言主体データ格納エリアに格納されている発言主体に関するデータの構成を示す図である。同データは、発言主体を番号づけるたけのローカルな番号の他、発言主体データ１３３に登録されている発言主体識別子と、名称、所属組織（もしくは上位組織）識別子などから構成される。
図１５Ｃは発言主体データ１３３のデータ構成を示す図である。発言主体データは発言主体識別子と対応する発言主体名称、および別称、所属組織（もしくは上位組織）識別子などで構成されている。名称は必須であるがその他は任意である。FIG. 3C is a diagram illustrating a configuration of data relating to a speech subject stored in the speech subject data storage area of FIG. 2. The data includes a local number for numbering a speech subject, a speech subject identifier registered in the speech subject data 133, a name, a belonging organization (or higher organization) identifier, and the like.
FIG. 15C is a diagram showing a data structure of the speech subject data 133. The speech subject data is composed of a speech subject name corresponding to the speech subject identifier, an alias, a belonging organization (or higher organization) identifier, and the like. The name is required but the others are optional.

図３Ｄは、図２の発言主体×価値観別重視度データ格納エリア２２１２００５に格納されている発言主体×価値観別重視度データの構成を示す図である。なお、「×」は、
マトリクスを表す。同データは、収集された意見の発言主体の各ローカル番号（図３Ｃに記載）と、価値観格納エリアに格納された価値観の各ローカル番号（図３Ａに記載）の対に対して、該発言主体の該価値観に対する重視度が記載されている表形式のデータである。FIG. 3D is a diagram illustrating a configuration of the speech subject × value-oriented importance data stored in the comment subject × value-oriented importance data storage area 221005 in FIG. 2. In addition, "x"
Represents a matrix. The data is obtained with respect to a pair of each local number (described in FIG. 3C) of the collected voice of the opinion and each local number (described in FIG. 3A) stored in the values storage area. This is tabular data in which the degree of importance of the speaking subject with respect to the values is described.

図４Ａは、意見収集装置１００の意見収集ワークエリア１２１５の構成図である。本ワークエリアは論題格納エリア１２１５００１、価値観格納エリア１２１５００２、意見データ格納エリア１２１５００３、発言主体データ格納エリア１２１５００４、発言主体×価値観別重視度格納エリア１２１５００５（以上は端末側ワークエリアのデータ格納エリア２２１２００１〜２２１２００５と対応する）と、検索条件格納エリア１２１５０１０、検索結果格納エリア１２１５０１１、文書実体格納エリア１２１５０１２、書誌情報格納エリア１２１５０１３、文格納エリア１２１５０２０、価値観の文構造格納エリア１２１５０２１、価値観の反転フラグ格納エリア１２１５０２２、文構造格納エリア１２１５０２３、文脈ファクター計算ワークエリア１２１５０２４、などから構成される。 FIG. 4A is a configuration diagram of the opinion collection work area 1215 of the opinion collection device 100. This work area includes a topic storage area 121501, a values storage area 1215002, an opinion data storage area 1215003, a speech subject data storage area 1215004, a speech subject × value-oriented importance storage area 1215005 (the above is a data storage area of the terminal side work area) 2212001 to 221005), a search condition storage area 1210501, a search result storage area 1215011, a document entity storage area 1215012, a bibliographic information storage area 1215013, a sentence storage area 12105020, a sentence structure storage area 1215021 of values, The area includes a reverse flag storage area 1215022, a sentence structure storage area 1215023, a context factor calculation work area 12105024, and the like.

図４Ｂは、意見収集装置１００の意見収集用規則・データ集１２１６の構成図である。同規則・データ集は、賛成・反対表現データ１２１６００１、理由・証拠表現データ１２１６００２、否定表現データ１２１６００３、発言表現データ１２１６００４、意味相似語句対データ１２１６０１１、促進・抑制語句データ１２１６０１２、主部ファクターデータ１２１６０２１、補助部ファクターデータ１２１６０２２、修飾部ファクターデータ１２１６０２３、などから構成される。 FIG. 4B is a configuration diagram of the opinion collection rule / data collection 1216 of the opinion collection apparatus 100. The rule / data collection includes approval / disagreement expression data 1216001, reason / evidence expression data 1216002, negative expression data 1216003, statement expression data 1216004, semantic similarity word pair data 1216011, promotion / suppression phrase data 12116012, main factor data 1216021 , Auxiliary part factor data 1216022, modification part factor data 1216023, and the like.

２．処理

図５は、図１の意見閲覧分析支援管理部２２１と意見収集管理部１２１とが、通信ネットワーク３００を介して動作するシーケンスを示す図である。以下、図５の各ステップ（ステップＦ１〜Ｆ１０Ｂ）について説明する。2. processing

FIG. 5 is a diagram illustrating a sequence in which the opinion browsing analysis support management unit 221 and the opinion collection management unit 121 in FIG. 1 operate via the communication network 300. Hereafter, each step (step F1-F10B) of FIG. 5 is demonstrated.

（ステップＦ１〜Ｆ３）：
意見閲覧分析支援端末２００は、意見閲覧分析支援画面２５１を起動し、ユーザの入力操作により、論題と価値観（複数）が設定され、オプションが指定された後、意見収集の指示を受けて、論題と価値観にオプションを添えて意見収集実行要求を、意見収集装置１００の意見収集管理部１２１へ送信する。(Steps F1 to F3):
The opinion browsing analysis support terminal 200 activates the opinion browsing analysis support screen 251, and the topic and values (plurality) are set by the user's input operation, and options are specified. An opinion collection execution request is sent to the opinion collection management unit 121 of the opinion collection apparatus 100 with an option added to the topic and values.

（ステップＦ４〜Ｆ６）：
意見収集装置１００の意見収集手段１２１１は、意見閲覧分析支援端末２００で設定され、送信されてきた論題に関する意見を、文書データ１３１から収集し、図３Ｂで構成例を示した意見データ（Ｄ１）を得る（詳細は後述する。）。意見収集装置１００は、同意見データを、ステップＦ４にて意見収集装置１００側の意見データ格納エリア１２１５００３に格納する。また、意見収集装置１００は、同意見データをステップＦ５にて端末側に送信し、意見閲覧分析支援端末２００は、端末側の意見データ格納エリア２２１２００３に同意見データを格納する。(Steps F4 to F6):
The opinion collection means 1211 of the opinion collection device 100 collects opinions on the topic set and transmitted by the opinion browsing analysis support terminal 200 from the document data 131, and the opinion data (D1) whose configuration example is shown in FIG. 3B (Details will be described later). The opinion collection device 100 stores the opinion data in the opinion data storage area 1215003 on the opinion collection device 100 side in step F4. Further, the opinion collection device 100 transmits the opinion data to the terminal side in step F5, and the opinion browsing analysis support terminal 200 stores the opinion data in the opinion data storage area 221003 on the terminal side.

（ステップＦ７〜Ｆ１０）：
意見収集装置１００は、さらに意見データ（Ｄ１）に含まれる各発言主体を集めて、発言主体データ（Ｄ２）を作成し、意見データ（Ｄ１）を構成する発言主体と、設定された各価値観に対して、発言主体の過去発言収集手段１２１２と価値観の重視度算出手段１２１３を用いて、発言主体の価値観に関する重視度を計算し、得られた重視度を表にまとめて発言主体×価値観別重視度データ（Ｄ３）を作成する。意見収集装置１００は、得られたデータＤ２とＤ３を意見収集装置１００側の発言主体データ格納エリア１２１５００４と発言主体×価値観別重視度データ格納エリア１２１５００５にそれぞれ格納する。また、意見収集装置１００は、得られたデータＤ２とＤ３をステップＦ９にて端末側に送り、意見閲覧分析支援端末２００は、Ｄ２を端末側の発言主体データ格納エリア２２１２００４、Ｄ３を端末側の発言主体×価値観別重視度データ格納エリア２２１２００５に格納する。(Steps F7 to F10):
The opinion collection device 100 further collects each of the comment subjects included in the opinion data (D1), creates the comment subject data (D2), the comment subjects that constitute the opinion data (D1), and the set values. On the other hand, the past degree collecting means 1212 and the importance level calculation means 1213 of the speech subject are used to calculate the importance level regarding the values of the speech subject, and the obtained importance levels are summarized in a table as the talk subject × Value-oriented importance data (D3) is created. The opinion collection device 100 stores the obtained data D2 and D3 in the speech subject data storage area 1215004 and the speech subject × value-oriented importance data storage area 1215005 on the opinion collection device 100 side, respectively. Further, the opinion collection device 100 sends the obtained data D2 and D3 to the terminal side in step F9, and the opinion browsing analysis support terminal 200 sends D2 to the terminal-side speech subject data storage area 221004, D3 on the terminal side. The data is stored in the importance data storage area 221005 of the utterance subject × value.

（ステップＦ１１Ａ・Ｂ）：
結果表示オプションが通常表示の場合には、意見閲覧分析支援端末２００は、意見データ（Ｄ１）に基づいて結果表示を行う。結果表示オプションが発言主体の価値観重視度反映表示の場合には、意見閲覧分析支援端末２００は、発言主体×価値観別重視度データ（Ｄ２）に基づいて、各発言主体を、価値観重視度に対応する座標位置に表示し、各発言主体の意見を、その位置に表示する。
意見閲覧分析支援端末２００は、価値観が「医学の進歩」と「動物の命」である場合、横軸に医学の進歩の重視度、縦軸に動物の命の重視度を取り、各発言者の各価値観の重視度に従って縦座標、横座標を決めて配置し、各発言者の賛否意見をその位置に、賛成意見は○反対意見は×で表示する。(Step F11A / B):
When the result display option is the normal display, the opinion browsing analysis support terminal 200 displays the result based on the opinion data (D1). In the case where the result display option is a display reflecting the importance level of the value of the speech subject, the opinion browsing analysis support terminal 200 assigns each of the comment subjects to the value based on the importance data of the speech subject × value (D2). It is displayed at the coordinate position corresponding to the degree, and the opinion of each utterance subject is displayed at that position.
When the values are “medical progress” and “animal life”, the opinion browsing analysis support terminal 200 takes the importance of medical progress on the horizontal axis and the importance of animal life on the vertical axis. The ordinate and abscissa are determined and arranged according to the importance of each person's sense of values, and each speaker's approval / disapproval opinion is displayed in that position, and the approval opinion is indicated by ○ and the disagreement is indicated by ×.

図６は図５のステップＦ４にて、意見収集手段１２１１が所与の論題から論題に関する意見を収集する手順について説明する図である。
処理１２１１００１では、意見収集手段１２１１は、所与の論題Ｐに関する検索条件Ｑを作成する。例えば、検索条件Ｑは、単語分割・品詞付与手段１２６を論題Ｐに適用し、助詞や助動詞などの機能語を除去して得られる内容語のリストのＯＲ結合を取ることによって得られる。なお単純なＯＲ結合では、複数の内容語が大きく離れた位置に出現される場合も含まれてしまうので、それを防ぐために、出現位置の距離を制限する条件を加えることも良い方法である。
処理１２１１００２では、意見収集手段１２１１は、前記検索条件Ｑを文書検索手段１２２に与える。同検索手段は、検索用索引データ１３２を用いて検索を実行することにより、前記検索条件Ｑを満たす文書ＩＤのリストを得て、検索結果格納エリア１２１５０１１に格納する。FIG. 6 is a diagram illustrating a procedure in which the opinion collection unit 1211 collects opinions related to a topic from a given topic in step F4 of FIG.
In process 1211001, the opinion collection unit 1211 creates a search condition Q for a given topic P. For example, the search condition Q is obtained by applying the word division / part-of-speech giving means 126 to the topic P and ORing the list of content words obtained by removing function words such as particles and auxiliary verbs. It should be noted that a simple OR combination includes a case where a plurality of content words appear at positions far away from each other. Therefore, in order to prevent this, it is a good method to add a condition for limiting the distance between the appearance positions.
In process 1211002, the opinion collection unit 1211 gives the search condition Q to the document search unit 122. The search means executes a search using the search index data 132 to obtain a list of document IDs satisfying the search condition Q, and stores it in the search result storage area 1215011.

図１５Ｂは検索用索引データ１３２のデータ構成を示す図である。検索用索引データ１３２は、文書ＩＤと索引語のリスト（索引データ）を含み、索引語にはその文書中での出現回数と、出現位置のリストなどが記載されている。検索用索引データ１３２は、文書データ１３１に基づき、索引語を抽出することにより予め作成されたものである。また、本文中に出現する索引語の他、後に図８の処理１２１３００２において、発言主体を指定した検索を行う時のために、発言主体に関する索引もデータには含まれている（「発言主体＝Ｗ０２４」など）。
本文中に出現する索引語については、検索式が検索語の論理結合（ＡＮＤやＯＲやＮＯＴ等の組み合わせ）のみの場合には、索引語だけでも良い。出現回数によって優先順位を上下させる場合には回数も記入する。また複数の検索語が何語以内に現れるかも制約を付ける場合には出現位置も記載しておく必要がある。本実施例は索引語のみでも実施可能であるが、より論題と関連性の高い文書データを高い順位で検索するためには、出現回数や出現位置の情報もあった方が良い。ＯＲ結合の検索要求に対して文書検索手段１２２は、本データにアクセスし、条件式中のいずれかの語を索引語として含む文書の文書ＩＤをリストとして出力する。
ループ１２１１００３では、意見収集手段１２１１は、前記検索された各文書ＩＤ（Ｉ）に、以下の処理１２１１０１０〜１２１１０１３を行う。
処理１２１１０１０では、意見収集手段１２１１は、文書実体取得手段１２３により、所与の文書ＩＤ（Ｉ）に対して、文書データ１３１を参照して、文書ＩＤ（Ｉ）に対応する文書内容のテキスト（本文）やタイトルを取得する。意見収集手段１２１１は、文書ＩＤ、文書内容のテキスト（本文）、タイトル等を意見データ格納エリア１２１５００３に格納する。FIG. 15B is a diagram showing a data structure of the search index data 132. The search index data 132 includes a document ID and a list of index words (index data). The index word describes the number of appearances in the document, a list of appearance positions, and the like. The search index data 132 is created in advance by extracting index words based on the document data 131. Further, in addition to the index word appearing in the text, an index related to the speaking subject is included in the data in order to perform a search specifying the speaking subject later in the processing 1213002 in FIG. 8 (“speaking subject = W024 ").
As for index terms appearing in the text, only the index terms may be used if the search formula is only a logical combination of search terms (a combination of AND, OR, NOT, etc.). If the priority is raised or lowered depending on the number of appearances, enter the number of times. In addition, in order to restrict how many search terms appear, it is necessary to describe the appearance position. Although the present embodiment can be implemented using only index words, in order to search document data having a higher relevance to the topic in a higher order, it is better to have information on the number of appearances and the appearance position. In response to an OR join search request, the document search means 122 accesses this data and outputs a list of document IDs of documents including any word in the conditional expression as an index word.
In a loop 1211003, the opinion collection unit 1211 performs the following processing 121110 to 1211013 on each retrieved document ID (I).
In processing 12111010, the opinion collection unit 1211 refers to the document data 131 for the given document ID (I) by the document entity acquisition unit 123, and the document content text corresponding to the document ID (I) ( Body) and title. The opinion collection unit 1211 stores the document ID, the text (text) of the document content, the title, and the like in the opinion data storage area 1215003.

図１５Ａは文書データ１３１のデータ構成を示す図である。文書データ１３１は、文書ＩＤに対応する形で、文書のタイトルや本文の他、後述する発言主体データ１３３との対応を取るための発言主体識別子や、日時に関する情報を含む場合もある。これらの内必須となる構成要素は本文である。タイトルが無い場合には、本文の先頭部分で代用することもできる。
処理１２１１０１１では、意見収集手段１２１１は、前記テキストを文分割手段１２４により文単位に分割する。
文分割手段については、単語分割・品詞付与手段１２６をテキストに適用後、文末と認定される箇所（句点など）を切れ目として認定して分割するなどの方法がある。
ループ１２１１０１２では、意見収集手段１２１１は、前記分割された各文（Ｓ）毎に、以下の処理１２１１０２０〜１２１１０２３を行う。
処理１２１１０２０では、意見収集手段１２１１は、文Ｓと論題Ｐとの関連性Ｒを算出する。意見収集手段１２１１は、関連性Ｒを意見データ格納エリア１２１５００３に格納する。関連性Ｒは、論題を構成するより多くの種類の内容語が、より狭い範囲に出現するほど高い値を示す値である。FIG. 15A shows the data structure of the document data 131. The document data 131 may include information about a date subject identifier and date / time for correspondence with later-described message subject data 133 in addition to the document title and body in a form corresponding to the document ID. The essential component of these is the body. If there is no title, the head part of the text can be substituted.
In processing 1211011, the opinion collecting unit 1211 divides the text into sentence units by the sentence dividing unit 124.
As the sentence dividing means, there is a method in which after applying the word dividing / part-of-speech giving means 126 to text, a part (such as a punctuation mark) recognized as a sentence end is recognized as a break and divided.
In a loop 12111012, the opinion collection unit 1211 performs the following processing 1211020 to 1211023 for each of the divided sentences (S).
In processing 12111020, the opinion collection unit 1211 calculates the relevance R between the sentence S and the topic P. The opinion collection unit 1211 stores the relevance R in the opinion data storage area 1215003. The relevance R is a value indicating a higher value as more types of content words constituting the topic appear in a narrower range.

計算方法の一例を以下に示す。
論題を構成するＮ種類の内容語が文Ｓに含まれている場合、Ｊを１からＮまでの整数として、各Ｊについて、Ｊ種類の内容語が文Ｓに含まれる最小区間の単語数（最初の出現位置から最後の出現位置までの単語数）をＤ（Ｊ）としてｒ（Ｊ）＝Ｊ×Ｊ÷Ｄ（Ｊ）を計算する。Ｊを１からＮまで動かした時のｒ（Ｊ）の最大値を関連性Ｒとするのは良い方法の一つである。仮にＮ種類の内容語が連続して出現していれば、Ｄ（Ｎ）＝Ｎであり、ｒ（Ｎ）＝Ｎ×Ｎ÷Ｎ＝Ｎ、すなわち内容語の種類数となる。離れて出現していればいるほど、低い値となる。
なおＮ個の内、Ｋ個（ただしＫ＞Ｎ／２）は連続して現れていて、残りは非常に離れた位置に出現する場合、ｒ（Ｋ）＝Ｋであるが、Ｊ＞ＫではＤ（Ｊ）が大きくなるためｒ（Ｊ）＜Ｋとなると考えられるので、その場合には関連性ＲはＫとなると考えられる。An example of the calculation method is shown below.
When N types of content words constituting the topic are included in the sentence S, the number of words in the minimum interval in which the J types of content words are included in the sentence S for each J, where J is an integer from 1 to N ( R (J) = J × J ÷ D (J) where D (J) is the number of words from the first appearance position to the last appearance position). One of the good methods is to set the relevance R to the maximum value of r (J) when J is moved from 1 to N. If N types of content words appear continuously, D (N) = N, and r (N) = N × N ÷ N = N, that is, the number of types of content words. The farther away, the lower the value.
Note that when N (K> N / 2) appear continuously and the rest appear at very distant positions, r (K) = K, but if J> K Since D (J) increases, it is considered that r (J) <K. In this case, the relevance R is considered to be K.

上記関連性の計算方法を具体例により説明する。ここでＡ〜Ｚは何らかの単語を表わすものとし、文Ｓは「ＡＢＣＤＥＦＧＡＢＣＤ」であるとする。また論題Ｐの内容語はＡとＢとＦとＫであるとする。この場合文Ｓに含まれるのはＡとＢとＦなのでＮ＝３種類の内容語が含まれていることになる。そのため、文Ｓの論題Ｐに対する関連性Ｒを計算するには、Ｊ＝１，２，３に対してｒ（Ｊ）を計算してその最大値を求めることになる。Ｊ＝１の場合は１種類の単語が現れる最短区間であり、それは常にそれ自身の１なので、Ｄ（１）＝１であり、ｒ（１）＝１×１÷１＝１と計算される。Ｊ＝２の場合にはＤ（２）はＡとＢの最短区間が２、ＡとＦの最短区間は（順番を問わないので）ＦＧＡの３、ＢとＦの最短区間はＦＧＡＢの４である。従って２種類の内容語の最短区間はＡとＢの場合の２ということで、Ｄ（２）＝２となり、ｒ（２）は２×２÷Ｄ（２）＝２となる。Ｊ＝３の場合、３種類の単語はＡとＢとＦの組み合わせのみである。ＡとＢとＦが全部出現する最短区間はＦＧＡＢの４なので、Ｄ（３）＝４で、ｒ（３）＝３×３÷Ｄ（３）＝２．２５となる。結局ｒ（Ｊ）の最大値はＪ＝３の場合の２．２５ということになる。仮に文ＳのＧと次のＡの間にＸＹＺが挿入されて「ＡＢＣＤＥＦＧＸＹＺＡＢＣＤ」であったと仮定すると、ｒ（１）ｒ（２）は同じであるが、Ｄ（３）はＡとＢとＦが全部出現する最短区間が最初のＡＢＣＤＥＦの６となるので、ｒ（３）＝３×３÷６＝１．５となり、この場合にはｒ（２）が最大で関連性Ｒ＝２と計算される。すなわち種類数が多くても、それらが離れて出現する場合には、より少ない種類数の単語がコンパクトに出現している箇所の方が勝つ場合がある、ということである。
なおここでは簡単のため、ｒ（Ｊ）＝Ｊ×（Ｊ÷Ｄ（Ｊ））としたが、（Ｊ÷Ｄ（Ｊ））をそのままＪと掛け算すると、Ｊ種類の最短出現区間長（Ｄ（Ｊ））の影響が強く効き過ぎる懸念があり、それを防ぐためには（Ｊ÷Ｄ（Ｊ））のルート（０．５乗）を取ってからＪと掛け算するのも良い方法である。また区間長に関する閾値Ｄ０を設け、この区間長内に出現している場合には、区間長による差を考慮しない、とする場合には、Ｊ１＝ＭＡＸ（Ｄ０，Ｊ），Ｄ１＝ＭＡＸ（Ｄ０，Ｄ（Ｊ））と定義して、ｒ（Ｊ）＝Ｊ×（Ｊ１÷Ｄ１）とするのも良い方法である。The relevance calculation method will be described with a specific example. Here, it is assumed that A to Z represent some word, and the sentence S is “ABCDE FGAGBAC”. The content words of the topic P are A, B, F, and K. In this case, since sentences S include A, B, and F, N = 3 types of content words are included. Therefore, in order to calculate the relevance R of the sentence S to the topic P, r (J) is calculated for J = 1, 2, 3, and the maximum value is obtained. When J = 1, it is the shortest interval in which one type of word appears, and since it is always its own 1, D (1) = 1 and r (1) = 1 × 1 ÷ 1 = 1 are calculated. . In the case of J = 2, D (2) is 2 for the shortest interval between A and B, 3 for the FGA for the shortest interval for A and F (no matter the order), and 4 for the shortest interval between B and F is FGAB. is there. Accordingly, the shortest interval between the two types of content words is 2 in the case of A and B, so D (2) = 2, and r (2) is 2 × 2 ÷ D (2) = 2. When J = 3, the three types of words are only combinations of A, B, and F. Since the shortest interval in which A, B, and F all appear is FGAB of 4, D (3) = 4 and r (3) = 3 × 3 ÷ D (3) = 2.25. Eventually, the maximum value of r (J) is 2.25 when J = 3. Assuming that XYZ is inserted between G of the sentence S and the next A to be “A BCD E F G X Y Z A B C D”, r (1) r (2) is the same. In D (3), the shortest interval in which all of A, B, and F appear is 6 of the first ABCDEF, so r (3) = 3 × 3 ÷ 6 = 1.5. In this case, r (2) is calculated as relevance R = 2 at maximum. That is, even if the number of types is large, if they appear apart, a portion where a smaller number of types of words appear compactly may win.
Here, for simplicity, r (J) = J × (J ÷ D (J)). However, when (J ÷ D (J)) is directly multiplied by J, J types of the shortest appearance section length (D There is a concern that the effect of (J)) is too strong, and in order to prevent this, it is a good method to take the route (0.5th power) of (J ÷ D (J)) and then multiply by J. In addition, when a threshold value D0 relating to the section length is provided and the difference due to the section length is not taken into account when it appears within the section length, J1 = MAX (D0, J), D1 = MAX (D0 , D (J)) and r (J) = J × (J1 ÷ D1) is a good method.

処理１２１１０２１では、意見収集手段１２１１は、論題Ｐと関連性がある文Ｓについて、賛成・反対表現データ１２１６００１（図７Ａ）と否定表現データ１２１６００３（図７Ｃ）を参照して、論題に対する賛成・反対を判定する。関連性があるかどうかは、例えば、関連性Ｒが予め定めた閾値と比較することで判定することができる。意見収集手段１２１１は、賛成・反対についての項目（＋１、−１）を意見データ格納エリア１２１５００３に格納する。賛成表現があり反対表現がなければ賛成、逆に賛成表現が無く、反対表現があれば反対とし、否定表現が複数ある文については、個数が奇数の場合に賛成か反対かを逆転させる。
処理１２１１０２２では、意見収集手段１２１１は、論題Ｐとの関連性があり、賛成か反対と判定された文Ｓについて、理由・証拠表現データ１２１６００２（図７Ｂ）を参照して理由・証拠の有無を判定する。意見収集手段１２１１は、理由・証拠の有無により、理由証拠スコアを意見データ格納エリア１２１５００３に格納する。
処理１２１１０２３では、これら文Ｓについて、意見収集手段１２１１は、発言表現データ１２１６００４（図７Ｄ）との照合により発言主体を特定する。意見収集手段１２１１は、発言主体識別子を意見データ格納エリア１２１５００３に格納する。例えば、発言表現の主語に相当する構文要素が発言主体に相当するとして発言主体を特定できる。名寄せ手段１２７により発言主体データ１３３のエントリーに帰着できる場合には、そのエントリーを発言主体として取る。また固有表現抽出手段１２８により発言の日時が特定できる場合には、その日時を発言の日時として取る。In the process 12111021, the opinion collecting unit 1211 refers to the approval / disapproval expression data 121601 (FIG. 7A) and the negative expression data 1216003 (FIG. 7C) for the sentence S related to the topic P, and supports / disagrees with the topic. Determine. Whether or not there is a relationship can be determined, for example, by comparing the relationship R with a predetermined threshold value. The opinion collection unit 1211 stores items (+1, −1) for approval and disagreement in the opinion data storage area 1215003. If there is an expression of approval and there is no expression of approval, the expression is approval, and conversely, if there is an expression of expression, it is determined to be objection, and if there are multiple negative expressions, it is reversed if the number is odd.
In the processing 1211022, the opinion collecting unit 1211 refers to the reason / evidence expression data 1216002 (FIG. 7B) for the sentence S that is related to the topic P and is determined to be in favor or not, and determines whether the reason / evidence exists. judge. The opinion collection unit 1211 stores the reason evidence score in the opinion data storage area 1215003 depending on the presence / absence of evidence.
In process 1211023, for these sentences S, the opinion collection unit 1211 identifies the speaking subject by collating with the speech expression data 1216004 (FIG. 7D). The opinion collection unit 1211 stores the comment subject identifier in the opinion data storage area 1215003. For example, it is possible to specify a speaking subject assuming that a syntax element corresponding to the subject of the speech expression corresponds to the speaking subject. If the name collating unit 127 can return to the entry of the speech subject data 133, the entry is taken as the speech subject. When the date and time of the utterance can be specified by the specific expression extracting means 128, the date and time is taken as the date and time of the utterance.

図１５Ｃには発言主体データ１３３の一例が示されている。発言主体データは前出のように名称、および別称、所属組織（もしくは上位組織）識別子などで構成されている。文Ｓの中や前後に図７Ｄに示した発言表現の述語が出現している場合には、意見収集手段１２１１は、その主語を取り、発言主体データの名称か別称とのマッチングを取ることにより発言主体を特定する。特定できない場合には、後述するように、意見収集手段１２１１は、文書データ（図１５Ａ）中に当該文書の発言主体が登録されていれば、それを発言主体として推定する。
日時の特定については、意見収集手段１２１１は、「＜数字＞年＜数字＞月＜数字＞日」のようなパターンと前後の文脈とマッチングを取り、成功した場合には、その値を発言の日時として取る。マッチングが取れない場合には、意見収集手段１２１１は、やはり文書データ（図１５Ａ）中に当該文書の日時が登録されていれば、それを発言の日時として推定する。FIG. 15C shows an example of the speech subject data 133. As described above, the speech subject data is composed of a name, an alternative name, a belonging organization (or higher organization) identifier, and the like. When the predicate of the speech expression shown in FIG. 7D appears before or after the sentence S, the opinion collection unit 1211 takes the subject and matches the name or the alias of the speech subject data. Identify the subject. If the document cannot be identified, the opinion collection unit 1211 estimates that the speaking subject of the document is registered in the document data (FIG. 15A), as will be described later.
Regarding the identification of the date and time, the opinion collection means 1211 matches the pattern such as “<number> year <number> month <number> day” with the context before and after, and if it succeeds, Take as date and time. When the matching cannot be obtained, the opinion collecting unit 1211 estimates the date / time of the document if the date / time of the document is registered in the document data (FIG. 15A).

意見収集手段１２１１は、以上１２１１０２０から１２１１０２３までの処理の繰り返しによりループ１２１１０１２を抜けた後、文の発言主体や発言日時が特定できてないものについては、デフォールトの値を用いる。すなわち、処理１２１１０１３では、論題Ｐと関連性があり、賛成・反対の判定ができる文が一つでもあれば、本文書（Ｉ）に関するデフォールトの発言主体、デフォールトの発言日時、およびタイトルを、文書データ１３１（図１５Ａ）から取得する。
意見収集手段１２１１は、以上１２１１０１０から１２１１０１３までの処理の繰り返しによりループ１２１１００３を抜けた後、処理１２１１００４では、得られた意見データ（図３Ｂ）を、論題Ｐとの関連性Ｒ（関連性スコア）を第１の整列キーとして降順にソートし、続いて理由・証拠の有無（理由・証拠スコア）を第２のキーとして降順にソートする。収集された意見の個数がオプション指定された上限値を超える場合は、ソート後の順位がその個数を超える部分は捨てる。The opinion collection means 1211 uses the default value for those in which the statement subject and the statement date / time cannot be specified after exiting the loop 12111012 by repeating the processing from 12121020 to 12111023. That is, in the process 12111013, if there is at least one sentence that is relevant to the topic P and can be approved / disagreeed, the default speaking subject, the default speaking date and time, and the title for this document (I) Obtained from data 131 (FIG. 15A).
The opinion collection means 1211 exits the loop 1211003 by repeating the processes from 12111010 to 12111013, and then in the process 1211004, the obtained opinion data (FIG. 3B) is used as the relevance R (relevance score) with the topic P. Are sorted in descending order as the first sort key, and then the reason / evidence presence / absence (reason / evidence score) is sorted in descending order as the second key. If the number of opinions collected exceeds the upper limit specified by the option, the part whose rank after sorting exceeds that number is discarded.

図７Ａは、意見収集の際に用いる、賛成・反対表現データ１２１６００１の一例を示す図である。語句と賛否の情報が書かれている。賛否についてはここでは賛成を１、反対を−１としている。
図７Ｂは、意見収集の際に用いる、理由・証拠表現データ１２１６００２の一例を示す図である。理由や証拠を示す際に使われる語句とその文法情報を含む。
図７Ｃは、意見収集の際に用いる、否定表現データ１２１６００３の一例を示す図である。否定を表わす際に使われる語句とその文法情報とを含む。
図７Ｄは、意見収集の際に用いる、発言表現データ１２１６００４の一例を示す図である。発言を表わす際に使われる語句とその文法情報とを含む。
なお図７Ａ〜図７Ｄの左図は日本語バージョン、右図は英語バージョンを示す。FIG. 7A is a diagram showing an example of approval / disagreement expression data 121001 used for collecting opinions. Words and pros and cons information are written. As for approval or disapproval, here, the approval is 1 and the opposite is -1.
FIG. 7B is a diagram showing an example of reason / evidence expression data 1216002 used when collecting opinions. Includes words and grammatical information used to show reasons and evidence.
FIG. 7C is a diagram showing an example of negative expression data 1216003 used for collecting opinions. Contains words and grammar information used to express negation.
FIG. 7D is a diagram illustrating an example of the utterance expression data 1216004 used when collecting opinions. Contains words and grammar information used to express a statement.
7A to 7D, the left figure shows the Japanese version, and the right figure shows the English version.

図８は、価値観の重視度算出手段１２１３が、所与の発言主体と所与の価値観について、発言主体の過去の発言を収集し、収集された発言から価値観に関する重視度を計算する手順について説明する図である。
初めに処理１２１３０００では、重視度算出手段１２１３は、求めるべき重視度の値（Ｖ）をゼロクリアする。
処理１２１３００１では、重視度算出手段１２１３は、所与の価値観に文構造解析手段１２５を適用し、得られる文構造（ＶＳ）を、価値観の文構造格納エリア１２１５０２１に格納する。その構文構造の最上位構文要素の主部が促進・抑制語句データ１２１６０１２（図９Ｂ）との照合により、促進・抑制語に該当する場合には、促進・抑制の対象となる構文要素を最上位構文要素とし、抑制の場合には、価値観の反転フラグ（Ｒｅｖ）をオンにする。Ｒｅｖの初期値はオフである。In FIG. 8, the importance level calculation means 1213 collects the past statements of the speaking entity for the given speaking entity and the given values, and calculates the importance regarding the values from the collected statements. It is a figure explaining a procedure.
First, in processing 1213000, the importance level calculation means 1213 clears the importance level value (V) to be obtained to zero.
In process 1213001, the importance level calculation means 1213 applies the sentence structure analysis means 125 to a given value and stores the obtained sentence structure (VS) in the value sentence structure storage area 1215021. When the main part of the topmost syntax element of the syntax structure corresponds to the promotion / suppression word by collation with the promotion / suppression word data 12116012 (FIG. 9B), the syntax element to be promoted / suppressed is the highest order. In the case of suppression, the value inversion flag (Rev) is turned on. The initial value of Rev is off.

図１６に、価値観が「医学の進歩」である場合に、価値観の文構造格納エリア１２１５０２１に格納されるデータの例が示されている。文構造解析手段の出力結果である初期の構造では、最上位（１番の構文要素）は主部が述語「進歩」の単文であり、その動作主体となる２番の構文要素は主部が名詞「医学」の項である。促進・抑制語句データ１２１６０１２（図９Ｂ）を参照すると「進歩」は動作主体に対して促進性を持つことが分かるので、その促進・抑制の対象である２番の構文要素（主部が「医学」の項）が最上位の構文要素として取られる。また「進歩」は促進方向なので、反転フラグはオフとなる。
文構造解析手段１２５は文構造解析用辞書１３５を参照しながら、文構造を構成する。FIG. 16 shows an example of data stored in the sentence structure storage area 1215021 of values when the values are “medical progress”. In the initial structure, which is the output result of the sentence structure analysis means, the top (first syntax element) is a simple sentence whose main part is the predicate “advance”, and the second syntax element that is the main subject of the operation is the main part. It is a noun "medicine". Referring to the promotion / suppression word / phrase data 12116012 (FIG. 9B), it can be seen that “advancement” has a facilitating property with respect to the action subject. Therefore, the second syntax element (main part is “medicine” ") Is taken as the top-level syntax element. Since “advance” is a promotion direction, the inversion flag is turned off.
The sentence structure analyzing unit 125 constructs a sentence structure while referring to the sentence structure analyzing dictionary 135.

図１７に、同辞書の一例が示されている。文構造解析用辞書１３５は単語名称とその品詞情報に加えて、動作を表わす語の場合には、動作主や動作対象などの意味役割と、その意味役割を担う項の探索ルールを含む。例えば進歩の場合、意味役割として主体があるが、その探索ルールは動作主となっている。辞書付属の探索ルールリストには、動作主を探索する際の助詞の優先順位が示されているので、その順に該当する項を探す。「医学の進歩」の場合、４番目の「の」が該当するので、医学が動作主として取られることになる。ただし、「進歩させる」のように使役の助動詞が付く場合には、動作対象のルールに従って項の探索が行われる。
処理１２１３００２では、重視度算出手段１２１３は、所与の発言主体の発言であることを検索条件として文書検索手段１２２に与え、検索用索引データ１３２に基づく検索を実行させることにより、前記発言主体の発言である文書ＩＤのリストを得る。
処理１２１３００３では、重視度算出手段１２１３は、前記文書ＩＤのリストが予め定められた規定数に達しない場合の措置として、発言主体データ１３３（図１５Ｃ）の所属（上位）組織識別子を参照し、それが登録されていた場合には、発言主体がその上位組織であることを条件として検索を実行し、文書ＩＤのリストを得て不足分を補うものとする。本処理は必ずしも必須では無いが、検索される文書数が少ない場合の救済措置として有効である。なお本処理は副作用がある場合もあるので、それを行うか行わないかはオプションとして設定できることが望ましい。
続いてループ１２１３００４では、重視度算出手段１２１３は、前記検索された各文書ＩＤ（Ｉ）毎に処理１２１３０１１以下処理１２１３０１３までの処理を実行する。
処理１２１３０１１では、重視度算出手段１２１３は、文書実体取得手段１２３に文書ＩＤ（Ｉ）を与えて、文書データ１３１（図１５Ａ）から文書内容のテキスト（本文）を取得する。
処理１２１３０１２では、重視度算出手段１２１３は、前記テキストを文分割手段１２４により文単位に分割する。FIG. 17 shows an example of the dictionary. In addition to the word name and its part-of-speech information, the sentence structure analysis dictionary 135 includes, in the case of a word representing an action, a semantic role such as an action main and an action target, and a search rule for a term that plays the meaning role. For example, in the case of progress, there is a subject as a semantic role, but the search rule is the main actor. In the search rule list attached to the dictionary, the priority order of the particles when searching for the actor is shown, and the corresponding terms are searched in that order. In the case of “medical progress”, since the fourth “no” corresponds, medicine is taken as an operation. However, when an auxiliary verb such as “advance” is attached, a term is searched according to the rule of action.
In processing 1213002, the importance level calculation unit 1213 gives the search condition to the document search unit 122 that the message is from the given message subject, and executes a search based on the search index data 132, thereby executing A list of document IDs that are statements is obtained.
In processing 1213003, the importance calculation means 1213 refers to the affiliation (higher order) organization identifier of the speech subject data 133 (FIG. 15C) as a measure when the list of document IDs does not reach a predetermined number. If it is registered, a search is executed on the condition that the speaking subject is the higher organization, and a list of document IDs is obtained to compensate for the shortage. This processing is not necessarily essential, but is effective as a remedy when the number of documents to be searched is small. In addition, since this process may have a side effect, it is desirable to be able to set as an option whether or not to do so.
Subsequently, in a loop 1213004, the importance level calculation unit 1213 executes processing from processing 1213011 to processing 1213013 for each searched document ID (I).
In processing 1213011, the importance level calculation unit 1213 gives the document ID (I) to the document entity acquisition unit 123, and acquires the text (text) of the document content from the document data 131 (FIG. 15A).
In process 1213012, the importance calculation means 1213 divides the text into sentence units by the sentence dividing means 124.

続いてループ１２１３０１３に入り、重視度算出手段１２１３は、前記分割された各文Ｓに、処理１２１３０２１とループ１２１３０２２を実施する。
処理１２１３０２１では、重視度算出手段１２１３は、文構造解析手段１２５に文Ｓを適用し、構文構造データを得て文構造格納エリア１２１５０２３に格納する。後に図１０の説明部分において、例文を用いて構文構造作成方法について具体的に説明する。文構造の解析は文法と構文解析用辞書１３５（図１７）に基づく処理である。次にループ１２１３０２２に入り、重視度算出手段１２１３は、前記構文構造を構成する構文要素Ｐで価値観と意味相似性を有するものについて、処理１２１３０３１〜１２１３０３３を行う。
処理１２１３０３１では、重視度算出手段１２１３は、前記価値観の文構造（ＶＳ）と構文要素Ｐの意味相似度Ｓｉｍを計算する。ここでＶＳは前記処理１２１３００１において、所与の価値観に文構造解析手段１２５を適用して得られ、価値観の文構造格納エリア１２１５０２１に格納されたものである。Subsequently, the processing enters a loop 1213013, and the importance calculation means 1213 performs processing 1213021 and loop 1213022 on each of the divided sentences S.
In process 1213021, the importance level calculation means 1213 applies the sentence S to the sentence structure analysis means 125, obtains syntax structure data, and stores it in the sentence structure storage area 1215023. Later, in the explanation part of FIG. 10, the syntax structure creation method will be described in detail using example sentences. The sentence structure analysis is a process based on the grammar and syntax analysis dictionary 135 (FIG. 17). Next, entering the loop 1213022, the importance calculation means 1213 performs the processing 1213031-1213033 for the syntax elements P constituting the syntax structure having values and semantic similarity.
In process 1213031, the importance calculation means 1213 calculates the sentence structure (VS) of values and the semantic similarity Sim of the syntax element P. Here, the VS is obtained by applying the sentence structure analyzing means 125 to a given value in the processing 1213001, and is stored in the sentence structure storage area 1215021 of the value.

以下意味相似度Ｓｉｍの計算方法の詳細について示す。
文構造ＶＳのトップの構文要素の主部と構文要素Ｐの主部と一致するか、もしくは意味相似語句対データ１２１６０１１に登録されている相似語句対に相当するかを調べ、一致している場合にはＳｉｍ＝１、相似対である場合にはデータに記されている相似度の値をＳｉｍとする。価値観がトップの構文だけの場合にはこれで終わりである。価値観が項を伴う場合には、構文要素Ｐの側にも同じ役割子の項が無ければＳｉｍ＝０となり、有れば項どうしの相似性を計算し、前記のＳｉｍに掛け算する。価値観側に項が複数ある場合には、すべてについて構文要素Ｐの側にも同じ役割子の項があることが条件となり、文構造ＶＳと構文要素Ｐの対応する項どうしの相似性をすべて計算して掛け算する。価値観側に連体修飾が付いている場合には、構文要素Ｐの側にも対応する連体修飾がついていることが条件となり、それらの間の相似性を計算して掛け算する。Details of the method of calculating the semantic similarity Sim will be described below.
When the main part of the top syntax element of the sentence structure VS matches the main part of the syntax element P, or whether it corresponds to the similar phrase pair registered in the semantic similarity phrase pair data 12116011 and matches Is Sim = 1, and in the case of a similarity pair, the similarity value written in the data is Sim. If the values are only the top syntax, this is the end. When the value is accompanied by a term, if there is no term of the same role child on the side of the syntax element P, Sim = 0 is obtained. If there is a term, the similarity between the terms is calculated and multiplied by the Sim. If there are multiple terms on the values side, it is a condition that there is a term of the same role child on the syntax element P side for all, and all the similarities between the corresponding terms of the sentence structure VS and the syntax element P are all Calculate and multiply. In the case where the value modification side has a linkage modification, it is a condition that the syntax element P side also has a corresponding linkage modification, and the similarity between them is calculated and multiplied.

後に示す図１０の例文の構文構造の場合、価値観が「医学の進歩」であるとすると、価値観の構文構造ＶＳは主部が医学の項であることになるので、Ｓｉｍの値がプラスとなるのは、構文要素が１０番の項ということになり、この場合には主部の単語が一致するので、Ｓｉｍ＝１．０となる。もし構文要素１０番の主部が医療であれば、意味相似語句対データ１２１６０１１で「医学」との類似度係数が１．０なのでやはりＳｉｍ＝１．０となり、主部が「薬剤」であれば、類似度係数が０．２なのでＳｉｍ＝０．２となる。
処理１２１３０３２では、重視度算出手段１２１３は、構文要素Ｐの文Ｓ中における文脈ファクターＣｔｘＦａｃｔｏｒを文脈ファクター計算手段１２１４により計算する。計算方法の詳細については後に図１０、図１１を用いて説明する。
処理１２１３０３３では、重視度算出手段１２１３は、前記意味相似度Ｓｉｍと前記文脈ファクターＣｔｘＦａｃｔｏｒから、構文要素Ｐの価値観支持度（ｓ）を算出し、求めるべき重視度の値（Ｖ）に加算する。価値観の反転フラグ（Ｒｅｖ）がオンの場合には減算する。価値観の支持度（ｓ）の計算方法としては、類義度Ｓｉｍと文脈ファクターＣｔｘＦａｃｔｏｒの積によって求めるのは一つの好ましい方法である。
以上で、３重のループ１２１３０２２、１２１３０１３、１２１３００４を抜け、重視度算出手段１２１３は、処理１２１３００５において、得られた重視度Ｖの値を出力する。In the case of the syntactic structure of the example sentence shown in FIG. 10 below, if the value is “medical progress”, the syntactic structure VS of values is mainly a medical term, so the value of Sim is positive. This means that the syntax element is the 10th term. In this case, since the main word matches, Sim = 1.0. If the main part of the syntax element 10 is medical, the similarity coefficient with “medicine” in the semantic similarity phrase pair data 1216011 is 1.0, so that Sim = 1.0, and the main part is “drug”. For example, since the similarity coefficient is 0.2, Sim = 0.2.
In process 1213032, the importance calculation means 1213 calculates the context factor CtxFactor in the sentence S of the syntax element P by the context factor calculation means 1214. Details of the calculation method will be described later with reference to FIGS.
In process 1213033, the importance level calculating means 1213 calculates the value support (s) of the syntax element P from the semantic similarity Sim and the context factor CtxFactor, and adds it to the value (V) of the importance to be obtained. . If the value inversion flag (Rev) is on, the value is subtracted. As a method for calculating the degree of support for values (s), it is one preferable method to obtain the value by the product of the similarity level Sim and the context factor CtxFactor.
As described above, the triple loops 1213022, 1213013, and 1213004 are exited, and the importance level calculation unit 1213 outputs the value of the importance level V obtained in the processing 1213005.

図９Ａは、重視度計算時に用いる、意味相似語句対データ１２１６０１１の一例を示す図である。データは意味相似性を有する語句の対とその相似度係数を含む。相似度係数は０より大きく１．０以下の実数である。大きいほど相似していることを示す。
図９Ｂは、重視度計算時に用いる、促進・抑制語句データ１２１６０１２の一例を示す図である。促進性や抑制性を有する語句と、それが何を促進したり抑制したりするのかを示す対象役割子と、促進・抑制の度合いを示す係数を含む。対象役割子は、一般に複数であり、優先順位順にリストされている。例えば「促進」の場合、対象に相当する「○○を」という構文要素を伴っている場合にはそれが促進対象として取られ、それが無くて、「○○が」という動作主体を表わす構文要素がある場合には、それが促進対象であることを示す。促進・抑制係数は正の場合が促進であり、負の場合が抑制である。FIG. 9A is a diagram illustrating an example of semantic similarity phrase pair data 1216011 used for importance calculation. The data includes word / phrase pairs having semantic similarity and their similarity coefficients. The similarity coefficient is a real number greater than 0 and less than or equal to 1.0. The larger the value, the more similar.
FIG. 9B is a diagram illustrating an example of the promotion / suppression word / phrase data 12116012 used when calculating the importance level. It includes a phrase having an accelerating property or an inhibiting property, a target role child that indicates what it promotes or inhibits, and a coefficient that indicates the degree of promotion or inhibition. There are generally a plurality of target role children, which are listed in order of priority. For example, in the case of “promotion”, if there is a syntax element “XX” corresponding to the target, it is taken as the target of promotion, and there is no syntax element that indicates the subject of action “XX” If an element is present, it indicates that it is a promotion target. The promotion / suppression coefficient is positive when it is positive and negative when it is negative.

図１０は、図８に示した方法を用いて重視度を計算した時に、文格納エリア１２１５０２０に格納された文の例と、その構文構造を解析した結果として得られる、文構造格納エリア１２１５０２３の内容を示した図である。例文は「医学の発展を阻害する要因を一つ一つ取り除いていくために、我々は何をすべきか、じっくりと考えてみる必要があるのではなかろうか。」である。参考のため、英語の例文の場合も示した。例文の構造は、最上位の構文要素（要素番号１）は、主部となる述語が「考える」の単文であり、動作の対象が番号２番の構文要素であることなどが記録されている。その他、文末に当たる補助部が「・・・てみる必要があるのではなかろうか」、また修飾部（連用修飾）が「じっくりと」であることも記されている。
文末補助部は、図１２Ｂに示されるような補助部ファクターデータ１２１６０２２に登録されている表現や、助動詞、助詞、接続詞などの機能語の連接を文末から取れるだけ取ることによって得られる。本例の場合「てみる／必要がある／の／で／は／なかろうか」の中、「てみる」「必要がある」「なかろうか」の３部分は補助部ファクターデータに登録されており、残りの「の」「で」「は」はそれぞれ形式名詞、助動詞および助詞で、機能語として取られたものである。次いで「考える」が構文解析用辞書（図１７）から、対象（考える内容）と主体（誰が考えたか）を意味役割として取る主部として取られる。対象としては、節の探索ルールに従って先頭から「何をすべきか」までの節が取られ、主体は該当なしとなる。また「じっくりと」のように主部を修飾する副詞は修飾部に追加される。FIG. 10 shows an example of the sentence stored in the sentence storage area 12105020 and the result of analyzing the syntax structure when the importance is calculated using the method shown in FIG. It is the figure which showed the content. The example sentence is "I wonder if we need to think carefully what we should do in order to remove the factors that impede the development of medicine one by one." For reference, English example sentences are also shown. In the structure of the example sentence, the highest-level syntax element (element number 1) is a single sentence in which the predicate as the main part is “think”, and the action target is the number 2 syntax element. . In addition, it is also described that the auxiliary part at the end of the sentence is “... I wonder if it is necessary to try”, and the modifier (continuous modification) is “slowly”.
The sentence ending auxiliary part is obtained by taking only the expressions registered in the auxiliary part factor data 1216022 as shown in FIG. 12B and the concatenation of function words such as auxiliary verbs, particles, and conjunctions from the sentence ending. In the case of this example, the three parts of “Try”, “Need”, “Nease”, “Neka” are registered in the auxiliary part factor data. The remaining “no”, “de”, and “ha” are formal nouns, auxiliary verbs, and particles, respectively, which are taken as function words. Next, “think” is taken from the syntax analysis dictionary (FIG. 17) as a main part that takes the object (thinking contents) and the subject (who thought) as semantic roles. As a target, the section from the top to “what to do” is taken according to the section search rule, and the subject is not applicable. Also, adverbs that modify the main part, such as “carefully”, are added to the modifier.

次に考える内容に相当する部分、すなわち文頭から「何をすべきか」までの部分の解析に移る。この部分は、「ＡするためにＢする」という目的（Ａ）−手段（Ｂ）のパターンに合致するので、２番の構文要素は主部が「目的−手段」の複文となり、役割１には目的、役割２には手段が登録される。本例では、それぞれ６番と３番の構文要素となっている。
手段に相当する部分は「我々は何をするべきか」であり、文末から機能語「べき」と「か」が取られ文末補助部となる。次に構文解析用辞書から動詞「する」が主体と対象を意味役割として取ることが分かり、それぞれの対応項の探索ルールに従って、それぞれ「我々」と「何」であることが分かる。以上から３番の構文要素は、主部の述語が「する」の単文であり、主体に相当する「我々」が４番の構文要素となり、対象に相当する「何」が５番の構文要素となる。
目的に相当する構文要素６番は、述語「取り除く」が主部の単文であり、取り除く対象が７番の構文要素であり、それは名詞「要因」が主部の項である。この７番の項には連体修飾が付いており、それが述語「阻害する」を主部に持つ８番の構文要素である。８番の動作主が７番の要因であり、阻害される対象が９番の構文要素である。９番の構文要素は述語「発展」を主部に持つ単文であり、発展する主体が１０番の医学を主部に持つ項である。英文の解析方法も同様にして行われるので説明は省略する。Next, the analysis proceeds to the part corresponding to the content to be considered, that is, the part from the beginning of the sentence to “what to do”. Since this part matches the pattern of purpose (A) -means (B) of “do B for A”, the second syntax element is a compound sentence whose main part is “purpose-means” and is in role 1 Means and role 2 are registered with means. In this example, the syntax elements are No. 6 and No. 3, respectively.
The part corresponding to the means is “what we should do”, and the function words “should” and “ka” are taken from the end of the sentence to become the end of sentence auxiliary part. Next, it can be seen from the lexicon for parsing that the verb “do” takes the subject and the object as semantic roles, and “we” and “what”, respectively, according to the search rules of the corresponding terms. From the above, the third syntax element is a simple statement whose main part predicate is “Yes”, “We” corresponding to the subject is the fourth syntax element, and “What” corresponding to the target is the fifth syntax element. It becomes.
In the syntax element No. 6 corresponding to the purpose, the predicate “remove” is a simple sentence of the main part, the object to be removed is the syntax element of No. 7, and the noun “factor” is the term of the main part. This 7th term has a linkage modification, which is the 8th syntax element whose main part is the predicate “inhibit”. The No. 8 operator is the No. 7 factor, and the obstructed target is the No. 9 syntax element. The ninth syntax element is a simple sentence having the predicate “development” in its main part, and the developing subject is a term having the tenth medicine in its main part. Since the English sentence analysis method is performed in the same manner, the description is omitted.

図１１は、図１０と同じ例文に関して重視度計算をしている時の文脈ファクター計算ワークエリア１２１５０２４の内容を示した図である。構文要素番号は図１０の構文要素番号と対応している。各構文要素に対して主部ファクター、補助部ファクター、修飾部ファクターのコラムと、文脈ファクター計算のためのコラムが容易されている。
主部ファクター欄には、構文要素の主部に対応する計算式を、主部ファクターデータ１２１６０２１を参照して、該当するものを格納する。補助部ファクター欄には、補助部ファクターデータ１２１６０２２を参照して計算した値を格納する。補助部ファクターの計算では、補助部について先頭から最長一致でデータに登録されている表現とのマッチングを行い、マッチングが取れたもののファクターの掛け算で値を得るのが簡便で好ましい方法である。マッチングが取れない場合にはデフォールト値を１．０とする。図ではデフォールト値は（）内に表示している。
修飾部ファクター欄には、修飾部ファクターデータ１２１６０２３を参照して該当するものがあれば対応する値を格納する。無ければデフォールト値は１．０とする。
主部ファクターに記載の計算式は、項（下位の構文要素）の文脈ファクターからその構文要素の文脈ファクターを計算する計算式である。補助部ファクターは構文構造の補助部に関するファクターであり、修飾部ファクターは、構文構造の修飾部に関するファクターである。文脈ファクターは構文要素を指定して、そこから構文構造の上位に向かって計算する。FIG. 11 is a diagram illustrating the contents of the context factor calculation work area 12105024 when the importance level is calculated for the same example sentence as FIG. The syntax element number corresponds to the syntax element number in FIG. For each syntax element, a main factor column, an auxiliary factor factor, a modifier factor column, and a context factor calculation column are facilitated.
In the main factor column, a calculation formula corresponding to the main part of the syntax element is stored with reference to the main factor data 1216021. The value calculated by referring to the auxiliary part factor data 1216022 is stored in the auxiliary part factor column. In the calculation of the auxiliary part factor, it is a simple and preferable method that the auxiliary part is matched with the expression registered in the data with the longest match from the beginning, and a value is obtained by multiplying the factors of the matching. When the matching cannot be obtained, the default value is set to 1.0. In the figure, the default value is displayed in parentheses.
In the modifier part factor column, a corresponding value is stored if there is a corresponding part with reference to the modifier part data 1216023. If not, the default value is 1.0.
The calculation formula described in the main factor is a calculation formula for calculating the context factor of the syntax element from the context factor of the term (subordinate syntax element). The auxiliary part factor is a factor relating to the auxiliary part of the syntax structure, and the modifier part factor is a factor relating to the modifier part of the syntax structure. The context factor specifies a syntax element and calculates from there up to the top of the syntax structure.

例文の場合、価値観が「医学の進歩」であるとすると、価値観と意味相似性がプラスになるのは、図８の処理１２１３０３１の説明時に示したように、構文要素が１０番の項（「医学」に相当する項）ということになる。従って、ここでは１０番の構文要素の文脈ファクターを計算する方法について説明する。
まず１０番の構文要素に対して文脈ファクター１．０が与えられる。次いで、１０番の構文要素だけから決まる構文要素を探索し、９番の構文要素がそれであることが分かる。主部ファクターは１０番の文脈ファクター×１．０で１．０と計算される。以下同様の作業を繰り返すと、順番に（１）８番の文脈ファクターが‐１．０、（２）７番が‐１．０、（３）６番が１．０、（４）２番が１．０、（５）１番が０．６というように順に計算されていくことになる。（１）〜（５）について以下に詳しく説明する。
（１）８番は「○○を阻害する」という構文要素であり、主部ファクターは（−１）×（阻害対象となる９番のファクター＝１．０）である。これから‐１．０と計算される。（２）７番は８番が連体修飾していることから、８番のファクターを引き継いで−１となる。（３）６番は「７番を取り除く」であり、主部ファクターは‐１．０×（取り除く対象である７番のファクター＝−１．０）＝１．０と計算される。
（４）２番は（６番の目的）のために（３番を行う）で、主部ファクターは６番と３番のＭａｘで計算される。この場合６番は１．０で３番は０．０なので、主部ファクターは１．０と計算される。（５）１番は、主部は「２番を考える」で２番のファクター×０．８で０．８と計算される。また補助部ファクターは図１２Ａの補助部ファクターデータ１２１６０２２を参照して、「なかろうか」を含むことから０．７と計算され、修飾部ファクターデータ１２１６０２３を参照して「じっくりと」のファクターが１．２と計算される。これらを掛け算して０．８×０．７×１．２＝０．６７で文脈ファクターが計算される。In the case of the example sentence, if the value is “advance in medicine”, the value and the semantic similarity are positive, as shown in the explanation of the process 1213031 in FIG. (The term corresponding to “medicine”). Therefore, here, a method for calculating the context factor of the tenth syntax element will be described.
First, a context factor of 1.0 is given to the 10th syntax element. Next, a syntax element determined only from the 10th syntax element is searched, and it is understood that the 9th syntax element is it. The main factor is calculated as 1.0 with 10th context factor × 1.0. Repeating the same work, (1) No. 8 context factor is -1.0, (2) No. 7 is -1.0, (3) No. 6 is 1.0, (4) No. 2 Is calculated in order such that 1.0 is 1.0 and (5) No. 1 is 0.6. (1) to (5) will be described in detail below.
(1) No. 8 is a syntax element “inhibit OO”, and the main factor is (−1) × (the ninth factor to be inhibited = 1.0). From this, -1.0 is calculated. (2) Since No. 7 has been continuously modified by No. 8, the factor of No. 8 is taken over and becomes -1. (3) No. 6 is “Remove No. 7”, and the main factor is calculated as −1.0 × (No. 7 factor to be removed = −1.0) = 1.0.
(4) No. 2 is (for No. 6 purpose) (No. 3 is performed), and the main factor is calculated by No. 6 and No. 3 Max. In this case, since No. 6 is 1.0 and No. 3 is 0.0, the main factor is calculated as 1.0. (5) For No. 1, the main part is “considering No. 2” and the factor of No. 2 × 0.8 is calculated as 0.8. Further, the auxiliary part factor is calculated to be 0.7 because it includes “Naka Kana” with reference to the auxiliary part factor data 1216022 in FIG. 12A, and the factor of “slowly” is 1 with reference to the modifier part data 1216023. .2 is calculated. By multiplying these, the context factor is calculated as 0.8 × 0.7 × 1.2 = 0.67.

図１２Ａは、文脈ファクター計算時に用いる、主部ファクターデータの一例を示す図である。主部ファクターデータは、構文要素の種類（複文・単文）、主部、項の役割子のリスト、と主部ファクターを含む。構文要素種類が複文の場合は、原因‐結果と目的―手段、など数は限定的である。構文要素が単文の場合には、動詞が主部となり、動作主や動作対象などが項の役割子となる。主部ファクターは項に対応する値から計算する計算式として与えられる。
図１２Ｂは、文脈ファクター計算時に用いる、補助部ファクターデータの一例を示す図である。補助部ファクターデータは、補助部の表現と対応するファクターの値を含む。自信をもって言い切っているほど絶対値が大きく、ぼやかした表現の場合には絶対値が小さい。また否定の場合には負の値となる。３番の「ない」は１番や２番に含まれるが、長い方が優先であり、１番や２番がマッチした部分には３番は適用されない。
図１２Ｃは、文脈ファクター計算時に用いる、修飾部ファクターデータの一例を示す図である。副詞などの修飾表現とそのファクター値を含む。強める働きの語には大きな値、ぼやかす働きの語には小さな値が与えられる。FIG. 12A is a diagram illustrating an example of main factor data used when calculating a context factor. The main factor data includes a type of syntax element (compound sentence / single sentence), a main part, a list of term roles, and a main factor. When the syntactic element type is complex, the number of causes-results and purposes-means is limited. When the syntax element is a simple sentence, the verb is the main part, and the action main and the action target are the role of the term. The main factor is given as a calculation formula that calculates from the value corresponding to the term.
FIG. 12B is a diagram illustrating an example of auxiliary unit factor data used when calculating the context factor. The auxiliary part factor data includes the value of the factor corresponding to the expression of the auxiliary part. The absolute value is larger the more confidently speaking, the smaller the absolute value in the case of a blurred expression. In the case of negative, it becomes a negative value. “No” of No. 3 is included in No. 1 and No. 2, but the longer one has priority, and No. 3 is not applied to the portion where No. 1 or No. 2 matches.
FIG. 12C is a diagram illustrating an example of modifier part factor data used in context factor calculation. Includes modifiers such as adverbs and their factor values. Large values are given to words of work that strengthen, and small values are given to words of work that blur.

以上のように、本実施例に関わる意見収集システムは、関心を持っている論題と共に、論題に関わる複数の価値観を設定することにより、論題に関する意見の発言主体が前記価値観をどの程度重視しているかを計算し、その値に基づいて前記発言主体の表示位置を決め、その発言主体の意見をその位置に基づいて提示する。これにより、収集された意見を、その発言者が前記価値観に関してどの程度重視しているかという考え方のバックグラウンドを把握しながら読み、また分析することが可能となる。 As described above, the opinion collection system according to the present embodiment sets a plurality of values related to a topic together with the topic of interest, and how much importance is given to the above-mentioned values by the subject of the opinion on the topic. The display position of the speaking subject is determined based on the value, and the opinion of the speaking subject is presented based on the position. This makes it possible to read and analyze the collected opinions while grasping the background of the idea of how much importance the speaker has regarding the values.

また、図１３に示したように、同じ発言主体であっても、年代と共に価値観の重視度が変化することが考えられるが、年代区分の設定手段部２５１７を設けることにより、発言主体と年代を組にして扱うことにより、発言主体の価値観に関するバックグラウンドが変化する様子も捉えることが可能である。 In addition, as shown in FIG. 13, it is considered that the importance of values changes with the age even if the subject is the same, but by providing a setting means unit 2517 for age classification, It is possible to capture how the background related to the values of the speaking subject changes.

実施例１は、具体的な適用例として、賛否の分かれる社会問題に関する意見の収集を対象としているが、その他、製品やサービスに関する意見や、政治・外交・安全保障上の論題であっても良い。 As a specific application example, Example 1 is intended for collecting opinions on social issues that are divided into pros and cons, but may also be opinions on products and services, and political, foreign, and security topics. .

図１４Ａは、製品やサービスに関する意見収集に適用した場合の図である。
製品やサービスの場合には、価値観としては、価格と性能が代表的である。意見を述べている人によって価格重視派や性能重視派、バランス派がいることが想定される。意見一覧表示部２５１５は、価格を横軸に性能を縦軸に取って結果表示した図である。性能が良いが価格は高い製品は性能重視派の人には好評価、価格重視派の人には不評となりがちである。そのような中で、価格重視派の人で項評価をしている人がいれば、その意見は参照してみたくなるであろう。
なお、性能については、製品によって観点が変わってくるので、意味相似語句対データ１２１６０１１には、性能と意味相似性を有すると考えられる語句を登録しておく必要がある（例えば速度、強度、容量、安全性など）。FIG. 14A is a diagram when applied to collecting opinions on products and services.
In the case of products and services, values and performance are typical values. It is assumed that there are price-oriented, performance-oriented, and balanced groups depending on the person who expresses the opinion. Opinion list display unit 2515 is a diagram that displays the results with price on the horizontal axis and performance on the vertical axis. Products with good performance but high price tend to be well-received by performance-oriented people and unpopular by price-oriented people. Under such circumstances, if there is a person who evaluates the term with a price-oriented person, he would like to refer to that opinion.
Since the viewpoint of performance varies depending on the product, it is necessary to register words that are considered to have semantic similarity with the performance in the semantic similarity phrase pair data 1216011 (for example, speed, strength, capacity, etc.). , Safety etc.).

一方、図１４Ｂは、政治・外交・安全保障上の論題での意見分析の場合に、国家間で意見の対立があるような論題が取り上げた場合である。
Ａ国とＢ国が対立しているような場合にＡ国側の主張ＰにＢ国が反対しているとして、当事国以外からはどのような意見が出ているか分析するような場面である。この場合、価値観としては、横軸にＡ国との関係をより重視する度合い、縦軸にＢ国との関係を重視する度合いを取ることが考えられる。国や人によって、このバランスは違ってくるだろう。通常はＢ国重視派の人が意見Ｐには賛成だったり、通常はＡ国重視派の国から意見Ｐに反対する意見が出ていたりすれば、意外性から興味を感じるであろう。On the other hand, FIG. 14B shows a case where a topic in which there is a conflict of opinions among nations is taken up in the case of opinion analysis on a topic on politics, diplomacy, and security.
When Country B is opposed to Country P's assertion P when Country A and Country B are in conflict, it is a scene that analyzes what opinions are coming from other countries. is there. In this case, as values, it can be considered that the horizontal axis indicates the degree of emphasis on the relationship with Country A and the vertical axis indicates the degree of importance on the relationship with Country B. This balance will vary from country to country. Normally, if a person from country B is in favor of opinion P, or if an opinion is against the opinion P from a country in which country A is important, it would be of interest to be surprised.

Ｃ．実施例の効果

本実施例によれば、発言内容が記載された文書群を対象として、所与の論題に関する複数の意見を、論題に関わる複数の価値観に関する各発言者の考え方の重視度（バックグラウンド等）を把握しながら読むことができるようになる。また、本実施例によれば、意外性や信頼性を念頭に置きながら各意見を読むことができ、意見分析の質と効率を向上させることができる。C. Effects of the embodiment

According to the present embodiment, for a group of documents in which the content of a statement is described, a plurality of opinions regarding a given topic, and a degree of importance of each speaker regarding a plurality of values related to the topic (background, etc.) You will be able to read while grasping. Further, according to the present embodiment, each opinion can be read while keeping in mind unexpectedness and reliability, and the quality and efficiency of opinion analysis can be improved.

Ｄ．付記

なお、本発明は上記した実施例に限定されるものではなく、様々な変形例が含まれている。例えば、上記した実施例は本発明を分かりやすく説明するために詳細に説明したものであり、必ずしも説明した全ての構成を備えるものに限定されるものではない。また、ある実施例の構成の一部を他の実施例の構成に置き換えることが可能であり、また、ある実施例の構成に他の実施例の構成を加えることも可能である。また、各実施例の構成の一部について、他の構成の追加・削除・置換をすることが可能である。
また、上記の各構成、機能、処理部、処理手段等は、それらの一部又は全部を、例えば集積回路で設計する等によりハードウェアで実現してもよい。また、上記の各構成、機能等は、プロセッサがそれぞれの機能を実現するプログラムを解釈し、実行することによりソフトウェアで実現してもよい。各機能を実現するプログラム、テーブル、ファイル等の情報は、メモリや、ハードディスク、ＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）等の記録装置、または、ＩＣカード、ＳＤカード、ＤＶＤ等の記録媒体に置くことができる。
また、制御線や情報線は説明上必要と考えられるものを示しており、製品上必ずしも全ての制御線や情報線を示しているとは限らない。実際には殆ど全ての構成が相互に接続されていると考えてもよい。D. Appendix

In addition, this invention is not limited to an above-described Example, Various modifications are included. For example, the above-described embodiments have been described in detail for easy understanding of the present invention, and are not necessarily limited to those having all the configurations described. Further, a part of the configuration of one embodiment can be replaced with the configuration of another embodiment, and the configuration of another embodiment can be added to the configuration of one embodiment. Further, it is possible to add, delete, and replace other configurations for a part of the configuration of each embodiment.
Each of the above-described configurations, functions, processing units, processing means, and the like may be realized by hardware by designing a part or all of them with, for example, an integrated circuit. Each of the above-described configurations, functions, and the like may be realized by software by interpreting and executing a program that realizes each function by the processor. Information such as programs, tables, and files for realizing each function can be stored in a memory, a hard disk, a recording device such as an SSD (Solid State Drive), or a recording medium such as an IC card, an SD card, or a DVD.
Further, the control lines and information lines indicate what is considered necessary for the explanation, and not all the control lines and information lines on the product are necessarily shown. Actually, it may be considered that almost all the components are connected to each other.

本発明の意見収集装置及びシステム及び意見収集方法は、その各手順をコンピュータに実行させるための意見収集プログラム、意見収集プログラムを記録したコンピュータ読み取り可能な記録媒体、意見収集プログラムを含みコンピュータの内部メモリにロード可能なプログラム製品、そのプログラムを含むサーバ等のコンピュータ、等により提供されることができる。 An opinion collection apparatus and system and an opinion collection method according to the present invention include an opinion collection program for causing a computer to execute each procedure, a computer-readable recording medium storing the opinion collection program, and an internal memory of the computer including the opinion collection program Can be provided by a program product that can be loaded on the computer, a computer such as a server including the program, and the like.

１００：意見収集装置、１１０：演算部、１２０：主記憶部、１２１：意見収集管理部、１２１１：意見収集手段、１２１２：発言主体の過去発言収集手段、１２１３：価値観の重視度算出手段、１２１４：文脈ファクター計算手段、１２１５：意見収集ワークエリア、１２１６：意見収集用規則・データ集、１２２：文書検索手段、１２３：文書実体取得手段、１２４：文分割手段、１２５：文構造解析手段、１２６：単語分割・品詞付与手段、１２７：名寄せ手段、１２８：固有表現抽出手段、１３０：補助記憶部、１３１：文書データ、１３２：検索用索引データ、１３３：発言主体データ、１３４：価値観データ、１３５：文構造解析用辞書、１４０：入力部、１５０：表示部、１６０：通信部。
１２１５００１：論題格納エリア、１２１５００２：価値観格納エリア、１２１５００３：意見データ格納エリア、１２１５００４：発言主体データ格納エリア、１２１５００５：発言主体×価値観別重視度格納エリア、１２１５０１０：検索条件格納エリア、１２１５０１１：検索結果格納エリア、１２１５０１２：文書実体格納エリア、１２１５０１３：書誌情報格納エリア、１２１５０２０：文格納エリア、１２１５０２１：価値観の文構造格納エリア、１２１５０２２：価値観の反転フラグ格納エリア、１２１５０２３：文構造格納エリア、１２１５０２４：文脈ファクター計算ワークエリア。
１２１６００１：賛成・反対表現データ、１２１６００２：理由・証拠表現データ、１２１６００３：否定表現データ、１２１６００４：発言表現データ、１２１６０１１：意味相似語句対データ、１２１６０１２：促進・抑制語句データ、１２１６０２１：主部ファクターデータ、１２１６０２２：補助部ファクターデータ、１２１６０２３：修飾部ファクターデータ。
２００：意見閲覧分析支援端末、２１０：演算部、２２０：主記憶部、２２１：意見閲覧分析支援管理部、２２１：意見閲覧分析支援管理部、２２１１：意見閲覧分析支援手段、２２１１：意見閲覧分析手段、２２１２：ワークエリア、２３０：補助記憶部、２４０：入力部、２５０：表示部、２５１：意見閲覧分析支援画面、２５１１：論題設定部、２５１２：価値観設定部、２５１３：オプション設定部、２５１４：意見一覧表示部、２５１５：意見一覧表示部、２５１６：個々の意見の詳細表示部、２５１７：年代区分設定部、２６０：通信部。
２２１２００１：論題格納エリア、２２１２００２：価値観格納エリア、２２１２００３：意見データ格納エリア、２２１２００４：発言主体データ格納エリア、２２１２００５：発言主体×価値観別重視度データ格納エリア。
３００：通信ネットワーク、４００：印刷手段、１０００：意見収集システム、

DESCRIPTION OF SYMBOLS 100: Opinion collection apparatus, 110: Operation part, 120: Main memory part, 121: Opinion collection management part, 1211: Opinion collection means, 1212: Past comment collection means of a speech subject, 1213: Importance calculation means of values 1214: Context factor calculation means, 1215: Opinion collection work area, 1216: Opinion collection rules and data collection, 122: Document search means, 123: Document entity acquisition means, 124: Sentence division means, 125: Sentence structure analysis means, 126: Word division / part of speech adding means, 127: name identification means, 128: specific expression extracting means, 130: auxiliary storage unit, 131: document data, 132: index data for search, 133: speech subject data, 134: value data 135: sentence structure analysis dictionary, 140: input unit, 150: display unit, 160: communication unit.
1215001: Topic storage area, 1215002: Values storage area, 1215003: Opinion data storage area, 1215004: Speaking subject data storage area, 1215005: Speaking subject × value-oriented importance storage area, 1210501: Search condition storage area, 1215011: Search result storage area, 1215012: Document entity storage area, 1215013: Bibliographic information storage area, 12105020: Sentence storage area, 1215021: Sentence structure storage area of values, 1215022: Reverse flag storage area of values, 12125023: Sentence structure storage Area, 121024: Context factor calculation work area.
1216001: Approval / opposite expression data, 1216002: Reason / evidence expression data, 1216003: Negative expression data, 1216004: Speech expression data, 1216011: Semantic similarity phrase data, 12116012: Promotion / suppression phrase data, 1216021: Main factor data 1216022: Auxiliary part factor data, 1216023: Modifier part factor data.
200: Opinion browsing analysis support terminal, 210: Calculation unit, 220: Main storage unit, 221: Opinion browsing analysis support management unit, 221: Opinion browsing analysis support management unit, 2211: Opinion browsing analysis support means, 2211: Opinion browsing analysis Means 2212: work area 230: auxiliary storage unit 240: input unit 250: display unit 251: opinion browsing analysis support screen 2511: topic setting unit 2512: value setting unit 2513: option setting unit 2514: Opinion list display unit, 2515: Opinion list display unit, 2516: Detailed display unit of individual opinions, 2517: Age division setting unit, 260: Communication unit.
221001: Topic storage area, 221002: Values storage area, 221003: Opinion data storage area, 221004: Talking subject data storage area, 221005: Talking subject × value-oriented importance data storage area.
300: communication network, 400: printing means, 1000: opinion collection system,

Claims

An opinion collecting device,
A plurality of document data including the document content in advance, a plurality of opinion data including the document content and the speaking subject, a storage unit holding importance data for each speaking subject and each value,
An arithmetic unit;
With

The computing unit is
Receiving a topic entered by the terminal that defines what documents to collect and one or more values that are considered to have an impact when determining the pros and cons of the topic;
Search the document data for the document content related to the received topic,
Finding a set of speaking subjects of the retrieved document content, storing a plurality of opinion data including the document content and the speaking subject in the storage unit,
For each speaking subject included in the opinion data, calculate the importance for each of the values,
Create importance level data for each subject and value from the calculated importance level, store it in the storage unit,
An opinion collecting apparatus, wherein the importance data classified by each speaking subject and each value is displayed on a display unit or output to an output unit.

In the opinion collecting device according to claim 1,
The opinion data further includes a relevance to the topic,
The computing unit is
Seeking a set of speaking subjects of the retrieved document content and a relationship indicating the strength of the relationship with the topic;
An opinion collecting apparatus that creates a predetermined number of the plurality of opinion data selected in descending order of relevance.

In the opinion collecting device according to claim 1,
The document data further includes a document content corresponding to the document ID. The opinion data further includes a document ID and a relevance to the topic.

The computing unit is
The topic is divided into words, and a search condition related to the topic is created from a list of content words obtained by removing particles, auxiliary verbs, and other function words, searching the document data, and satisfying the search condition Get a list of document IDs,
For each retrieved document ID, refer to the document data to obtain the document content corresponding to the document ID;
For each sentence in which the document content is divided, the relevance is calculated so that the higher the number of content words that constitute the topic, the higher the value appears in a narrower range for the sentence and the topic. , For the sentence relevant to the topic, identify the subject by collating with data representing a predetermined statement,
An opinion collecting apparatus characterized in that a document ID, relevance, and a subject are included in the statement data and stored in the storage unit.

In the opinion collecting device according to claim 3,
The computing unit is
An opinion collecting apparatus characterized by referring to the document data for a sentence content for which a statement origin and a statement date and time cannot be specified, and obtaining a default statement subject relating to a document including the sentence for each sentence.

In the opinion collecting device according to claim 4,
The computing unit is
The obtained opinion data is sorted in descending order using the relevance to the topic as the first sort key, and / or when the number of collected document contents exceeds the upper limit value specified as an option An opinion gathering device characterized by discarding the portion where the number of the subsequent ranks exceeds that number.

In the opinion collecting device according to claim 1,
The computing unit is
In the calculation process for calculating the importance level,
With reference to the document data, the past document data of the speaking subject is searched,
The retrieved past document data is divided into sentence units,
For each divided sentence, the sentence is parsed to obtain the structural elements that make up the syntax structure,
For each syntax element of the parsing result, the semantic similarity (Sim) with the above values is calculated,
In the sentence of the syntax element, calculate a context factor representing the influence of the context on the degree of support for values,
From the semantic similarity (Sim) and the context factor, the importance (V) for the value of the sentence is calculated,
An opinion collecting apparatus that calculates a degree of importance of the speaking subject based on the values by accumulating the degree of importance (V) with respect to the values of each sentence.

In the opinion collecting device according to claim 6,
The computing unit is
The main part of the syntax element at the top of the sentence structure of the values and the main part of each syntax element coincide with each other and are registered in predetermined semantic similarity phrase pair data indicating the semantic similarity between phrases. An opinion collecting apparatus characterized in that semantic similarity (Sim) is obtained by examining whether or not it corresponds to a similar phrase pair.

In the opinion collecting device according to claim 6,
The storage unit
For the main part of the syntax element, data of the main factor that defines a formula for calculating its own context factor from the context factor of the lower syntax element;
Auxiliary factor data that converts the influence on the context factor of the auxiliary component corresponding to the sentence end expression of the syntax element into data,
Includes data of modifier part data that quantifies the effect of syntax element modifiers on context factors,

The computing unit is
In the process of calculating the context factor,
Assign the main factor, auxiliary factor, and modifier factor to each syntax element based on the data,
From the designated syntax element to the higher-order syntax element, calculate the main factor of each syntax element according to the calculation formula given by the main factor,
An opinion collecting apparatus characterized in that a context factor is calculated by synergizing an auxiliary factor factor and a modifier factor with a calculated main factor value.

In the opinion collecting device according to claim 6,
The computing unit is
Seeking syntactic structure derived from values,
When the main part of the topmost syntax element of the syntax structure corresponds to a predetermined promotion / suppression word related to the promotion / suppression of an event, the syntax element to be promoted / suppressed is designated as the top-level syntax. Element, and in the case of suppression, set a reverse flag of values,
When the value support (s) of each syntax element is calculated from the semantic similarity (Sim) and the context factor, and an addition or a value inversion flag is set in the importance (V) An opinion collecting device characterized by subtracting.

In the opinion collecting device according to claim 1,
In the display section,
Referring to the importance level data for each subject and value, the importance level for the second value for the vertical axis and the vertical value for the first value for the horizontal axis for each of the message subjects An opinion collecting device, wherein each utterance subject is displayed with ordinate as the ordinate.

In the opinion collecting device according to claim 1,
In the display section,
An opinion collection device characterized in that referring to opinion data, each document content is displayed in association with the display position of the subject of the document content.

In the opinion collecting device according to claim 1,
The remark data further includes an age division,
The computing unit is
Creates importance level data by utterance entity and values for each age group, stores them in the storage unit, and displays on the display unit or outputs to the output unit how the importance level regarding the values of the utterance entity changes An opinion collection device.

An opinion collection system,
A terminal,
An opinion collection device connected to the terminal via a communication network;

The opinion collection device includes:
A plurality of document data including the document content in advance, a plurality of opinion data including the document content and the speaking subject, a storage unit holding importance data for each speaking subject and each value,
An arithmetic unit;
Have

The computing unit is
Receiving a topic entered by the terminal that defines what documents to collect and one or more values that are considered to have an impact when determining the pros and cons of the topic;
Search the document data for the document content related to the received topic,
Finding a set of speaking subjects of the retrieved document content, storing a plurality of opinion data including the document content and the speaking subject in the storage unit,
For each speaking subject included in the opinion data, calculate the importance for each of the values,
Create importance level data for each subject and value from the calculated importance level, store it in the storage unit,
The opinion collection system characterized in that the importance level data for each subject and value is displayed on a display unit or output to an output unit.

In the opinion collection system according to claim 13,
The opinion collection device transmits, to the terminal, the opinion data and the importance level data according to the speaking subject and values,
The terminal
An input unit for inputting the topic and the one or more values;
A terminal storage unit that stores the opinion data received from the opinion collection device, and the importance data according to each speaking subject and each value;
An opinion comprising: a display unit for displaying the opinion data and the importance data for each utterance subject and each value, or an output unit for outputting, and a terminal calculation unit for performing communication, storage, and display processing Collection system.

An opinion collection method in an opinion collection device,
The opinion collection device includes:
A plurality of document data including the document content in advance, a plurality of opinion data including the document content and the speaking subject, a storage unit holding importance data for each speaking subject and each value,
An arithmetic unit;
With

The computing unit is
Receiving a topic entered by the terminal that defines what documents to collect and one or more values that are considered to have an impact when determining the pros and cons of the topic;
Search the document data for the document content related to the received topic,
Finding a set of speaking subjects of the retrieved document content, storing a plurality of opinion data including the document content and the speaking subject in the storage unit,
For each speaking subject included in the opinion data, calculate the importance for each of the values,
Create importance level data for each subject and value from the calculated importance level, store it in the storage unit,
An opinion collecting method, wherein the importance data classified by each speaking subject and each value is displayed on a display unit or output to an output unit.