JP6452025B2

JP6452025B2 - Display system, display method, and display program

Info

Publication number: JP6452025B2
Application number: JP2014139365A
Authority: JP
Inventors: 洋介本橋; 健児青木; 伊藤　貴之; 貴之伊藤; はるか末松; 佐也香八木
Original assignee: OCHANOMIZU UNIVERSITY; NEC Corp
Current assignee: OCHANOMIZU UNIVERSITY; NEC Corp
Priority date: 2014-07-07
Filing date: 2014-07-07
Publication date: 2019-01-16
Anticipated expiration: 2034-07-07
Also published as: JP2016018301A

Description

本発明は、表示システム、表示方法および表示プログラムに関する。 The present invention relates to a display system, a display method, and a display program.

データの表示方法として、３ＤＰａｒａｌｌｅｌＣｏｏｒｄｉｎａｔｅｓやヒートマップでデータを表示する方法がある。 As a data display method, there is a method of displaying data by 3D Parallel Coordinates or a heat map.

一般に、ある属性をＸ軸としたヒートマップでは、Ｙ軸は１つの属性のみを表す。身長をＸ軸とした場合、Ｙ軸には、例えば、体重という１つの属性のみが表される。 In general, in a heat map with an attribute as the X axis, the Y axis represents only one attribute. When the height is the X axis, for example, only one attribute of weight is represented on the Y axis.

なお、属性には、連続型変数の属性と、カテゴリ型変数の属性とがある。連続型変数の属性値は、数値である。また、カテゴリ型変数の属性値は、項目である。上記の例における「体重」は、連続型変数の属性である。カテゴリ型変数の属性の例として、例えば、「性別」が挙げられ、その属性値は「男性」または「女性」という項目である。ヒートマップのＹ軸で表される属性は「性別」等のカテゴリ型変数の属性であってもよい。その場合、Ｙ軸には、「男性」および「女性」という各項目が表される。例えば、身長をＸ軸とした場合、身長を表すＸ軸を区切った区分毎に、「男性」という項目、「女性」という項目のカウント値がヒートマップとして表現される。 The attributes include a continuous variable attribute and a categorical variable attribute. The attribute value of the continuous variable is a numerical value. The attribute value of the categorical variable is an item. “Weight” in the above example is an attribute of a continuous variable. As an example of the attribute of the categorical variable, for example, “sex” can be cited, and the attribute value is an item of “male” or “female”. The attribute represented by the Y axis of the heat map may be an attribute of a categorical variable such as “gender”. In this case, the items “male” and “female” are represented on the Y axis. For example, when the height is the X axis, the count value of the item “male” and the item “female” is expressed as a heat map for each section obtained by dividing the X axis representing the height.

図１５は、Ｙ軸が連続型変数の属性を表すヒートマップの例を示す模式図である。図１５に示す例では、「時間」をＸ軸とし、「客数」をＹ軸としている。「客数」は、連続型変数の属性である。 FIG. 15 is a schematic diagram illustrating an example of a heat map in which the Y axis represents the attribute of a continuous variable. In the example illustrated in FIG. 15, “time” is the X axis, and “number of customers” is the Y axis. “Number of customers” is an attribute of a continuous variable.

図１６は、Ｙ軸がカテゴリ型変数の属性を表すヒートマップの例を示す模式図である。図１６に示す例では、「時間」をＸ軸とし、「店舗コード」をＹ軸としている。「店舗コード」は、カテゴリ型変数の属性であり、その属性値は「店舗Ａ」、「店舗Ｂ」、・・・等の項目である。図１６に示す例では、予め、例えば、時間帯と、「店舗Ａ」等の項目とを対応付けたデータの集合が与えられ、個々の時間帯に対応する「店舗Ａ」のカウント値、「店舗Ｂ」のカウント値等がヒートマップとして表現される。図１６に示す例においても、Ｙ軸は、「店舗コード」という１つの属性のみを表している。 FIG. 16 is a schematic diagram illustrating an example of a heat map in which the Y-axis represents an attribute of a categorical variable. In the example shown in FIG. 16, “time” is the X axis, and “store code” is the Y axis. “Store code” is an attribute of a categorical variable, and the attribute value is an item such as “Store A”, “Store B”,. In the example shown in FIG. 16, for example, a set of data in which a time zone is associated with an item such as “Store A” is given in advance, and the count value of “Store A” corresponding to each time zone, “ The count value of “Store B” is expressed as a heat map. Also in the example illustrated in FIG. 16, the Y axis represents only one attribute “store code”.

また、特許文献１には、画素データを、Ｋ−ｍｅａｎｓ法に基づくクラスタリング処理によって、複数のクラスタに分類することが記載されている。 Patent Document 1 describes that pixel data is classified into a plurality of clusters by clustering processing based on the K-means method.

また、特許文献２には、外れ値検出可視化ツールとしてヒートマップを用いることが記載されている。 Patent Document 2 describes using a heat map as an outlier detection visualization tool.

特開２００９−２１２６０５号公報（段落００６６参照）JP 2009-212605 A (see paragraph 0066) 特開２００９−７６０５６号公報（段落００１６参照）JP 2009-76056 A (see paragraph 0016)

前述のように、一般に、ある属性をＸ軸としたヒートマップでは、Ｙ軸は１つの属性のみを表す。従って、ヒートマップの観察者は、１つの属性の変化の状況しか把握できない。そのため、観察者は、複数の属性において、変化のパターンが類似する属性を容易に把握することができなかった。 As described above, generally, in a heat map in which a certain attribute is an X axis, the Y axis represents only one attribute. Accordingly, the observer of the heat map can grasp only the change state of one attribute. For this reason, the observer cannot easily grasp an attribute having a similar change pattern among a plurality of attributes.

そこで、本発明は、複数の属性において、変化のパターンが類似する属性を観察者が容易に把握できるようにする表示システム、表示方法および表示プログラムを提供することを一つの目的とする。 Therefore, an object of the present invention is to provide a display system, a display method, and a display program that enable an observer to easily grasp attributes having a similar change pattern among a plurality of attributes.

本発明による表示システムは、互いに対応付けられた複数の属性の属性値の組の集合が与えられ、複数の属性のうち１つの属性が指定されたときに、指定されていない属性の属性値を、指定された属性の属性値の区分に従って集計した集計データを、指定されていない属性毎に生成する集計データ生成手段と、集計データをクラスタリングするクラスタリング手段と、指定された属性をＸ軸として、クラスタリングの結果得られたクラスタ毎に、集計データをヒートマップとして表示する表示手段と、集計データの選択を受け付ける選択受付手段と、選択された集計データに基づいて、表示対象とする集計データを絞り込む絞り込み手段とを備え、絞り込み手段が、選択された集計データとして表されるベクトルと、他の集計データとして表される各ベクトルとの距離に基づいて、表示対象とする集計データを絞り込むことを特徴とする。 The display system according to the present invention is provided with a set of attribute value pairs of a plurality of attributes associated with each other, and when one attribute is specified among the plurality of attributes, the attribute values of attributes not specified are displayed. The aggregated data generating means for generating the aggregated data aggregated according to the attribute value category of the designated attribute for each attribute not designated, the clustering means for clustering the aggregated data, and the designated attribute as the X axis, For each cluster obtained as a result of clustering, display means for displaying the aggregated data as a heat map, selection accepting means for accepting selection of aggregated data, and narrowing down the aggregated data to be displayed based on the selected aggregated data A narrowing-down means, and the narrowing-down means displays a vector represented as the selected aggregated data and other aggregated data. Based on the distances between the vectors, characterized in that narrow down the aggregated data to be displayed.

また、本発明による表示方法は、互いに対応付けられた複数の属性の属性値の組の集合が与えられ、複数の属性のうち１つの属性が指定されたときに、指定されていない属性の属性値を、指定された属性の属性値の区分に従って集計した集計データを、指定されていない属性毎に生成し、集計データをクラスタリングし、指定された属性をＸ軸として、クラスタリングの結果得られたクラスタ毎に、集計データをヒートマップとして表示し、集計データの選択を受け付け、選択された集計データに基づいて、表示対象とする集計データを絞り込み、表示対象とする集計データを絞り込むときに、選択された集計データとして表されるベクトルと、他の集計データとして表される各ベクトルとの距離に基づいて、表示対象とする集計データを絞り込むことを特徴とする。 Further, the display method according to the present invention is provided with a set of attribute value pairs of a plurality of attributes that are associated with each other, and when one attribute among the plurality of attributes is specified, the attributes of the attributes that are not specified Aggregated data that is aggregated according to the attribute value category of the specified attribute is generated for each attribute that is not specified, the aggregated data is clustered, and the specified attribute is used as the X axis. Displays aggregate data as a heat map for each cluster , accepts selection of aggregate data, selects aggregate data to be displayed based on the selected aggregate data, and selects when narrowing the aggregate data to be displayed The aggregate data to be displayed is narrowed down based on the distance between the vector represented as the aggregated data and each vector represented as other aggregated data. Characterized in that the writing.

また、本発明による表示プログラムは、コンピュータに、互いに対応付けられた複数の属性の属性値の組の集合が与えられ、複数の属性のうち１つの属性が指定されたときに、指定されていない属性の属性値を、指定された属性の属性値の区分に従って集計した集計データを、指定されていない属性毎に生成する集計データ生成処理、集計データをクラスタリングするクラスタリング処理、指定された属性をＸ軸として、クラスタリングの結果得られたクラスタ毎に、集計データをヒートマップとして表示する表示処理、集計データの選択を受け付ける処理、および、選択された集計データに基づいて、表示対象とする集計データを絞り込む処理を実行させ、表示対象とする集計データを絞り込む処理で、選択された集計データとして表されるベクトルと、他の集計データとして表される各ベクトルとの距離に基づいて、表示対象とする集計データを絞り込ませることを特徴とする。 Further, the display program according to the present invention is not specified when a set of attribute values of a plurality of attributes associated with each other is given to the computer and one attribute is specified among the plurality of attributes. Aggregated data generation process that generates aggregated data for each attribute that is not specified, aggregated data that is aggregated according to the attribute value category of the specified attribute , clustering process that clusters the aggregated data , and specified attribute X For each cluster obtained as a result of clustering, display processing that displays aggregated data as a heat map, processing that accepts selection of aggregated data, and aggregated data to be displayed based on the selected aggregated data It is displayed as the selected aggregate data in the process of narrowing down the aggregate data to be displayed by executing the narrowing process. And vector, based on the distance between the vector represented as other aggregate data, and characterized in that narrowed aggregated data to be displayed.

本発明によれば、観察者が、複数の属性において、変化のパターンが類似する属性を容易に把握できる。 According to the present invention, an observer can easily grasp an attribute having a similar change pattern among a plurality of attributes.

本発明の第１の実施形態の表示システムの構成例を示すブロック図である。It is a block diagram which shows the structural example of the display system of the 1st Embodiment of this invention. 入力データの例を示す模式図である。It is a schematic diagram which shows the example of input data. 第１の実施形態の表示システムの処理経過の例を示すフローチャートである。It is a flowchart which shows the example of process progress of the display system of 1st Embodiment. 第１の実施形態における表示システムの表示画面の例を示す模式図である。It is a schematic diagram which shows the example of the display screen of the display system in 1st Embodiment. ステップＳ１で「日付」が指定された場合における他の各属性（図２に例示する「降水量」等の各属性）の集計データを示す模式図である。It is a schematic diagram which shows the total data of other each attribute (Each attribute such as “Precipitation” illustrated in FIG. 2) when “Date” is designated in Step S1. 図５に模式的に示す集計データに対してステップＳ３の処理を行った後の状態を示す模式図である。It is a schematic diagram which shows the state after performing the process of step S3 with respect to the total data typically shown in FIG. 図６に模式的に示す集計データに対してステップＳ４のクラスタリング処理を実行した結果を示す模式図である。It is a schematic diagram which shows the result of having performed the clustering process of step S4 with respect to the total data typically shown in FIG. ステップＳ５で表示されるヒートマップの例を示す模式図である。It is a schematic diagram which shows the example of the heat map displayed by step S5. クラスタ毎に表示された集計データのヒートマップの例を示す図である。It is a figure which shows the example of the heat map of the total data displayed for every cluster. 本発明の第２の実施形態の表示システムの構成例を示すブロック図である。It is a block diagram which shows the structural example of the display system of the 2nd Embodiment of this invention. 第２の実施形態における表示システムの表示画面の例を示す模式図である。It is a schematic diagram which shows the example of the display screen of the display system in 2nd Embodiment. 図８に例示するヒートマップの表示後に表示対象とする集計データを絞り込んだ場合のヒートマップの表示例を示す模式図である。It is a schematic diagram which shows the example of a heat map display at the time of narrowing down the total data made into display object after displaying the heat map illustrated in FIG. 本発明の各実施形態に係るコンピュータの構成例を示す概略ブロック図である。It is a schematic block diagram which shows the structural example of the computer which concerns on each embodiment of this invention. 本発明の表示システムの最小構成の例を示すブロック図である。It is a block diagram which shows the example of the minimum structure of the display system of this invention. Ｙ軸が連続型変数の属性を表すヒートマップの例を示す模式図である。It is a schematic diagram which shows the example of the heat map in which the Y-axis represents the attribute of a continuous type variable. Ｙ軸がカテゴリ型変数の属性を表すヒートマップの例を示す模式図である。It is a schematic diagram which shows the example of the heat map in which a Y-axis represents the attribute of a categorical variable.

以下、図面を参照して本発明の実施形態を説明する。以下、本発明の表示システムを操作する者を操作者と記す。 Hereinafter, embodiments of the present invention will be described with reference to the drawings. Hereinafter, a person who operates the display system of the present invention is referred to as an operator.

実施形態１．
図１は、本発明の第１の実施形態の表示システムの構成例を示すブロック図である。本実施形態の表示システム１は、指定受付手段２と、集計データ生成手段３と、分解手段４と、クラスタリング手段５と、表示手段６とを備える。 Embodiment 1. FIG.
FIG. 1 is a block diagram showing a configuration example of a display system according to the first embodiment of the present invention. The display system 1 of the present embodiment includes a designation receiving unit 2, a total data generation unit 3, a decomposition unit 4, a clustering unit 5, and a display unit 6.

表示システム１には、互いに対応付けられた複数の属性の属性値の組の集合が入力データとして入力される。図２は、入力データの例を示す模式図である。入力データは、図２に示すように、テーブルとして表現することができる。図２に示す例では、属性として、「日付」、「降水量」、「気温」、「販売促進キャンペーン」、「客数」、「風速」を例示しているが、属性の例は、図２に示す例に限定されない。図２に示す例では、「日付」として年月日を示しているが、「年」、「月」、「日」がそれぞれ独立した属性であってもよい。図２に示す各行において、「日付」、「降水量」、「気温」、「販売促進キャンペーン」、「客数」、「風速」の属性値が互いに対応付けられている。すなわち、図２に示す個々の行は、属性値の組を表している。図２に示す行番号は、属性値の組のＩＤである。また、「日付」等の各属性は、テーブルの列に対応する。図２に例示する属性のうち、「販売促進キャンペーン」は、カテゴリ型変数の属性であり、他の属性は、連続型変数の属性である。以下の説明では、「祭り」、「テレビコマーシャル」、「なし」、「おまけ配布」、「クーポン配布」、「雑誌広告」、「新聞広告」の各項目が、「販売促進キャンペーン」の属性値であるものとして説明する。なお、入力データには、連続型変数の属性のみ、あるいは、カテゴリ型変数の属性のみがふくまれていてもよい。 A set of attribute value pairs of a plurality of attributes associated with each other is input to the display system 1 as input data. FIG. 2 is a schematic diagram illustrating an example of input data. The input data can be expressed as a table as shown in FIG. In the example shown in FIG. 2, “date”, “precipitation”, “temperature”, “sales promotion campaign”, “number of customers”, and “wind speed” are illustrated as attributes. It is not limited to the example shown. In the example shown in FIG. 2, the date is shown as “date”, but “year”, “month”, and “day” may be independent attributes. In each row shown in FIG. 2, the attribute values of “date”, “precipitation”, “temperature”, “sales promotion campaign”, “number of customers”, and “wind speed” are associated with each other. That is, each row shown in FIG. 2 represents a set of attribute values. The row number shown in FIG. 2 is an ID of a set of attribute values. Each attribute such as “date” corresponds to a column of the table. Among the attributes exemplified in FIG. 2, “sales promotion campaign” is an attribute of a categorical variable, and the other attributes are attributes of a continuous variable. In the following explanation, the items “Festival”, “TV Commercial”, “None”, “Bonus Distribution”, “Coupon Distribution”, “Magazine Advertising”, and “Newspaper Advertising” are attribute values of “Sales Promotion Campaign”. It is assumed that Note that the input data may include only the attributes of continuous variables or only the attributes of categorical variables.

指定受付手段２は、入力データに含まれる複数の属性のうち１つの属性の指定を、入力デバイス（図１において図示略）を介して操作者から受け付ける。操作者によって指定された属性を、指定属性と記す。指定受付手段２は、指定属性を、ヒートマップのＸ軸に対応する属性として決定する。 The designation accepting unit 2 accepts designation of one attribute among a plurality of attributes included in the input data from an operator via an input device (not shown in FIG. 1). The attribute specified by the operator is referred to as a specified attribute. The designation receiving means 2 determines the designated attribute as an attribute corresponding to the X axis of the heat map.

集計データ生成手段３は、指定受付手段２が操作者から属性の指定を受け付けると、指定属性以外の属性の属性値を指定属性の属性値の区分に従って集計した集計データを生成する。集計データ生成手段３は、集計データを指定属性以外の属性毎に生成する。指定属性として指定されたときの区分は、属性毎に予め定められている。連続型変数の属性の場合、取り得る属性値を所定間隔で区切ることによって個々の区分を定めておけばよい。例えば、「日付」の場合、「２０１４年６月１日〜２０１４年６月３０日」、「２０１４年７月１日〜２０１４年７月３１日」等ように１カ月毎に属性値を区切ることによって、区間を予め定めておけばよい。また、例えば、「気温」の場合「０℃〜５℃」、「５℃〜１０℃」等のように、一定間隔で属性値を区切ることによって区間を予め定めておけばよい。カテゴリ型変数の場合には、個々の属性値を１つの区分として定めておけばよい。 When the designation receiving unit 2 receives the designation of the attribute from the operator, the total data generating unit 3 generates total data obtained by totaling attribute values of attributes other than the designated attribute according to the attribute value classification of the designated attribute. The total data generation unit 3 generates total data for each attribute other than the designated attribute. The classification when designated as a designated attribute is predetermined for each attribute. In the case of attributes of continuous variables, the individual divisions may be determined by dividing possible attribute values at predetermined intervals. For example, in the case of “date”, attribute values are delimited every month, such as “June 1, 2014 to June 30, 2014”, “July 1, 2014 to July 31, 2014”, and the like. Thus, the section may be determined in advance. Further, for example, in the case of “temperature”, a section may be determined in advance by dividing attribute values at regular intervals, such as “0 ° C. to 5 ° C.” and “5 ° C. to 10 ° C.”. In the case of a categorical variable, each attribute value may be determined as one category.

集計データ生成手段３は、連続型変数の属性に関して集計データを生成する場合、指定属性の属性値の区分毎に、区分に対応する属性値の統計値を集計値として算出することによって、集計データを生成する。連続型変数の属性に関しては、区分に対応する属性値の統計値を算出することが、区分に従った属性値の集計に該当する。以下、この統計値が平均値である場合を例にして説明する。 When the aggregate data generation means 3 generates aggregate data regarding the attribute of the continuous variable, the aggregate data is calculated by calculating the statistical value of the attribute value corresponding to the category as the aggregate value for each attribute value category of the designated attribute. Is generated. Regarding the attributes of continuous variables, calculating the statistical value of the attribute value corresponding to the category corresponds to the aggregation of attribute values according to the category. Hereinafter, the case where this statistical value is an average value will be described as an example.

例えば、指定属性が「日付」であるとする。集計データ生成手段３は、連続型変数である降水量の集計データを生成する場合、図２に例示する入力データを参照し、「２０１４年６月１日〜２０１４年６月３０日」という区分に対応する降水量の平均値を集計値として算出し、「２０１４年７月１日〜２０１４年７月３１日」等の他の区分に関しても同様に、区分毎に、降水量の平均値を算出する。集計データ生成手段３は、区分毎に得られた降水量の平均値を要素とするベクトルを、降水量の集計データとする。 For example, assume that the designated attribute is “date”. When the aggregated data generation means 3 generates aggregated data of precipitation, which is a continuous variable, the input data illustrated in FIG. 2 is referred to, and the classification “June 1, 2014 to June 30, 2014” For the other categories such as “July 1, 2014 to July 31, 2014”, the average value of precipitation corresponding to is calculated for each category. calculate. The total data generation means 3 uses a vector whose element is the average value of precipitation obtained for each category as total data of precipitation.

また、例えば、指定属性が「気温」であるとする。集計データ生成手段３は、連続型変数である降水量の集計データを生成する場合、図２に例示する入力データを参照し、「０℃〜５℃」という区分に対応する降水量の平均値を集計値として算出し、「５℃〜１０℃」等の他の区分に関しても同様に、区分毎に、降水量の平均値を算出する。集計データ生成手段３は、区分毎に得られた降水量の平均値を要素とするベクトルを、降水量の集計データとする。 Also, for example, assume that the designated attribute is “temperature”. The aggregate data generation means 3 refers to the input data illustrated in FIG. 2 when generating aggregate data of precipitation that is a continuous variable, and calculates the average value of precipitation corresponding to the category of “0 ° C. to 5 ° C.” Is calculated as an aggregate value, and similarly for other categories such as “5 ° C. to 10 ° C.”, an average value of precipitation is calculated for each category. The total data generation means 3 uses a vector whose element is the average value of precipitation obtained for each category as total data of precipitation.

上記の例から分かるように、連続型変数の属性の集計データは、区分数の次元のベクトルとして表される。集計データ生成手段３は、平均値を区分の順に並べたベクトルを、集計データとする。 As can be seen from the above example, the aggregated data of the attributes of continuous variables is represented as a vector of the number of divisions. The total data generation means 3 uses a vector in which the average values are arranged in the order of classification as the total data.

上記の２つの例では、降水量の集計データを生成する場合を例にして説明したが、集計データ生成手段３は、降水量以外の他の連続型変数の属性に関しても、同様に集計データを生成する。 In the above two examples, the case of generating the precipitation total data has been described as an example. However, the total data generation unit 3 similarly calculates the total data for the attributes of the continuous variables other than the precipitation. Generate.

集計データ生成手段３は、カテゴリ型変数の属性に関して集計データを生成する場合、指定属性の属性値の区分毎に、区分に対応するカテゴリ型変数の属性値である項目の数をカウントし、そのカウント値を集計値として算出することによって、集計データを生成する。カテゴリ型変数の属性に関しては、区分に対応する項目の数を項目別にカウントすることが、区分に従った属性値の集計に該当する。 The aggregate data generation means 3 counts the number of items that are attribute values of the categorical variable corresponding to the category for each category of the attribute value of the designated attribute when generating aggregate data regarding the attribute of the categorical variable, By calculating the count value as the total value, the total data is generated. Regarding the attributes of categorical variables, counting the number of items corresponding to a category corresponds to the aggregation of attribute values according to the category.

例えば、指定属性が「日付」であるとする。集計データ生成手段３は、カテゴリ型変数である「販売促進キャンペーン」の集計データを生成する場合、図２に例示する入力データを参照し、「２０１４年６月１日〜２０１４年６月３０日」という区分に対応する「祭り」という項目（属性値）の数、「テレビコマーシャル」という項目の数等を、項目毎にそれぞれカウントする。集計データ生成手段３は、「２０１４年７月１日〜２０１４年７月３１日」等の他の区分に関しても同様に、区分毎に、区分に対応する各項目の数をカウントする。この結果、１つの項目について、区分数と同数のカウント値が得られ、そのカウント値を区分の順に並べたベクトルが得られる。このベクトルの次元数は、区分数と同数である。集計データ生成手段３は、項目毎のベクトルの集合を、「販売促進キャンペーン」の集計データとする。 For example, assume that the designated attribute is “date”. When generating the total data of the “sales promotion campaign” that is a categorical variable, the total data generation unit 3 refers to the input data illustrated in FIG. 2 and reads “June 1, 2014 to June 30, 2014”. The number of items (attribute values) “festival” corresponding to the category “,” the number of items “TV commercial”, and the like are counted for each item. Similarly, other data such as “July 1, 2014 to July 31, 2014” is counted by the total data generation unit 3 for each category and counts the number of items corresponding to the category. As a result, the same number of count values as the number of sections is obtained for one item, and a vector in which the count values are arranged in the order of the sections is obtained. The number of dimensions of this vector is the same as the number of sections. The total data generation means 3 uses a set of vectors for each item as total data for the “sales promotion campaign”.

また、例えば、指定属性が「気温」であるとする。集計データ生成手段３は、カテゴリ型変数である「販売促進キャンペーン」の集計データを生成する場合、図２に例示する入力データを参照し、「０℃〜５℃」という区分に対応する「祭り」という項目（属性値）の数、「テレビコマーシャル」という項目の数等を、項目毎にそれぞれカウントする。集計データ生成手段３は、「５℃〜１０℃」等の他の区分に関しても同様に、区分毎に、区分に対応する各項目の数をカウントする。上記の例と同様に、１つの項目について、区分数と同数のカウント値が得られ、そのカウント値を区分の順に並べたベクトルが得られる。このベクトルの次元数は、区分数と同数である。集計データ生成手段３は、項目毎のベクトルの集合を、「販売促進キャンペーン」の集計データとする。 Also, for example, assume that the designated attribute is “temperature”. When generating the total data of the “sales promotion campaign” which is a categorical variable, the total data generation means 3 refers to the input data illustrated in FIG. 2 and corresponds to the “Festival” corresponding to the category “0 ° C. to 5 ° C.”. The number of items (attribute values) ", the number of items" TV commercial ", etc. are counted for each item. Similarly, the total data generation unit 3 counts the number of items corresponding to each category for each category such as “5 ° C. to 10 ° C.”. Similar to the above example, the count value equal to the number of sections is obtained for one item, and a vector in which the count values are arranged in the order of the sections is obtained. The number of dimensions of this vector is the same as the number of sections. The total data generation means 3 uses a set of vectors for each item as total data for the “sales promotion campaign”.

上記の２つの例では、「販売促進キャンペーン」の集計データを生成する場合を例にして説明したが、集計データ生成手段３は、他のカテゴリ型変数の属性に関しても、同様に集計データを生成する。 In the above two examples, the case where the aggregated data of “sales promotion campaign” is generated has been described as an example. However, the aggregated data generation unit 3 similarly generates aggregated data for the attributes of other categorical variables. To do.

分解手段４は、属性毎に生成された集計データを、連続型変数の集計データと、カテゴリ型変数の集計データとに分類する。分解手段４は、１つのベクトルとして表されている集計データを、連続型変数の集計データとして分類し、複数のベクトルの集合として表されている集計データを、カテゴリ型変数の集計データとして分類すればよい。分解手段４は、分類した集計データに対して、連続型変数の属性の集計データであるのか、カテゴリ型変数の属性の集計データであるのかを示すラベル情報を付加してもよい。以下、分解手段４が、分類した集計データにこのようなラベル情報を付加するものとして説明する。 The decomposing means 4 classifies the aggregate data generated for each attribute into continuous variable aggregate data and categorical variable aggregate data. The disassembling means 4 classifies the aggregated data represented as one vector as aggregated data of continuous variables, and classifies the aggregated data represented as a set of a plurality of vectors as aggregated data of categorical variables. That's fine. The disassembling unit 4 may add label information indicating whether the data is the aggregated data of the attribute of the continuous variable or the aggregated data of the attribute of the categorical variable to the classified aggregated data. Hereinafter, description will be made assuming that the decomposing means 4 adds such label information to the classified total data.

さらに、分解手段４は、カテゴリ型変数の属性の集計データを項目毎に分解する。前述のように、カテゴリ型変数の属性の集計データは、項目毎のベクトルの集合である。分解手段４は、この集合を、１つ１つのベクトルに分解すればよい。分解手段４は、この分解後の集計データ（項目毎の個々のベクトル）にも、カテゴリ型変数の属性のデータであることを示すラベル情報を引き継がせる。 Further, the decomposing means 4 decomposes the aggregate data of the attribute of the categorical variable for each item. As described above, the aggregate data of the attributes of the categorical variable is a set of vectors for each item. The decomposition means 4 may decompose this set into individual vectors. The disassembling means 4 can also inherit the label information indicating that the data is attribute data of the categorical variable in the aggregated data (individual vectors for each item) after the disassembly.

連続型変数の属性の集計データとして得られるベクトルの次元と、カテゴリ型変数の属性の集計データを項目毎に分解した結果得られるベクトルの次元は、共通である。 The dimension of the vector obtained as the aggregated data of the attributes of the continuous variable and the dimension of the vector obtained as a result of decomposing the aggregated data of the attributes of the categorical variable for each item are common.

クラスタリング手段５は、連続型変数の属性の集計データとして得られる各ベクトル、および、カテゴリ型変数の属性の集計データを項目毎に分解した結果得られるベクトルをそれぞれ正規化する。正規化の方法は、特に限定されない。例えば、クラスタリング手段５は、ベクトル毎に、ベクトル内の要素が０〜１の範囲の値になるように、ベクトル内の要素を変換してもよい。また、例えば、クラスタリング手段５は、ベクトル毎に、ベクトル内の個々の要素からそれらの要素の平均値を減算し、各減算結果を要素の分散で除算してもよい。 The clustering means 5 normalizes each vector obtained as aggregate data of continuous variable attributes and a vector obtained as a result of decomposing the aggregate data of categorical variable attributes for each item. The normalization method is not particularly limited. For example, the clustering unit 5 may convert the elements in the vector so that the elements in the vector are in the range of 0 to 1 for each vector. For example, for each vector, the clustering unit 5 may subtract the average value of the elements from the individual elements in the vector and divide each subtraction result by the variance of the elements.

そして、クラスタリング手段５は、連続型変数の属性の集計データを正規化したベクトルと、カテゴリ型変数の属性の集計データを項目毎に分解し、さらに正規化して得られたベクトルとを対象にして、クラスタリングを行う。クラスタリングの結果得られるクラスタには、連続型変数の属性のラベル情報が付加されたベクトルと、カテゴリ型変数の属性のラベル情報が付加されたベクトルとが混在していてよい。 Then, the clustering unit 5 targets the vector obtained by normalizing the aggregated data of the attributes of the continuous variable and the vector obtained by decomposing the aggregated data of the attributes of the categorical variable for each item and further normalizing. Perform clustering. A cluster obtained as a result of clustering may include a vector to which label information of attribute of continuous variable is added and a vector to which label information of attribute of categorical variable is added.

クラスタリング手段５は、例えば、Ｋ−ｍｅａｎｓ法によって、各ベクトルをクラスタリングすればよい。なお、クラスタリングの方法はＫ−ｍｅａｎｓ法に限定されず、クラスタリング手段５は、他の方法で各ベクトルをクラスタリングしてもよい。 The clustering means 5 may cluster each vector by, for example, the K-means method. Note that the clustering method is not limited to the K-means method, and the clustering means 5 may cluster each vector by another method.

表示手段６は、クラスタリングの結果得られたクラスタ毎に、集計データ（分解や正規化が行われたベクトル）をヒートマップとして表示する。このとき、表示手段６は、指定受付手段２に対して指定された属性をＸ軸とし、他の属性や項目（カテゴリ型変数の属性の項目）をＹ軸としてヒートマップを表示する。表示手段６は、ヒートマップとともに属性名あるいは項目名も表示する。データにラベル情報を付加している場合、表示手段６は、ラベル情報も表示する。 The display means 6 displays the aggregated data (vector subjected to decomposition and normalization) as a heat map for each cluster obtained as a result of clustering. At this time, the display means 6 displays the heat map with the attribute designated to the designation receiving means 2 as the X axis and other attributes and items (category variable attribute items) as the Y axis. The display means 6 displays the attribute name or item name together with the heat map. When label information is added to the data, the display means 6 also displays label information.

指定受付手段２、集計データ生成手段３、分解手段４、クラスタリング手段５、表示手段６は、例えば、入力デバイスおよびディスプレイ装置をするコンピュータのＣＰＵによって実現される。この場合、ＣＰＵは、例えば、コンピュータのプログラム記憶装置（図１において図示略）等のプログラム記録媒体から表示プログラムを読み込み、その表示プログラムに従って、指定受付手段２、集計データ生成手段３、分解手段４、クラスタリング手段５、表示手段６として動作すればよい。なお、表示手段６に関しては、ディスプレイ装置に表示を実行させる部分が、ＣＰＵによって実現される。また、各手段が、それぞれ別々のハードウェアで実現されていてもよい。 The designation receiving unit 2, the total data generating unit 3, the decomposing unit 4, the clustering unit 5, and the display unit 6 are realized by, for example, a CPU of a computer that performs an input device and a display device. In this case, for example, the CPU reads a display program from a program recording medium such as a program storage device (not shown in FIG. 1) of the computer, and in accordance with the display program, the designation receiving means 2, the total data generating means 3, and the decomposing means 4 The clustering means 5 and the display means 6 may be operated. In addition, regarding the display means 6, the part which makes a display apparatus perform a display is implement | achieved by CPU. Each means may be realized by separate hardware.

図３は、第１の実施形態の表示システム１の処理経過の例を示すフローチャートである。互いに対応付けられた複数の属性の属性値の組の集合（例えば、図２参照）が入力データとして、予め表示システムに入力されているものとする。 FIG. 3 is a flowchart illustrating an example of processing progress of the display system 1 according to the first embodiment. Assume that a set (for example, see FIG. 2) of a set of attribute values of a plurality of attributes associated with each other is input in advance to the display system as input data.

最初に、指定受付手段２は、入力データに含まれる複数の属性のうち１つの属性の指定を操作者から受け付ける（ステップＳ１）。図４は、第１の実施形態における表示システム１の表示画面の例を示す模式図である。表示画面は、ヒートマップ表示領域２１と、属性指定領域２２とを有する。図４では、まだ、ヒートマップが表示されていない状態を示している。属性指定領域２２は、ヒートマップのＸ軸に対応する属性の指定を行うための領域である。例えば、属性指定領域２２には、入力データに含まれる各属性がそれぞれラジオボタンとともに表示される。この場合、操作者が指定する属性のラジオボタンをクリック等したときに、指定受付手段２は、その属性が指定されたと判定し、その属性を、ヒートマップのＸ軸に対応する属性として決定する。なお、例えば、表示手段６が、図４に例示する表示画面をディスプレイ装置に表示させる。 First, the designation accepting unit 2 accepts designation of one attribute among a plurality of attributes included in the input data from the operator (step S1). FIG. 4 is a schematic diagram illustrating an example of a display screen of the display system 1 according to the first embodiment. The display screen has a heat map display area 21 and an attribute designation area 22. FIG. 4 shows a state where the heat map is not yet displayed. The attribute designation area 22 is an area for designating an attribute corresponding to the X axis of the heat map. For example, in the attribute designation area 22, each attribute included in the input data is displayed together with a radio button. In this case, when the operator clicks a radio button of an attribute designated, the designation receiving unit 2 determines that the attribute is designated, and decides that attribute as an attribute corresponding to the X axis of the heat map. . For example, the display unit 6 displays the display screen illustrated in FIG. 4 on the display device.

次に、集計データ生成手段３は、指定属性（ステップＳ１で指定された属性）以外の属性毎に、集計データを作成する（ステップＳ２）。 Next, the total data generation unit 3 creates total data for each attribute other than the specified attribute (the attribute specified in step S1) (step S2).

既に説明したように、連続型変数の属性に関して集計データを生成する場合、集計データ生成手段３は、指定属性の属性値の区分毎に、区分に対応する属性値の統計値を集計値として算出することによって、集計データとなるベクトルを生成する。 As already described, when generating the aggregated data regarding the attribute of the continuous variable, the aggregated data generating unit 3 calculates the statistical value of the attribute value corresponding to the category as the aggregated value for each category of the attribute value of the specified attribute. By doing so, a vector to be aggregated data is generated.

また、カテゴリ型変数の属性に関して集計データを生成する場合、集計データ生成手段３は、指定属性の属性値の区分毎に、区分に対応するカテゴリ型変数の項目（カテゴリ型変数の属性値）の数をカウントし、カウント値を区分順に並べたベクトルを項目毎に得る。集計データ生成手段３は、このベクトルの集合を、カテゴリ型変数の属性の集計データと定める。 Further, when generating the aggregated data regarding the attribute of the categorical variable, the aggregated data generating unit 3 stores the category type variable item (attribute value of the categorical variable) corresponding to the category for each attribute value category of the specified attribute. The number is counted, and a vector in which the count values are arranged in the order of division is obtained for each item. The total data generation means 3 determines this set of vectors as total data of the attribute of the categorical variable.

図５は、ステップＳ１で「日付」が指定された場合における他の各属性（図２に例示する「降水量」等の各属性）の集計データを示す模式図である。集計データは、ベクトル、またはベクトルの集合として得られるが、図５では、ヒートマップの表示態様で集計データを模式的に示している。図５に示す個々の段が、１つのベクトルに対応する。これらの点は、後述の、図６、図７においても同様である。「降水量」、「気温」、「客数」および「風速」は、連続型変数の属性であるので、その集計データは１つのベクトルで表される。「販売促進キャンペーン」は、カテゴリ型変数の属性であるので、その集計データは複数のベクトルの集合で表される（図５に示す例では、複数の段で表される）。また、図５では、「日付」が指定された場合を例にして示しているが、指定属性（Ｘ軸に該当する属性）は、「日付」に限定されず、任意の属性でよい。 FIG. 5 is a schematic diagram showing total data of other attributes (each attribute such as “precipitation” illustrated in FIG. 2) when “date” is designated in step S1. The aggregate data is obtained as a vector or a set of vectors. In FIG. 5, the aggregate data is schematically shown in a heat map display mode. Each stage shown in FIG. 5 corresponds to one vector. These points are the same in FIGS. 6 and 7 described later. Since “precipitation”, “temperature”, “number of customers”, and “wind speed” are attributes of continuous variables, the aggregated data is represented by one vector. Since “sales promotion campaign” is an attribute of a categorical variable, the aggregated data is represented by a set of a plurality of vectors (in the example shown in FIG. 5, it is represented by a plurality of stages). FIG. 5 shows an example in which “date” is designated, but the designated attribute (attribute corresponding to the X axis) is not limited to “date”, and may be an arbitrary attribute.

次に、分解手段４は、属性毎に生成された集計データを、連続型変数の集計データと、カテゴリ型変数の集計データとに分類し、カテゴリ型変数の属性の集計データを項目毎に分解する（ステップＳ３）。このとき、分解手段４は、分類した集計データに、連続型変数の属性の集計データであるのか、カテゴリ型変数の属性の集計データであるのかを示すラベル情報を付加してもよい。分解手段４は、分解後の集計データ（項目毎の個々のベクトル）にも、カテゴリ型変数の属性のデータであることを示すラベル情報を引き継がせる。以下、カテゴリ型変数の属性の集計データを項目毎に分解した結果得られるデータを、項目の集計データと記す場合がある。 Next, the decomposing means 4 classifies the aggregate data generated for each attribute into continuous variable aggregate data and categorical variable aggregate data, and decomposes the categorical variable attribute aggregate data for each item. (Step S3). At this time, the decomposing means 4 may add label information indicating whether the data is the aggregated data of the attribute of the continuous variable or the aggregated data of the attribute of the categorical variable to the classified aggregated data. The disassembling means 4 can also inherit the label information indicating the attribute data of the categorical variable in the aggregated data (individual vectors for each item) after the disassembly. Hereinafter, data obtained as a result of disassembling the aggregate data of the attribute of the categorical variable for each item may be referred to as item aggregate data.

図５に示す例では、カテゴリ型変数の属性の集計データは、「祭り」、「テレビコマーシャル」、「なし」、・・・等の７つの項目毎に得られたベクトルの集合である。従って、分解手段４は、そのベクトルの集合を、個々の項目に対応する７つのベクトルに分解すればよい。 In the example shown in FIG. 5, the aggregate data of the attribute of the categorical variable is a set of vectors obtained for each of seven items such as “festival”, “TV commercial”, “none”,. Therefore, the decomposition unit 4 may decompose the set of vectors into seven vectors corresponding to individual items.

図６は、図５に模式的に示す集計データに対して、ステップＳ３の処理を行った後の状態を示す模式図である。図６に示す「Ｐ」は、連続型変数の属性の集計データであることを示すラベル情報である。図６に示す「Ｑ」は、カテゴリ型変数の属性のデータであることを示すラベル情報である。 FIG. 6 is a schematic diagram showing the state after the processing of step S3 is performed on the aggregate data schematically shown in FIG. “P” shown in FIG. 6 is label information indicating that it is aggregated data of attributes of continuous variables. “Q” shown in FIG. 6 is label information indicating attribute data of a categorical variable.

次に、クラスタリング手段５は、連続型変数の属性の集計データと、各項目の集計データに対して、クラスタリングを行う（ステップＳ４）。具体的には、クラスタリング手段５は、まず、連続型変数の属性の集計データとして得られる各ベクトル、および、各項目の集計データとして得られる各ベクトルをそれぞれ正規化する。そして、クラスタリング手段５は、連続型変数の属性の集計データを正規化した各ベクトルと、個々の項目の集計データを正規化した各ベクトルとを対象にして、クラスタリングを行う。クラスタリング手段５は、例えば、Ｋ−ｍｅａｎｓ法によって、各ベクトルをクラスタリングすればよい。 Next, the clustering means 5 performs clustering on the aggregated data of the attributes of the continuous variables and the aggregated data of each item (Step S4). Specifically, the clustering means 5 first normalizes each vector obtained as the aggregated data of the attributes of the continuous variable and each vector obtained as the aggregated data of each item. Then, the clustering means 5 performs clustering for each vector obtained by normalizing the aggregate data of the attributes of the continuous variable and each vector obtained by normalizing the aggregate data of the individual items. The clustering means 5 may cluster each vector by, for example, the K-means method.

図７は、図６に模式的に示す集計データに対して、ステップＳ４のクラスタリング処理を実行した結果を示す模式図である。図７では、連続型変数の属性の集計データ、および、各項目の集計データが、７つのクラスタに分類した場合を例示している。 FIG. 7 is a schematic diagram illustrating a result of performing the clustering process in step S4 on the aggregate data schematically illustrated in FIG. FIG. 7 illustrates a case where the aggregated data of the attributes of the continuous variable and the aggregated data of each item are classified into 7 clusters.

次に、表示手段６は、クラスタ毎に、集計データをヒートマップとして、表示する（ステップＳ５）。ステップＳ５において、表示手段６は、ステップＳ１で指定された属性をＸ軸（横軸）とし、他の属性や項目をＹ軸（縦軸）としてヒートマップを表示する。表示手段６は、ディスプレイ装置に表示させる画面（図４参照）内のヒートマップ表示領域２１にヒートマップを表示させる。図８は、ステップＳ５で表示されるヒートマップの例を示す模式図である。 Next, the display unit 6 displays the aggregated data as a heat map for each cluster (step S5). In step S5, the display means 6 displays the heat map with the attribute designated in step S1 as the X axis (horizontal axis) and other attributes and items as the Y axis (vertical axis). The display means 6 displays the heat map in the heat map display area 21 in the screen (see FIG. 4) displayed on the display device. FIG. 8 is a schematic diagram illustrating an example of a heat map displayed in step S5.

本実施形態の表示システム１は、連続型変数の属性の集計データや、カテゴリ型変数の属性の各項目の集計データをクラスタリングし、その結果得られるクラスタ毎に集計データをヒートマップとして表示する。従って、表示システム１の操作者（すなわち、ヒートマップの観察者）は、複数の属性において、変化のパターンが類似する属性を容易に把握することができる。また、カテゴリ型変数の属性の集計データは、その属性の個々の項目（属性値）毎の集計データに分解される。従って、連続型変数の属性や、カテゴリ型変数の属性の各項目に関して、変化のパターンが類似している属性および項目を操作者が容易に把握できる。 The display system 1 of the present embodiment clusters the aggregated data of continuous variable attributes and the aggregated data of each item of categorical variable attributes, and displays the aggregated data as a heat map for each cluster obtained as a result. Therefore, an operator of the display system 1 (that is, an observer of the heat map) can easily grasp an attribute having a similar change pattern among a plurality of attributes. Further, the total data of the attribute of the categorical variable is decomposed into total data for each item (attribute value) of the attribute. Therefore, for each item of the attribute of the continuous variable and the attribute of the categorical variable, the operator can easily grasp the attributes and items having similar change patterns.

図９は、クラスタ毎に表示された集計データのヒートマップの例を示す図である。図９に示すヒートマップが表示された場合、操作者は、７月から９月頃に関して、図９に示す５つの属性の集計値がいずれも高くなると容易に把握することができる。また、操作者は、図９に示す上段のクラスタのヒートマップから、売上が客数に比例する商品は商品Ａのみであり、そのクラスタに含まれていない商品Ｂ，Ｃ，Ｄ・・・等（図示略）に関しては、売上が客数に比例しないと判断することができる。また、例えば、操作者は、図９に示す下段のクラスタのヒートマップから、湿度と気温が高いと、最高不快指数も高くなる傾向にあるということを把握できる。 FIG. 9 is a diagram illustrating an example of a heat map of the total data displayed for each cluster. When the heat map shown in FIG. 9 is displayed, the operator can easily grasp that the total values of the five attributes shown in FIG. Further, from the heat map of the upper cluster shown in FIG. 9, the operator has only the product A whose sales are proportional to the number of customers, and the products B, C, D... With respect to (not shown), it can be determined that sales are not proportional to the number of customers. Further, for example, the operator can grasp from the heat map of the lower cluster shown in FIG. 9 that the highest discomfort index tends to increase when the humidity and temperature are high.

実施形態２．
図１０は、本発明の第２の実施形態の表示システムの構成例を示すブロック図である。第１の実施形態における要素と同様の要素については、図１と同一の符号を付し、説明を省略する。本実施形態の表示システム１は、第１の実施形態における各要素（図１参照）に加え、選択受付手段７と、絞り込み手段８とを備える。 Embodiment 2. FIG.
FIG. 10 is a block diagram illustrating a configuration example of a display system according to the second embodiment of this invention. The same elements as those in the first embodiment are denoted by the same reference numerals as those in FIG. The display system 1 of the present embodiment includes a selection receiving unit 7 and a narrowing-down unit 8 in addition to the elements in the first embodiment (see FIG. 1).

選択受付手段７は、集計データの選択を、入力デバイス（図１０において図示略）を介して操作者から受け付ける。選択候補となる集計データは、指定属性以外の連続型変数の属性の集計データや、指定属性以外のカテゴリ型変数の属性の各項目の集計データである。 The selection accepting unit 7 accepts selection of aggregate data from an operator via an input device (not shown in FIG. 10). Aggregated data that is a selection candidate is aggregated data of attributes of continuous variables other than specified attributes, and aggregated data of items of attributes of categorical variables other than specified attributes.

図１１は、第２の実施形態における表示システム１の表示画面の例を示す模式図である。第２の実施形態における表示画面は、ヒートマップ表示領域２１および属性指定領域２２に加えて、集計データ選択領域２３を含む。例えば、表示手段６が、図１１に例示する表示画面をディスプレイ装置に表示させる。例えば、表示手段６は、指定属性以外の連続型変数の各属性や、指定属性以外のカテゴリ型変数の属性の各項目が、それぞれラジオボタンとともに表示させる。この場合、操作者が、連続型変数の属性のうちの１つ、あるいは、カテゴリ型変数の属性の各項目のうちの１つに対応するラジオボタンをクリックしたときに、集計データ選択領域２３は、クリックされたラジオボタンに対応する属性または項目の集計データが選択されたと判定する。 FIG. 11 is a schematic diagram illustrating an example of a display screen of the display system 1 according to the second embodiment. The display screen in the second embodiment includes a total data selection area 23 in addition to the heat map display area 21 and the attribute designation area 22. For example, the display unit 6 displays the display screen illustrated in FIG. 11 on the display device. For example, the display unit 6 displays each attribute of the continuous variable other than the designated attribute and each item of the attribute of the categorical variable other than the designated attribute together with the radio button. In this case, when the operator clicks a radio button corresponding to one of the attributes of the continuous variable or one of the items of the attribute of the categorical variable, the total data selection area 23 is Then, it is determined that the total data of the attribute or item corresponding to the clicked radio button is selected.

絞り込み手段８は、操作者に選択された集計データに基づいて、表示対象とする集計データを絞り込む。第１の実施形態で述べたように、連続型変数の属性の集計データも、カテゴリ型変数の属性の各項目の集計データも、ベクトルとして定められる。絞り込み手段８は、選択された集計データに該当するベクトルと、他の集計データに該当するベクトルとの距離を、選択された集計データ以外の集計データ毎に算出し、その距離に基づいて、表示対象とする集計データを絞り込む。ここで、ベクトル同士の距離を算出する際に用いるベクトルは、クラスタリング手段５によって正規化が行われたベクトルである。 The narrowing down means 8 narrows down the aggregate data to be displayed based on the aggregate data selected by the operator. As described in the first embodiment, the aggregate data of the attributes of the continuous variable and the aggregate data of each item of the attribute of the categorical variable are determined as vectors. The narrowing-down means 8 calculates the distance between the vector corresponding to the selected aggregate data and the vector corresponding to the other aggregate data for each aggregate data other than the selected aggregate data, and displays based on the distance. Narrow down the target aggregated data. Here, the vector used when calculating the distance between the vectors is a vector that has been normalized by the clustering means 5.

絞り込み手段８は、操作者に選択された集計データを表示対象として定める。さらに、絞り込み手段８は、選択された集計データに該当するベクトルと、他の集計データに該当するベクトルとの距離が、閾値以下であることを条件として、当該他の集計データを表示対象として定める。すなわち、選択された集計データに該当するベクトルと、他の集計データに該当するベクトルとの距離が閾値を超えている場合、当該他の集計を表示対象から除外する。この結果、表示対象となる集計データが絞り込まれる。 The narrowing-down means 8 determines the aggregate data selected by the operator as a display target. Further, the narrowing-down means 8 determines the other aggregate data as a display target on the condition that the distance between the vector corresponding to the selected aggregate data and the vector corresponding to the other aggregate data is equal to or less than a threshold value. . That is, when the distance between the vector corresponding to the selected total data and the vector corresponding to the other total data exceeds the threshold, the other total is excluded from the display target. As a result, the aggregate data to be displayed is narrowed down.

表示手段６は、絞り込み手段８によって絞り込まれた集計データのみ（すなわち、表示対象として定められた集計データのみ）を、ヒートマップとして表示する。 The display means 6 displays only the aggregated data narrowed down by the narrowing-down means 8 (that is, only the aggregated data determined as a display target) as a heat map.

図１２は、図８に例示するヒートマップの表示後に、表示対象とする集計データを絞り込んだ場合のヒートマップの表示例を示す模式図である。表示対象とする集計データが絞り込まれることによって、ヒートマップとして表示される属性や項目の数が減少する。なお、図１２に示す例では、図８に示すクラスタ１，２，５に属する集計データが全て表示される場合を示している。しかし、集計データの絞り込みの結果、絞り込み前のクラスタに属する集計データの一部が表示対象から除外される場合もある。 FIG. 12 is a schematic diagram illustrating a display example of the heat map when the aggregate data to be displayed is narrowed down after the heat map illustrated in FIG. 8 is displayed. By narrowing down the aggregate data to be displayed, the number of attributes and items displayed as a heat map is reduced. In the example shown in FIG. 12, the total data belonging to the clusters 1, 2, and 5 shown in FIG. 8 are all displayed. However, as a result of narrowing down the aggregated data, some aggregated data belonging to the cluster before narrowing down may be excluded from the display target.

また、前述の閾値は、予め定められた固定値であってもよい。あるいは、操作者が閾値を選択できる構成であってもよい。図１１に示す例では、集計データ選択領域２３内に、操作者が閾値を選択するためのＧＵＩ（Graphic User Interface）として、スライダ２４が表示される場合を示している。選択受付手段７は、集計データの選択とともに、スライダ２４を介して閾値の選択も受け付けてよい。なお、閾値を選択するためのＧＵＩは、スライダ２４でなくてもよい。また、閾値が固定値である場合、閾値を選択するためのＧＵＩは表示されなくてよい。 Further, the above-described threshold value may be a predetermined fixed value. Or the structure which an operator can select a threshold value may be sufficient. In the example illustrated in FIG. 11, the slider 24 is displayed in the total data selection area 23 as a GUI (Graphic User Interface) for the operator to select a threshold value. The selection accepting means 7 may accept the selection of the threshold value via the slider 24 together with the selection of the total data. Note that the GUI for selecting the threshold value may not be the slider 24. When the threshold is a fixed value, the GUI for selecting the threshold need not be displayed.

選択受付手段７および絞り込み手段８は、例えば、表示プログラムに従って動作するＣＰＵによって実現される。また、選択受付手段７および絞り込み手段８を含む各手段が、それぞれ別々のハードウェアで実現されていてもよい。 The selection receiving means 7 and the narrowing-down means 8 are realized by a CPU that operates according to a display program, for example. Each means including the selection receiving means 7 and the narrowing-down means 8 may be realized by separate hardware.

第２の実施形態の表示システム１は、操作者によって選択された集計データと類似する集計データを絞り込んで、ヒートマップとして表示する。従って、操作者は、着目している集計データと類似する集計データと変化のパターンが類似している属性や項目をさらに容易に把握することができる。 The display system 1 of the second embodiment narrows down the aggregate data similar to the aggregate data selected by the operator and displays it as a heat map. Accordingly, the operator can more easily grasp attributes and items whose change pattern is similar to the aggregate data similar to the aggregate data of interest.

図１３は、本発明の各実施形態に係るコンピュータの構成例を示す概略ブロック図である。コンピュータ１０００は、ＣＰＵ１００１と、主記憶装置１００２と、補助記憶装置１００３と、インタフェース１００４と、ディスプレイ装置１００５と、入力デバイス１００６とを備える。入力デバイス１００６は、例えば、ラジオボタンやスライダ２４を操作するためのマウス等である。 FIG. 13 is a schematic block diagram illustrating a configuration example of a computer according to each embodiment of the present invention. The computer 1000 includes a CPU 1001, a main storage device 1002, an auxiliary storage device 1003, an interface 1004, a display device 1005, and an input device 1006. The input device 1006 is, for example, a radio button or a mouse for operating the slider 24.

上述の表示システム１は、コンピュータ１０００に実装される。表示システム１の動作は、プログラム（表示プログラム）の形式で補助記憶装置１００３に記憶されている。ＣＰＵ１００１は、プログラムを補助記憶装置１００３から読み出して主記憶装置１００２に展開し、そのプログラムに従って上記の処理を実行する。 The display system 1 described above is mounted on the computer 1000. The operation of the display system 1 is stored in the auxiliary storage device 1003 in the form of a program (display program). The CPU 1001 reads out the program from the auxiliary storage device 1003, develops it in the main storage device 1002, and executes the above processing according to the program.

補助記憶装置１００３は、一時的でない有形の媒体の一例である。一時的でない有形の媒体の他の例として、インタフェース１００４を介して接続される磁気ディスク、光磁気ディスク、ＣＤ−ＲＯＭ、ＤＶＤ−ＲＯＭ、半導体メモリ等が挙げられる。また、このプログラムが通信回線によってコンピュータ１０００に配信される場合、配信を受けたコンピュータ１０００がそのプログラムを主記憶装置１００２に展開し、上記の処理を実行してもよい。 The auxiliary storage device 1003 is an example of a tangible medium that is not temporary. Other examples of the tangible medium that is not temporary include a magnetic disk, a magneto-optical disk, a CD-ROM, a DVD-ROM, and a semiconductor memory connected via the interface 1004. When this program is distributed to the computer 1000 via a communication line, the computer 1000 that has received the distribution may develop the program in the main storage device 1002 and execute the above processing.

また、プログラムは、前述の処理の一部を実現するためのものであってもよい。さらに、プログラムは、補助記憶装置１００３に既に記憶されている他のプログラムとの組み合わせで前述の処理を実現する差分プログラムであってもよい。 The program may be for realizing a part of the above-described processing. Furthermore, the program may be a differential program that realizes the above-described processing in combination with another program already stored in the auxiliary storage device 1003.

次に、本発明の最小構成について説明する。図１４は、本発明の表示システムの最小構成の例を示すブロック図である。本発明の表示システムは、集計データ生成手段５３と、クラスタリング手段５５と、表示手段５６とを備える。 Next, the minimum configuration of the present invention will be described. FIG. 14 is a block diagram showing an example of the minimum configuration of the display system of the present invention. The display system of the present invention includes total data generation means 53, clustering means 55, and display means 56.

集計データ生成手段５３（例えば、集計データ生成手段３）は、互いに対応付けられた複数の属性の属性値の組の集合が与えられ、複数の属性のうち１つの属性が指定されたときに、指定されていない属性の属性値を、指定された属性の属性値の区分に従って集計した集計データを、指定されていない属性毎に生成する。 The aggregate data generation means 53 (for example, the aggregate data generation means 3) is given a set of attribute value pairs of a plurality of attributes associated with each other, and when one attribute is designated among the plurality of attributes, Aggregated data obtained by aggregating attribute values of attributes that are not specified according to attribute value classifications of the specified attributes is generated for each attribute that is not specified.

クラスタリング手段５５（例えば、クラスタリング手段５）は、集計データをクラスタリングする。 Clustering means 55 (for example, clustering means 5) clusters the aggregated data.

表示手段５６（例えば、表示手段６）は、指定された属性をＸ軸として、クラスタリングの結果得られたクラスタ毎に、集計データをヒートマップとして表示する。 The display unit 56 (for example, the display unit 6) displays the aggregated data as a heat map for each cluster obtained as a result of clustering using the designated attribute as the X axis.

そのような構成により、表示システムの操作者（換言すれば、ヒートマップの観察者）は、複数の属性において、変化のパターンが類似する属性を容易に把握できる。 With such a configuration, an operator of the display system (in other words, an observer of the heat map) can easily grasp an attribute having a similar change pattern among a plurality of attributes.

また、集計データ生成手段５３が、指定されていない属性のうち、連続型変数の属性に関しては、指定された属性の属性値の区分毎に、対応する属性値の統計値を集計値として算出し、指定されていない属性のうち、カテゴリ型変数の属性に関しては、指定された属性の属性値の区分毎に、対応するカテゴリ型変数の属性の属性値である個々の項目のカウント値を、集計値として算出し、カテゴリ型変数の属性の集計データを、項目毎に分解する分解手段（例えば、分解手段４）を備え、クラスタリング手段５５が、分解後のカテゴリ型変数の属性の集計データと、連続型変数の属性の集計データとをクラスタリングする構成であってもよい。 In addition, the aggregate data generation unit 53 calculates the statistical value of the corresponding attribute value as the aggregate value for each attribute value category for the attribute of the continuous variable among the attributes that are not specified. Among the attributes that are not specified, for categorical variable attributes, the count value of each item that is the attribute value of the corresponding categorical variable attribute is aggregated for each attribute value category of the specified attribute. A decomposing unit (e.g., decomposing unit 4) that decomposes the categorical variable attribute for each item, and the clustering unit 55 calculates the categorical variable attribute total data after decomposing, A configuration may be used in which the aggregated data of the attributes of the continuous variable is clustered.

また、複数の属性のうちの１つの属性の指定を受け付ける指定受付手段（例えば、指定受付手段２）を備える構成であってもよい。 Moreover, the structure provided with the designation | designated reception means (for example, designation | designated reception means 2) which receives designation | designated of one attribute among several attributes may be sufficient.

また、集計データの選択を受け付ける選択受付手段（例えば、選択受付手段７）と、選択された集計データに基づいて、表示対象とする集計データを絞り込む絞り込み手段（絞り込み手段８）とを備える構成であってもよい。 In addition, the configuration includes a selection receiving unit (for example, a selection receiving unit 7) that receives selection of total data, and a narrowing unit (narrowing unit 8) that narrows down the total data to be displayed based on the selected total data. There may be.

また、絞り込み手段が、選択された集計データとして表されるベクトルと、他の集計データとして表される各ベクトルとの距離に基づいて、表示対象とする集計データを絞り込む構成であってもよい。 Further, the narrowing-down means may be configured to narrow down the aggregate data to be displayed based on the distance between the vector represented as the selected aggregate data and each vector represented as other aggregate data.

また、絞り込み手段が、選択された集計データを表示対象として定めるとともに、選択された集計データとして表されるベクトルと、他の集計データとして表されるベクトルとの距離が閾値以下であることを条件に、当該他の集計データを表示対象として定める構成であってもよい。 In addition, the narrowing down means that the selected aggregated data is set as a display target, and the distance between the vector represented as the selected aggregated data and the vector represented as other aggregated data is equal to or less than a threshold. In addition, the other aggregated data may be determined as a display target.

また、選択受付手段が、集計データの選択とともに、閾値の選択も受け付ける構成であってもよい。 Further, the selection accepting unit may be configured to accept selection of a threshold value together with selection of aggregate data.

本発明は、ヒートマップを表示する表示システムに好適に適用される。 The present invention is suitably applied to a display system that displays a heat map.

１表示システム
２指定受付手段
３集計データ生成手段
４分解手段
５クラスタリング手段
６表示手段
７選択受付手段
８絞り込み手段 DESCRIPTION OF SYMBOLS 1 Display system 2 Specification reception means 3 Total data generation means 4 Decomposition means 5 Clustering means 6 Display means 7 Selection reception means 8 Refinement means

Claims

When a set of attribute value pairs of a plurality of attributes that are associated with each other is given, and one attribute of the plurality of attributes is specified, the attribute values of the attributes that are not specified are Aggregated data generating means for generating aggregated data aggregated according to the attribute value category for each of the attributes not specified;
Clustering means for clustering the aggregated data;
Display means for displaying the aggregated data as a heat map for each cluster obtained as a result of clustering using the designated attribute as the X axis ;
Selection accepting means for accepting selection of aggregate data;
And a narrowing means for narrowing down the aggregate data to be displayed based on the selected aggregate data,
The narrowing means is
Narrow down the aggregate data to be displayed based on the distance between the vector represented as the selected aggregate data and each vector represented as other aggregate data
Display system comprising a call.

The aggregate data generation means
Among the attributes that are not specified, for continuous variable attributes, the statistical value of the corresponding attribute value is calculated as an aggregate value for each attribute value category of the specified attribute,
Among the attributes that are not specified, for categorical variable attributes, the count value of each item that is the attribute value of the corresponding categorical variable attribute is aggregated for each category of the attribute value of the specified attribute. As a value,
Disassembling means for decomposing the aggregate data of the attribute of the categorical variable for each item,
Clustering means
The display system according to claim 1, wherein the aggregated data of the attribute of the categorical variable after decomposition and the aggregated data of the attribute of the continuous variable are clustered.

The display system according to claim 1, further comprising designation receiving means for accepting designation of one of a plurality of attributes.

Narrowing means
The selected aggregate data is defined as a display target, and the other data is provided on the condition that the distance between the vector represented as the selected aggregate data and the vector represented as the other aggregate data is equal to or less than a threshold value. Set the aggregate data as the display target
The display system according to any one of claims 1 to 3 .

The display system according to claim 4 , wherein the selection accepting unit accepts selection of a threshold value as well as selection of aggregate data.

When a set of attribute value pairs of a plurality of attributes that are associated with each other is given, and one attribute of the plurality of attributes is specified, the attribute values of the attributes that are not specified are Generate aggregate data aggregated according to the attribute value category for each of the attributes not specified,
Clustering the aggregated data;
The aggregated data is displayed as a heat map for each cluster obtained as a result of clustering using the designated attribute as the X axis ,
Accept selection of aggregate data,
Based on the selected aggregate data, filter the aggregate data to be displayed,
When narrowing down the aggregate data to be displayed, narrow down the aggregate data to be displayed based on the distance between the vector represented as the selected aggregate data and each vector represented as other aggregate data. Characteristic display method.

On the computer,
When a set of attribute value pairs of a plurality of attributes that are associated with each other is given, and one attribute of the plurality of attributes is specified, the attribute values of the attributes that are not specified are Aggregated data generation processing for generating aggregated data aggregated according to attribute value categories for each of the attributes not specified,
A clustering process for clustering the aggregated data ;
Display processing for displaying the aggregated data as a heat map for each cluster obtained as a result of clustering using the designated attribute as the X-axis ,
A process for accepting selection of aggregate data, and
Based on the selected aggregate data, the process to narrow down the aggregate data to be displayed is executed,
In the process of narrowing down the aggregated data to be displayed, the aggregated data to be displayed is narrowed down based on the distance between the vector represented as the selected aggregated data and each vector represented as other aggregated data
Because of the display program.