WO2015163322A1 - Data analysis device, data analysis method, and program - Google Patents

Data analysis device, data analysis method, and program Download PDF

Info

Publication number
WO2015163322A1
WO2015163322A1 PCT/JP2015/062123 JP2015062123W WO2015163322A1 WO 2015163322 A1 WO2015163322 A1 WO 2015163322A1 JP 2015062123 W JP2015062123 W JP 2015062123W WO 2015163322 A1 WO2015163322 A1 WO 2015163322A1
Authority
WO
WIPO (PCT)
Prior art keywords
prediction
rule
prediction rule
actual measurement
measurement value
Prior art date
Application number
PCT/JP2015/062123
Other languages
French (fr)
Japanese (ja)
Inventor
勇気 小阪
虎 王
Original Assignee
日本電気株式会社
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 日本電気株式会社 filed Critical 日本電気株式会社
Priority to JP2016514948A priority Critical patent/JPWO2015163322A1/en
Publication of WO2015163322A1 publication Critical patent/WO2015163322A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"

Definitions

  • the present invention is based on a patent application of the People's Republic of China: application number 20141167977.2 (filed on April 24, 2014), and the entire description of the application is incorporated herein by reference.
  • the present invention relates to a data analysis device, a data analysis method, and a program, and more particularly, to a data analysis device, a data analysis method, and a program that simultaneously analyze relationships between a plurality of objective variables and a plurality of explanatory variables.
  • the results of future predictions derived by analyzing a large amount of accumulated data are beginning to be used for corporate decision making. For example, in stores such as supermarkets and convenience stores, the number of purchases of each product is adjusted based on the demand prediction result of each product. For example, when the demand prediction of each product is performed, the relationship between the product / customer information such as the attribute of each product and the attribute of the customer who sold each product, and the sales performance of each product is analyzed. At this time, the sales performance value of each product is used as the objective variable, while the attributes (price, manufacturer) of each product, the attributes of the customer who sold each product (age, gender), etc. are used as the explanatory variables. It is done.
  • multitask analysis In other words, in multi-task analysis, after learning prediction rules that express the relationship between each objective variable and multiple explanatory variables, the values of multiple explanatory variables are input to the learned prediction rules, so that Calculate the predicted value.
  • Non-Patent Document 1 An example of multitask analysis technology is described in Non-Patent Document 1.
  • a prediction rule (hereinafter referred to as “prediction rule”) expressed by explanatory variables related to all objective variables in common based on measured values of a plurality of objective variables and measured values of a plurality of explanatory variables.
  • “Common prediction rule”) and a prediction rule for each objective variable represented by an explanatory variable related to each objective variable hereinafter referred to as “individual prediction rule”.
  • actual values of each explanatory variable are input to the learned common prediction rule and individual prediction rule, and a prediction value is calculated for each objective variable.
  • Non-Patent Document 2 describes a convex optimization method for minimizing an objective function.
  • Non-Patent Documents 1 and 2 above are incorporated herein by reference. The following analysis was made by the present inventors.
  • the number of objective variables ranges from thousands to tens of thousands
  • the number of prediction rules also ranges from thousands to tens of thousands, making it difficult for the user to check whether each prediction rule is valid. .
  • An object of the present invention is to provide a data analysis apparatus, a data analysis method, and a program that contribute to such a demand.
  • a multitask type data analysis apparatus includes a first actual measurement value that is an actual measurement value of a plurality of objective variables, a second actual measurement value that is an actual measurement value of a plurality of explanatory variables corresponding to the plurality of objective variables, and an object of a prediction target.
  • a storage unit for holding a third actual measurement value that is an actual measurement value of the explanatory variable corresponding to the variable is provided.
  • the data analysis device uses the first actual measurement value and the second actual measurement value, and a common prediction rule that is a prediction rule represented by explanatory variables related to the plurality of objective variables in common.
  • group comprising prediction rules for each objective variable represented by explanatory variables related to each objective variable, and prediction rules for each group when the prediction rules included in the individual prediction rules are grouped
  • a prediction rule learning unit for learning prediction rules is provided.
  • the computer includes a first actual measurement value that is an actual measurement value of a plurality of objective variables, and a second actual measurement value that is an actual measurement value of a plurality of explanatory variables corresponding to the plurality of objective variables; A step of holding a third actual measurement value, which is an actual measurement value of the explanatory variable corresponding to the target variable to be predicted, in the storage unit. Further, in the data analysis method, the computer uses the first actual measurement value and the second actual measurement value read from the storage unit, and is an explanatory variable commonly related to the plurality of objective variables.
  • a group of the common prediction rule that is a prediction rule represented by, the individual prediction rule that consists of the prediction rule for each objective variable represented by the explanatory variable related to each objective variable, and the prediction rule included in the individual prediction rule A step of learning a group-specific prediction rule including a prediction rule for each group and recording it in the storage unit.
  • a program for causing a computer to execute multitask type data analysis includes a first actual measurement value that is an actual measurement value of a plurality of objective variables, a second actual measurement value that is an actual measurement value of a plurality of explanatory variables corresponding to the plurality of objective variables, and an objective variable to be predicted.
  • the computer is caused to execute a process of holding a third actual measurement value, which is an actual measurement value of the corresponding explanatory variable, in the storage unit.
  • the program uses the first actual measurement value and the second actual measurement value read from the storage unit, and the prediction rule is expressed by an explanatory variable commonly related to the plurality of objective variables.
  • the computer is caused to execute a process of learning a prediction rule for each group composed of prediction rules and recording it in the storage unit.
  • the program can be provided as a program product recorded in a non-transitory computer-readable storage medium.
  • the data analysis apparatus, data analysis method, and program according to the present invention can reduce the number of prediction rules while preventing a decrease in prediction accuracy in multi-task type data analysis.
  • FIG. 1 is a block diagram illustrating the configuration of a data analysis apparatus 10 according to an embodiment.
  • the data analysis device 10 is a multitask type data analysis device, and includes a storage unit 14, a prediction rule learning unit 15B, and a prediction value calculation unit 15C.
  • the storage unit 14 includes a first actual measurement value 14A that is an actual measurement value of a plurality of objective variables, a second actual measurement value 14B that is an actual measurement value of a plurality of explanatory variables corresponding to the plurality of objective variables, and a prediction target.
  • a third actual measurement value 14C that is an actual measurement value of the explanatory variable corresponding to the objective variable is held.
  • the prediction rule learning unit 15B uses the first actual measurement value 14A and the second actual measurement value 14B, and a common prediction rule 14D that is a prediction rule expressed by explanatory variables related to a plurality of objective variables in common.
  • a common prediction rule 14D that is a prediction rule expressed by explanatory variables related to a plurality of objective variables in common.
  • individual prediction rules 14E consisting of prediction rules for each objective variable represented by explanatory variables related to each objective variable
  • the prediction rule 14F is calculated.
  • the prediction rule learning unit 15B preferably groups the plurality of prediction rules so that prediction rules similar to each other among the plurality of prediction rules included in the individual prediction rule 14E belong to the same group.
  • the prediction value calculation unit 15C calculates the prediction value 14G of the target variable to be predicted using the common prediction rule 14D and the group-specific prediction rule 14F calculated by the prediction rule learning unit 15B and the third actual measurement value 14C. .
  • the data analysis apparatus 10 it is possible to reduce the number of prediction rules while preventing a decrease in prediction accuracy in multitask type data analysis. This is because, according to the data analysis apparatus 10, instead of the individual prediction rule 14E composed of the prediction rules for each objective variable, common with the group-specific prediction rule 14F for each group when the prediction rules included in the individual prediction rule 14E are grouped.
  • the prediction value 14G of the target variable to be predicted can be calculated using the prediction rule 14D.
  • the number of prediction rules included in the group-specific prediction rule 14F is the number of prediction rules included in the individual prediction rule 14E. This is because it can be greatly reduced.
  • the user determines the validity of the prediction rule used to derive the prediction result based on a relatively small number of prediction rules (common prediction rule 14D, group-specific prediction rule 14F). It becomes possible.
  • FIG. 2 is a block diagram illustrating an example of the configuration of the data analysis apparatus 20 according to the present embodiment.
  • the data analyzer 20 shown in FIG. 2 performs multitask analysis. That is, the data analysis apparatus 20 inputs the actual measurement values 24A of the plurality of objective variables and the actual measurement values 24B of the plurality of explanatory variables, and learns prediction rules (24D to 24F) representing the relationship between the objective variable and the explanatory variables. When the actual measured value 24C of the explanatory variable corresponding to the target variable to be predicted is input, the predicted value 24G for each target variable to be predicted is calculated and output.
  • the data analysis apparatus 20 of the present embodiment relates to a prediction rule (referred to as “common prediction rule 24D”) represented by explanatory variables that are commonly related to all objective variables, and to each objective variable.
  • a prediction rule for each objective variable represented by the explanatory variable (referred to as “individual prediction rule 24E”) and similar individual prediction rules are grouped, and a prediction rule for each group 24F is calculated by recalculating the prediction rule for each group. Then, when the measured value of the explanatory variable is input based on the common prediction rule 24D and the group-specific prediction rule 24F, the predicted value 24G for each target variable to be predicted is calculated and output.
  • the data analysis apparatus 20 includes a communication interface (I / F) unit 21, an operation input unit 22, a screen display unit 23, a storage unit 24, and a processor 25 as hardware. .
  • the communication I / F unit 21 has a dedicated data communication circuit, and performs data communication with various devices (not shown) connected via a communication line (not shown).
  • the operation input unit 22 includes an operation input device such as a keyboard and a mouse, detects an operator's operation, and outputs it to the processor 25.
  • the screen display unit 23 includes a screen display device such as an LCD (Liquid Crystal Display) or a PDP (Plasma Display Panel), and displays various information such as an operation menu and a selection result on the screen in response to an instruction from the processor 25. .
  • the storage unit 24 has a storage device such as a hard disk or a semiconductor memory, and stores processing information and programs required for various processes in the processor 25.
  • the program is a program that realizes various processing units (25A to 25C) by being read and executed by the processor 25.
  • the program is read in advance from an external device (not shown) or a computer-readable storage medium (not shown) via a data input / output function such as the communication I / F unit 21 and stored in the storage unit 24. Also good.
  • the main processing information recorded in the storage unit 24 includes measured values 24A of a plurality of objective variables, measured values 24B of a plurality of explanatory variables, and measured values 24C of explanatory variables corresponding to the target variable to be predicted.
  • the common prediction rule 24D, the individual prediction rule 24E, the group-specific prediction rule 24F, and the prediction value 24G are included.
  • the measured values 24A of the plurality of objective variables and the measured values 24B of the plurality of explanatory variables are classified according to the type of the objective variable.
  • the data divided according to the type of the objective variable may be a list in which the actual measured value of the objective variable and the actual measured value of the corresponding explanatory variable are paired.
  • the actual measured value 24C of the explanatory variable corresponding to the target variable to be predicted is the actual measured value of the explanatory variable corresponding to the target variable to be predicted.
  • the common prediction rule 24D is a prediction rule represented by explanatory variables related to all the objective variables in common.
  • the common prediction rule 24 ⁇ / b> D may be a list in which an explanatory variable name commonly related to all objective variables and a value representing an influence exerted by the explanatory variable on the objective variable are paired.
  • the individual prediction rule 24E is a prediction rule for each objective variable represented by an explanatory variable related to each objective variable.
  • the individual prediction rule 24E is a list composed of triples of an objective variable name, an explanatory variable name related to the objective variable, and a value representing the influence of the explanatory variable on the objective variable. May be.
  • the group-specific prediction rule 24F is a group-specific prediction rule when similar individual prediction rules are grouped.
  • the group-specific prediction rule 24F may be configured by information in which a group ID and a group-specific prediction rule are paired, and information indicating the individual prediction rule 24E belonging to each group ID.
  • the predicted value 24G may be a list in which the target variable to be predicted and the predicted result are paired.
  • the processor 25 includes a microprocessor such as a CPU (Central Processing Unit) and its peripheral circuits.
  • the processor 25 reads the program from the storage unit 24 and executes it, thereby realizing various processing units by cooperating the hardware and the program.
  • the main processing units realized by the processor 25 include an input unit 25A, a prediction rule learning unit 25B, and a predicted value calculation unit 25C.
  • the input unit 25A inputs from the communication I / F unit 21 or the operation input unit 22 the measured values 24A of a plurality of objective variables and the measured values 24C of explanatory variables corresponding to the objective variable to be predicted. To store.
  • the prediction rule learning unit 25B uses the measured values 24A of the plurality of objective variables and the measured values 24B of the plurality of explanatory variables, and the common prediction rule 24D represented by the explanatory variables related to all the objective variables, The individual prediction rule 24E represented by the explanatory variable related to each objective variable is learned, and further, the group-specific prediction rule 24F calculated by grouping similar individual prediction rules is learned and stored in the storage unit 24 To do.
  • the predicted value calculation unit 25C reads the common prediction rule 24D, the group-specific prediction rule 24F, and the actual measured value 24C of the explanatory variable corresponding to the target variable to be predicted from the storage unit 24, and the common prediction rule 24D and the group-specific prediction
  • the measured value 24C of the explanatory variable corresponding to the objective variable to be predicted is input to the rule 24F, and the predicted value 24G for each objective variable to be predicted is calculated and stored in the storage unit 24.
  • the predicted value calculation unit 25C reads the predicted value 24G from the storage unit 24 and outputs it to the screen display unit 23 or outputs it to the outside through the communication I / F unit 21. Further, the predicted value calculation unit 25C reads the common prediction rule 24D, the individual prediction rule 24E, and the group-specific prediction rule 24F from the storage unit 24, and outputs them to the screen display unit 23 or externally through the communication I / F unit 21. Output.
  • FIG. 3 is a flowchart showing an operation of the data analysis apparatus 20 as an example.
  • the operation of the data analysis apparatus 20 includes two phases, a learning phase and a prediction phase.
  • the input unit 25A inputs measured values 24A of a plurality of objective variables and measured values 24B of a plurality of explanatory variables corresponding to the measured values 24A from the communication I / F unit 21 or the operation input unit 22, and stores the storage unit 24. (Step S11).
  • the prediction rule learning unit 25B reads the measured values 24A of the plurality of objective variables and the measured values 24B of the plurality of explanatory variables from the storage unit 24, and sets the common prediction rule 24D, the individual prediction rule 24E, and the group-specific prediction rule 24F. All are learned simultaneously (step S12).
  • the data analysis device 20 performs the following operations.
  • the input unit 25A inputs the actual measured value 24C of the explanatory variable corresponding to the target variable to be predicted from the communication I / F unit 21 or the operation input unit 22, and stores it in the storage unit 24 (step S21).
  • the predicted value calculation unit 25C reads the common prediction rule 24D and the group-specific prediction rule 24F from the storage unit 24, inputs the actual measured value 24C of the explanatory variable corresponding to the target variable to be predicted, and performs the desired prediction A predicted value for each variable is calculated (step S22).
  • the predicted value calculation unit 25C outputs, to the screen display unit 23, the one selected by the user from the predicted value 24G, the common prediction rule 24D, the individual prediction rule 24E, and the group-specific prediction rule 24F, or Output to the outside through the communication I / F unit 21 (step S23).
  • the prediction accuracy is greatly reduced by obtaining the group-specific prediction rules 24F calculated by grouping the prediction rules learned for each objective variable. Without making it possible, the number of prediction rules can be reduced.
  • the input unit 25A receives the measured values 24A of a plurality of objective variables and the measured values 24B of a plurality of explanatory variables as inputs.
  • the vector X_nt is an M-dimensional column vector representing the n-th observation vector of the target variable type t.
  • Y_nt is the nth actually measured value of the target variable type t.
  • N_t represents the number of actually measured values of the target variable type t.
  • T represents the number of types of objective variables.
  • M represents the number of explanatory variables.
  • ⁇ T ⁇ represents transposition.
  • the common prediction rule 24D, the individual prediction rule 24E, and the group-specific prediction rule 24F are represented by a column vector p_t, a matrix Q, and a matrix F, respectively.
  • the M-dimensional column vector p_t represents a common prediction rule for the task t.
  • a method for calculating the respective prediction rules 24D to 24F is as follows.
  • the common prediction rule indicates an explanatory variable that is commonly related to the objective variables of all tasks, and the degree of influence of each explanatory variable on the objective variable differs for each task. Therefore, the common prediction rule is defined for each task.
  • the matrix Q is a matrix indicating the individual prediction rule 24E.
  • the matrix Q represents an M ⁇ T size matrix of [q1q2... Q_t... Q_T].
  • the vector q_t is an M-dimensional column vector and represents an individual prediction rule for the task t.
  • the matrix F is a matrix indicating the group-specific prediction rule 24F.
  • the matrix F represents an M ⁇ K sized matrix [f1f2... F_k.
  • the vector f_k is an M-dimensional column vector and represents the kth group-specific prediction rule.
  • K represents the number of groups when the individual prediction rule 24E is divided into groups.
  • the matrix G represents a T ⁇ K sized matrix [g_1 ⁇ ⁇ T ⁇ ; g_2 ⁇ ⁇ T ⁇ ;...; G_t ⁇ ⁇ T ⁇ ; ...; g_T ⁇ ⁇ T ⁇ ].
  • the vector g_t is a K-dimensional column vector.
  • the vector g_t represents to which group the individual prediction rule of the objective variable type t belongs.
  • the prediction rule learning unit 25B learns the vector p_t and the matrices Q, F, and G at the same time. Specifically, the vector p_t and the matrices Q, F, and G may be learned by minimizing a predetermined objective function.
  • the prediction rule learning unit 25B can use an objective function represented by the following formula (1) as an example.
  • ⁇ _1, ⁇ _2, ⁇ _3, and ⁇ _4 are parameters that adjust the degree of influence of each term.
  • ⁇ _t represents the sum of t.
  • Equation (1) The purpose of introducing each term of Equation (1) is as follows.
  • the first term aims to reduce the error between the prediction result using the prediction rule and the actual measurement value.
  • the second term is intended to reduce the number of types of explanatory variables that are effective for prediction in common with respect to tasks with respect to the common prediction rule.
  • the third term is aimed at reducing the number of types of explanatory variables that are effective for prediction with respect to the group-specific prediction rule.
  • the purpose of the fourth term is to make the types of explanatory variables effective for prediction different between the group-specific prediction rule and the common prediction rule.
  • the purpose of the fifth term is to group so that prediction rules similar to each other among a plurality of individual prediction rules belong to the same group-specific prediction rule.
  • the fourth term is directly effective in making the types of explanatory variables effective for prediction differ between the individual prediction rule and the common prediction rule, but the group-specific prediction rule is an individual prediction rule. Therefore, if the type of explanatory variable that works for prediction differs between the individual prediction rule and the common prediction rule, the type of explanatory variable that works for prediction also differs between the group-specific prediction rule and the co-prediction rule. It is thought that it becomes.
  • the definition of the norm in the formula (1) is as follows. If W is a d-dimensional column vector,
  • _1
  • w_d represents a d-dimensional value of the vector W, and
  • _ ⁇ max (
  • the prediction rule learning unit 25B calculates matrices P, Q, F, and G that minimize the objective function given by Expression (1).
  • the prediction rule learning unit 25B can calculate the matrices P, Q, F, and G that minimize Equation (1) by using the convex optimization method described in Non-Patent Document 2.
  • the input unit 25A inputs the nth actually measured value X′_nt of the explanatory variable corresponding to the target variable t to be predicted.
  • the vector X′_nt is an M-dimensional vector.
  • the predicted value calculation unit 25C calculates the predicted value Y′_nt of the objective variable of X′_nt using the following equation (2).
  • the predicted value calculation unit 25C outputs the predicted value Y '_nt calculated based on equation (2).
  • [Form 1] The data analysis apparatus according to the first aspect is as described above.
  • [Form 2] A prediction value calculation unit that calculates a prediction value of the target variable to be predicted using the common prediction rule and the group-specific prediction rule learned by the prediction rule learning unit and the third actually measured value is provided.
  • [Form 3] The prediction rule learning unit further learns a grouping rule for grouping the plurality of prediction rules so that prediction rules similar to each other among the plurality of prediction rules included in the individual prediction rule belong to the same group.
  • the prediction rule learning unit based on a predetermined objective function including the common prediction rule, the individual prediction rule, the group-specific prediction rule, and the grouping rule, the common prediction rule, the individual prediction rule, The data analysis device according to mode 3, wherein the group-specific prediction rule and the grouping rule are learned.
  • the prediction rule learning unit learns the common prediction rule, the individual prediction rule, the group-specific prediction rule, and the grouping rule by minimizing the predetermined objective function based on a convex optimization method.
  • the predetermined objective function includes a first prediction function for reducing an error between a predicted value based on the common prediction rule, the individual prediction rule, the group-specific prediction rule, and the grouping rule and the first actual measurement value.
  • a second term for learning the common prediction rule, a third term for learning the group-specific prediction rule, an explanation effective for prediction between the group-specific prediction rule and the common prediction rule A fourth term for making the types of variables different, and a fifth term for making prediction rules similar to each other among a plurality of prediction rules included in the individual prediction rule belong to the same group
  • the data analysis device according to aspect 4 or 5 including at least one of the items.
  • the data analysis apparatus according to mode 6, wherein the predetermined objective function is a weighted sum of a plurality of terms among the first term to the fifth term.
  • the prediction value calculation unit is configured to determine the prediction target based on the third actual measurement value, the common prediction rule learned by the prediction rule learning unit, the group-specific prediction rule, and the grouping rule.
  • the data analysis device according to any one of Embodiments 3 to 7, which calculates a predicted value of an objective variable.
  • the data analysis method according to the second viewpoint is as described above.
  • a ninth aspect includes a prediction value calculation unit that calculates a prediction value of the target variable to be predicted using the common prediction rule and the group-specific prediction rule learned by the prediction rule learning unit and the third actually measured value.
  • the data analysis method described in 1. [Form 11]
  • the prediction rule learning unit further learns a grouping rule for grouping the plurality of prediction rules so that prediction rules similar to each other among the plurality of prediction rules included in the individual prediction rule belong to the same group.
  • the prediction rule learning unit based on a predetermined objective function including the common prediction rule, the individual prediction rule, the group-specific prediction rule, and the grouping rule, the common prediction rule, the individual prediction rule, The data analysis method according to claim 11, wherein the group-specific prediction rule and the grouping rule are learned.
  • the prediction rule learning unit learns the common prediction rule, the individual prediction rule, the group-specific prediction rule, and the grouping rule by minimizing the predetermined objective function based on a convex optimization method.
  • the predetermined objective function includes a first prediction function for reducing an error between a predicted value based on the common prediction rule, the individual prediction rule, the group-specific prediction rule, and the grouping rule and the first actual measurement value.
  • a second term for learning the common prediction rule, a third term for learning the group-specific prediction rule, an explanation effective for prediction between the group-specific prediction rule and the common prediction rule A fourth term for making the types of variables different, and a fifth term for making prediction rules similar to each other among a plurality of prediction rules included in the individual prediction rule belong to the same group 14.
  • the data analysis method is a weighted sum of a plurality of terms among the first term to the fifth term.
  • the prediction value calculation unit is configured to determine the prediction target based on the third actual measurement value, the common prediction rule learned by the prediction rule learning unit, the group-specific prediction rule, and the grouping rule.
  • the data analysis method according to any one of forms 11 to 15, wherein a predicted value of the objective variable is calculated.
  • the program is related to the third viewpoint.
  • a mode 17 including a prediction value calculation unit that calculates a prediction value of the target variable to be predicted using the common prediction rule and the group-specific prediction rule learned by the prediction rule learning unit and the third actually measured value. The program described in.
  • the prediction rule learning unit further learns a grouping rule for grouping the plurality of prediction rules so that prediction rules similar to each other among the plurality of prediction rules included in the individual prediction rule belong to the same group. , The program according to Form 18.
  • the prediction rule learning unit based on a predetermined objective function including the common prediction rule, the individual prediction rule, the group-specific prediction rule, and the grouping rule, the common prediction rule, the individual prediction rule, The program according to mode 19, which learns group-specific prediction rules and the grouping rules.
  • the prediction rule learning unit learns the common prediction rule, the individual prediction rule, the group-specific prediction rule, and the grouping rule by minimizing the predetermined objective function based on a convex optimization method.
  • the predetermined objective function includes a first prediction function for reducing an error between a predicted value based on the common prediction rule, the individual prediction rule, the group-specific prediction rule, and the grouping rule and the first actual measurement value.
  • a second term for learning the common prediction rule, a third term for learning the group-specific prediction rule, an explanation effective for prediction between the group-specific prediction rule and the common prediction rule A fourth term for making the types of variables different, and a fifth term for making prediction rules similar to each other among a plurality of prediction rules included in the individual prediction rule belong to the same group
  • the prediction value calculation unit is configured to determine the prediction target based on the third actual measurement value, the common prediction rule learned by the prediction rule learning unit, the group-specific prediction rule, and the grouping rule.
  • Non-Patent Documents 1 and 2 are incorporated herein by reference.
  • the embodiment can be changed and adjusted based on the basic technical concept.
  • various combinations or selections of various disclosed elements are possible within the scope of the claims of the present invention. It is. That is, the present invention of course includes various variations and modifications that could be made by those skilled in the art according to the entire disclosure including the claims and the technical idea.
  • any numerical value or small range included in the range should be construed as being specifically described even if there is no specific description.

Abstract

 The present invention reduces the number of prediction rules while preventing a reduction in prediction accuracy pertaining to multitask-type analysis in which relations between a plurality of objective variables and a plurality of explanatory variables are analyzed simultaneously. This multitask-type data analysis device is provided with: a storage unit for holding first measured values which are the measured values of the plurality of objective variables, second measured values which are the measured values of the plurality of explanatory variables corresponding to the plurality of objective variables, and third measured values which are the measured values of explanatory variables corresponding to objective variables to be predicted; and a prediction rule learning unit that uses the first and second measured values to calculate a common prediction rule, which is the prediction rule expressed by an explanatory variable relating in common to the plurality of objective variables, individual prediction rules, which comprise by-objective-variable prediction rules expressed by an explanatory variable relating to each objective variable, and by-group prediction rules, which, when prediction rules included in the individual prediction rules have been grouped, are formed from prediction rules for each group.

Description

データ分析装置、データ分析方法およびプログラムData analysis apparatus, data analysis method and program
[関連出願についての記載]
 本発明は、中華人民共和国特許出願:出願番号201410167977.2(2014年4月24日出願)に基づくものであり、同出願の全記載内容は引用をもって本書に組み込み記載されているものとする。
 本発明は、データ分析装置、データ分析方法およびプログラムに関し、特に、複数の目的変数と複数の説明変数との関係性を同時に分析するデータ分析装置、データ分析方法およびプログラムに関する。
[Description of related applications]
The present invention is based on a patent application of the People's Republic of China: application number 20141167977.2 (filed on April 24, 2014), and the entire description of the application is incorporated herein by reference.
The present invention relates to a data analysis device, a data analysis method, and a program, and more particularly, to a data analysis device, a data analysis method, and a program that simultaneously analyze relationships between a plurality of objective variables and a plurality of explanatory variables.
 蓄積された膨大なデータを分析して導き出された将来予測の結果が、企業の意思決定に活用され始めている。例えば、スーパー、コンビニエンスストアなどの店舗では、各商品の需要予測結果に基づいて各商品の仕入れ数を調整している。例えば、各商品の需要予測を行う場合、各商品の属性や各商品を販売した顧客の属性などの商品・顧客情報と、各商品の販売実績との関係を分析する。このとき、目的変数として、各商品の販売実績値が用いられ、一方、説明変数として、各商品の属性(価格、製造メーカー)、各商品を販売した顧客の属性(年齢、性別)などが用いられる。 The results of future predictions derived by analyzing a large amount of accumulated data are beginning to be used for corporate decision making. For example, in stores such as supermarkets and convenience stores, the number of purchases of each product is adjusted based on the demand prediction result of each product. For example, when the demand prediction of each product is performed, the relationship between the product / customer information such as the attribute of each product and the attribute of the customer who sold each product, and the sales performance of each product is analyzed. At this time, the sales performance value of each product is used as the objective variable, while the attributes (price, manufacturer) of each product, the attributes of the customer who sold each product (age, gender), etc. are used as the explanatory variables. It is done.
 上記のデータ分析において、複数の目的変数を別々に扱って各目的変数と複数の説明変数との関係性を表す予測ルールを独立に学習する代わりに、複数の目的変数間の関係性も考慮しつつ、各目的変数と複数の説明変数との関係性を表す予測ルールを学習することによって予測精度を向上する技術が開発されている。このようなアプローチは、「マルチタスク型分析」と呼ばれている。すなわち、マルチタスク型分析では、各目的変数と複数の説明変数との関係性を表す予測ルールを学習した後、学習した予測ルールに複数の説明変数の値を入力することにより、各目的変数の予測値を算出する。 In the above data analysis, instead of learning the prediction rule that expresses the relationship between each objective variable and multiple explanatory variables by handling multiple objective variables separately, the relationship between multiple objective variables is also considered. On the other hand, a technique for improving the prediction accuracy by learning a prediction rule representing the relationship between each objective variable and a plurality of explanatory variables has been developed. Such an approach is called “multitask analysis”. In other words, in multi-task analysis, after learning prediction rules that express the relationship between each objective variable and multiple explanatory variables, the values of multiple explanatory variables are input to the learned prediction rules, so that Calculate the predicted value.
 マルチタスク型分析技術の一例が、非特許文献1に記載されている。非特許文献1に記載された技術では、複数の目的変数の実測値と複数の説明変数の実測値に基づいて、すべての目的変数に共通して関係する説明変数によって表される予測ルール(以下、「共通予測ルール」という。)と、各目的変数に対してそれぞれ関係する説明変数によって表される目的変数別の予測ルール(以下、「個別予測ルール」という。)を学習する。次に、学習した共通予測ルールと個別予測ルールに、各説明変数の実測値を入力して、目的変数毎に予測値を算出する。 An example of multitask analysis technology is described in Non-Patent Document 1. In the technique described in Non-Patent Document 1, a prediction rule (hereinafter referred to as “prediction rule”) expressed by explanatory variables related to all objective variables in common based on measured values of a plurality of objective variables and measured values of a plurality of explanatory variables. , “Common prediction rule”) and a prediction rule for each objective variable represented by an explanatory variable related to each objective variable (hereinafter referred to as “individual prediction rule”). Next, actual values of each explanatory variable are input to the learned common prediction rule and individual prediction rule, and a prediction value is calculated for each objective variable.
 また、関連技術として、非特許文献2には、目的関数を最小化するための凸最適化方法が記載されている。 Also, as related technology, Non-Patent Document 2 describes a convex optimization method for minimizing an objective function.
 上記非特許文献1、2の全開示内容は、本書に引用をもって繰り込み記載されているものとする。以下の分析は、本発明者によってなされたものである。 The entire contents disclosed in Non-Patent Documents 1 and 2 above are incorporated herein by reference. The following analysis was made by the present inventors.
 マルチタスク型のデータ分析において、機械が予測した結果のみならず、実用上は、機械がどのようにして予測結果を導いたのかを表すことが求められる。なぜなら、意思決定をする際には、予測結果だけを確認するだけではなく、予測結果を導いた予測ルールの妥当性が重要となるからである。 In multi-task type data analysis, not only the results predicted by the machine, but also practically it is required to express how the machine derived the prediction result. This is because, when making a decision, not only the prediction result is confirmed, but also the validity of the prediction rule that led to the prediction result is important.
 機械がどのようにして予測結果を導いたのかをユーザに知らせるには、ユーザに対して予測ルールを表示して提供する必要がある。しかしながら、目的変数の数が数千~数万種類に及ぶと、予測ルールの数も数千~数万種類となるため、ユーザは各予測ルールが妥当か否かを確認することが困難となる。 In order to inform the user how the machine has derived the prediction result, it is necessary to display and provide the prediction rule to the user. However, if the number of objective variables ranges from thousands to tens of thousands, the number of prediction rules also ranges from thousands to tens of thousands, making it difficult for the user to check whether each prediction rule is valid. .
 したがって、目的変数の数が膨大な場合でも、予測精度を大きく低下させることなく学習される予測ルールの個数を削減することが重要となるが、現状では、そのような技術は確立されていない。 Therefore, even when the number of objective variables is enormous, it is important to reduce the number of prediction rules learned without greatly reducing the prediction accuracy, but at present, such a technique has not been established.
 そこで、マルチタスク型のデータ分析において、予測精度の低下を防ぎつつ予測ルールの数を削減することが要望される。本発明の目的は、かかる要望に寄与するデータ分析装置、データ分析方法およびプログラムを提供することにある。 Therefore, in multi-task type data analysis, it is desired to reduce the number of prediction rules while preventing a decrease in prediction accuracy. An object of the present invention is to provide a data analysis apparatus, a data analysis method, and a program that contribute to such a demand.
 本発明の第1の視点によると、マルチタスク型のデータ分析装置が提供される。前記データ分析装置は、複数の目的変数の実測値である第1の実測値と、前記複数の目的変数に対応する複数の説明変数の実測値である第2の実測値と、予測対象の目的変数に対応する説明変数の実測値である第3の実測値を保持する記憶部を備えている。また、前記データ分析装置は、前記第1の実測値と前記第2の実測値を用いて、前記複数の目的変数に共通して関係する説明変数によって表される予測ルールである共通予測ルールと、各目的変数に関係する説明変数によって表される目的変数別の予測ルールから成る個別予測ルールと、前記個別予測ルールに含まれる予測ルールをグループ化したときの各グループに対する予測ルールから成るグループ別予測ルールを学習する予測ルール学習部を備えている。 According to the first aspect of the present invention, a multitask type data analysis apparatus is provided. The data analysis apparatus includes a first actual measurement value that is an actual measurement value of a plurality of objective variables, a second actual measurement value that is an actual measurement value of a plurality of explanatory variables corresponding to the plurality of objective variables, and an object of a prediction target. A storage unit for holding a third actual measurement value that is an actual measurement value of the explanatory variable corresponding to the variable is provided. In addition, the data analysis device uses the first actual measurement value and the second actual measurement value, and a common prediction rule that is a prediction rule represented by explanatory variables related to the plurality of objective variables in common. By group, comprising prediction rules for each objective variable represented by explanatory variables related to each objective variable, and prediction rules for each group when the prediction rules included in the individual prediction rules are grouped A prediction rule learning unit for learning prediction rules is provided.
 本発明の第2の視点によると、コンピュータがマルチタスク型のデータ分析を行うデータ分析方法が提供される。前記データ分析方法は、前記コンピュータが、複数の目的変数の実測値である第1の実測値と、前記複数の目的変数に対応する複数の説明変数の実測値である第2の実測値と、予測対象の目的変数に対応する説明変数の実測値である第3の実測値を記憶部に保持する工程を含む。また、前記データ分析方法は、前記コンピュータが、前記記憶部から読み出された前記第1の実測値と前記第2の実測値を用いて、前記複数の目的変数に共通して関係する説明変数によって表される予測ルールである共通予測ルールと、各目的変数に関係する説明変数によって表される目的変数別の予測ルールから成る個別予測ルールと、前記個別予測ルールに含まれる予測ルールをグループ化したときの各グループに対する予測ルールから成るグループ別予測ルールを学習して前記記憶部に記録する工程を含む。 According to the second aspect of the present invention, there is provided a data analysis method in which a computer performs multitask data analysis. In the data analysis method, the computer includes a first actual measurement value that is an actual measurement value of a plurality of objective variables, and a second actual measurement value that is an actual measurement value of a plurality of explanatory variables corresponding to the plurality of objective variables; A step of holding a third actual measurement value, which is an actual measurement value of the explanatory variable corresponding to the target variable to be predicted, in the storage unit. Further, in the data analysis method, the computer uses the first actual measurement value and the second actual measurement value read from the storage unit, and is an explanatory variable commonly related to the plurality of objective variables. A group of the common prediction rule that is a prediction rule represented by, the individual prediction rule that consists of the prediction rule for each objective variable represented by the explanatory variable related to each objective variable, and the prediction rule included in the individual prediction rule A step of learning a group-specific prediction rule including a prediction rule for each group and recording it in the storage unit.
 本発明の第3の視点によると、マルチタスク型のデータ分析をコンピュータに実行させるプログラムが提供される。前記プログラムは、複数の目的変数の実測値である第1の実測値と、前記複数の目的変数に対応する複数の説明変数の実測値である第2の実測値と、予測対象の目的変数に対応する説明変数の実測値である第3の実測値を記憶部に保持する処理を前記コンピュータに実行させる。また、前記プログラムは、前記記憶部から読み出された前記第1の実測値と前記第2の実測値を用いて、前記複数の目的変数に共通して関係する説明変数によって表される予測ルールである共通予測ルールと、各目的変数に関係する説明変数によって表される目的変数別の予測ルールから成る個別予測ルールと、前記個別予測ルールに含まれる予測ルールをグループ化したときの各グループに対する予測ルールから成るグループ別予測ルールを学習して前記記憶部に記録する処理を前記コンピュータに実行させる。なお、プログラムは、非一時的なコンピュータ可読記録媒体(non-transitory computer-readable storage medium)に記録されたプログラム製品として提供することができる。 According to the third aspect of the present invention, there is provided a program for causing a computer to execute multitask type data analysis. The program includes a first actual measurement value that is an actual measurement value of a plurality of objective variables, a second actual measurement value that is an actual measurement value of a plurality of explanatory variables corresponding to the plurality of objective variables, and an objective variable to be predicted. The computer is caused to execute a process of holding a third actual measurement value, which is an actual measurement value of the corresponding explanatory variable, in the storage unit. In addition, the program uses the first actual measurement value and the second actual measurement value read from the storage unit, and the prediction rule is expressed by an explanatory variable commonly related to the plurality of objective variables. For each group when the prediction rules included in the individual prediction rule are grouped, and the individual prediction rule composed of the prediction rule for each objective variable represented by the explanatory variable related to each objective variable. The computer is caused to execute a process of learning a prediction rule for each group composed of prediction rules and recording it in the storage unit. The program can be provided as a program product recorded in a non-transitory computer-readable storage medium.
 本発明に係るデータ分析装置、データ分析方法およびプログラムによると、マルチタスク型のデータ分析において、予測精度の低下を防ぎつつ予測ルールの数を削減することが可能となる。 The data analysis apparatus, data analysis method, and program according to the present invention can reduce the number of prediction rules while preventing a decrease in prediction accuracy in multi-task type data analysis.
一実施形態に係るデータ分析装置の構成を一例として示すブロック図である。It is a block diagram which shows the structure of the data analyzer which concerns on one Embodiment as an example. 第1の実施形態に係るデータ分析装置の構成を一例として示すブロック図である。It is a block diagram which shows the structure of the data analyzer which concerns on 1st Embodiment as an example. 第1の実施形態に係るデータ分析装置の動作を一例として示すフロー図である。It is a flowchart which shows operation | movement of the data analyzer which concerns on 1st Embodiment as an example.
 はじめに、一実施形態の概要について説明する。なお、この概要に付記する図面参照符号は、専ら理解を助けるための例示であり、本発明を図示の態様に限定することを意図するものではない。 First, an outline of one embodiment will be described. Note that the reference numerals of the drawings attached to this summary are merely examples for facilitating understanding, and are not intended to limit the present invention to the illustrated embodiment.
 図1は、一実施形態に係るデータ分析装置10の構成を例示するブロック図である。図1を参照すると、データ分析装置10は、マルチタスク型のデータ分析装置であって、記憶部14、予測ルール学習部15B、および、予測値算出部15Cを備えている。 FIG. 1 is a block diagram illustrating the configuration of a data analysis apparatus 10 according to an embodiment. Referring to FIG. 1, the data analysis device 10 is a multitask type data analysis device, and includes a storage unit 14, a prediction rule learning unit 15B, and a prediction value calculation unit 15C.
 記憶部14は、複数の目的変数の実測値である第1の実測値14Aと、当該複数の目的変数に対応する複数の説明変数の実測値である第2の実測値14Bと、予測対象の目的変数に対応する説明変数の実測値である第3の実測値14Cを保持する。 The storage unit 14 includes a first actual measurement value 14A that is an actual measurement value of a plurality of objective variables, a second actual measurement value 14B that is an actual measurement value of a plurality of explanatory variables corresponding to the plurality of objective variables, and a prediction target. A third actual measurement value 14C that is an actual measurement value of the explanatory variable corresponding to the objective variable is held.
 予測ルール学習部15Bは、第1の実測値14Aと第2の実測値14Bを用いて、複数の目的変数に共通して関係する説明変数によって表される予測ルールである共通予測ルール14Dと、各目的変数に関係する説明変数によって表される目的変数別の予測ルールから成る個別予測ルール14Eと、個別予測ルール14Eに含まれる予測ルールをグループ化したときの各グループに対する予測ルールから成るグループ別予測ルール14Fを算出する。ここで、予測ルール学習部15Bは、個別予測ルール14Eに含まれる複数の予測ルールのうちの互いに類似する予測ルールが同一のグループに属するように当該複数の予測ルールをグループ化することが好ましい。 The prediction rule learning unit 15B uses the first actual measurement value 14A and the second actual measurement value 14B, and a common prediction rule 14D that is a prediction rule expressed by explanatory variables related to a plurality of objective variables in common. By group consisting of individual prediction rules 14E consisting of prediction rules for each objective variable represented by explanatory variables related to each objective variable, and prediction rules for each group when the prediction rules included in the individual prediction rules 14E are grouped The prediction rule 14F is calculated. Here, the prediction rule learning unit 15B preferably groups the plurality of prediction rules so that prediction rules similar to each other among the plurality of prediction rules included in the individual prediction rule 14E belong to the same group.
 予測値算出部15Cは、予測ルール学習部15Bにより算出された共通予測ルール14Dおよびグループ別予測ルール14Fと、第3の実測値14Cを用いて、予測対象の目的変数の予測値14Gを算出する。 The prediction value calculation unit 15C calculates the prediction value 14G of the target variable to be predicted using the common prediction rule 14D and the group-specific prediction rule 14F calculated by the prediction rule learning unit 15B and the third actual measurement value 14C. .
 かかるデータ分析装置10によると、マルチタスク型のデータ分析において、予測精度の低下を防ぎつつ予測ルールの数を減らすことが可能となる。なぜなら、データ分析装置10によると、目的変数別の予測ルールから成る個別予測ルール14Eの代わりに、個別予測ルール14Eに含まれる予測ルールをグループ化したときの各グループに対するグループ別予測ルール14Fと共通予測ルール14Dを用いて、予測対象の目的変数の予測値14Gを算出することができ、このとき、グループ別予測ルール14Fに含まれる予測ルールの数は個別予測ルール14Eに含まれる予測ルールの数よりも大幅に少なくすることができるからである。 According to the data analysis apparatus 10, it is possible to reduce the number of prediction rules while preventing a decrease in prediction accuracy in multitask type data analysis. This is because, according to the data analysis apparatus 10, instead of the individual prediction rule 14E composed of the prediction rules for each objective variable, common with the group-specific prediction rule 14F for each group when the prediction rules included in the individual prediction rule 14E are grouped. The prediction value 14G of the target variable to be predicted can be calculated using the prediction rule 14D. At this time, the number of prediction rules included in the group-specific prediction rule 14F is the number of prediction rules included in the individual prediction rule 14E. This is because it can be greatly reduced.
 したがって、データ分析装置10によると、ユーザは、予測結果の導出に用いられた予測ルールの妥当性を、比較的少数の予測ルール(共通予測ルール14D、グループ別予測ルール14F)に基づいて判断することが可能となる。  Therefore, according to the data analysis apparatus 10, the user determines the validity of the prediction rule used to derive the prediction result based on a relatively small number of prediction rules (common prediction rule 14D, group-specific prediction rule 14F). It becomes possible. *
<実施形態1>
 次に、第1の実施形態に係るデータ分析装置について、図面を参照して詳細に説明する。図2は、本実施形態のデータ分析装置20の構成を一例として示すブロック図である。
<Embodiment 1>
Next, the data analysis apparatus according to the first embodiment will be described in detail with reference to the drawings. FIG. 2 is a block diagram illustrating an example of the configuration of the data analysis apparatus 20 according to the present embodiment.
 図2に記載したデータ分析装置20は、マルチタスク型分析を行う。すなわち、データ分析装置20は、複数の目的変数の実測値24Aと複数の説明変数の実測値24Bとを入力して、目的変数と説明変数の関係性を表す予測ルール(24D~24F)を学習し、予測対象の目的変数に対応する説明変数の実測値24Cを入力すると、予測対象の目的変数ごとの予測値24Gを算出して出力する。 The data analyzer 20 shown in FIG. 2 performs multitask analysis. That is, the data analysis apparatus 20 inputs the actual measurement values 24A of the plurality of objective variables and the actual measurement values 24B of the plurality of explanatory variables, and learns prediction rules (24D to 24F) representing the relationship between the objective variable and the explanatory variables. When the actual measured value 24C of the explanatory variable corresponding to the target variable to be predicted is input, the predicted value 24G for each target variable to be predicted is calculated and output.
 特に、本実施形態のデータ分析装置20は、すべての目的変数に共通して関係する説明変数によって表される予測ルール(「共通予測ルール24D」という。)、各目的変数に対してそれぞれ関係する説明変数によって表される目的変数別の予測ルール(「個別予測ルール24E」という。)、類似する個別予測ルールをグループ化して、グループ別に予測ルールを算出し直したグループ別予測ルール24Fを学習して、共通予測ルール24Dとグループ別予測ルール24Fに基づいて、説明変数の実測値を入力すると、予測したい目的変数毎の予測値24Gを算出して出力する。 In particular, the data analysis apparatus 20 of the present embodiment relates to a prediction rule (referred to as “common prediction rule 24D”) represented by explanatory variables that are commonly related to all objective variables, and to each objective variable. A prediction rule for each objective variable represented by the explanatory variable (referred to as “individual prediction rule 24E”) and similar individual prediction rules are grouped, and a prediction rule for each group 24F is calculated by recalculating the prediction rule for each group Then, when the measured value of the explanatory variable is input based on the common prediction rule 24D and the group-specific prediction rule 24F, the predicted value 24G for each target variable to be predicted is calculated and output.
 図2を参照すると、データ分析装置20は、ハードウェアとして、通信インターフェース(I/F:Interface)部21、操作入力部22、画面表示部23、記憶部24、および、プロセッサ25を備えている。 Referring to FIG. 2, the data analysis apparatus 20 includes a communication interface (I / F) unit 21, an operation input unit 22, a screen display unit 23, a storage unit 24, and a processor 25 as hardware. .
 通信I/F部21は、専用のデータ通信回路を有し、通信回線(非図示)を介して接続された図示しない各種装置との間でデータ通信を行う。操作入力部22は、キーボード、マウスなどの操作入力装置を有し、オペレータの操作を検出してプロセッサ25に出力する。画面表示部23は、LCD(Liquid Crystal Display)、PDP(Plasma Display Panel)などの画面表示装置を有し、プロセッサ25からの指示に応じて、操作メニュー、選定結果などの各種情報を画面表示する。 The communication I / F unit 21 has a dedicated data communication circuit, and performs data communication with various devices (not shown) connected via a communication line (not shown). The operation input unit 22 includes an operation input device such as a keyboard and a mouse, detects an operator's operation, and outputs it to the processor 25. The screen display unit 23 includes a screen display device such as an LCD (Liquid Crystal Display) or a PDP (Plasma Display Panel), and displays various information such as an operation menu and a selection result on the screen in response to an instruction from the processor 25. .
 記憶部24は、ハードディスク、半導体メモリなどの記憶装置を有し、プロセッサ25での各種処理に必要とされる処理情報およびプログラムを記憶する。プログラムは、プロセッサ25に読み込まれて実行されることにより各種処理部(25A~25C)を実現するプログラムである。プログラムは、通信I/F部21などのデータ入出力機能を介して外部装置(非図示)やコンピュータ読取可能な記憶媒体(非図示)から予め読み込まれて記憶部24に保存されるようにしてもよい。 The storage unit 24 has a storage device such as a hard disk or a semiconductor memory, and stores processing information and programs required for various processes in the processor 25. The program is a program that realizes various processing units (25A to 25C) by being read and executed by the processor 25. The program is read in advance from an external device (not shown) or a computer-readable storage medium (not shown) via a data input / output function such as the communication I / F unit 21 and stored in the storage unit 24. Also good.
 記憶部24に記録される主な処理情報には、複数の目的変数の実測値24Aと、複数の説明変数の実測値24Bと、予測対象となる目的変数に対応する説明変数の実測値24Cと、共通予測ルール24Dと、個別予測ルール24Eと、グループ別予測ルール24Fと、予測値24Gとが含まれる。 The main processing information recorded in the storage unit 24 includes measured values 24A of a plurality of objective variables, measured values 24B of a plurality of explanatory variables, and measured values 24C of explanatory variables corresponding to the target variable to be predicted. The common prediction rule 24D, the individual prediction rule 24E, the group-specific prediction rule 24F, and the prediction value 24G are included.
 複数の目的変数の実測値24Aと複数の説明変数の実測値24Bは、目的変数の種類別に分けられている。目的変数の種類別に分けられたデータは、目的変数の実測値と、対応する説明変数の実測値とが対になったリストであってもよい。 The measured values 24A of the plurality of objective variables and the measured values 24B of the plurality of explanatory variables are classified according to the type of the objective variable. The data divided according to the type of the objective variable may be a list in which the actual measured value of the objective variable and the actual measured value of the corresponding explanatory variable are paired.
 予測対象となる目的変数に対応する説明変数の実測値24Cは、予測対象となる目的変数に対応する説明変数の実測値である。 The actual measured value 24C of the explanatory variable corresponding to the target variable to be predicted is the actual measured value of the explanatory variable corresponding to the target variable to be predicted.
 共通予測ルール24Dは、すべての目的変数に共通して関係する説明変数によって表される予測ルールである。共通予測ルール24Dは、すべての目的変数に共通して関係する説明変数名とその説明変数が目的変数に与える影響力を表す値が対になって構成されたリストであってもよい。 The common prediction rule 24D is a prediction rule represented by explanatory variables related to all the objective variables in common. The common prediction rule 24 </ b> D may be a list in which an explanatory variable name commonly related to all objective variables and a value representing an influence exerted by the explanatory variable on the objective variable are paired.
 個別予測ルール24Eは、各目的変数に対してそれぞれ関係する説明変数によって表される目的変数別の予測ルールである。個別予測ルール24Eは、目的変数名とその目的変数に対して関係する説明変数名と、その説明変数がその目的変数に与える影響力を表す値が3つ組になって構成されたリストであってもよい。 The individual prediction rule 24E is a prediction rule for each objective variable represented by an explanatory variable related to each objective variable. The individual prediction rule 24E is a list composed of triples of an objective variable name, an explanatory variable name related to the objective variable, and a value representing the influence of the explanatory variable on the objective variable. May be.
 グループ別予測ルール24Fは、類似する個別予測ルールをグループ化したときの、グループ別の予測ルールである。グループ別予測ルール24Fは、グループIDとグループ別の予測ルールが対になった情報と、各グループIDに属する個別予測ルール24Eを表す情報とによって構成してもよい。 The group-specific prediction rule 24F is a group-specific prediction rule when similar individual prediction rules are grouped. The group-specific prediction rule 24F may be configured by information in which a group ID and a group-specific prediction rule are paired, and information indicating the individual prediction rule 24E belonging to each group ID.
 予測値24Gは、予測対象となる目的変数と予測した結果が対になったリストとしてもよい。 The predicted value 24G may be a list in which the target variable to be predicted and the predicted result are paired.
 プロセッサ25は、CPU(Central Processing Unit)などのマイクロプロセッサとその周辺回路とを有する。プロセッサ25は、記憶部24からプログラムを読み込んで実行することにより、上記ハードウェアとプログラムとを協働させて各種処理部を実現する。プロセッサ25で実現される主な処理部には、入力部25A、予測ルール学習部25Bおよび予測値算出部25Cが含まれる。 The processor 25 includes a microprocessor such as a CPU (Central Processing Unit) and its peripheral circuits. The processor 25 reads the program from the storage unit 24 and executes it, thereby realizing various processing units by cooperating the hardware and the program. The main processing units realized by the processor 25 include an input unit 25A, a prediction rule learning unit 25B, and a predicted value calculation unit 25C.
 入力部25Aは、通信I/F部21または操作入力部22から、複数の目的変数の実測値24A、予測対象となる目的変数に対応する説明変数の実測値24Cを入力して、記憶部24に格納する。 The input unit 25A inputs from the communication I / F unit 21 or the operation input unit 22 the measured values 24A of a plurality of objective variables and the measured values 24C of explanatory variables corresponding to the objective variable to be predicted. To store.
 予測ルール学習部25Bは、複数の目的変数の実測値24Aと複数の説明変数の実測値24Bを用いて、すべての目的変数に共通して関係する説明変数によって表される共通予測ルール24Dと、各目的変数に対してそれぞれ関係する説明変数によって表される個別予測ルール24Eを学習し、さらに、類似する個別予測ルールをグループ化して計算したグループ別予測ルール24Fを学習し、記憶部24に保存する。 The prediction rule learning unit 25B uses the measured values 24A of the plurality of objective variables and the measured values 24B of the plurality of explanatory variables, and the common prediction rule 24D represented by the explanatory variables related to all the objective variables, The individual prediction rule 24E represented by the explanatory variable related to each objective variable is learned, and further, the group-specific prediction rule 24F calculated by grouping similar individual prediction rules is learned and stored in the storage unit 24 To do.
 予測値算出部25Cは、共通予測ルール24Dとグループ別予測ルール24Fと、予測対象となる目的変数に対応する説明変数の実測値24Cとを記憶部24から読み込み、共通予測ルール24Dとグループ別予測ルール24Fに、予測対象となる目的変数に対応する説明変数の実測値24Cを入力して、予測したい目的変数毎の予測値24Gを算出し、記憶部24に保存する。 The predicted value calculation unit 25C reads the common prediction rule 24D, the group-specific prediction rule 24F, and the actual measured value 24C of the explanatory variable corresponding to the target variable to be predicted from the storage unit 24, and the common prediction rule 24D and the group-specific prediction The measured value 24C of the explanatory variable corresponding to the objective variable to be predicted is input to the rule 24F, and the predicted value 24G for each objective variable to be predicted is calculated and stored in the storage unit 24.
 また、予測値算出部25Cは、記憶部24から予測値24Gを読み込み、画面表示部23に出力し、あるいは、通信I/F部21を通じて外部に出力する。また、予測値算出部25Cは、記憶部24から共通予測ルール24D、個別予測ルール24E、グループ別予測ルール24Fを読み込み、画面表示部23に出力し、あるいは、通信I/F部21を通じて外部に出力する。 The predicted value calculation unit 25C reads the predicted value 24G from the storage unit 24 and outputs it to the screen display unit 23 or outputs it to the outside through the communication I / F unit 21. Further, the predicted value calculation unit 25C reads the common prediction rule 24D, the individual prediction rule 24E, and the group-specific prediction rule 24F from the storage unit 24, and outputs them to the screen display unit 23 or externally through the communication I / F unit 21. Output.
 次に、本実施形態に係るデータ分析装置20の動作について、図面参照して説明する。図3は、データ分析装置20の動作を一例として示すフロー図である。 Next, the operation of the data analysis apparatus 20 according to the present embodiment will be described with reference to the drawings. FIG. 3 is a flowchart showing an operation of the data analysis apparatus 20 as an example.
 図3を参照すると、本実施形態に係るデータ分析装置20の動作には、学習フェーズおよび予測フェーズの2つのフェーズが含まれる。 Referring to FIG. 3, the operation of the data analysis apparatus 20 according to the present embodiment includes two phases, a learning phase and a prediction phase.
 まず、学習フェーズでは、データ分析装置20は、以下の動作を行う。入力部25Aは、複数の目的変数の実測値24Aと、当該実測値24Aに対応する複数の説明変数の実測値24Bを、通信I/F部21または操作入力部22から入力し、記憶部24に格納する(ステップS11)。 First, in the learning phase, the data analysis apparatus 20 performs the following operations. The input unit 25A inputs measured values 24A of a plurality of objective variables and measured values 24B of a plurality of explanatory variables corresponding to the measured values 24A from the communication I / F unit 21 or the operation input unit 22, and stores the storage unit 24. (Step S11).
 次に、予測ルール学習部25Bは、記憶部24から複数の目的変数の実測値24Aと複数の説明変数の実測値24Bを読み出し、共通予測ルール24D、個別予測ルール24Eおよびグループ別予測ルール24Fのすべてを同時に学習する(ステップS12)。 Next, the prediction rule learning unit 25B reads the measured values 24A of the plurality of objective variables and the measured values 24B of the plurality of explanatory variables from the storage unit 24, and sets the common prediction rule 24D, the individual prediction rule 24E, and the group-specific prediction rule 24F. All are learned simultaneously (step S12).
 一方、予測フェーズでは、データ分析装置20は、以下の動作を行う。まず、入力部25Aは、予測対象となる目的変数に対応する説明変数の実測値24Cを通信I/F部21または操作入力部22から入力し、記憶部24に格納する(ステップS21)。 On the other hand, in the prediction phase, the data analysis device 20 performs the following operations. First, the input unit 25A inputs the actual measured value 24C of the explanatory variable corresponding to the target variable to be predicted from the communication I / F unit 21 or the operation input unit 22, and stores it in the storage unit 24 (step S21).
 次に、予測値算出部25Cは、共通予測ルール24Dとグループ別予測ルール24Fを記憶部24から読み込み、予測対象となる目的変数に対応する説明変数の実測値24Cを入力して、予測したい目的変数毎の予測値を算出する(ステップS22)。 Next, the predicted value calculation unit 25C reads the common prediction rule 24D and the group-specific prediction rule 24F from the storage unit 24, inputs the actual measured value 24C of the explanatory variable corresponding to the target variable to be predicted, and performs the desired prediction A predicted value for each variable is calculated (step S22).
 次に、予測値算出部25Cは、予測値24G、共通予測ルール24D、個別予測ルール24E、グループ別予測ルール24Fのうちのユーザに選択されたものを、画面表示部23に出力し、あるいは、通信I/F部21を通じて外部に出力する(ステップS23)。 Next, the predicted value calculation unit 25C outputs, to the screen display unit 23, the one selected by the user from the predicted value 24G, the common prediction rule 24D, the individual prediction rule 24E, and the group-specific prediction rule 24F, or Output to the outside through the communication I / F unit 21 (step S23).
 本実施形態のデータ分析装置20によると、目的変数が多いときでも、目的変数毎に学習される予測ルールをグルーピングすることによって算出されるグループ別予測ルール24Fを求めることによって、予測精度を大きく低下させることなく、予測ルールの数を減らすことができる。 According to the data analysis apparatus 20 of the present embodiment, even when there are many objective variables, the prediction accuracy is greatly reduced by obtaining the group-specific prediction rules 24F calculated by grouping the prediction rules learned for each objective variable. Without making it possible, the number of prediction rules can be reduced.
 次に、データ分析装置20の学習フェーズおよび予測フェーズにおける動作について、具体例に基づいてより詳細に説明する。以下では、下付きの添え字をアンダーラインを付して表現する。例えば、AをA_Bと表記する。また、上付きの添え字をハットを付して表現する。例えば、AをA^Bと表記する。 Next, operations in the learning phase and the prediction phase of the data analysis device 20 will be described in more detail based on specific examples. In the following, subscripts are expressed with an underline. For example, it referred to as A_B the A B. Also, superscripts are expressed with a hat. For example, A B is expressed as A ^ B.
(1)学習フェーズの詳細
[ステップS11]
 入力部25Aは、複数の目的変数の実測値24Aと複数の説明変数の実測値24Bを入力とする。入力された複数の目的変数の実測値24Aと複数の説明変数の実測値24Bを、それぞれ、X_ntとY_nt(n=1,2,…,N_t;t=1,…,T)とする。
(1) Details of the learning phase [Step S11]
The input unit 25A receives the measured values 24A of a plurality of objective variables and the measured values 24B of a plurality of explanatory variables as inputs. The input actual measurement values 24A and plural explanatory variables 24B are X_nt and Y_nt (n = 1, 2,..., N_t; t = 1,..., T), respectively.
 ここで、ベクトルX_ntは、目的変数の種類tのn番目の観測ベクトルを表すM次元の列ベクトルである。一方、Y_ntは、目的変数の種類tのn番目の実測値である。また、N_tは、目的変数の種類tの実測値の個数を表す。さらに、Tは、目的変数の種類の個数を表す。X_ntm(m=1,…,M)は、目的変数の種類tのn番目の観測ベクトルの説明変数mの実測値を表す。Mは、説明変数の個数を表す。行列X_tは、行ベクトルX_nt^{T}(n=1,2,…,N_t)を行ごとに整列させたN_t×Mサイズの行列を表す。ここで、^{T}は転置を表す。ベクトルY_tは、Y_nt(n=1,2,…,N_t)を行ごとに整列させたN_t×1サイズの列ベクトルを表す。 Here, the vector X_nt is an M-dimensional column vector representing the n-th observation vector of the target variable type t. On the other hand, Y_nt is the nth actually measured value of the target variable type t. N_t represents the number of actually measured values of the target variable type t. Further, T represents the number of types of objective variables. X_ntm (m = 1,..., M) represents an actual measurement value of the explanatory variable m of the n-th observation vector of the target variable type t. M represents the number of explanatory variables. The matrix X_t represents an N_t × M size matrix in which row vectors X_nt ^ {T} (n = 1, 2,..., N_t) are aligned for each row. Here, {T} represents transposition. The vector Y_t represents an N_t × 1 size column vector in which Y_nt (n = 1, 2,..., N_t) is aligned for each row.
[ステップS12]
 共通予測ルール24D、個別予測ルール24E、グループ別予測ルール24Fを、それぞれ、列ベクトルp_t、行列Q、行列Fによって表す。ここで、M次元の列ベクトルp_tはタスクtに対する共通予測ルールを表す。各予測ルール24D~24Fを算出するため方法は、次のとおりである。
[Step S12]
The common prediction rule 24D, the individual prediction rule 24E, and the group-specific prediction rule 24F are represented by a column vector p_t, a matrix Q, and a matrix F, respectively. Here, the M-dimensional column vector p_t represents a common prediction rule for the task t. A method for calculating the respective prediction rules 24D to 24F is as follows.
 行列Pは、共通予測ルールを示す行列であり、P=[p_1^{T};p_2^{T};…;p_T^{T}]で与えられるT×M行列である。共通予測ルールは、全てのタスクの目的変数に共通して関係する説明変数を示し、各説明変数が目的変数に及ぼす影響度合いは、タスクごとに異なるため、共通予測ルールはタスクごとに定義する。 The matrix P is a matrix indicating the common prediction rule, and is a T × M matrix given by P = [p_1 ^ {T}; p_2 ^ {T};...; P_T ^ {T}]. The common prediction rule indicates an explanatory variable that is commonly related to the objective variables of all tasks, and the degree of influence of each explanatory variable on the objective variable differs for each task. Therefore, the common prediction rule is defined for each task.
 行列Qは、個別予測ルール24Eを示す行列である。行列Qは、[q1q2…q_t…q_T]のM×Tサイズの行列を表す。ここで、ベクトルq_tは、M次元列ベクトルであり、タスクtの個別予測ルールを表す。 The matrix Q is a matrix indicating the individual prediction rule 24E. The matrix Q represents an M × T size matrix of [q1q2... Q_t... Q_T]. Here, the vector q_t is an M-dimensional column vector and represents an individual prediction rule for the task t.
 行列Fは、グループ別予測ルール24Fを示す行列である。行列Fは、M×Kサイズの行列[f1f2…f_k…f_K]を表す。ここで、ベクトルf_kは、M次元列ベクトルであり、k番目のグループ別予測ルールを表す。Kは、個別予測ルール24Eをグループに分けたときのグループの数を表す。 The matrix F is a matrix indicating the group-specific prediction rule 24F. The matrix F represents an M × K sized matrix [f1f2... F_k. Here, the vector f_k is an M-dimensional column vector and represents the kth group-specific prediction rule. K represents the number of groups when the individual prediction rule 24E is divided into groups.
 行列Gは、T×Kサイズの行列[g_1^{T};g_2^{T};…;g_t^{T};…;g_T^{T}]を表す。ベクトルg_tは、K次元列ベクトルである。ベクトルg_tは、目的変数の種類tの個別予測ルールが、いくつ目のグループに属するのかを表す。 The matrix G represents a T × K sized matrix [g_1 ^ {T}; g_2 ^ {T};...; G_t ^ {T}; ...; g_T ^ {T}]. The vector g_t is a K-dimensional column vector. The vector g_t represents to which group the individual prediction rule of the objective variable type t belongs.
 予測ルール学習部25Bは、ベクトルp_tおよび行列Q、F、Gを同時に学習する。具体的には、所定の目的関数を最小化することにより、ベクトルp_tおよび行列Q、F、Gの学習を行うようにしてもよい。 The prediction rule learning unit 25B learns the vector p_t and the matrices Q, F, and G at the same time. Specifically, the vector p_t and the matrices Q, F, and G may be learned by minimizing a predetermined objective function.
 予測ルール学習部25Bは、一例として、以下の式(1)で表される目的関数を用いることができる。 The prediction rule learning unit 25B can use an objective function represented by the following formula (1) as an example.
 Σ_t||X_t(p_t+Fg_t)-Y_t||^2
 +ρ_1||P||_(1,∞)
 +ρ_2||F||_1
 +ρ_3tr(PQ)
 +ρ_4tr(Q^{T}Q-2Q^{T}FG^{T}+GF^{T}FG^{T})
                     …(1)
Σ_t || X_t (p_t + Fg_t) −Y_t || ^ 2
+ Ρ_1 || P || _ (1, ∞)
+ Ρ_2 || F || _1
+ Ρ_3tr (PQ)
+ Ρ — 4tr (Q ^ {T} Q-2Q ^ {T} FG ^ {T} + GF ^ {T} FG ^ {T})
... (1)
 式(1)において、ρ_1、ρ_2、ρ_3、ρ_4は各項の影響度を調整するパラメータである。また、Σ_tは、tについての和を表す。 In Expression (1), ρ_1, ρ_2, ρ_3, and ρ_4 are parameters that adjust the degree of influence of each term. Σ_t represents the sum of t.
 式(1)の各項を導入した目的は次のとおりである。第1項は、予測ルールを用いた予測結果と実測値との誤差を小さくすることを目的とする。第2項は、共通予測ルールに関して、タスクに共通して予測に効く説明変数の種類数を減らすことを目的とする。第3項は、グループ別予測ルールに関して、予測に効く説明変数の種類数を減らすことを目的とする。第4項は、グループ別予測ルールと共通予測ルールとの間で予測に効く説明変数の種類が異なるようにすることを目的とする。第5項は、複数の個別予測ルールのうちの、互いに類似する予測ルールが同一のグループ別予測ルールに属するようにグループ化することを目的とする。ここで、第4項は、直接的には、個別予測ルールと共通予測ルールとの間で、予測に効く説明変数の種類が異なるようにする効果があるが、グループ別予測ルールは個別予測ルールから導き出されるため、個別予測ルールと共通予測ルールとの間で予測に効く説明変数の種類が異なれば、グループ別予測ルールと共予測ルールとの間でも、予測に効く説明変数の種類が異なるようになると考えられる。 The purpose of introducing each term of Equation (1) is as follows. The first term aims to reduce the error between the prediction result using the prediction rule and the actual measurement value. The second term is intended to reduce the number of types of explanatory variables that are effective for prediction in common with respect to tasks with respect to the common prediction rule. The third term is aimed at reducing the number of types of explanatory variables that are effective for prediction with respect to the group-specific prediction rule. The purpose of the fourth term is to make the types of explanatory variables effective for prediction different between the group-specific prediction rule and the common prediction rule. The purpose of the fifth term is to group so that prediction rules similar to each other among a plurality of individual prediction rules belong to the same group-specific prediction rule. Here, the fourth term is directly effective in making the types of explanatory variables effective for prediction differ between the individual prediction rule and the common prediction rule, but the group-specific prediction rule is an individual prediction rule. Therefore, if the type of explanatory variable that works for prediction differs between the individual prediction rule and the common prediction rule, the type of explanatory variable that works for prediction also differs between the group-specific prediction rule and the co-prediction rule. It is thought that it becomes.
 また、式(1)におけるノルムの定義は、次のとおりである。Wを、d次元の列ベクトルとすると、||W||_1=|w_1|+|w_2|+…+|w_d|である。ここで、w_dはベクトルWのd次元の値、|・|は絶対値を表す。また、||W||_∞=max(|w_1|,|w_2|,…,|w_d|)である。さらに、行列Aをd次元×T次元の行列とし、a^{i}をi番目の行ベクトルとすると、||A||_(1,∞)は||A||_(1,∞)=(Σ_{i=1}^{d}||a^{i}||_{∞})を示す。ここで、||a^{i}||_{∞}=max(|a^{i}_1|,|a^{i}_2|,…,|a^{i}_T|とする。 In addition, the definition of the norm in the formula (1) is as follows. If W is a d-dimensional column vector, || W || _1 = | w_1 | + | w_2 | + ... + | w_d |. Here, w_d represents a d-dimensional value of the vector W, and | · | represents an absolute value. Further, || W || _∞ = max (| w_1 |, | w_2 |,..., | W_d |). Further, if the matrix A is a d-dimensional × T-dimensional matrix and a ^ {i} is the i-th row vector, || A || _ (1, ∞) is || A || _ (1, ∞ ) = (Σ_ {i = 1} ^ {d} || a ^ {i} || _ {∞}). Here, || a ^ {i} || _ {∞} = max (| a ^ {i} _1 |, | a ^ {i} _2 |,..., | A ^ {i} _T |.
 予測ルール学習部25Bは、式(1)で与えられる目的関数を最小化する行列P、Q、F、Gを算出する。予測ルール学習部25Bは、一例として、非特許文献2に記載された凸最適化方法を用いることによって、式(1)を最小化する行列P、Q、F、Gを算出することができる。 The prediction rule learning unit 25B calculates matrices P, Q, F, and G that minimize the objective function given by Expression (1). As an example, the prediction rule learning unit 25B can calculate the matrices P, Q, F, and G that minimize Equation (1) by using the convex optimization method described in Non-Patent Document 2.
(2)予測フェーズの詳細
[ステップS21]
 入力部25Aは、予測対象となる目的変数tに対応する説明変数のn番目の実測値X’_ntを入力する。ベクトルX’_ntは、M次元ベクトルである。
(2) Details of prediction phase [step S21]
The input unit 25A inputs the nth actually measured value X′_nt of the explanatory variable corresponding to the target variable t to be predicted. The vector X′_nt is an M-dimensional vector.
[ステップS22]
 予測値算出部25Cは、以下の式(2)を用いて、X’_ntの目的変数の予測値Y’_ntを算出する。
[Step S22]
The predicted value calculation unit 25C calculates the predicted value Y′_nt of the objective variable of X′_nt using the following equation (2).
 Y’_nt=(p_t+Fg_t)^{T}X’_nt   (2) Y'_nt = (p_t + Fg_t) ^ {T} X'_nt (2)
[ステップS23]
 次に、予測値算出部25Cは、式(2)に基づいて算出した予測値Y_ntを出力する。
[Step S23]
Then, the predicted value calculation unit 25C outputs the predicted value Y '_nt calculated based on equation (2).
 なお、本発明において、下記の形態が可能である。
[形態1]
 上記第1の視点に係るデータ分析装置のとおりである。
[形態2]
 前記予測ルール学習部により学習された共通予測ルールおよびグループ別予測ルールと、前記第3の実測値を用いて、前記予測対象の目的変数の予測値を算出する予測値算出部を備える、形態1に記載のデータ分析装置。
[形態3]
 前記予測ルール学習部は、前記個別予測ルールに含まれる複数の予測ルールのうちの互いに類似する予測ルールが同一のグループに属するように該複数の予測ルールをグループ化するグループ化ルールをさらに学習する、形態2に記載のデータ分析装置。
[形態4]
 前記予測ルール学習部は、前記共通予測ルール、前記個別予測ルール、前記グループ別予測ルール、および、前記グループ化ルールを含む所定の目的関数に基づいて、前記共通予測ルール、前記個別予測ルール、前記グループ別予測ルール、および、前記グループ化ルールを学習する、形態3に記載のデータ分析装置。
[形態5]
 前記予測ルール学習部は、前記所定の目的関数を凸最適化方法に基づいて最小化することにより、前記共通予測ルール、前記個別予測ルール、前記グループ別予測ルール、および、前記グループ化ルールを学習する、形態4に記載のデータ分析装置。
[形態6]
 前記所定の目的関数は、前記共通予測ルール、前記個別予測ルール、前記グループ別予測ルール、および、前記グループ化ルールに基づく予測値と前記第1の実測値との誤差を小さくするための第1の項、前記共通予測ルールを学習するための第2の項、前記グループ別予測ルールを学習するための第3の項、前記グループ別予測ルールと前記共通予測ルールとの間で予測に効く説明変数の種類が異なるものとなるようにするための第4の項、前記個別予測ルールに含まれる複数の予測ルールのうちの互いに類似する予測ルールが同一のグループに属するようにするための第5の項のうちの少なくともいずれかの項を含む、形態4または5に記載のデータ分析装置。
[形態7]
 前記所定の目的関数は、前記第1の項ないし第5の項のうちの複数の項の重み付きの和である、形態6に記載のデータ分析装置。
[形態8]
 前記予測値算出部は、前記第3の実測値、ならびに、前記予測ルール学習部により学習された前記共通予測ルール、前記グループ別予測ルール、および、前記グループ化ルールに基づいて、前記予測対象の目的変数の予測値を算出する、形態3ないし7のいずれか一に記載のデータ分析装置。
[形態9]
 上記第2の視点に係るデータ分析方法のとおりである。
[形態10]
 前記予測ルール学習部により学習された共通予測ルールおよびグループ別予測ルールと、前記第3の実測値を用いて、前記予測対象の目的変数の予測値を算出する予測値算出部を備える、形態9に記載のデータ分析方法。
[形態11]
 前記予測ルール学習部は、前記個別予測ルールに含まれる複数の予測ルールのうちの互いに類似する予測ルールが同一のグループに属するように該複数の予測ルールをグループ化するグループ化ルールをさらに学習する、形態10に記載のデータ分析方法。
[形態12]
 前記予測ルール学習部は、前記共通予測ルール、前記個別予測ルール、前記グループ別予測ルール、および、前記グループ化ルールを含む所定の目的関数に基づいて、前記共通予測ルール、前記個別予測ルール、前記グループ別予測ルール、および、前記グループ化ルールを学習する、形態11に記載のデータ分析方法。
[形態13]
 前記予測ルール学習部は、前記所定の目的関数を凸最適化方法に基づいて最小化することにより、前記共通予測ルール、前記個別予測ルール、前記グループ別予測ルール、および、前記グループ化ルールを学習する、形態12に記載のデータ分析方法。
[形態14]
 前記所定の目的関数は、前記共通予測ルール、前記個別予測ルール、前記グループ別予測ルール、および、前記グループ化ルールに基づく予測値と前記第1の実測値との誤差を小さくするための第1の項、前記共通予測ルールを学習するための第2の項、前記グループ別予測ルールを学習するための第3の項、前記グループ別予測ルールと前記共通予測ルールとの間で予測に効く説明変数の種類が異なるものとなるようにするための第4の項、前記個別予測ルールに含まれる複数の予測ルールのうちの互いに類似する予測ルールが同一のグループに属するようにするための第5の項のうちの少なくともいずれかの項を含む、形態12または13に記載のデータ分析方法。
[形態15]
 前記所定の目的関数は、前記第1の項ないし第5の項のうちの複数の項の重み付きの和である、形態14に記載のデータ分析方法。
[形態16]
 前記予測値算出部は、前記第3の実測値、ならびに、前記予測ルール学習部により学習された前記共通予測ルール、前記グループ別予測ルール、および、前記グループ化ルールに基づいて、前記予測対象の目的変数の予測値を算出する、形態11ないし15のいずれか一に記載のデータ分析方法。
[形態17]
 上記第3の視点に係るプログラムのとおりである。
[形態18]
 前記予測ルール学習部により学習された共通予測ルールおよびグループ別予測ルールと、前記第3の実測値を用いて、前記予測対象の目的変数の予測値を算出する予測値算出部を備える、形態17に記載のプログラム。
[形態19]
 前記予測ルール学習部は、前記個別予測ルールに含まれる複数の予測ルールのうちの互いに類似する予測ルールが同一のグループに属するように該複数の予測ルールをグループ化するグループ化ルールをさらに学習する、形態18に記載のプログラム。
[形態20]
 前記予測ルール学習部は、前記共通予測ルール、前記個別予測ルール、前記グループ別予測ルール、および、前記グループ化ルールを含む所定の目的関数に基づいて、前記共通予測ルール、前記個別予測ルール、前記グループ別予測ルール、および、前記グループ化ルールを学習する、形態19に記載のプログラム。
[形態21]
 前記予測ルール学習部は、前記所定の目的関数を凸最適化方法に基づいて最小化することにより、前記共通予測ルール、前記個別予測ルール、前記グループ別予測ルール、および、前記グループ化ルールを学習する、形態20に記載のプログラム。
[形態22]
 前記所定の目的関数は、前記共通予測ルール、前記個別予測ルール、前記グループ別予測ルール、および、前記グループ化ルールに基づく予測値と前記第1の実測値との誤差を小さくするための第1の項、前記共通予測ルールを学習するための第2の項、前記グループ別予測ルールを学習するための第3の項、前記グループ別予測ルールと前記共通予測ルールとの間で予測に効く説明変数の種類が異なるものとなるようにするための第4の項、前記個別予測ルールに含まれる複数の予測ルールのうちの互いに類似する予測ルールが同一のグループに属するようにするための第5の項のうちの少なくともいずれかの項を含む、形態20または21に記載のプログラム。
[形態23]
 前記所定の目的関数は、前記第1の項ないし第5の項のうちの複数の項の重み付きの和である、形態22に記載のプログラム。
[形態24]
 前記予測値算出部は、前記第3の実測値、ならびに、前記予測ルール学習部により学習された前記共通予測ルール、前記グループ別予測ルール、および、前記グループ化ルールに基づいて、前記予測対象の目的変数の予測値を算出する、形態19ないし23のいずれか一に記載のプログラム。
In the present invention, the following modes are possible.
[Form 1]
The data analysis apparatus according to the first aspect is as described above.
[Form 2]
A prediction value calculation unit that calculates a prediction value of the target variable to be predicted using the common prediction rule and the group-specific prediction rule learned by the prediction rule learning unit and the third actually measured value is provided. The data analysis device described in 1.
[Form 3]
The prediction rule learning unit further learns a grouping rule for grouping the plurality of prediction rules so that prediction rules similar to each other among the plurality of prediction rules included in the individual prediction rule belong to the same group. The data analysis apparatus according to mode 2.
[Form 4]
The prediction rule learning unit, based on a predetermined objective function including the common prediction rule, the individual prediction rule, the group-specific prediction rule, and the grouping rule, the common prediction rule, the individual prediction rule, The data analysis device according to mode 3, wherein the group-specific prediction rule and the grouping rule are learned.
[Form 5]
The prediction rule learning unit learns the common prediction rule, the individual prediction rule, the group-specific prediction rule, and the grouping rule by minimizing the predetermined objective function based on a convex optimization method. The data analysis device according to claim 4, wherein
[Form 6]
The predetermined objective function includes a first prediction function for reducing an error between a predicted value based on the common prediction rule, the individual prediction rule, the group-specific prediction rule, and the grouping rule and the first actual measurement value. , A second term for learning the common prediction rule, a third term for learning the group-specific prediction rule, an explanation effective for prediction between the group-specific prediction rule and the common prediction rule A fourth term for making the types of variables different, and a fifth term for making prediction rules similar to each other among a plurality of prediction rules included in the individual prediction rule belong to the same group The data analysis device according to aspect 4 or 5, including at least one of the items.
[Form 7]
The data analysis apparatus according to mode 6, wherein the predetermined objective function is a weighted sum of a plurality of terms among the first term to the fifth term.
[Form 8]
The prediction value calculation unit is configured to determine the prediction target based on the third actual measurement value, the common prediction rule learned by the prediction rule learning unit, the group-specific prediction rule, and the grouping rule. The data analysis device according to any one of Embodiments 3 to 7, which calculates a predicted value of an objective variable.
[Form 9]
The data analysis method according to the second viewpoint is as described above.
[Mode 10]
A ninth aspect includes a prediction value calculation unit that calculates a prediction value of the target variable to be predicted using the common prediction rule and the group-specific prediction rule learned by the prediction rule learning unit and the third actually measured value. The data analysis method described in 1.
[Form 11]
The prediction rule learning unit further learns a grouping rule for grouping the plurality of prediction rules so that prediction rules similar to each other among the plurality of prediction rules included in the individual prediction rule belong to the same group. The data analysis method according to claim 10.
[Form 12]
The prediction rule learning unit, based on a predetermined objective function including the common prediction rule, the individual prediction rule, the group-specific prediction rule, and the grouping rule, the common prediction rule, the individual prediction rule, The data analysis method according to claim 11, wherein the group-specific prediction rule and the grouping rule are learned.
[Form 13]
The prediction rule learning unit learns the common prediction rule, the individual prediction rule, the group-specific prediction rule, and the grouping rule by minimizing the predetermined objective function based on a convex optimization method. The data analysis method according to claim 12, wherein
[Form 14]
The predetermined objective function includes a first prediction function for reducing an error between a predicted value based on the common prediction rule, the individual prediction rule, the group-specific prediction rule, and the grouping rule and the first actual measurement value. , A second term for learning the common prediction rule, a third term for learning the group-specific prediction rule, an explanation effective for prediction between the group-specific prediction rule and the common prediction rule A fourth term for making the types of variables different, and a fifth term for making prediction rules similar to each other among a plurality of prediction rules included in the individual prediction rule belong to the same group 14. The data analysis method according to form 12 or 13, comprising at least one of the items.
[Form 15]
15. The data analysis method according to claim 14, wherein the predetermined objective function is a weighted sum of a plurality of terms among the first term to the fifth term.
[Form 16]
The prediction value calculation unit is configured to determine the prediction target based on the third actual measurement value, the common prediction rule learned by the prediction rule learning unit, the group-specific prediction rule, and the grouping rule. The data analysis method according to any one of forms 11 to 15, wherein a predicted value of the objective variable is calculated.
[Form 17]
The program is related to the third viewpoint.
[Form 18]
A mode 17 including a prediction value calculation unit that calculates a prediction value of the target variable to be predicted using the common prediction rule and the group-specific prediction rule learned by the prediction rule learning unit and the third actually measured value. The program described in.
[Form 19]
The prediction rule learning unit further learns a grouping rule for grouping the plurality of prediction rules so that prediction rules similar to each other among the plurality of prediction rules included in the individual prediction rule belong to the same group. , The program according to Form 18.
[Mode 20]
The prediction rule learning unit, based on a predetermined objective function including the common prediction rule, the individual prediction rule, the group-specific prediction rule, and the grouping rule, the common prediction rule, the individual prediction rule, The program according to mode 19, which learns group-specific prediction rules and the grouping rules.
[Form 21]
The prediction rule learning unit learns the common prediction rule, the individual prediction rule, the group-specific prediction rule, and the grouping rule by minimizing the predetermined objective function based on a convex optimization method. The program according to claim 20, wherein
[Form 22]
The predetermined objective function includes a first prediction function for reducing an error between a predicted value based on the common prediction rule, the individual prediction rule, the group-specific prediction rule, and the grouping rule and the first actual measurement value. , A second term for learning the common prediction rule, a third term for learning the group-specific prediction rule, an explanation effective for prediction between the group-specific prediction rule and the common prediction rule A fourth term for making the types of variables different, and a fifth term for making prediction rules similar to each other among a plurality of prediction rules included in the individual prediction rule belong to the same group The program according to the form 20 or 21, including at least one of the items.
[Form 23]
The program according to the form 22, wherein the predetermined objective function is a weighted sum of a plurality of terms among the first term to the fifth term.
[Form 24]
The prediction value calculation unit is configured to determine the prediction target based on the third actual measurement value, the common prediction rule learned by the prediction rule learning unit, the group-specific prediction rule, and the grouping rule. The program according to any one of forms 19 to 23, which calculates a predicted value of an objective variable.
 なお、上記非特許文献1、2の全開示内容は、本書に引用をもって繰り込み記載されているものとする。本発明の全開示(請求の範囲を含む)の枠内において、さらにその基本的技術思想に基づいて、実施形態の変更・調整が可能である。また、本発明の請求の範囲の枠内において種々の開示要素(各請求項の各要素、各実施形態の各要素、各図面の各要素等を含む)の多様な組み合わせ、ないし、選択が可能である。すなわち、本発明は、請求の範囲を含む全開示、技術的思想にしたがって当業者であればなし得るであろう各種変形、修正を含むことは勿論である。特に、本書に記載した数値範囲については、当該範囲内に含まれる任意の数値ないし小範囲が、別段の記載のない場合でも具体的に記載されているものと解釈されるべきである。 It should be noted that all the disclosed contents of Non-Patent Documents 1 and 2 are incorporated herein by reference. Within the scope of the entire disclosure (including claims) of the present invention, the embodiment can be changed and adjusted based on the basic technical concept. Further, various combinations or selections of various disclosed elements (including each element of each claim, each element of each embodiment, each element of each drawing, etc.) are possible within the scope of the claims of the present invention. It is. That is, the present invention of course includes various variations and modifications that could be made by those skilled in the art according to the entire disclosure including the claims and the technical idea. In particular, with respect to the numerical ranges described in this document, any numerical value or small range included in the range should be construed as being specifically described even if there is no specific description.
10、20  データ分析装置
14、24  記憶部
14A  第1の実測値
14B  第2の実測値
14C  第3の実測値
14D、24D  共通予測ルール
14E、24E  個別予測ルール
14F、24F  グループ別予測ルール
14G  予測値
15B、25B  予測ルール学習部
15C、25C  予測値算出部
21  通信I/F部
22  操作入力部
23  画面表示部
24A  目的変数の実測値
24B  説明変数の実測値
24C  予測対象の目的変数に対応する説明変数の実測値
24G  予測値
25  プロセッサ
25A  入力部
10, 20 Data analysis device 14, 24 Storage unit 14A First measured value 14B Second measured value 14C Third measured value 14D, 24D Common prediction rule 14E, 24E Individual prediction rule 14F, 24F Group-specific prediction rule 14G Prediction Value 15B, 25B Prediction rule learning unit 15C, 25C Predicted value calculation unit 21 Communication I / F unit 22 Operation input unit 23 Screen display unit 24A Target variable actual value 24B Explanation variable actual value 24C Corresponds to target variable to be predicted Measured value of explanatory variable 24G Predicted value 25 Processor 25A Input section

Claims (10)

  1.  マルチタスク型のデータ分析装置であって、
     複数の目的変数の実測値である第1の実測値と、前記複数の目的変数に対応する複数の説明変数の実測値である第2の実測値と、予測対象の目的変数に対応する説明変数の実測値である第3の実測値を保持する記憶部と、
     前記第1の実測値と前記第2の実測値を用いて、前記複数の目的変数に共通して関係する説明変数によって表される予測ルールである共通予測ルールと、各目的変数に関係する説明変数によって表される目的変数別の予測ルールから成る個別予測ルールと、前記個別予測ルールに含まれる予測ルールをグループ化したときの各グループに対する予測ルールから成るグループ別予測ルールを学習する予測ルール学習部と、を備える、データ分析装置。
    A multitasking data analysis device,
    A first actual measurement value that is an actual measurement value of a plurality of objective variables; a second actual measurement value that is an actual measurement value of a plurality of explanatory variables corresponding to the plurality of objective variables; and an explanatory variable that corresponds to an objective variable to be predicted. A storage unit that holds a third actual measurement value that is an actual measurement value of
    Using the first actual measurement value and the second actual measurement value, a common prediction rule that is a prediction rule represented by an explanatory variable commonly related to the plurality of objective variables, and an explanation related to each objective variable Prediction rule learning that learns individual prediction rules composed of prediction rules for each objective variable represented by variables and prediction rules for each group when the prediction rules included in the individual prediction rules are grouped A data analysis device.
  2.  前記予測ルール学習部により学習された共通予測ルールおよびグループ別予測ルールと、前記第3の実測値を用いて、前記予測対象の目的変数の予測値を算出する予測値算出部を備える、請求項1に記載のデータ分析装置。 The prediction value calculation part which calculates the prediction value of the objective variable of the prediction object using the common prediction rule and group-specific prediction rule learned by the prediction rule learning part, and the 3rd actual measurement value is provided. The data analysis apparatus according to 1.
  3.  前記予測ルール学習部は、前記個別予測ルールに含まれる複数の予測ルールのうちの互いに類似する予測ルールが同一のグループに属するように該複数の予測ルールをグループ化するグループ化ルールをさらに学習する、請求項2に記載のデータ分析装置。 The prediction rule learning unit further learns a grouping rule for grouping the plurality of prediction rules so that prediction rules similar to each other among the plurality of prediction rules included in the individual prediction rule belong to the same group. The data analysis apparatus according to claim 2.
  4.  前記予測ルール学習部は、前記共通予測ルール、前記個別予測ルール、前記グループ別予測ルール、および、前記グループ化ルールを含む所定の目的関数に基づいて、前記共通予測ルール、前記個別予測ルール、前記グループ別予測ルール、および、前記グループ化ルールを学習する、請求項3に記載のデータ分析装置。 The prediction rule learning unit, based on a predetermined objective function including the common prediction rule, the individual prediction rule, the group-specific prediction rule, and the grouping rule, the common prediction rule, the individual prediction rule, The data analysis apparatus according to claim 3, wherein the group-specific prediction rule and the grouping rule are learned.
  5.  前記予測ルール学習部は、前記所定の目的関数を凸最適化方法に基づいて最小化することにより、前記共通予測ルール、前記個別予測ルール、前記グループ別予測ルール、および、前記グループ化ルールを学習する、請求項4に記載のデータ分析装置。 The prediction rule learning unit learns the common prediction rule, the individual prediction rule, the group-specific prediction rule, and the grouping rule by minimizing the predetermined objective function based on a convex optimization method. The data analysis apparatus according to claim 4.
  6.  前記所定の目的関数は、前記共通予測ルール、前記個別予測ルール、前記グループ別予測ルール、および、前記グループ化ルールに基づく予測値と前記第1の実測値との誤差を小さくするための第1の項、前記共通予測ルールを学習するための第2の項、前記グループ別予測ルールを学習するための第3の項、前記グループ別予測ルールと前記共通予測ルールとの間で予測に効く説明変数の種類が異なるものとなるようにするための第4の項、前記個別予測ルールに含まれる複数の予測ルールのうちの互いに類似する予測ルールが同一のグループに属するようにするための第5の項のうちの少なくともいずれかの項を含む、請求項4または5に記載のデータ分析装置。 The predetermined objective function includes a first prediction function for reducing an error between a predicted value based on the common prediction rule, the individual prediction rule, the group-specific prediction rule, and the grouping rule and the first actual measurement value. , A second term for learning the common prediction rule, a third term for learning the group-specific prediction rule, an explanation effective for prediction between the group-specific prediction rule and the common prediction rule A fourth term for making the types of variables different, and a fifth term for making prediction rules similar to each other among a plurality of prediction rules included in the individual prediction rule belong to the same group The data analysis device according to claim 4, comprising at least one of the following items.
  7.  前記所定の目的関数は、前記第1の項ないし第5の項のうちの複数の項の重み付きの和である、請求項6に記載のデータ分析装置。 The data analysis apparatus according to claim 6, wherein the predetermined objective function is a weighted sum of a plurality of terms among the first term to the fifth term.
  8.  前記予測値算出部は、前記第3の実測値、ならびに、前記予測ルール学習部により学習された前記共通予測ルール、前記グループ別予測ルール、および、前記グループ化ルールに基づいて、前記予測対象の目的変数の予測値を算出する、請求項3ないし7のいずれか1項に記載のデータ分析装置。 The prediction value calculation unit is configured to determine the prediction target based on the third actual measurement value, the common prediction rule learned by the prediction rule learning unit, the group-specific prediction rule, and the grouping rule. The data analysis apparatus according to claim 3, wherein a predicted value of the objective variable is calculated.
  9.  コンピュータがマルチタスク型のデータ分析を行うデータ分析方法であって、
     前記コンピュータが、複数の目的変数の実測値である第1の実測値と、前記複数の目的変数に対応する複数の説明変数の実測値である第2の実測値と、予測対象の目的変数に対応する説明変数の実測値である第3の実測値を記憶部に保持する工程と、
     前記記憶部から読み出された前記第1の実測値と前記第2の実測値を用いて、前記複数の目的変数に共通して関係する説明変数によって表される予測ルールである共通予測ルールと、各目的変数に関係する説明変数によって表される目的変数別の予測ルールから成る個別予測ルールと、前記個別予測ルールに含まれる予測ルールをグループ化したときの各グループに対する予測ルールから成るグループ別予測ルールを学習して前記記憶部に記録する工程と、を含む、データ分析方法。
    A data analysis method in which a computer performs multitasking data analysis,
    The computer uses a first actual measurement value that is an actual measurement value of a plurality of objective variables, a second actual measurement value that is an actual measurement value of a plurality of explanatory variables corresponding to the plurality of objective variables, and an objective variable to be predicted. Holding a third actual measurement value, which is an actual measurement value of the corresponding explanatory variable, in the storage unit;
    A common prediction rule, which is a prediction rule represented by explanatory variables related to the plurality of objective variables in common, using the first actual measurement value and the second actual measurement value read from the storage unit; By group, comprising prediction rules for each objective variable represented by explanatory variables related to each objective variable, and prediction rules for each group when the prediction rules included in the individual prediction rules are grouped Learning a prediction rule and recording it in the storage unit.
  10.  マルチタスク型のデータ分析をコンピュータに実行させるプログラムであって、
     複数の目的変数の実測値である第1の実測値と、前記複数の目的変数に対応する複数の説明変数の実測値である第2の実測値と、予測対象の目的変数に対応する説明変数の実測値である第3の実測値を記憶部に保持する処理と、
     前記記憶部から読み出された前記第1の実測値と前記第2の実測値を用いて、前記複数の目的変数に共通して関係する説明変数によって表される予測ルールである共通予測ルールと、各目的変数に関係する説明変数によって表される目的変数別の予測ルールから成る個別予測ルールと、前記個別予測ルールに含まれる予測ルールをグループ化したときの各グループに対する予測ルールから成るグループ別予測ルールを学習して前記記憶部に記録する処理と、を前記コンピュータに実行させる、プログラム。
    A program that causes a computer to perform multitasking data analysis,
    A first actual measurement value that is an actual measurement value of a plurality of objective variables; a second actual measurement value that is an actual measurement value of a plurality of explanatory variables corresponding to the plurality of objective variables; and an explanatory variable that corresponds to an objective variable to be predicted. A process of holding a third actual measurement value, which is an actual measurement value, in the storage unit;
    A common prediction rule, which is a prediction rule represented by explanatory variables related to the plurality of objective variables in common, using the first actual measurement value and the second actual measurement value read from the storage unit; By group, comprising prediction rules for each objective variable represented by explanatory variables related to each objective variable, and prediction rules for each group when the prediction rules included in the individual prediction rules are grouped The program which makes the said computer perform the process which learns a prediction rule and records it on the said memory | storage part.
PCT/JP2015/062123 2014-04-24 2015-04-21 Data analysis device, data analysis method, and program WO2015163322A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2016514948A JPWO2015163322A1 (en) 2014-04-24 2015-04-21 Data analysis apparatus, data analysis method and program

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201410167977.2A CN105095616A (en) 2014-04-24 2014-04-24 Data analysis device and data analysis method
CN201410167977.2 2014-04-24

Publications (1)

Publication Number Publication Date
WO2015163322A1 true WO2015163322A1 (en) 2015-10-29

Family

ID=54332486

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2015/062123 WO2015163322A1 (en) 2014-04-24 2015-04-21 Data analysis device, data analysis method, and program

Country Status (3)

Country Link
JP (1) JPWO2015163322A1 (en)
CN (1) CN105095616A (en)
WO (1) WO2015163322A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20220013881A (en) * 2020-07-27 2022-02-04 옴니어스 주식회사 Method, system and non-transitory computer-readable recording medium for recognizing attributes of products by using multi task learning

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3779835A4 (en) * 2018-03-27 2021-12-29 Culture Convenience Club Co., Ltd. Device, method and program for analyzing customer attribute information
CN115203180A (en) * 2022-05-16 2022-10-18 北京航空航天大学 Data blood relationship generation method

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2013519138A (en) * 2010-02-01 2013-05-23 グーグル インコーポレイテッド Join embedding for item association

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2013519138A (en) * 2010-02-01 2013-05-23 グーグル インコーポレイテッド Join embedding for item association

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
TOMOAKI IWASE ET AL.: "Simultaneous Prediction of Multiple Physical Properties Using Multi- task Learning", IEICE TECHNICAL REPORT, vol. 113, no. 476, 27 February 2014 (2014-02-27), pages 9 - 13 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20220013881A (en) * 2020-07-27 2022-02-04 옴니어스 주식회사 Method, system and non-transitory computer-readable recording medium for recognizing attributes of products by using multi task learning
KR102584289B1 (en) * 2020-07-27 2023-10-05 옴니어스 주식회사 Method, system and non-transitory computer-readable recording medium for recognizing attributes of products by using multi task learning

Also Published As

Publication number Publication date
CN105095616A (en) 2015-11-25
JPWO2015163322A1 (en) 2017-04-20

Similar Documents

Publication Publication Date Title
US10324453B2 (en) Space for materials selection
JP6843882B2 (en) Learning from historical logs and recommending database operations for data assets in ETL tools
JP6182242B1 (en) Machine learning method, computer and program related to data labeling model
JP7405775B2 (en) Computer-implemented estimating methods, estimating devices, electronic equipment and storage media
JP2017091329A (en) Database analysis device and database analysis method
WO2015163322A1 (en) Data analysis device, data analysis method, and program
JP6696568B2 (en) Item recommendation method, item recommendation program and item recommendation device
US11804283B2 (en) Predicting molecular properties of molecular variants using residue-specific molecular structural features
KR102022944B1 (en) Method for generating big data analysis report automatically and device performing the same
US20160371596A1 (en) Evaluation system, evaluation method, and computer-readable storage medium
US9792706B2 (en) Graph processing system, graph processing method, and non-transitory computer readable medium
US20120253998A1 (en) Cost calculation device, cost calculation method, and cost calculation program
JP5584917B2 (en) Data analysis system and data analysis method
JP6290716B2 (en) Environmental impact assessment system and method
US8805809B2 (en) Autotransform system
JP6151435B2 (en) Warehouse profit simulation device
JP2020205026A (en) Information processor, information processing system, model learning method
JP4419814B2 (en) Service quality evaluation support equipment
Su et al. On time-varying factor models: estimation and testing
US20180253515A1 (en) Characterizing model performance using hierarchical feature groups
US20210004737A1 (en) Journey Mapping
JP2018180711A (en) Calculator selecting test case and selection method of test case
JP6938052B2 (en) Analytical method of claims and its equipment
US10454497B1 (en) Reducing a size of multiple data sets
JPWO2020065876A1 (en) Assortment support device, assortment support method, and program

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15783067

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2016514948

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 15783067

Country of ref document: EP

Kind code of ref document: A1