WO2014081012A1 - Data analysis assistance processing system and method - Google Patents

Data analysis assistance processing system and method Download PDF

Info

Publication number
WO2014081012A1
WO2014081012A1 PCT/JP2013/081513 JP2013081513W WO2014081012A1 WO 2014081012 A1 WO2014081012 A1 WO 2014081012A1 JP 2013081513 W JP2013081513 W JP 2013081513W WO 2014081012 A1 WO2014081012 A1 WO 2014081012A1
Authority
WO
WIPO (PCT)
Prior art keywords
analysis
data
visualization
processing
input data
Prior art date
Application number
PCT/JP2013/081513
Other languages
French (fr)
Japanese (ja)
Inventor
正裕 本林
古川 直広
中野 定樹
Original Assignee
株式会社日立製作所
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 株式会社日立製作所 filed Critical 株式会社日立製作所
Publication of WO2014081012A1 publication Critical patent/WO2014081012A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T11/002D [Two Dimensional] image generation
    • G06T11/20Drawing from basic elements, e.g. lines or circles
    • G06T11/206Drawing of charts or graphs

Definitions

  • the present invention relates to a data analysis support processing system and method, and relates to a technology for supporting data analysis.
  • Patent Literature 1 Japanese Patent Application Laid-Open No. 2004-151561 discloses a technique that makes it easy to select an analysis method by extracting registered analysis setting information using information obtained by abstracting data as a key, and using the extracted information.
  • Patent Document 2 discloses a technology that supports the execution of a composite analysis in which a plurality of analyzes are combined using a past analysis history.
  • An object of the present invention is to provide a data analysis support method and system capable of solving the above-described problems and automatically constructing a process for interpolating between the input data and the result output / visualized image. There is to do.
  • This data analysis support processing method is A data analysis support processing method in a system that executes predetermined analysis processing on input data and visualizes the processing result, When the identification information of the input data selected by the user and the identification information of the visualization method are input via the input device, the setting item candidates for visualization predicted from the identification information of the input data are displayed. Thus, by specifying the input data and the result output / visualization image, it is possible to automatically predict and display the setting items for visualization that interpolate between them.
  • the analysis processing is composed of a combination of predetermined analysis processing units, With respect to the analysis processing history and the visualization history for the input data, an analysis pattern indicating a transition of the analysis processing unit is analyzed, and an analysis processing candidate for the selected input data is predicted according to the analysis pattern. Thereby, the candidate of the setting item for a highly appropriate analysis process and visualization can be estimated.
  • the setting item for visualization is data to be displayed in a table column or data to be a graph axis. Thereby, it is possible to present support information that allows the user to easily select setting items for data to be displayed in the columns of the table and data to be graph axes.
  • a data analysis support method and system capable of automatically constructing a process for interpolating between input data and result output / visualized image. Can do.
  • FIG. 1 It is a figure which shows the image of visualization template preparation of 1st embodiment. It is a figure which shows an example of the data structure of the visualization template of 1st embodiment. It is a figure which shows an example of the visualization template of 1st embodiment. It is a visualization content editing support information generation image (1) when “bar graph” is selected as the visualization component of the first embodiment. It is a visualization content editing support information generation image (2) when “bar graph” is selected as the visualization component of the first embodiment. It is an example of the flowchart which shows the procedure in which the visualization content edit assistance information of S206 of 1st embodiment is set to a table
  • FIG. 1 is a block diagram of the analysis support system of the present embodiment.
  • the analysis support system (computer system) described in FIG. 1 includes a server 101, a computer 102, a display 103, an input device 104, networks 105 and 106, and a database 1004 (DB) 107.
  • the server 101 and the computer 102 are connected to each other via the network 105, and the server 101 and the DB 107 are connected to each other via the network 106.
  • the server 101 and the computer 102 are used when the user 100 performs analysis work. Furthermore, functions related to the analysis work are provided, a history of function execution is collected, and a template described later is extracted and recommended according to the situation to assist the user 100 in the analysis work.
  • the server 101 and the computer 102 can use a general PC as an example.
  • the server 101 and the computer 102 include a processor, a memory, and an interface.
  • the processor executes various processes by processing a program stored in the memory.
  • the memory stores a program and data for executing processing.
  • the interface is connected to the input device 104 such as a keyboard and a mouse, connected to the display 103, connected to the server 101 and the computer 102 via the network 105, and connected to the server 101 via the network 106. And those connected to the DB 107 and the like.
  • the DB 107 is a database that holds various data such as information related to companies, various statistical data, time-series data such as sensors, and Web access logs.
  • the DB 107 may be configured to be included in the server 101, or may be configured to be stored in an external storage device and connected to the server 101 via the network 106.
  • the server 101 includes, for example, an input data creation unit 111, an analysis processing execution unit 112, a data visualization unit 113, a processing procedure recording unit 114, a processing construction unit 115, a processing procedure execution unit 116, and a template recommendation unit (processing procedure recommendation unit) 117. , A processing procedure analysis unit 118, a template DB 119, and a processing procedure DB 120.
  • the input data creation unit 111, the analysis process execution unit 112, the data visualization unit 113, the process procedure recording unit 114, the process construction unit 115, the process procedure execution unit 116, the template recommendation unit 117, and the process procedure analysis unit 118 are, for example, programs The function of each unit is realized by being stored in the memory and executed by the processor.
  • the input data creation unit 111 extracts desired data from the DB 107 in accordance with an instruction from the user 100 or the processing procedure execution unit 116, and performs input data creation processing on the extracted data to create analysis target data.
  • the analysis processing execution unit 112 performs data processing on the analysis target data created by the input data creation unit 111 according to an instruction from the user 100 or the processing procedure execution unit 116, and creates processing result data.
  • the data visualization unit 113 performs visualization processing on the processing result data created by the analysis processing execution unit 112 according to an instruction from the user 100 or the processing procedure execution unit 116, and visualizes the processing result data.
  • the processing procedure recording unit 114 records each processing of the input data creation unit 111, the analysis processing execution unit 112, and the data visualization unit 113 in the processing procedure DB 120.
  • the process construction unit 115 creates visualized content editing support information and constructs an analysis process.
  • the processing procedure execution unit 116 retrieves the processing procedure from the processing procedure DB 120 according to the instruction of the user 100, and instructs the input data creation unit 111, the analysis processing execution unit 112, and the data visualization unit 113 according to the contents of the processing procedure to perform analysis processing.
  • the processing procedure recommendation unit 117 extracts a processing procedure from the processing procedure DB 120 based on a predetermined criterion, and presents the processing procedure to the user 100 using an appropriate output device such as a display.
  • the computer 102 includes an input data selection unit 121, a visualization template editing unit 122, and an analysis processing procedure creation unit 123.
  • the input data selection unit 121 displays a dialog for selecting input data in accordance with an instruction from the user 100 and holds the selection result of the user.
  • the visualization template editing unit 122 displays a dialog for editing the visualization template in accordance with an instruction from the user 100 and holds the editing result of the user.
  • the analysis processing procedure creation unit 123 displays a dialog for creating an analysis processing procedure in accordance with an instruction from the user 100, and holds the analysis processing procedure created by the user.
  • Each dialog is displayed on the display 103, for example, and each instruction from the user, a selection result, an editing result, and the like are input using the input device 104.
  • Each unit of the server 101 and the computer 102 may be configured by one apparatus or may be appropriately distributed.
  • the visualization template editing unit 122 and the analysis processing procedure creation unit 123 are collectively referred to as a data analysis support processing unit.
  • the configuration of the above-described analysis support system is not limited to the first embodiment, and can be applied to other embodiments.
  • FIG. 2 is a flowchart illustrating an example of a procedure according to the first embodiment. This flowchart schematically shows the operation of the entire system including the computer 102 and the server 101. More details will be described later.
  • the computer 102 for example, the analysis processing procedure creation unit 123) displays the analysis processing procedure creation dialog shown in FIG. 3-A on the display 103 (S202). ).
  • the computer 102 (for example, the analysis processing procedure creation unit 123) causes the user 100 to select an analysis processing unit such as “data selection”, “aggregation”, and “filtering” in the analysis processing unit selection unit 301A using the input device 104.
  • the analysis processing procedure to be created is sent to the server 101.
  • the server 101 for example, the analysis process execution unit 112) executes the analysis process according to the received analysis process procedure, and sends the execution result to the computer 102 (S203).
  • the computer 102 displays a visualized content creation dialog shown in FIG.
  • the computer 102 uses the input device 104 to select an item included in the visualization component selection unit 301B and adds it to the visualization content display unit 302B
  • the computer 102 creates visualization content (S204).
  • the visualized content for example, the execution result of the received analysis process is visualized according to the selected item (for example, a table or a graph).
  • the computer 102 displays the visualized content as a result of S204 on the display 103 (S209).
  • the server (for example, the processing procedure recording unit 114) records the analysis processing procedure and the visualization processing procedure in the processing procedure DB 120 (S210).
  • analysis processing procedure creation dialog An example of the analysis processing procedure creation dialog is shown in FIG.
  • a user creates an analysis processing procedure
  • an analysis processing procedure is created by selecting items included in the analysis processing unit selection unit 301A and adding them to the analysis processing unit sequence display unit 302A.
  • the processing procedure recommendation unit 117 can extract a processing procedure from the processing procedure DB 120 according to the creation status at the start of analysis processing procedure creation, or can recommend it to the user.
  • the analysis processing procedure refers to a series of analysis processing units (for example, the minimum unit of analysis processing, which may be an appropriate processing unit).
  • the analysis processing unit indicates processing such as “data selection”, “aggregation”, “filtering”, “calculation”, and “editing” as displayed in the analysis processing unit selection unit 301A.
  • FIG. 2 An example of the visualization content creation dialog is shown in FIG.
  • the visualization content is created by selecting an item (visualization component) included in the visualization component selection unit 301B and adding it to the visualization content display unit 302B.
  • the processing procedure recommendation unit 117 can recommend the visualization component to the user in accordance with the analysis processing execution result obtained as a result of S203 from the processing procedure DB 120 when the visualization content creation starts or in the middle.
  • the analysis support function of this embodiment is used (S201)
  • the computer 102 displays, for example, an input data selection dialog shown in FIG. 5 and a visualized content edit dialog exemplified in FIG. 6-A on the display 103. (S205).
  • the computer 102 sends the data selected by the user in the input data selection dialog using the input device 104 and the visualized part information created in the visualization content editing dialog to the server 101, and the processing construction unit 115 of the server 101 selects the selected data.
  • visualization content editing support information for example, setting item candidates for visualization
  • Edit the visualized content by operating the input device 104 by the user using the parameter setting dialog related to the table illustrated in FIG. 6B or the parameter setting dialog related to the graph illustrated in FIG. 6C and the visualized content editing support information.
  • a desired setting item is selected from the displayed setting item candidates.
  • the computer 102 sends the editing result to the server 101, and the processing construction unit 115 constructs an analysis process based on the selection data and the visualization content editing result (S207).
  • the analysis process execution unit 112 executes the constructed analysis process and sends the execution result to the computer 102 (S208).
  • the visualization content as a result of S208 is displayed on the display 103 (S209).
  • the visualization content here displays, for example, the processing result according to the specified visualization component information and setting items.
  • the processing procedure recording unit 114 records the analysis processing procedure and the visualization processing procedure in the processing procedure DB 120 (S210).
  • Fig. 4-A shows an example of the data structure and data of the analysis processing procedure
  • Fig. 4-B shows an example of the data structure and data of the visualization processing procedure. Such data is recorded in the processing procedure DB 120.
  • Fig. 5 shows the input data selection dialog.
  • the data displayed in the input data list display area 501 is selected by the user 100 using the input device 104, and the selection of the input data is determined by pressing the OK button 502.
  • Fig. 6-A is an example of a visualization content edit dialog.
  • the user edits the visualized content, the user selects the item included in the visualized part selecting unit 601A using the input device 104 and adds the selected item to the visualized content display unit 602A, thereby editing the visualized content.
  • the computer 102 displays a table parameter setting dialog (FIG. 6B) for setting parameters related to the table, and when any one of the graphs is selected, the computer 102 102 displays a graph parameter setting dialog (FIG. 6C) for setting parameters relating to the graph.
  • the table parameter setting dialog illustrated in FIG. 6B includes, for example, a front side column candidate list display unit (601B), a front side column edit unit (602B), a front side column pattern list display unit (603B), and a front side column.
  • a candidate list display section (604B) and a selected column display section for head front column (605B) are included.
  • an OK button (606B) and a cancel button (608B) are included.
  • the graph parameter setting dialog illustrated in FIG. 6C includes, for example, a viewpoint list display unit (601C), a viewpoint editing unit (602C), an X axis pattern list display unit (603C), and an X axis candidate list display unit ( 604C), an X-axis pattern editing unit (605C), a Y-axis pattern list display unit (606C), a Y-axis candidate list display unit (607C), and a Y-axis pattern editing unit (608C). Also, an OK button (609C) and a cancel button (610C) are included.
  • FIG. 7A, FIG. 7B, FIG. 7C, and FIG. 7D create data to be displayed in each part of the table parameter setting dialog (FIG. 6B) and the graph parameter setting dialog (FIG. 6C).
  • FIG. 8-A shows the procedure for.
  • the analysis pattern is a set of two or more analysis processing units included in succession in an analysis processing procedure (for example, 801 in FIG. 8-A) as shown by 802 in FIG. 8-A.
  • a description will be given as a set of two consecutive analysis processing units.
  • analysis processing units are arranged in the order of processing.
  • the processing procedure analysis unit 118 acquires an unprocessed analysis processing procedure M related to the analysis pattern creation from the processing procedure DB 120 (S701).
  • the processing procedure analysis unit 118 takes out a set of the Nth (N is an integer of 1 or more) analysis processing unit and the N + 1th analysis processing unit of the analysis processing procedure M, and creates an analysis pattern (S702).
  • the processing procedure analysis unit 118 repeats S701 and S702 until there is no unprocessed analysis processing procedure in the processing procedure DB 120 (S703).
  • the processing procedure analysis unit 118 obtains a probability (transition probability) that the created analysis pattern appears in the analysis processing procedure.
  • the transition probability indicates the probability that the analysis processing unit serving as the end point of the analysis pattern is executed next to the analysis processing unit serving as the starting point of the analysis pattern.
  • the processing procedure analysis unit 118 may store the analysis pattern in the template DB 119.
  • FIG. 8-A shows an image of creating an analysis pattern.
  • the analysis processing procedures A to E (801) are processed, and eight or more analysis patterns (802) are created.
  • 803 in FIG. 8A is an example of the analysis processing unit, and the numbers in the circles 801 and 802 correspond to the numbers in the table.
  • FIG. 8-B shows an example of the analysis pattern data structure
  • FIG. 8-C shows an example of the analysis pattern data.
  • the visualization template indicates the relationship between the visualization process (901 in FIG. 9-A, the data structure and the example of the content is 904 in FIG. 9-A) and the analysis process result data (902 in FIG. 9-A). It is data.
  • the processing procedure analysis unit 118 acquires the analysis processing procedure group G (905 in FIG. 9A) and the unprocessed visualization processing procedure M (904 in FIG. 9A) regarding the creation of the visualization template from the processing procedure DB 120 (S704). .
  • the processing procedure analysis unit 118 acquires the analysis processing procedure Q corresponding to the analysis processing ID of the visualization processing procedure M from the analysis processing procedure group G (S705).
  • a set of visualization components of the last analysis processing unit N of the analysis processing procedure Q and visualization processing procedure M is created, and a visualization template T is created (S706).
  • the visualization template T includes information shown in FIG. 9-B, for example.
  • the processing procedure analysis unit 118 generalizes the selected column of the visualization template T and updates the visualization template T (S707). S704 to S707 are repeated until there is no unprocessed analysis processing procedure in the processing procedure DB 120 (S708). Note that the processing procedure analysis unit 118 may store the visualization template T in the template DB 119.
  • Fig. 9-A shows the image of creating the visualization template
  • Fig. 9-B shows an example of the data structure of the visualization template
  • Fig. 9-C shows an example of the visualization template.
  • the visualization template includes at least immediately preceding processing (902 in FIG. 9-A), selection data information (information such as column names and column elements of the input data selected in 903 in FIG. 9-A), and processing result data information ( Information such as data string names and data elements at the time of executing 902 in FIG. 9A.
  • FIG. 7C shows an example of a procedure for generating visualization content editing support information applicable to selection data and visualization component information selected by the user.
  • the process construction unit 115 acquires an analysis pattern and a visualization template from the template DB 119 (S711).
  • the processing construction unit 115 generates visualization content editing support information applicable to the selection data and visualization component information (S712). Details will be described later.
  • the processing construction unit 115 sends the visualized content editing support information rearranged according to a predetermined criterion to the computer 102 (713).
  • Fig. 7-D shows the detailed procedure of S712.
  • the predetermined criterion is “in order of increasing transition probability”. However, for example, “in order of decreasing number of analysis processing units from“ input data selection ”to visualization processing” may be used. Other criteria may be used. These can be switched by a setting file or the like.
  • J and K are parameters
  • N is an analysis processing unit
  • L and L0 are lists
  • M is an analysis processing unit to be processed or a visualization process.
  • the determination of whether or not it is applicable is applicable when the data D and the parameter P of the analysis processing unit M satisfy the following conditions.
  • the number of rows of data D1 as a result of applying analysis processing unit M to data D with parameter P is zero.
  • FIGS. 10A and 10B show visualization content editing support information generation images when “bar graph” is selected as the visualization component.
  • the visualization content editing support information is used to construct a processing procedure for filling a space between “input data selection” and “bar graph” based on a predetermined standard using an analysis pattern and a visualization template.
  • Information For example, the processing construction unit 115 applies the selected input data (here, the processing unit identification number “1”) to the selected visualization component (here, a bar graph) based on the analysis pattern and the visualization template acquired from the template DB 119.
  • One or a plurality of processing procedure candidates are obtained.
  • a process procedure candidate is obtained by following the process transition indicated by the analysis pattern and the transition from the final analysis processing unit indicated by the visualization template to the visualization part.
  • a plurality of processing procedure candidates can be arranged (ranked) in accordance with a predetermined criterion such as an order of increasing transition probability. The same is true when a component other than the bar graph is selected as the visualization component.
  • FIG. 11 is a flowchart showing a procedure in which the visualized content editing support information in S206 is set in the table or graph parameter setting dialog.
  • the computer 102 acquires the selection result (selected visualization component) of the visualization component selection unit 601A in the visualization content editing dialog, and sends it to the server 101 together with the selection data (selected input data) (S2061).
  • the visualization template editing unit 122 receives the visualization content editing support information created and sent by the process construction unit 115 of the server 101 in S7121 to S7134 (S2062).
  • the visualization template editing unit 122 prepares a parameter setting dialog for a table or a graph (see, for example, FIG. 6-B and FIG. 6-C) according to the selected visualization part, and sets a value in each list display unit ( S2063).
  • the visualization template editing unit 122 acquires the selection result of each list display unit, and proceeds to S207 (S2064).
  • the 12-A and 12-B show an image of setting content editing support information in the graph parameter setting dialog in S2063.
  • the server 101 executes the processing of S7121 to S7134 to generate an analysis processing procedure in which the display component becomes “bar graph”, a series group of visualization templates, and visualization content editing support Information.
  • the viewpoint candidate list display unit 601C displays the viewpoint selection column of the visualization template and the columns included in the analysis processing result data until immediately before the visualization processing (12A).
  • the X axis pattern list display section 603C displays the X axis pattern of the selected column of the visualization template (12B).
  • the X-axis candidate list display unit 604C displays columns included in the analysis process result data up to immediately before the visualization process (12C).
  • the Y axis pattern list display section 606C displays the Y axis pattern of the selected column of the visualization template (12D).
  • the Y-axis candidate list display unit 607C displays columns included in the analysis process result data up to immediately before the visualization process (12E).
  • 13A and 13B show images for setting content editing support information in the table parameter setting dialog in S2063.
  • the server 101 executes the processing of S7121 to S7134 to generate an analysis processing procedure in which the display component becomes “table”, a series of visualization templates, and visualization content editing support Information.
  • the front side column candidate list display unit 601B displays the front side selected columns of the visualization template and the columns included in the analysis processing result data until immediately before the visualization processing (13A).
  • the front head row pattern list display section 603B displays the front head pattern of the selected column of the visualization template (13B).
  • the head column candidate list display section 604B columns included in the analysis processing result data until immediately before the visualization processing are displayed (13C).
  • the selection column indicating the data used for visualization in the past and the data of the past processing result are displayed on each list display unit to assist the selection by the user.
  • FIG. 14 is a detailed procedure of S207.
  • the predetermined criterion is “order in which the transition probability is large”, but for example, “order in which the number of analysis processing units from“ input data selection ”to visualization processing is small” may be considered. This can be switched by a setting file or the like.
  • K is a parameter
  • N is an analysis processing unit
  • L is a list
  • M is an analysis processing unit or visualization process to be processed
  • G is a visualization process.
  • the process construction unit 115 executes the following process. Empty list L
  • the analysis processing procedure generated as a result of the above processing is data combining the analysis processing procedure and the visualization processing procedure.
  • FIG. 15 shows an image of the result of executing the analysis processing procedure generated as a result of the above processing.
  • the determination of whether or not it is applicable is applicable when the data D and the parameter P of the analysis processing unit M satisfy the following conditions.
  • the user can create the desired visualization content only by selecting the input data and editing the visualization content, without being aware of the analysis procedure for filling in between them. It is possible to reduce the labor of analysis work.
  • FIG. 16 shows a visualized content edit dialog according to the second embodiment.
  • the visualized content edit dialog according to the second embodiment further includes a visualized content case selection unit (605A).
  • An example (for example, a visualization image) visualized in the past is shown in the visualization content example selection unit (605A).
  • the user edits the visualized content, the user selects the visualized content case displayed on the visualized content case selecting unit 605A, selects items included in the visualized part selecting unit 601A as necessary, and displays the visualized content display unit.
  • the visualization content is edited by adding to 602A.
  • Each visualization component of the visualization content example is linked to the analysis processing procedure and the visualization processing procedure shown in FIGS. 4A and 4B. Therefore, the visualization desired by the user is performed by performing the same processing as in the first embodiment. Content can be generated.
  • the user when the user wants to use the existing visualized content, the user simply selects the data and the existing content and creates the desired visualized content without being aware of the analysis procedure for filling in between them. It becomes possible to do. In addition, it is possible to reduce the labor of analysis work of the user 100.
  • Applicability determination of the analysis processing unit M for the data D in S7130 and S2080 in the first embodiment is often determined by whether or not the data D includes the column name or element specified in the parameter P.
  • the applicable range of the analysis processing unit M becomes small.
  • the applicable range of the analysis processing unit M is expanded by taking into account the similarity based on the number of elements and the appearance frequency of the column names and elements of the parameter P. For example, it is determined whether to apply the past analysis process to the selected input data based on the similarity between the data used in the past analysis process and the selected input data. More specifically, the determination as to whether it is applicable is applicable when the data D and the parameter P of the analysis processing unit M satisfy the following conditions.
  • the similarity between columns is assumed to be similar when, for example, the difference is calculated in descending order of the appearance frequency or appearance rate of each element in the column, and the sum is the similarity, and the difference is smaller than a predetermined threshold.
  • the similarity of elements is assumed to be similar when the difference in appearance frequency or appearance rate is smaller than a threshold value. Similar definitions can be changed according to the purpose. It is not applicable in the following cases.
  • 17A and 17B show an example of the data structure and data of the analysis pattern of the third embodiment.
  • FIG. 18 illustrates an example of similarity calculation, parameters P and data D having the following configuration, and the flow of similarity calculation for each column and each element when the threshold is 30.
  • the column “A” in the parameter P includes 40% of the element ⁇ , 35% of the element ⁇ , 15% of the element ⁇ , 10% of the element ⁇ , and so on.
  • Data D includes columns “B”, “C”, “D”...,
  • column “B” includes four elements B1 to B4. B1 is 45%, B2 is 30%, B3 is 15%, and B4 is It is included at a rate of 5%.
  • the column “C” includes two elements C1 and C2, and both C1 and C2 are included in a ratio of 50%.
  • the column “D” includes the elements D1 to D4..., And D1 is 5%, D2 is 4%, D3 is 4%, D4 is 3%,.
  • the similarity of each column is calculated as follows.
  • the similarity AB between the columns “A” and “B” is
  • 15.
  • the similarity AC between the columns “A” and “C” is
  • 15.
  • the similarity AD between the columns “A” and “D” is
  • the column similar to the column “A” is the column “B”, and the element of the column “B” most similar to the element ⁇ is B1.
  • the elements are rearranged in descending order of the ratio of each column, and the ratio of the largest elements, the second largest elements, and so on are subtracted.
  • the third largest element is the fourth
  • Subtraction between large elements employs a method of subtracting zero.
  • the application range of the analysis processing unit M can be expanded by taking into account the similarity based on the number of elements and the appearance frequency of the column names and elements of the parameter P.
  • Configuration example 1 An analysis support method and system for automatically generating a processing procedure necessary for realizing a visualization method designated for input data when a user designates input data and a visualization method.
  • Configuration example 2 A visualization content creation support method and system for supporting specification of a visualization method by designating a table column and a graph axis.
  • Configuration example 3 A visualization content creation support method and system that supports specification of an analysis procedure and a visualization method by specifying existing content.
  • this invention is not limited to the above-mentioned Example, Various modifications are included.
  • the above-described embodiments have been described in detail for easy understanding of the present invention, and are not necessarily limited to those having all the configurations described.
  • a part of the configuration of one embodiment can be replaced with the configuration of another embodiment, and the configuration of another embodiment can be added to the configuration of one embodiment.
  • Each of the above-described configurations, functions, processing units, processing means, and the like may be realized by hardware by designing a part or all of them with, for example, an integrated circuit.
  • Each of the above-described configurations, functions, and the like may be realized by software by interpreting and executing a program that realizes each function by the processor.
  • Information such as programs, tables, and files for realizing each function can be stored in a memory, a hard disk, a recording device such as an SSD (Solid State Drive), or a recording medium such as an IC card, an SD card, or a DVD.
  • the control lines and information lines indicate what is considered necessary for the explanation, and not all the control lines and information lines on the product are necessarily shown. Actually, it may be considered that almost all the components are connected to each other.

Landscapes

  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

In order to designate input data and result output/visualization images to thereby automatically construct processing for performing an interpolation therebetween, input data and result output/visualization images are inputted to thereby automatically construct processing for performing an interpolation therebetween. For example, when in a system for executing predetermined analysis processing on the basis of input data and visualizing a processing result, identification information relating to input data selected by a user via an input device, and identification information relating to a visualization method are inputted, setting item candidates for visualization, which are predicted from the identification information relating to the input data, are displayed.

Description

データ分析支援処理システム及び方法Data analysis support processing system and method
 本発明は、データ分析支援処理システム及び方法に係り、データ分析を支援する技術に関する。 The present invention relates to a data analysis support processing system and method, and relates to a technology for supporting data analysis.
 業務システムなどから蓄積される企業内の膨大なデータを、蓄積、分析、加工して、企業の意思決定に活用するビジネスインテリジェンスという手法がある。この手法は、複数のデータベース(DB)を横断し、多種、大量のデータを高速に分析可能である。この手法は、高速化のために、高度に構造化された分析対象データを分析の目的に応じて予め構築する必要があり、分析内容と必要なデータが決まっている定型分析に強い。 There is a method called business intelligence that stores, analyzes, and processes a huge amount of corporate data accumulated from business systems, etc., and uses it for corporate decision making. This method can analyze a large amount of data at a high speed across a plurality of databases (DB). This method is highly resistant to routine analysis in which highly structured analysis target data needs to be constructed in advance according to the purpose of analysis in order to increase the speed, and analysis contents and necessary data are determined.
 一方、分析対象データやデータに対する分析処理や可視化処理が固定ではない場合には、試行錯誤のたびに,分析対象データを作り変えると共に、分析処理、可視化処理を変更しなければならない。このようなデータ分析に関連する背景技術として、特許文献1や特許文献2がある。特許文献1には、データを抽象化した情報をキーとして登録済みの分析設定情報を抽出し、それを活用することで分析手法の選択を容易にする技術が開示されている。また、特許文献2には、過去の分析履歴を利用し、複数の分析を組合せた複合分析の実行を支援する技術が開示されている。 On the other hand, if the analysis process and visualization process for the analysis target data and data are not fixed, the analysis process data and the visualization process must be changed at each trial and error. As background art related to such data analysis, there are Patent Literature 1 and Patent Literature 2. Japanese Patent Application Laid-Open No. 2004-151561 discloses a technique that makes it easy to select an analysis method by extracting registered analysis setting information using information obtained by abstracting data as a key, and using the extracted information. Patent Document 2 discloses a technology that supports the execution of a composite analysis in which a plurality of analyzes are combined using a past analysis history.
特開2010-205218号公報JP 2010-205218 A 特開2005-157896号公報JP 2005-157896 A
 上述の背景技術に見るように、分析を行う際には(1)データ選択(2)分析手法の選択(3)分析実行(4)結果出力・可視化という手順で行う場合が一般的である。このような分析手順を記録し、再利用する技術では、過去の分析手順を再現し、また利用者が分析手順を作成する際に利用者の目的に合致した分析手法を推薦し得る。しかし利用者、特に分析を業務としない利用者にとっては、どのような手順や手法を用いて分析するかはさほど重要でなく、利用者が指定したデータの分析結果にのみ興味がある、つまり、利用者は(1)と(4)のみを選択するだけで、(2)(3)については利用者に意識させないような分析支援技術が求められている。
 本発明の目的は、上記の課題を解決し、入力データ及び結果出力・可視化イメージを指定することにより、その間を補間する処理を自動的に構築することが可能なデータ分析支援方法及びシステムを提供することにある。
As seen in the background art described above, when performing analysis, it is common to perform a procedure of (1) data selection (2) analysis method selection (3) analysis execution (4) result output / visualization. In the technique of recording and reusing such an analysis procedure, a past analysis procedure can be reproduced, and when the user creates an analysis procedure, an analysis method that matches the purpose of the user can be recommended. However, for users, especially those who do not conduct analysis, it is not so important what kind of procedure or method is used for analysis, and only interested in the analysis result of the data specified by the user, There is a need for an analysis support technology that allows the user to select only (1) and (4) and not to make the user aware of (2) and (3).
An object of the present invention is to provide a data analysis support method and system capable of solving the above-described problems and automatically constructing a process for interpolating between the input data and the result output / visualized image. There is to do.
 上記課題を解決するために、例えば特許請求の範囲に記載の構成を採用する。
[適用例1]
 本データ分析支援処理方法は、
 入力データに対して所定の分析処理を実行し、処理結果を可視化するシステムにおけるデータ分析支援処理方法であって、
 入力装置を介して利用者が選択した入力データの識別情報と、可視化方法の識別情報を入力すると、該入力データの識別情報から予測される、可視化のための設定項目の候補を表示する。
 これにより、入力データ及び結果出力・可視化イメージを指定することにより、その間を補間する可視化のための設定項目を自動的に予測して表示することが可能である。
In order to solve the above problems, for example, the configuration described in the claims is adopted.
[Application Example 1]
This data analysis support processing method is
A data analysis support processing method in a system that executes predetermined analysis processing on input data and visualizes the processing result,
When the identification information of the input data selected by the user and the identification information of the visualization method are input via the input device, the setting item candidates for visualization predicted from the identification information of the input data are displayed.
Thus, by specifying the input data and the result output / visualization image, it is possible to automatically predict and display the setting items for visualization that interpolate between them.
[適用例2]
 上記データ分析支援処理方法において、
 前記設定項目の候補の中から選択された設定項目に従い、前記可視化方法で、入力データに基づく処理結果を表示する。
 これにより、処理結果を、指定された通りに可視化できる。
[Application Example 2]
In the above data analysis support processing method,
In accordance with the setting item selected from the setting item candidates, the processing result based on the input data is displayed by the visualization method.
As a result, the processing result can be visualized as specified.
[適用例3]
 上記データ分析支援処理方法において、
 選択された入力データに対する過去の可視化表示又はその概略をさらに表示する。
 これにより、利用者が細かい設定なく可視化方法を設定できるよう支援情報を提供できる。
[Application Example 3]
In the above data analysis support processing method,
A past visualization display or summary of the selected input data is further displayed.
Thereby, support information can be provided so that the user can set the visualization method without fine settings.
[適用例4]
 上記データ分析支援処理方法において、
 入力データに対する分析処理の履歴及び可視化の履歴に基づき、選択された入力データに対する分析処理及び可視化のための設定項目の候補を予測する。
 これにより、妥当性の高い分析処理及び可視化のための設定項目の候補を予測できる。
[Application Example 4]
In the above data analysis support processing method,
Based on the history of analysis processing and the history of visualization of input data, candidate setting items for analysis processing and visualization of the selected input data are predicted.
Thereby, the candidate of the setting item for a highly appropriate analysis process and visualization can be estimated.
[適用例5]
 上記データ分析支援処理方法において、
 前記分析処理は、所定の分析処理単位の組合せで構成され、
 入力データに対する分析処理の履歴及び可視化の履歴について、分析処理単位の遷移を示す分析パターンを解析し、選択された入力データに対する分析処理の候補を該分析パターンに従い予測する。
 これにより、妥当性の高い分析処理及び可視化のための設定項目の候補を予測できる。
[Application Example 5]
In the above data analysis support processing method,
The analysis processing is composed of a combination of predetermined analysis processing units,
With respect to the analysis processing history and the visualization history for the input data, an analysis pattern indicating a transition of the analysis processing unit is analyzed, and an analysis processing candidate for the selected input data is predicted according to the analysis pattern.
Thereby, the candidate of the setting item for a highly appropriate analysis process and visualization can be estimated.
[適用例6]
 上記データ分析支援処理方法において、
 過去の分析処理に用いたデータと、選択された入力データとの類似度に基づいて、該過去の分析処理を、選択された入力データに適用するか否かを判断する。
 これにより、分析処理の入力データへの適用可能性をより詳細に判断できる。
[Application Example 6]
In the above data analysis support processing method,
Based on the similarity between the data used in the past analysis process and the selected input data, it is determined whether to apply the past analysis process to the selected input data.
Thereby, the applicability of the analysis process to the input data can be determined in more detail.
[適用例7]
 上記データ分析支援処理方法において、
 前記可視化のための設定項目は、表の列に表示するデータ、又は、グラフの軸とするデータである。
 これにより、利用者が表の列に表示するデータ、グラフの軸とするデータの設定項目を簡易に選択できる支援情報を提示できる。
[Application Example 7]
In the above data analysis support processing method,
The setting item for visualization is data to be displayed in a table column or data to be a graph axis.
Thereby, it is possible to present support information that allows the user to easily select setting items for data to be displayed in the columns of the table and data to be graph axes.
 本発明の代表的な一形態によれば、入力データ及び結果出力・可視化イメージを指定することにより、その間を補間する処理を自動的に構築することが可能なデータ分析支援方法及びシステムを提供するができる。 According to a typical embodiment of the present invention, there is provided a data analysis support method and system capable of automatically constructing a process for interpolating between input data and result output / visualized image. Can do.
分析支援システムのブロック図である。It is a block diagram of an analysis support system. 第一の実施の形態の作業1の手順の一例を示すフローチャートである。It is a flowchart which shows an example of the procedure of the operation | work 1 of 1st embodiment. 第一の実施の形態の分析処理手順作成ダイアログの一例である。It is an example of the analysis process procedure creation dialog of 1st embodiment. 第一の実施の形態の可視化コンテンツ作成ダイアログの一例である。It is an example of the visualization content creation dialog of 1st embodiment. 第一の実施の形態の分析処理手順のデータ構造(上図)、及びデータの一例(下図)である。It is a data structure (upper figure) of an analysis processing procedure of a first embodiment, and an example of data (lower figure). 第一の実施の形態の可視化処理手順のデータ構造(上図)、及びデータの一例(下図)である。It is the data structure (upper figure) of the visualization process procedure of 1st embodiment, and an example (lower figure) of data. 第一の実施の形態の入力データ選択用ダイアログの一例である。It is an example of the dialog for input data selection of 1st embodiment. 第一の実施の形態の可視化コンテンツ編集ダイアログの一例である。It is an example of the visualization content edit dialog of 1st embodiment. 第一の実施の形態の可視化コンテンツ編集ダイアログの表に関するパラメータ設定ダイアログの一例である。It is an example of the parameter setting dialog regarding the table | surface of the visualization content edit dialog of 1st embodiment. 第一の実施の形態の可視化コンテンツ編集ダイアログのグラフに関するパラメータ設定ダイアログの一例である。It is an example of the parameter setting dialog regarding the graph of the visualization content edit dialog of 1st embodiment. 第一の実施の形態の分析パターン作成手順の一例を示すフローチャートである。It is a flowchart which shows an example of the analysis pattern creation procedure of 1st embodiment. 第一の実施の形態の可視化テンプレート作成手順の一例を示すフローチャートである。It is a flowchart which shows an example of the visualization template preparation procedure of 1st embodiment. 第一の実施の形態における、利用者が選択した選択データ、可視化部品情報に適用可能な可視化コンテンツ編集支援情報を生成する手順の一例であるIt is an example of the procedure which produces | generates the visualization content edit assistance information applicable to the selection data and visualization components information which the user selected in 1st embodiment. 図7-CのS712の手順を詳細化した手順である。This is a detailed procedure of S712 in FIG. 7-C. 第一の実施の形態の分析パターン作成のイメージを示す図である。It is a figure which shows the image of creation of the analysis pattern of 1st embodiment. 第一の実施の形態の分析パターンのデータ構造の一例を示す図である。It is a figure which shows an example of the data structure of the analysis pattern of 1st embodiment. 第一の実施の形態の分析パターンのデータの一例を示す図である。It is a figure which shows an example of the data of the analysis pattern of 1st embodiment. 第一の実施の形態の可視化テンプレート作成のイメージを示す図である。It is a figure which shows the image of visualization template preparation of 1st embodiment. 第一の実施の形態の可視化テンプレートのデータ構造の一例を示す図である。It is a figure which shows an example of the data structure of the visualization template of 1st embodiment. 第一の実施の形態の可視化テンプレートの一例を示す図である。It is a figure which shows an example of the visualization template of 1st embodiment. 第一の実施の形態の可視化部品として「棒グラフ」が選択された場合の可視化コンテンツ編集支援情報生成イメージ(1)である。It is a visualization content editing support information generation image (1) when “bar graph” is selected as the visualization component of the first embodiment. 第一の実施の形態の可視化部品として「棒グラフ」が選択された場合の可視化コンテンツ編集支援情報生成イメージ(2)である。It is a visualization content editing support information generation image (2) when “bar graph” is selected as the visualization component of the first embodiment. 第一の実施の形態のS206の可視化コンテンツ編集支援情報が表またはグラフパラメータ設定ダイアログにセットされる手順を示すフローチャートの一例である。It is an example of the flowchart which shows the procedure in which the visualization content edit assistance information of S206 of 1st embodiment is set to a table | surface or a graph parameter setting dialog. 第一の実施の形態のS2063のグラフパラメータ設定ダイアログにコンテンツ編集支援情報をセットするイメージ(1)である。It is an image (1) for setting content editing support information in the graph parameter setting dialog in S2063 of the first embodiment. 第一の実施の形態のS2063のグラフパラメータ設定ダイアログにコンテンツ編集支援情報をセットするイメージ(2)である。It is an image (2) for setting content editing support information in the graph parameter setting dialog in S2063 of the first embodiment. 第一の実施の形態のS2063の表パラメータ設定ダイアログにコンテンツ編集支援情報をセットするイメージ(1)である。It is an image (1) for setting content editing support information in the table parameter setting dialog in S2063 of the first embodiment. 第一の実施の形態のS2063の表パラメータ設定ダイアログにコンテンツ編集支援情報をセットするイメージ(2)である。It is an image (2) for setting content editing support information in the table parameter setting dialog in S2063 of the first embodiment. 第一の実施の形態のS207の手順を詳細化した手順の一例である。It is an example of the procedure which detailed the procedure of S207 of 1st embodiment. 第一の実施の形態のS2071~S2082の処理の結果生成される分析処理手順データの一例及び分析処理手順データを実行結果のイメージを示す図である。It is a figure which shows an example of the execution result of an example of the analysis process procedure data produced | generated as a result of the process of S2071-S2082 of 1st embodiment, and analysis process procedure data. 第二の実施の形態の可視化コンテンツ編集ダイアログの一例である。It is an example of the visualization content edit dialog of 2nd embodiment. 第三の実施の形態の分析パターンのデータ構造の一例を示す図である。It is a figure which shows an example of the data structure of the analysis pattern of 3rd embodiment. 第三の実施の形態の分析パターンのデータの一例を示す図である。It is a figure which shows an example of the data of the analysis pattern of 3rd embodiment. 第三の実施の形態の類似度算出の一例を示す図である。It is a figure which shows an example of similarity calculation of 3rd embodiment.
 以下、本発明の実施の形態について、図面を参照しながら説明する。
(第一の実施の形態)
 図1は、本実施の形態の分析支援システムのブロック図である。
 図1に記載の分析支援システム(計算機システム)は、サーバ101、計算機102、ディスプレイ103、入力装置104、ネットワーク105、106及びデータベース1004(DB)107を備える。
 サーバ101と計算機102はネットワーク105を介して互いに接続され、サーバ101とDB107はネットワーク106を介して互いに接続される。
 サーバ101、計算機102は、利用者100が分析作業を行う際に利用される。さらに、分析作業に関わる機能を提供し、機能実行時の履歴を収集し、後述するテンプレートを状況に合わせて抽出・推奨することにより利用者100の分析作業を支援する。
Hereinafter, embodiments of the present invention will be described with reference to the drawings.
(First embodiment)
FIG. 1 is a block diagram of the analysis support system of the present embodiment.
The analysis support system (computer system) described in FIG. 1 includes a server 101, a computer 102, a display 103, an input device 104, networks 105 and 106, and a database 1004 (DB) 107.
The server 101 and the computer 102 are connected to each other via the network 105, and the server 101 and the DB 107 are connected to each other via the network 106.
The server 101 and the computer 102 are used when the user 100 performs analysis work. Furthermore, functions related to the analysis work are provided, a history of function execution is collected, and a template described later is extracted and recommended according to the situation to assist the user 100 in the analysis work.
 また、サーバ101及び計算機102は、一例として一般的なPCを用いることができる。サーバ101及び計算機102は、プロセッサ、メモリ及びインターフェースを備える。プロセッサは、メモリに記憶されたプログラムを処理することによって、各種処理を実行する。メモリは、処理を実行するためのプログラム及びデータを記憶する。インターフェースは、キーボード、マウスなどの入力装置104に接続するもの、ディスプレイ103に接続するもの、ネットワーク105を介して、サーバ101、計算機102を相互に接続するもの、及び、ネットワーク106を介してサーバ101とDB107などに接続するもの、などを備える。
 DB107は、例えば、企業に関わる情報、各種統計データ、センサ等の時系列データ、Webアクセスログなどの各種データを保持するデータベースである。DB107は、サーバ101に含まれるように構成してもよいし、外部のストレージ装置に格納され、ネットワーク106を介してサーバ101に接続されるように構成してもよい。
 サーバ101は、例えば、入力データ作成部111、分析処理実行部112、データ可視化部113、処理手順記録部114、処理構築部115、処理手順実行部116、テンプレート推奨部(処理手順推奨部)117、処理手順分析部118、テンプレートDB119、及び、処理手順DB120を備える。入力データ作成部111、分析処理実行部112、データ可視化部113、処理手順記録部114、処理構築部115、処理手順実行部116、テンプレート推奨部117、及び、処理手順分析部118は例えば、プログラムであってメモリに記憶され、プロセッサによって実行されて各部の機能が実現される。
 入力データ作成部111は、利用者100あるいは処理手順実行部116の指示によりDB107から所望のデータを抽出し、抽出したデータに入力データ作成処理を施すことにより、分析対象データを作成する。
 分析処理実行部112は、利用者100あるいは処理手順実行部116の指示により、入力データ作成部111が作成した分析対象データにデータ処理を施し、処理結果データを作成する。
The server 101 and the computer 102 can use a general PC as an example. The server 101 and the computer 102 include a processor, a memory, and an interface. The processor executes various processes by processing a program stored in the memory. The memory stores a program and data for executing processing. The interface is connected to the input device 104 such as a keyboard and a mouse, connected to the display 103, connected to the server 101 and the computer 102 via the network 105, and connected to the server 101 via the network 106. And those connected to the DB 107 and the like.
The DB 107 is a database that holds various data such as information related to companies, various statistical data, time-series data such as sensors, and Web access logs. The DB 107 may be configured to be included in the server 101, or may be configured to be stored in an external storage device and connected to the server 101 via the network 106.
The server 101 includes, for example, an input data creation unit 111, an analysis processing execution unit 112, a data visualization unit 113, a processing procedure recording unit 114, a processing construction unit 115, a processing procedure execution unit 116, and a template recommendation unit (processing procedure recommendation unit) 117. , A processing procedure analysis unit 118, a template DB 119, and a processing procedure DB 120. The input data creation unit 111, the analysis process execution unit 112, the data visualization unit 113, the process procedure recording unit 114, the process construction unit 115, the process procedure execution unit 116, the template recommendation unit 117, and the process procedure analysis unit 118 are, for example, programs The function of each unit is realized by being stored in the memory and executed by the processor.
The input data creation unit 111 extracts desired data from the DB 107 in accordance with an instruction from the user 100 or the processing procedure execution unit 116, and performs input data creation processing on the extracted data to create analysis target data.
The analysis processing execution unit 112 performs data processing on the analysis target data created by the input data creation unit 111 according to an instruction from the user 100 or the processing procedure execution unit 116, and creates processing result data.
 データ可視化部113は、利用者100あるいは処理手順実行部116の指示により、分析処理実行部112が作成した処理結果データに可視化処理を施し、可視化する。
 処理手順記録部114は、入力データ作成部111、分析処理実行部112、データ可視化部113の各処理を処理手順DB120に記録する。
 処理構築部115は、可視化コンテンツ編集支援情報を作成し、分析処理を構築する。
 処理手順実行部116は、利用者100の指示により、処理手順DB120から処理手順を取り出し、処理手順の内容に従って入力データ作成部111、分析処理実行部112、データ可視化部113に指示し、分析処理の実行を制御する。
 処理手順推奨部117は、予め定められた基準で処理手順DB120から処理手順を抽出し、ディスプレイなど適宜の出力装置により利用者100に提示する。
 計算機102は、入力データ選択部121、可視化テンプレート編集部122、分析処理手順作成部123を備える。
The data visualization unit 113 performs visualization processing on the processing result data created by the analysis processing execution unit 112 according to an instruction from the user 100 or the processing procedure execution unit 116, and visualizes the processing result data.
The processing procedure recording unit 114 records each processing of the input data creation unit 111, the analysis processing execution unit 112, and the data visualization unit 113 in the processing procedure DB 120.
The process construction unit 115 creates visualized content editing support information and constructs an analysis process.
The processing procedure execution unit 116 retrieves the processing procedure from the processing procedure DB 120 according to the instruction of the user 100, and instructs the input data creation unit 111, the analysis processing execution unit 112, and the data visualization unit 113 according to the contents of the processing procedure to perform analysis processing. Control the execution of
The processing procedure recommendation unit 117 extracts a processing procedure from the processing procedure DB 120 based on a predetermined criterion, and presents the processing procedure to the user 100 using an appropriate output device such as a display.
The computer 102 includes an input data selection unit 121, a visualization template editing unit 122, and an analysis processing procedure creation unit 123.
 入力データ選択部121は、利用者100の指示により、入力データを選択するためのダイアログを表示し、利用者の選択結果を保持する。
 可視化テンプレート編集部122は、利用者100の指示により、可視化テンプレートを編集するためのダイアログを表示し、利用者の編集結果を保持する。
 分析処理手順作成部123は、利用者100の指示により、分析処理手順を作成するためのダイアログを表示し、利用者が作成した分析処理手順を保持する。
 各ダイアログは、例えばディスプレイ103に表示され、入力装置104を用いて利用者からの各指示、選択結果、編集結果等が入力される。
 サーバ101及び計算機102の各部は、ひとつの装置で構成されてもよいし、適宜分散して構成されてもよい。なお、本明細書において、例えば、サーバ101の処理手順記録部114、処理構築部115、処理手順実行部116、テンプレート推奨部(処理手順推奨部)117及び処理手順分析部118と、計算機102の可視化テンプレート編集部122及び分析処理手順作成部123とを併せて、データ分析支援処理部と称する。
上述の分析支援システムの構成は、第一の実施の形態に限らず、他の実施の形態にも適用できる。
The input data selection unit 121 displays a dialog for selecting input data in accordance with an instruction from the user 100 and holds the selection result of the user.
The visualization template editing unit 122 displays a dialog for editing the visualization template in accordance with an instruction from the user 100 and holds the editing result of the user.
The analysis processing procedure creation unit 123 displays a dialog for creating an analysis processing procedure in accordance with an instruction from the user 100, and holds the analysis processing procedure created by the user.
Each dialog is displayed on the display 103, for example, and each instruction from the user, a selection result, an editing result, and the like are input using the input device 104.
Each unit of the server 101 and the computer 102 may be configured by one apparatus or may be appropriately distributed. In this specification, for example, the processing procedure recording unit 114, the processing construction unit 115, the processing procedure execution unit 116, the template recommendation unit (processing procedure recommendation unit) 117, the processing procedure analysis unit 118, and the computer 102 of the server 101 The visualization template editing unit 122 and the analysis processing procedure creation unit 123 are collectively referred to as a data analysis support processing unit.
The configuration of the above-described analysis support system is not limited to the first embodiment, and can be applied to other embodiments.
 図2は、第一の実施の形態の手順の一例を示すフローチャートである。
 本フローチャートは、計算機102とサーバ101を含むシステム全体の動作を概略的に示す。より詳細については後述する。
 まず、本実施の形態の分析支援機能を利用しない場合(S201)、計算機102(例えば分析処理手順作成部123)は、ディスプレイ103に図3-Aに示す分析処理手順作成ダイアログを表示する(S202)。計算機102(例えば分析処理手順作成部123)は、ユーザ100が入力装置104を用いて、分析処理単位選択部301Aの「データ選択」「集約」「フィルタリング」等の分析処理単位を選択することによって作成される分析処理手順をサーバ101に送付する。サーバ101(例えば分析処理実行部112)では、受信した分析処理手順に従い分析処理を実行し、実行結果を計算機102に送付する(S203)。
FIG. 2 is a flowchart illustrating an example of a procedure according to the first embodiment.
This flowchart schematically shows the operation of the entire system including the computer 102 and the server 101. More details will be described later.
First, when the analysis support function of this embodiment is not used (S201), the computer 102 (for example, the analysis processing procedure creation unit 123) displays the analysis processing procedure creation dialog shown in FIG. 3-A on the display 103 (S202). ). The computer 102 (for example, the analysis processing procedure creation unit 123) causes the user 100 to select an analysis processing unit such as “data selection”, “aggregation”, and “filtering” in the analysis processing unit selection unit 301A using the input device 104. The analysis processing procedure to be created is sent to the server 101. The server 101 (for example, the analysis process execution unit 112) executes the analysis process according to the received analysis process procedure, and sends the execution result to the computer 102 (S203).
 次に、計算機102は、ディスプレイ103に図3-Bに示す可視化コンテンツ作成ダイアログをディスプレイ103に表示する。ユーザ100が入力装置104を用いて可視化部品選択部301Bに含まれる項目を選択し、可視化コンテンツ表示部302Bに追加することにより、計算機102は可視化コンテンツを作成する(S204)。可視化コンテンツには例えば受信した分析処理の実行結果が、選択された項目(例えば表やグラフ)に従い可視化されている。計算機102は、ディスプレイ103にS204の結果の可視化コンテンツを表示する(S209)。サーバ(例えば処理手順記録部114)が分析処理手順、可視化処理手順を処理手順DB120に記録する(S210)。 Next, the computer 102 displays a visualized content creation dialog shown in FIG. When the user 100 uses the input device 104 to select an item included in the visualization component selection unit 301B and adds it to the visualization content display unit 302B, the computer 102 creates visualization content (S204). In the visualized content, for example, the execution result of the received analysis process is visualized according to the selected item (for example, a table or a graph). The computer 102 displays the visualized content as a result of S204 on the display 103 (S209). The server (for example, the processing procedure recording unit 114) records the analysis processing procedure and the visualization processing procedure in the processing procedure DB 120 (S210).
 図3-Aに分析処理手順作成ダイアログの一例をしめす。
 利用者が分析処理手順を作成する際には、分析処理単位選択部301Aに含まれる項目を選択し、分析処理単位シーケンス表示部302Aに追加することにより、分析処理手順を作成する。分析処理手順作成の開始時、または途中で処理手順推奨部117が処理手順DB120から作成の状況に合わせて処理手順を抽出し、利用者に推薦することもできる。
 ここで分析処理手順とは、分析処理単位(例えば、分析処理の最小単位。適宜の処理単位でもよい)の系列を指す。分析処理単位は具体的には、分析処理単位選択部301Aに表示されているような、「データ選択」「集約」「フィルタリング」「演算」「編集」のような処理を指す。
An example of the analysis processing procedure creation dialog is shown in FIG.
When a user creates an analysis processing procedure, an analysis processing procedure is created by selecting items included in the analysis processing unit selection unit 301A and adding them to the analysis processing unit sequence display unit 302A. The processing procedure recommendation unit 117 can extract a processing procedure from the processing procedure DB 120 according to the creation status at the start of analysis processing procedure creation, or can recommend it to the user.
Here, the analysis processing procedure refers to a series of analysis processing units (for example, the minimum unit of analysis processing, which may be an appropriate processing unit). Specifically, the analysis processing unit indicates processing such as “data selection”, “aggregation”, “filtering”, “calculation”, and “editing” as displayed in the analysis processing unit selection unit 301A.
 図3-Bに可視化コンテンツ作成ダイアログの一例を示す。利用者が可視化コンテンツを作成する際には、可視化部品選択部301Bに含まれる項目(可視化部品)を選択し、可視化コンテンツ表示部302Bに追加することにより、可視化コンテンツを作成する。可視化コンテンツ作成の開始時、または途中で処理手順推奨部117が処理手順DB120からS203の結果えられる分析処理実行結果に合わせて可視化部品を利用者に推薦することもできる。
 図2に戻り、フローチャートの説明を続ける。本実施の形態の分析支援機能を利用する場合(S201)、計算機102は、ディスプレイ103に、例えば図5に示す入力データ選択ダイアログ、及び、図6-Aに例示する可視化コンテンツ編集ダイアログを表示する(S205)。計算機102は、利用者が入力装置104を利用して入力データ選択ダイアログで選択したデータ及び可視化コンテンツ編集ダイアログで作成した可視化部品情報をサーバ101に送付し、サーバ101の処理構築部115が選択データ(選択された入力データ)及び可視化部品情報に基づき、後述する可視化コンテンツ編集支援情報(例えば、可視化のための設定項目の候補)を作成し、計算機102に送付する(S206)。図6-Bに例示する表に関するパラメータ設定ダイアログ、又は、図6-Cに例示するグラフに関するパラメータ設定ダイアログと、可視化コンテンツ編集支援情報を利用してユーザによる入力装置104の操作により可視化コンテンツを編集する。例えば、表示された設定項目の候補の中から所望の設定項目が選択される。計算機102は、編集結果をサーバ101に送付し、処理構築部115が選択データ、可視化コンテンツ編集結果に基づき分析処理を構築する(S207)。分析処理実行部112が構築された分析処理を実行し、実行結果を計算機102に送付する(S208)。ディスプレイ103にS208の結果の可視化コンテンツを表示する(S209)。ここでの可視化コンテンツは、例えば、指定された可視化部品情報及び設定項目に従い処理結果が、表示される。処理手順記録部114が分析処理手順、可視化処理手順を処理手順DB120に記録する(S210)。
An example of the visualization content creation dialog is shown in FIG. When the user creates the visualization content, the visualization content is created by selecting an item (visualization component) included in the visualization component selection unit 301B and adding it to the visualization content display unit 302B. The processing procedure recommendation unit 117 can recommend the visualization component to the user in accordance with the analysis processing execution result obtained as a result of S203 from the processing procedure DB 120 when the visualization content creation starts or in the middle.
Returning to FIG. 2, the description of the flowchart will be continued. When the analysis support function of this embodiment is used (S201), the computer 102 displays, for example, an input data selection dialog shown in FIG. 5 and a visualized content edit dialog exemplified in FIG. 6-A on the display 103. (S205). The computer 102 sends the data selected by the user in the input data selection dialog using the input device 104 and the visualized part information created in the visualization content editing dialog to the server 101, and the processing construction unit 115 of the server 101 selects the selected data. Based on (selected input data) and visualization component information, visualization content editing support information (for example, setting item candidates for visualization) to be described later is created and sent to the computer 102 (S206). Edit the visualized content by operating the input device 104 by the user using the parameter setting dialog related to the table illustrated in FIG. 6B or the parameter setting dialog related to the graph illustrated in FIG. 6C and the visualized content editing support information. To do. For example, a desired setting item is selected from the displayed setting item candidates. The computer 102 sends the editing result to the server 101, and the processing construction unit 115 constructs an analysis process based on the selection data and the visualization content editing result (S207). The analysis process execution unit 112 executes the constructed analysis process and sends the execution result to the computer 102 (S208). The visualization content as a result of S208 is displayed on the display 103 (S209). The visualization content here displays, for example, the processing result according to the specified visualization component information and setting items. The processing procedure recording unit 114 records the analysis processing procedure and the visualization processing procedure in the processing procedure DB 120 (S210).
 図4-Aに分析処理手順のデータ構造及びデータの一例を、図4-Bに可視化処理手順のデータ構造及びデータの一例を示す。このようなデータが処理手順DB120に記録される。 Fig. 4-A shows an example of the data structure and data of the analysis processing procedure, and Fig. 4-B shows an example of the data structure and data of the visualization processing procedure. Such data is recorded in the processing procedure DB 120.
 図5は入力データ選択用ダイアログである。入力データ一覧表示領域501に表示されたデータを、入力装置104を用いてユーザ100が選択し、OKボタン502を押下げることで入力データの選択を決定する。 Fig. 5 shows the input data selection dialog. The data displayed in the input data list display area 501 is selected by the user 100 using the input device 104, and the selection of the input data is determined by pressing the OK button 502.
 図6-Aは可視化コンテンツ編集ダイアログの一例である。ユーザが可視化コンテンツを編集する際には、入力装置104を用いて可視化部品選択部601Aに含まれる項目を選択し、可視化コンテンツ表示部602Aに追加することにより、可視化コンテンツを編集する。例えば可視化部品選択部601Aで表を選択した場合、計算機102は表に関するパラメータ設定のための表パラメータ設定ダイアログ(図6-B)を表示し、グラフのうちいずれかを選択した場合には、計算機102はグラフに関するパラメータ設定のためのグラフパラメータ設定ダイアログ(図6-C)を表示する。 Fig. 6-A is an example of a visualization content edit dialog. When the user edits the visualized content, the user selects the item included in the visualized part selecting unit 601A using the input device 104 and adds the selected item to the visualized content display unit 602A, thereby editing the visualized content. For example, when a table is selected by the visualization component selection unit 601A, the computer 102 displays a table parameter setting dialog (FIG. 6B) for setting parameters related to the table, and when any one of the graphs is selected, the computer 102 102 displays a graph parameter setting dialog (FIG. 6C) for setting parameters relating to the graph.
 図6-Bに例示する表パラメータ設定ダイアログは、例えば、表側列候補一覧表示部(601B)と、表側列編集部(602B)と、表頭列パターン一覧表示部(603B)と、表頭列候補一覧表示部(604B)と、表頭列用選択列表示部(605B)とを含む。また、OKボタン(606B)とキャンセルボタン(608B)を含む。 The table parameter setting dialog illustrated in FIG. 6B includes, for example, a front side column candidate list display unit (601B), a front side column edit unit (602B), a front side column pattern list display unit (603B), and a front side column. A candidate list display section (604B) and a selected column display section for head front column (605B) are included. Also, an OK button (606B) and a cancel button (608B) are included.
 図6-Cに例示するグラフパラメータ設定ダイアログは、例えば、視点一覧表示部(601C)と、視点編集部(602C)と、X軸パターン一覧表示部(603C)と、X軸候補一覧表示部(604C)と、X軸パターン編集部(605C)と、Y軸パターン一覧表示部(606C)と、Y軸候補一覧表示部(607C)と、Y軸パターン編集部(608C)とを含む。また、OKボタン(609C)とキャンセルボタン(610C)を含む。 The graph parameter setting dialog illustrated in FIG. 6C includes, for example, a viewpoint list display unit (601C), a viewpoint editing unit (602C), an X axis pattern list display unit (603C), and an X axis candidate list display unit ( 604C), an X-axis pattern editing unit (605C), a Y-axis pattern list display unit (606C), a Y-axis candidate list display unit (607C), and a Y-axis pattern editing unit (608C). Also, an OK button (609C) and a cancel button (610C) are included.
 図7-A、図7-B、図7-C、図7-Dは表パラメータ設定ダイアログ(図6-B)、グラフパラメータ設定ダイアログ(図6-C)の各部で表示するデータを作成するための手順を示すフローチャートである。
 まず図7-Aの分析処理手順から分析パターンを作成する処理について、図8-Aを参照して説明する。ここで分析パターンとは図8-Aの802に示すような、分析処理手順(例えば図8-Aの801)に連続して含まれる2つ以上の分析処理単位の組である。ここでは、連続する2つの分析処理単位の組として説明する。分析処理手順は、分析処理単位を処理順に並べたものである。処理手順分析部118が処理手順DB120から、分析パターン作成に関して未処理の分析処理手順Mを取得する(S701)。処理手順分析部118は、分析処理手順MのN番目(Nは1以上の整数)の分析処理単位とN+1番目の分析処理単位の組を取り出し、分析パターンを作成する(S702)。処理手順分析部118は、処理手順DB120に未処理の分析処理手順がなくなるまでS701、S702を繰り返す(S703)。処理手順分析部118は、作成された分析パターンが分析処理手順内で出現する確率(遷移確率)を求める。遷移確率は、分析パターンの始点となる分析処理単位の次に、分析パターンの終点となる分析処理単位が実行される確率を示す。なお、処理手順分析部118は、テンプレートDB119に分析パターンを記憶してもよい。
7A, FIG. 7B, FIG. 7C, and FIG. 7D create data to be displayed in each part of the table parameter setting dialog (FIG. 6B) and the graph parameter setting dialog (FIG. 6C). It is a flowchart which shows the procedure for.
First, processing for creating an analysis pattern from the analysis processing procedure of FIG. 7A will be described with reference to FIG. 8-A. Here, the analysis pattern is a set of two or more analysis processing units included in succession in an analysis processing procedure (for example, 801 in FIG. 8-A) as shown by 802 in FIG. 8-A. Here, a description will be given as a set of two consecutive analysis processing units. In the analysis processing procedure, analysis processing units are arranged in the order of processing. The processing procedure analysis unit 118 acquires an unprocessed analysis processing procedure M related to the analysis pattern creation from the processing procedure DB 120 (S701). The processing procedure analysis unit 118 takes out a set of the Nth (N is an integer of 1 or more) analysis processing unit and the N + 1th analysis processing unit of the analysis processing procedure M, and creates an analysis pattern (S702). The processing procedure analysis unit 118 repeats S701 and S702 until there is no unprocessed analysis processing procedure in the processing procedure DB 120 (S703). The processing procedure analysis unit 118 obtains a probability (transition probability) that the created analysis pattern appears in the analysis processing procedure. The transition probability indicates the probability that the analysis processing unit serving as the end point of the analysis pattern is executed next to the analysis processing unit serving as the starting point of the analysis pattern. Note that the processing procedure analysis unit 118 may store the analysis pattern in the template DB 119.
 図8-Aに分析パターン作成のイメージを示す。図8-Aでは分析処理手順A~E(801)を処理し、8以上の分析パターン(802)を作成している。図8-Aの803は分析処理単位の一例であり、801、802の円の中の数字と表中の数字が対応している。図8-Bに分析パターンのデータ構造の一例、また図8-Cに分析パターンのデータの一例を示す。 Fig. 8-A shows an image of creating an analysis pattern. In FIG. 8A, the analysis processing procedures A to E (801) are processed, and eight or more analysis patterns (802) are created. 803 in FIG. 8A is an example of the analysis processing unit, and the numbers in the circles 801 and 802 correspond to the numbers in the table. FIG. 8-B shows an example of the analysis pattern data structure, and FIG. 8-C shows an example of the analysis pattern data.
 次に図7-Bの可視化テンプレート作成に関する処理について、図9-Aに可視化テンプレート作成のイメージを引用しながら説明する。ここで可視化テンプレートとは、可視化処理(図9-Aの901、データ構造及び内容の例は図9-Aの904)と分析処理結果のデータ(図9-Aの902)との関連を示すデータである。処理手順分析部118が処理手順DB120から分析処理手順群G(図9-Aの905)と、可視化テンプレート作成に関して未処理の可視化処理手順M(図9-Aの904)を取得する(S704)。処理手順分析部118は、分析処理手順群Gから可視化処理手順Mの分析処理IDに該当する分析処理手順Qを取得する(S705)。分析処理手順Qの最後の分析処理単位Nと可視化処理手順Mの可視化部品の組を作成し、可視化テンプレートTを作成する(S706)。可視化テンプレートTは、例えば図9-Bに示す情報を含む。処理手順分析部118は、可視化テンプレートTの選択列を一般化し、可視化テンプレートTを更新する(S707)。処理手順DB120に未処理の分析処理手順がなくなるまでS704~S707を繰り返す(S708)。なお、処理手順分析部118は、テンプレートDB119に可視化テンプレートTを記憶してもよい。 Next, the processing related to visualization template creation in FIG. 7-B will be described with reference to the image of visualization template creation in FIG. 9-A. Here, the visualization template indicates the relationship between the visualization process (901 in FIG. 9-A, the data structure and the example of the content is 904 in FIG. 9-A) and the analysis process result data (902 in FIG. 9-A). It is data. The processing procedure analysis unit 118 acquires the analysis processing procedure group G (905 in FIG. 9A) and the unprocessed visualization processing procedure M (904 in FIG. 9A) regarding the creation of the visualization template from the processing procedure DB 120 (S704). . The processing procedure analysis unit 118 acquires the analysis processing procedure Q corresponding to the analysis processing ID of the visualization processing procedure M from the analysis processing procedure group G (S705). A set of visualization components of the last analysis processing unit N of the analysis processing procedure Q and visualization processing procedure M is created, and a visualization template T is created (S706). The visualization template T includes information shown in FIG. 9-B, for example. The processing procedure analysis unit 118 generalizes the selected column of the visualization template T and updates the visualization template T (S707). S704 to S707 are repeated until there is no unprocessed analysis processing procedure in the processing procedure DB 120 (S708). Note that the processing procedure analysis unit 118 may store the visualization template T in the template DB 119.
 ここでS707の一般化の例を示す。
 (1)Y軸列のリストに含まれる文字列に法則がある場合、パターンに置き換える。例:Y軸列リスト「2012年04月、2012年05月、…」の場合、Y軸「yyyy年MM月」。例:Y軸列リスト「S1000、S1001、S1002、…」の場合、Y軸「S####」(#は数字1文字を表す)
 (2)X軸列の内容により、カテゴリに置き換える。例:数値データの場合:数値データ列、日付データの場合:日付データ列、文字列の場合:文字列データ列、など。
Here, an example of generalization in S707 is shown.
(1) If there is a law in the character string included in the list of Y-axis columns, it is replaced with a pattern. Example: In the case of the Y axis column list “April 2012, May 2012,...”, The Y axis “yyyy year MM month”. Example: In the case of the Y-axis column list “S1000, S1001, S1002,...”, The Y-axis “S ####” (# represents one character)
(2) Replace with a category according to the contents of the X-axis row. Example: Numeric data: Numeric data string, Date data: Date data string, Character string: Character string data string, etc.
 図9-Aに可視化テンプレート作成のイメージ、図9―Bに可視化テンプレートのデータ構造の一例、図9―Cに可視化テンプレートの一例を示す。可視化テンプレートには少なくとも、直前処理(図9-Aの902)や選択データ情報(図9-Aの903で選択された入力データの列名、列の要素などの情報)、処理結果データ情報(図9-Aの902を実行した時点でのデータ列名、データ要素などの情報)が含まれる。 Fig. 9-A shows the image of creating the visualization template, Fig. 9-B shows an example of the data structure of the visualization template, and Fig. 9-C shows an example of the visualization template. The visualization template includes at least immediately preceding processing (902 in FIG. 9-A), selection data information (information such as column names and column elements of the input data selected in 903 in FIG. 9-A), and processing result data information ( Information such as data string names and data elements at the time of executing 902 in FIG. 9A.
 図7-Cは利用者が選択した選択データ、可視化部品情報に適用可能な可視化コンテンツ編集支援情報を生成する手順の一例である。
 まず、処理構築部115がテンプレートDB119から分析パターン、可視化テンプレートを取得する(S711)。処理構築部115が選択データと可視化部品情報に適用可能な可視化コンテンツ編集支援情報を生成する(S712)。詳細は後述する。処理構築部115が、予め決められた基準で並べ替えた可視化コンテンツ編集支援情報を計算機102に送付する(713)。
FIG. 7C shows an example of a procedure for generating visualization content editing support information applicable to selection data and visualization component information selected by the user.
First, the process construction unit 115 acquires an analysis pattern and a visualization template from the template DB 119 (S711). The processing construction unit 115 generates visualization content editing support information applicable to the selection data and visualization component information (S712). Details will be described later. The processing construction unit 115 sends the visualized content editing support information rearranged according to a predetermined criterion to the computer 102 (713).
 図7-DはS712の手順を詳細化した手順である。図7-Dでは予め決められた基準を「遷移確率が大きい順」としているが、他には例えば、「「入力データ選択」から可視化処理までの分析処理単位数が少ない順」などでもよいし、他の基準でもよい。これらは設定ファイルなどで切り替えられるものとする。 Fig. 7-D shows the detailed procedure of S712. In FIG. 7-D, the predetermined criterion is “in order of increasing transition probability”. However, for example, “in order of decreasing number of analysis processing units from“ input data selection ”to visualization processing” may be used. Other criteria may be used. These can be switched by a setting file or the like.
 以下の説明において、J、Kはパラメータ、Nは分析処理単位、L、L0はリスト、Mは処理対象の分析処理単位また可視化処理を表す。以下の処理は、処理構築部115が実行する。
 空のリストL、J=1、K=1、分析処理単位N=「入力データ選択」、空のリストL0とし、L0にNを追加、L0をLに追加する(S7121)。分析処理単位Nからの遷移確率がK番目に大きい分析処理単位また可視化処理をテンプレートDB119から取得し、Mとする(S7122)。NとMの組をまだ処理していなければS7124に進む。処理済の場合、K=K+1としてS7122に進む(S7123、S7126)。Mが可視化処理の場合、L0をリストL1にコピーし、L0にMを追加し、L0をLに登録し、L0にL1を代入しK=K+1としてS7122に進む。Mが可視化処理ではない場合、S7127に進む(S7124~S7126)。
 Mが空の場合、J=J-1、L0の先頭からJ番目までをリストL1にコピーし、K=1とし、J=0の場合S713に進む。J=0ではない場合S7122に進む(S7127、S7133、S7234)。
 Mが空ではない場合、選択データにL0に含まれる分析処理を適用した結果のデータをDとし、データDに分析処理単位Mを適用する(S7127~S7129)。適用可能な場合、L0にMを追加、N=M、J=J+1、K=1として、S7122に進む(S7130、S7131)。適用可能ではない場合、K=K+1としてS7122に進む(S7130、S7132)。
In the following description, J and K are parameters, N is an analysis processing unit, L and L0 are lists, and M is an analysis processing unit to be processed or a visualization process. The process construction unit 115 executes the following process.
Empty list L, J = 1, K = 1, analysis processing unit N = “input data selection”, empty list L0, N is added to L0, and L0 is added to L (S7121). An analysis processing unit or visualization process having the Kth largest transition probability from the analysis processing unit N is acquired from the template DB 119 and is set as M (S7122). If the set of N and M has not yet been processed, the process proceeds to S7124. If the processing has been completed, the process proceeds to S7122 with K = K + 1 (S7123, S7126). When M is a visualization process, L0 is copied to the list L1, M is added to L0, L0 is registered in L, L1 is substituted into L0, and K = K + 1 is set, and the process proceeds to S7122. If M is not a visualization process, the process proceeds to S7127 (S7124 to S7126).
If M is empty, J = J−1, the first to Jth of L0 are copied to the list L1, K = 1, and if J = 0, the process proceeds to S713. When J is not 0, the process proceeds to S7122 (S7127, S7133, S7234).
When M is not empty, the data obtained as a result of applying the analysis process included in L0 is selected as D, and the analysis processing unit M is applied to the data D (S7127 to S7129). If applicable, M is added to L0, N = M, J = J + 1, K = 1, and the process proceeds to S7122 (S7130, S7131). If not applicable, the process proceeds to S7122 with K = K + 1 (S7130, S7132).
 ここで適用可能かどうかの判定は、データDと分析処理単位MのパラメータPが以下の条件を満たすとき適用可能とする。
 (1)パラメータPが列名Aのとき、データDが列名Aを含む場合
 (2)パラメータPが列名Aと要素名αのとき、データDが列名Aを含み、列名Aが要素名αを含む場合。
 また以下の場合適用不可とする。
 分析処理単位MをパラメータPでデータDに適用した結果のデータD1の行数が0の場合。
Here, the determination of whether or not it is applicable is applicable when the data D and the parameter P of the analysis processing unit M satisfy the following conditions.
(1) When parameter P is column name A and data D includes column name A (2) When parameter P is column name A and element name α, data D includes column name A and column name A is When element name α is included.
It is not applicable in the following cases.
When the number of rows of data D1 as a result of applying analysis processing unit M to data D with parameter P is zero.
 図10-A及び図10-Bに、可視化部品として「棒グラフ」が選択された場合の、可視化コンテンツ編集支援情報生成イメージ示す。可視化コンテンツ編集支援情報は図10-Aに示すように、「入力データ選択」から「棒グラフ」の間を埋める処理手順を、分析パターンと可視化テンプレートを用いて、予め決められた基準で構築するための情報である。
 例えば、処理構築部115は、テンプレートDB119から取得した分析パターンと可視化テンプレートに基づき、選択された入力データ(ここでは処理単位識別番号「1」)から選択された可視化部品(ここでは棒グラフ)への処理手順候補(可視化コンテンツ編集支援情報)をひとつ又は複数求める。例えば、分析パターンが示す処理の遷移と、可視化テンプレートが示す最終の分析処理単位から可視化部品への遷移をたどり、処理手順候補を求める。複数の処理手順候補は、遷移確率が大きい順などの予め定められた基準に従い並べる(順位付けする)ことができる。なお、可視化部品として棒グラフ以外が選択された場合も同様である。
FIGS. 10A and 10B show visualization content editing support information generation images when “bar graph” is selected as the visualization component. As shown in FIG. 10-A, the visualization content editing support information is used to construct a processing procedure for filling a space between “input data selection” and “bar graph” based on a predetermined standard using an analysis pattern and a visualization template. Information.
For example, the processing construction unit 115 applies the selected input data (here, the processing unit identification number “1”) to the selected visualization component (here, a bar graph) based on the analysis pattern and the visualization template acquired from the template DB 119. One or a plurality of processing procedure candidates (visualized content editing support information) are obtained. For example, a process procedure candidate is obtained by following the process transition indicated by the analysis pattern and the transition from the final analysis processing unit indicated by the visualization template to the visualization part. A plurality of processing procedure candidates can be arranged (ranked) in accordance with a predetermined criterion such as an order of increasing transition probability. The same is true when a component other than the bar graph is selected as the visualization component.
 図11は、S206の可視化コンテンツ編集支援情報が表またはグラフパラメータ設定ダイアログにセットされる手順を示すフローチャートである。まず、計算機102は、可視化コンテンツ編集ダイアログの可視化部品選択部601Aの選択結果(選択された可視化部品)を取得し、選択データ(選択された入力データ)と共にサーバ101に送付する(S2061)。サーバ101の処理構築部115がS7121~S7134で作成し、送付した可視化コンテンツ編集支援情報を、可視化テンプレート編集部122が受け取る(S2062)。可視化テンプレート編集部122は、選択された可視化部品に応じて表またはグラフ用のパラメータ設定ダイアログ(例えば図6-B、図6-C参照)を用意し、各一覧表示部に値をセットする(S2063)。可視化テンプレート編集部122は、各一覧表示部の選択結果を取得し、S207へ進む(S2064)。 FIG. 11 is a flowchart showing a procedure in which the visualized content editing support information in S206 is set in the table or graph parameter setting dialog. First, the computer 102 acquires the selection result (selected visualization component) of the visualization component selection unit 601A in the visualization content editing dialog, and sends it to the server 101 together with the selection data (selected input data) (S2061). The visualization template editing unit 122 receives the visualization content editing support information created and sent by the process construction unit 115 of the server 101 in S7121 to S7134 (S2062). The visualization template editing unit 122 prepares a parameter setting dialog for a table or a graph (see, for example, FIG. 6-B and FIG. 6-C) according to the selected visualization part, and sets a value in each list display unit ( S2063). The visualization template editing unit 122 acquires the selection result of each list display unit, and proceeds to S207 (S2064).
 図12-A、図12-Bに、S2063のグラフパラメータ設定ダイアログにコンテンツ編集支援情報をセットするイメージを示す。利用者100が「棒グラフ」を選択した場合、サーバ101がS7121~S7134の処理を実行して、表示部品が「棒グラフ」となる分析処理手順、可視化テンプレートの系列群を生成し、可視化コンテンツ編集支援情報とする。視点候補一覧表示部601Cには可視化テンプレートの視点選択列及び可視化処理の直前までの分析処理結果データに含まれる列を表示する(12A)。X軸パターン一覧表示部603Cには、可視化テンプレートの選択列のX軸パターンを表示する(12B)。X軸候補一覧表示部604Cには、可視化処理の直前までの分析処理結果データに含まれる列を表示する(12C)。Y軸パターン一覧表示部606Cには、可視化テンプレートの選択列のY軸パターンを表示する(12D)。Y軸候補一覧表示部607Cには、可視化処理の直前までの分析処理結果データに含まれる列を表示する(12E)。 12-A and 12-B show an image of setting content editing support information in the graph parameter setting dialog in S2063. When the user 100 selects “bar graph”, the server 101 executes the processing of S7121 to S7134 to generate an analysis processing procedure in which the display component becomes “bar graph”, a series group of visualization templates, and visualization content editing support Information. The viewpoint candidate list display unit 601C displays the viewpoint selection column of the visualization template and the columns included in the analysis processing result data until immediately before the visualization processing (12A). The X axis pattern list display section 603C displays the X axis pattern of the selected column of the visualization template (12B). The X-axis candidate list display unit 604C displays columns included in the analysis process result data up to immediately before the visualization process (12C). The Y axis pattern list display section 606C displays the Y axis pattern of the selected column of the visualization template (12D). The Y-axis candidate list display unit 607C displays columns included in the analysis process result data up to immediately before the visualization process (12E).
 図13-A、図13-Bに、S2063の表パラメータ設定ダイアログにコンテンツ編集支援情報をセットするイメージを示す。利用者100が「表」を選択した場合、サーバ101がS7121~S7134の処理を実行して、表示部品が「表」となる分析処理手順、可視化テンプレートの系列群を生成し、可視化コンテンツ編集支援情報とする。表側列候補一覧表示部601Bには可視化テンプレートの表側選択列及び可視化処理の直前までの分析処理結果データに含まれる列を表示する(13A)。表頭列パターン一覧表示部603Bには、可視化テンプレートの選択列の表頭パターンを表示する(13B)。表頭列候補一覧表示部604Bには、可視化処理の直前までの分析処理結果データに含まれる列を表示する(13C)。
 このように、過去に可視化に用いられたデータを示す選択列と、過去の処理結果のデータとを、各一覧表示部に表示して、ユーザによる選択を支援する。
13A and 13B show images for setting content editing support information in the table parameter setting dialog in S2063. When the user 100 selects “table”, the server 101 executes the processing of S7121 to S7134 to generate an analysis processing procedure in which the display component becomes “table”, a series of visualization templates, and visualization content editing support Information. The front side column candidate list display unit 601B displays the front side selected columns of the visualization template and the columns included in the analysis processing result data until immediately before the visualization processing (13A). The front head row pattern list display section 603B displays the front head pattern of the selected column of the visualization template (13B). In the head column candidate list display section 604B, columns included in the analysis processing result data until immediately before the visualization processing are displayed (13C).
As described above, the selection column indicating the data used for visualization in the past and the data of the past processing result are displayed on each list display unit to assist the selection by the user.
 図14は、S207の手順を詳細化した手順である。図14でも予め決められた基準を「遷移確率が大きい順」としているが、他には例えば、「「入力データ選択」から可視化処理までの分析処理単位数が少ない順」などが考えられる。これは設定ファイルなどで切り替えられるものとする。
 以下の説明において、Kはパラメータ、Nは分析処理単位、Lはリスト、Mは処理対象の分析処理単位また可視化処理、Gは可視化処理を表す。以下の処理は、処理構築部115が実行する。
 空のリストL、K=1、分析処理単位N=「入力データ選択」、可視化処理G=利用者100による可視化コンテンツ編集結果とし、LにNを追加する(S2071)。分析処理単位Nからの遷移確率がK番目に大きい分析処理単位または可視化処理をテンプレートDB119から取得し、Mとする(S2072)。NとMの組をまだ処理していなければS2075に進む。処理済の場合K=K+1としてS2072に進む(S2073、S2074)。M=Gの場合、LにMを追加し、分析処理として出力しS209に進む(S2075、S2076)。M=Gではなく、Mが空の場合K=K+1としてS2072に進む(S2077、S2074)。Mが空でない場合、選択データにLに含まれる分析処理を適用した結果のデータをDとし、データDに分析処理単位Mを適用する(S2077~S2079)。適用可能の場合、LにMを追加、N=M、K=1として、S2072に進む(S2080、S2081)。適用可能ではない場合、K=K+1としてS2072に進む(S2080、S2082)。上記の処理の結果生成される分析処理手順は、図15に示すように、分析処理手順と可視化処理手順を合わせたデータとなる。また、図15に上記の処理の結果生成される分析処理手順を実行した結果のイメージを示す。
FIG. 14 is a detailed procedure of S207. In FIG. 14, the predetermined criterion is “order in which the transition probability is large”, but for example, “order in which the number of analysis processing units from“ input data selection ”to visualization processing is small” may be considered. This can be switched by a setting file or the like.
In the following description, K is a parameter, N is an analysis processing unit, L is a list, M is an analysis processing unit or visualization process to be processed, and G is a visualization process. The process construction unit 115 executes the following process.
Empty list L, K = 1, analysis processing unit N = “input data selection”, visualization process G = visualized content editing result by user 100, and N is added to L (S2071). The analysis process unit or visualization process with the Kth largest transition probability from the analysis process unit N is acquired from the template DB 119 and is set as M (S2072). If the set of N and M has not yet been processed, the process proceeds to S2075. If it has been processed, the process proceeds to S2072 with K = K + 1 (S2073, S2074). If M = G, M is added to L, output as analysis processing, and the process proceeds to S209 (S2075, S2076). When M is not M = G but M is empty, K = K + 1 and the process proceeds to S2072 (S2077, S2074). If M is not empty, the data obtained as a result of applying the analysis processing included in L to the selected data is set as D, and the analysis processing unit M is applied to the data D (S2077 to S2079). If applicable, M is added to L, N = M, K = 1, and the process proceeds to S2072 (S2080, S2081). If not applicable, the process proceeds to S2072 with K = K + 1 (S2080, S2082). As shown in FIG. 15, the analysis processing procedure generated as a result of the above processing is data combining the analysis processing procedure and the visualization processing procedure. FIG. 15 shows an image of the result of executing the analysis processing procedure generated as a result of the above processing.
 ここで適用可能かどうかの判定は、データDと分析処理単位MのパラメータPが以下の条件を満たすとき適用可能とする。
 (1)パラメータPが列名Aのとき、データDが列名Aを含む場合
 (2)パラメータPが列名Aと要素名αのとき、データDが列名Aを含み、列名Aが要素名αを含む場合
 (3)(1)(2)以外の場合でも、分析処理単位MをパラメータPでデータDに適用した結果のデータD1の行数が1以上の場合
 第一の実施の形態によれば、利用者は入力データ選択及び可視化コンテンツ編集を行うだけで、その間を埋める分析手順については意識することなく、望みの可視化コンテンツを作成することが可能となるため、利用者100の分析作業の手間を削減することが可能となる。
 
Here, the determination of whether or not it is applicable is applicable when the data D and the parameter P of the analysis processing unit M satisfy the following conditions.
(1) When parameter P is column name A and data D includes column name A (2) When parameter P is column name A and element name α, data D includes column name A and column name A is When the element name α is included (3) Even in cases other than (1) and (2), when the number of rows of the data D1 as a result of applying the analysis processing unit M to the data D with the parameter P is one or more According to the embodiment, the user can create the desired visualization content only by selecting the input data and editing the visualization content, without being aware of the analysis procedure for filling in between them. It is possible to reduce the labor of analysis work.
(第二の実施の形態)
 第一の実施の形態では利用者が可視化部品を選択し、並べることで可視化コンテンツを作成したが、第二の実施の形態では、既存のコンテンツを利用することで可視化コンテンツを作成する。
 図16は第二の実施の形態の可視化コンテンツ編集ダイアログである。第二の実施の形態の可視化コンテンツ編集ダイアログは可視化コンテンツ事例選択部(605A)をさらに含む。可視化コンテンツ事例選択部(605A)には、過去に可視化した例(例えば可視化イメージ)が示される。利用者が可視化コンテンツを編集する際に、可視化コンテンツ事例選択部605Aに表示される可視化コンテンツ事例を選択し、必要に応じて、可視化部品選択部601Aに含まれる項目を選択し、可視化コンテンツ表示部602Aに追加することにより、可視化コンテンツを編集する。
 可視化コンテンツ事例の各可視化部品は図4-A、Bに示す分析処理手順及び可視化処理手順と紐づいているため、第一の実施の形態と同様の処理を施すことにより、利用者が望む可視化コンテンツを生成することができる。
 第二の実施の形態によれば、利用者は既存の可視化コンテンツを利用したい場合、データと既存コンテンツを選択するだけで、その間を埋める分析手順については意識することなく、望みの可視化コンテンツを作成することが可能となる。また、利用者100の分析作業の手間を削減することが可能となる。
 
(Second embodiment)
In the first embodiment, the user creates a visualization content by selecting and arranging the visualization components. In the second embodiment, the visualization content is created by using the existing content.
FIG. 16 shows a visualized content edit dialog according to the second embodiment. The visualized content edit dialog according to the second embodiment further includes a visualized content case selection unit (605A). An example (for example, a visualization image) visualized in the past is shown in the visualization content example selection unit (605A). When the user edits the visualized content, the user selects the visualized content case displayed on the visualized content case selecting unit 605A, selects items included in the visualized part selecting unit 601A as necessary, and displays the visualized content display unit. The visualization content is edited by adding to 602A.
Each visualization component of the visualization content example is linked to the analysis processing procedure and the visualization processing procedure shown in FIGS. 4A and 4B. Therefore, the visualization desired by the user is performed by performing the same processing as in the first embodiment. Content can be generated.
According to the second embodiment, when the user wants to use the existing visualized content, the user simply selects the data and the existing content and creates the desired visualized content without being aware of the analysis procedure for filling in between them. It becomes possible to do. In addition, it is possible to reduce the labor of analysis work of the user 100.
(第三の実施の形態)
 第一の実施の形態のS7130、S2080でのデータDに対する分析処理単位Mの適用可能判定は、データDがパラメータPに指定された列名や要素を含むかどうかで判定されることが多いため、分析処理単位Mの適用可能範囲が小さくなってしまう。これに対し第三の実施の形態では、パラメータPの列名や要素を要素数や出現頻度に基づく類似度を加味することにより、分析処理単位Mの適用可能範囲を拡張する。
 例えば、過去の分析処理に用いたデータと、選択された入力データとの類似度に基づいて、該過去の分析処理を、選択された入力データに適用するか否かを判断する。
 より詳細には、適用可能かどうかの判定は、データDと分析処理単位MのパラメータPが以下の条件を満たすとき適用可能とする。
 (1)パラメータPが列名Aのとき、データDが列名Aを含む場合
 (2)パラメータPが列名Aと要素名αのとき、データDが列名Aを含み、列名Aが要素名αを含む場合
 (3)パラメータPが列名Aと要素名αのとき、データDが列名Aを含まない場合、または列名Aを含むが列Aが要素名αを含まない場合、分析処理単位Mのデータ情報から列名Aの情報を抽出し、列名Aと類似するデータDの列Bおよび要素名αと類似する列B中の要素を抽出する。類似列および類似要素を抽出可能な場合、適用可能とする。ここで列の類似は例えば、列の各要素の出現頻度または出現率が大きい順に差を計算し、その総和を類似度としその差が予め定められた閾値より小さい場合類似しているとする。また要素の類似は、出現頻度あるいは出現率の差が閾値より小さい場合類似しているとする。類似の定義は目的に応じて変更することが可能である。
 また以下の場合適用不可とする。
 分析処理単位MをパラメータPでデータDに適用した結果のデータD1の行数が0の場合。
(Third embodiment)
Applicability determination of the analysis processing unit M for the data D in S7130 and S2080 in the first embodiment is often determined by whether or not the data D includes the column name or element specified in the parameter P. The applicable range of the analysis processing unit M becomes small. On the other hand, in the third embodiment, the applicable range of the analysis processing unit M is expanded by taking into account the similarity based on the number of elements and the appearance frequency of the column names and elements of the parameter P.
For example, it is determined whether to apply the past analysis process to the selected input data based on the similarity between the data used in the past analysis process and the selected input data.
More specifically, the determination as to whether it is applicable is applicable when the data D and the parameter P of the analysis processing unit M satisfy the following conditions.
(1) When parameter P is column name A and data D includes column name A (2) When parameter P is column name A and element name α, data D includes column name A and column name A is When element name α is included (3) When parameter P is column name A and element name α, data D does not include column name A, or column name A is included but column A does not include element name α Then, the information of the column name A is extracted from the data information of the analysis processing unit M, and the elements in the column B of the data D similar to the column name A and the column B similar to the element name α are extracted. Applicable when similar columns and similar elements can be extracted. Here, the similarity between columns is assumed to be similar when, for example, the difference is calculated in descending order of the appearance frequency or appearance rate of each element in the column, and the sum is the similarity, and the difference is smaller than a predetermined threshold. The similarity of elements is assumed to be similar when the difference in appearance frequency or appearance rate is smaller than a threshold value. Similar definitions can be changed according to the purpose.
It is not applicable in the following cases.
When the number of rows of data D1 as a result of applying analysis processing unit M to data D with parameter P is zero.
 図17-A、17-Bに第三の実施の形態の分析パターンのデータ構造とデータの一例を示す。 17A and 17B show an example of the data structure and data of the analysis pattern of the third embodiment.
 図18に類似度算出の一例、パラメータP及びデータDが下記のような構成であり、閾値を30とした場合の各列、各要素の類似度算出の流れを説明する。
 パラメータPの列「A」が要素αを含む割合が40%、要素βを含む割合が35%、要素γを含む割合が15%、要素δを含む割合が10%、…とする。データDは列「B」「C」「D」…を含むとし、列「B」はB1~B4の4つの要素を含み、B1が45%、B2が30%、B3が15%、B4が5%の割合で含まれる。列「C」はC1、C2の二つの要素を含み、C1、C2ともに50%の割合で含まれる。列「D」はD1~D4…の要素を含み、D1が5%、D2が4%、D3が4%、D4が3%、…の割合で含まれるものとする。この場合、各列の類似度を以下のように算出する。列「A」「B」間の類似度A-Bは|40-45|+|35-30|+|15-15|+|10-5|=15となる。
列「A」「C」間の類似度A-Cは|40-50|+|35-50|+|15-0|+|10-0|=15となる。列「A」「D」間の類似度A-Dは|40-5|+|35-4|+|15-4|+|10-3|+…=168となる。閾値30の場合、列「A」に類似する列は列「B」であり、要素αに最も類似する列「B」の要素はB1である。
FIG. 18 illustrates an example of similarity calculation, parameters P and data D having the following configuration, and the flow of similarity calculation for each column and each element when the threshold is 30.
It is assumed that the column “A” in the parameter P includes 40% of the element α, 35% of the element β, 15% of the element γ, 10% of the element δ, and so on. Data D includes columns “B”, “C”, “D”..., And column “B” includes four elements B1 to B4. B1 is 45%, B2 is 30%, B3 is 15%, and B4 is It is included at a rate of 5%. The column “C” includes two elements C1 and C2, and both C1 and C2 are included in a ratio of 50%. The column “D” includes the elements D1 to D4..., And D1 is 5%, D2 is 4%, D3 is 4%, D4 is 3%,. In this case, the similarity of each column is calculated as follows. The similarity AB between the columns “A” and “B” is | 40−45 | + | 35−30 | + | 15−15 | + | 10−5 | = 15.
The similarity AC between the columns “A” and “C” is | 40−50 | + | 35−50 | + | 15−0 | + | 10−0 | = 15. The similarity AD between the columns “A” and “D” is | 40-5 | + | 35-4 | + | 15-4 | + | 10-3 | +. In the case of the threshold value 30, the column similar to the column “A” is the column “B”, and the element of the column “B” most similar to the element α is B1.
 ここで|X-Y|はX-Yの絶対値を表すものとする。類似度の算出方法は様々考えられるが、ここでは、各列の割合が大きい順に要素を並び替え、1番大きい要素同士、2番目に大きい要素同士、…の割合を引算し、列間の要素数が等しくない場合は、例えば類似度A-Cのように列「A」の要素数が4、列「C」の要素数が2の場合は、3番目に大きい要素同士、4番目に大きい要素同士の引算は0と引算する方式を採用している。この算出方法の場合、類似度が小さい方が類似していることとなり、閾値を下回る列がない場合は、類似列なしとする。
 第三の実施の形態によれば、パラメータPの列名や要素を要素数や出現頻度に基づく類似度を加味することにより、分析処理単位Mの適用範囲を拡張することが可能となる。
 
Here, | XY | represents the absolute value of XY. There are various methods for calculating the similarity, but here, the elements are rearranged in descending order of the ratio of each column, and the ratio of the largest elements, the second largest elements, and so on are subtracted. When the number of elements is not equal, for example, when the number of elements in the column “A” is 4 and the number of elements in the column “C” is 2, as in the degree of similarity AC, the third largest element is the fourth Subtraction between large elements employs a method of subtracting zero. In the case of this calculation method, the smaller similarity is similar, and if there is no column below the threshold value, there is no similar column.
According to the third embodiment, the application range of the analysis processing unit M can be expanded by taking into account the similarity based on the number of elements and the appearance frequency of the column names and elements of the parameter P.
(構成例)
[構成例1]
 利用者が入力データ、可視化方法を指定すると、入力データに対して指定された可視化方法の実現に必要な処理手順を自動生成する分析支援方法及びシステム。
[構成例2]
 表の列、グラフの軸を指定することで可視化方法の指定を支援する可視化コンテンツ作成支援方法及びシステム。
[構成例3]
 既存コンテンツを指定することで分析手順、可視化方法の指定を支援する可視化コンテンツ作成支援方法及びシステム。
[構成例4]
 分析処理の履歴、可視化処理の履歴を利用して、可視化方法の指定を支援するデータを生成する分析手順再構築方法及びシステム。
[構成例5]
 分析処理の履歴、可視化処理の履歴を分析し、分析処理の分解、再構築を行う構成例4の分析手順再構築方法及びシステム。
[構成例6]
 データ間の類似度を加味して、分析処理の適用の拡張を行う構成例4の分析手順再構築方法及びシステム。
(Configuration example)
[Configuration example 1]
An analysis support method and system for automatically generating a processing procedure necessary for realizing a visualization method designated for input data when a user designates input data and a visualization method.
[Configuration example 2]
A visualization content creation support method and system for supporting specification of a visualization method by designating a table column and a graph axis.
[Configuration example 3]
A visualization content creation support method and system that supports specification of an analysis procedure and a visualization method by specifying existing content.
[Configuration Example 4]
An analysis procedure restructuring method and system for generating data that supports the designation of a visualization method using the history of analysis processing and the history of visualization processing.
[Configuration Example 5]
The analysis procedure reconstruction method and system of the configuration example 4 which analyzes the analysis process history and the visualization process history, and decomposes and reconstructs the analysis process.
[Configuration Example 6]
The analysis procedure reconstruction method and system of Configuration Example 4 in which application of analysis processing is expanded in consideration of the similarity between data.
 なお、本発明は上記した実施例に限定されるものではなく、様々な変形例が含まれている。例えば、上記した実施例は本発明を分かりやすく説明するために詳細に説明したものであり、必ずしも説明した全ての構成を備えるものに限定されるものではない。また、ある実施例の構成の一部を他の実施例の構成に置き換えることが可能であり、また、ある実施例の構成に他の実施例の構成を加えることも可能である。また、各実施例の構成の一部について、他の構成の追加・削除・置換をすることが可能である。 In addition, this invention is not limited to the above-mentioned Example, Various modifications are included. For example, the above-described embodiments have been described in detail for easy understanding of the present invention, and are not necessarily limited to those having all the configurations described. Further, a part of the configuration of one embodiment can be replaced with the configuration of another embodiment, and the configuration of another embodiment can be added to the configuration of one embodiment. Further, it is possible to add, delete, and replace other configurations for a part of the configuration of each embodiment.
 また、上記の各構成、機能、処理部、処理手段等は、それらの一部又は全部を、例えば集積回路で設計する等によりハードウェアで実現してもよい。また、上記の各構成、機能等は、プロセッサがそれぞれの機能を実現するプログラムを解釈し、実行することによりソフトウェアで実現してもよい。各機能を実現するプログラム、テーブル、ファイル等の情報は、メモリや、ハードディスク、SSD(Solid State Drive)等の記録装置、または、ICカード、SDカード、DVD等の記録媒体に置くことができる。
 また、制御線や情報線は説明上必要と考えられるものを示しており、製品上必ずしも全ての制御線や情報線を示しているとは限らない。実際には殆ど全ての構成が相互に接続されていると考えてもよい。
Each of the above-described configurations, functions, processing units, processing means, and the like may be realized by hardware by designing a part or all of them with, for example, an integrated circuit. Each of the above-described configurations, functions, and the like may be realized by software by interpreting and executing a program that realizes each function by the processor. Information such as programs, tables, and files for realizing each function can be stored in a memory, a hard disk, a recording device such as an SSD (Solid State Drive), or a recording medium such as an IC card, an SD card, or a DVD.
Further, the control lines and information lines indicate what is considered necessary for the explanation, and not all the control lines and information lines on the product are necessarily shown. Actually, it may be considered that almost all the components are connected to each other.
101 サーバ
102 計算機
103 ディスプレイ
104 入力装置
105 ネットワーク
106 ネットワーク
107 DB
111 入力データ作成部
112 分析処理実行部
113 データ可視化部
114 処理手順記録部
115 処理構築部
116 処理手順実行部
117 処理手順推奨部
118 処理手順分析部
119 テンプレートDB
120 処理手順DB
101 Server 102 Computer 103 Display 104 Input Device 105 Network 106 Network 107 DB
111 Input Data Creation Unit 112 Analysis Processing Execution Unit 113 Data Visualization Unit 114 Processing Procedure Recording Unit 115 Processing Construction Unit 116 Processing Procedure Execution Unit 117 Processing Procedure Recommendation Unit 118 Processing Procedure Analysis Unit 119 Template DB
120 processing procedure DB

Claims (14)

  1.  入力データに対して所定の分析処理を実行し、処理結果を可視化するシステムにおけるデータ分析支援処理方法であって、
     入力装置を介して利用者が選択した入力データの識別情報と、可視化方法の識別情報を入力すると、該入力データの識別情報から予測される、可視化のための設定項目の候補を表示するデータ分析支援処理方法。
    A data analysis support processing method in a system that executes predetermined analysis processing on input data and visualizes the processing result,
    Data analysis that displays candidate setting items for visualization predicted from the identification information of the input data when the identification information of the input data selected by the user and the identification information of the visualization method are input via the input device Support processing method.
  2.  前記設定項目の候補の中から選択された設定項目に従い、前記可視化方法で、入力データに基づく処理結果を表示する請求項1に記載のデータ分析支援処理方法。 The data analysis support processing method according to claim 1, wherein a processing result based on input data is displayed by the visualization method according to a setting item selected from the setting item candidates.
  3.  選択された入力データに対する過去の可視化表示又はその概略をさらに表示する請求項1に記載のデータ分析支援処理方法。 The data analysis support processing method according to claim 1, further displaying a past visualization display or an outline of the selected input data.
  4.  入力データに対する分析処理の履歴及び可視化の履歴に基づき、選択された入力データに対する分析処理及び可視化のための設定項目の候補を予測する請求項1に記載のデータ分析支援処理方法。 The data analysis support processing method according to claim 1, wherein candidates of setting items for analysis processing and visualization of the selected input data are predicted based on the history of analysis processing and visualization history of the input data.
  5.  前記分析処理は、所定の分析処理単位の組合せで構成され、
     入力データに対する分析処理の履歴及び可視化の履歴について、分析処理単位の遷移を示す分析パターンを解析し、選択された入力データに対する分析処理の候補を該分析パターンに従い予測する請求項1に記載のデータ分析支援処理方法。
    The analysis processing is composed of a combination of predetermined analysis processing units,
    The data according to claim 1, wherein an analysis pattern indicating a transition of an analysis processing unit is analyzed with respect to an analysis processing history and a visualization history of input data, and an analysis processing candidate for the selected input data is predicted according to the analysis pattern. Analysis support processing method.
  6.  過去の分析処理に用いたデータと、選択された入力データとの類似度に基づいて、該過去の分析処理を、選択された入力データに適用するか否かを判断する請求項5に記載のデータ分析支援処理方法。 6. The method according to claim 5, wherein whether or not to apply the past analysis processing to the selected input data is determined based on the similarity between the data used for the past analysis processing and the selected input data. Data analysis support processing method.
  7.  前記可視化のための設定項目は、表の列に表示するデータ、又は、グラフの軸とするデータである請求項1に記載のデータ分析支援処理方法。 The data analysis support processing method according to claim 1, wherein the setting item for visualization is data displayed in a table column or data used as a graph axis.
  8.  入力データに対して所定の分析処理を実行した処理結果を可視化するデータ可視化部と、
     入力装置を介して利用者が選択した入力データの識別情報と、可視化方法の識別情報を入力すると、該入力データの識別情報から予測される、可視化のための設定項目の候補を表示するデータ分析支援処理部と
    を備えたシステム。
    A data visualization unit that visualizes the result of executing a predetermined analysis process on input data;
    Data analysis that displays candidate setting items for visualization predicted from the identification information of the input data when the identification information of the input data selected by the user and the identification information of the visualization method are input via the input device A system including a support processing unit.
  9.  前記データ分析支援処理部は、前記設定項目の候補の中から選択された設定項目に従い、前記可視化方法で、入力データに基づく処理結果を表示する請求項8に記載のシステム。 The system according to claim 8, wherein the data analysis support processing unit displays a processing result based on input data by the visualization method according to a setting item selected from the setting item candidates.
  10.  前記データ分析支援処理部は、選択された入力データに対する過去の可視化表示又はその概略をさらに表示する請求項8に記載のシステム。 The system according to claim 8, wherein the data analysis support processing unit further displays a past visualization display or an outline of the selected input data.
  11.  前記データ分析支援処理部は、入力データに対する分析処理の履歴及び可視化の履歴に基づき、選択された入力データに対する分析処理及び可視化のための設定項目の候補を予測する請求項8に記載のシステム。 The system according to claim 8, wherein the data analysis support processing unit predicts a setting item candidate for analysis processing and visualization of selected input data based on a history of analysis processing and visualization history of the input data.
  12.  前記分析処理は、所定の分析処理単位の組合せで構成され、
     前記データ分析支援処理部は、入力データに対する分析処理の履歴及び可視化の履歴について、分析処理単位の遷移を示す分析パターンを解析し、選択された入力データに対する分析処理の候補を該分析パターンに従い予測する請求項8に記載のシステム。
    The analysis processing is composed of a combination of predetermined analysis processing units,
    The data analysis support processing unit analyzes an analysis pattern indicating a transition of an analysis processing unit with respect to an analysis processing history and a visualization history of input data, and predicts an analysis processing candidate for the selected input data according to the analysis pattern. The system according to claim 8.
  13.  前記データ分析支援処理部は、過去の分析処理に用いたデータと、選択された入力データとの類似度に基づいて、該過去の分析処理を、選択された入力データに適用するか否かを判断する請求項12に記載のシステム。 The data analysis support processing unit determines whether to apply the past analysis processing to the selected input data based on the similarity between the data used for the past analysis processing and the selected input data. The system of claim 12 for determining.
  14.  前記データ分析支援処理部は、前記可視化のための設定項目は、表の列に表示するデータ、又は、グラフの軸とするデータである請求項8に記載のシステム。 The system according to claim 8, wherein in the data analysis support processing unit, the setting item for visualization is data displayed in a table column or data used as a graph axis.
PCT/JP2013/081513 2012-11-26 2013-11-22 Data analysis assistance processing system and method WO2014081012A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2012257486A JP6025520B2 (en) 2012-11-26 2012-11-26 Data analysis support processing system and method
JP2012-257486 2012-11-26

Publications (1)

Publication Number Publication Date
WO2014081012A1 true WO2014081012A1 (en) 2014-05-30

Family

ID=50776188

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2013/081513 WO2014081012A1 (en) 2012-11-26 2013-11-22 Data analysis assistance processing system and method

Country Status (2)

Country Link
JP (1) JP6025520B2 (en)
WO (1) WO2014081012A1 (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016013099A1 (en) * 2014-07-25 2016-01-28 株式会社日立製作所 Feature data management system and feature data management method
US9898464B2 (en) 2014-11-19 2018-02-20 Kabushiki Kaisha Toshiba Information extraction supporting apparatus and method
US10444742B2 (en) 2016-02-09 2019-10-15 Kabushiki Kaisha Toshiba Material recommendation apparatus
US10936806B2 (en) 2015-11-04 2021-03-02 Kabushiki Kaisha Toshiba Document processing apparatus, method, and program
US11037062B2 (en) 2016-03-16 2021-06-15 Kabushiki Kaisha Toshiba Learning apparatus, learning method, and learning program
US11481663B2 (en) 2016-11-17 2022-10-25 Kabushiki Kaisha Toshiba Information extraction support device, information extraction support method and computer program product

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2021196912A (en) * 2020-06-15 2021-12-27 ヤフー株式会社 Execution apparatus, execution method, and execution program

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2000348037A (en) * 1999-06-02 2000-12-15 Nippon Telegr & Teleph Corp <Ntt> Method/device for converting meaning frame form of user 's request, method/device for reconstituting data, method/ device for visualizing data, meaning frame form converting program of user's request and data reconstitution program and recording medium with visualizing program recorded
JP2006065482A (en) * 2004-08-25 2006-03-09 Toshiba Corp Multi-dimensional data display device and method, multi-dimensional data display program
JP2010250769A (en) * 2009-04-20 2010-11-04 Mitsubishi Electric Corp Device, method and program for visualization of analysis result

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030154443A1 (en) * 2002-02-13 2003-08-14 Ncr Corporation Visual discovery tool
US8892501B2 (en) * 2009-05-08 2014-11-18 Sap Se Capturing OLAP analysis thread as refreshable business intelligence data
US8825649B2 (en) * 2010-07-21 2014-09-02 Microsoft Corporation Smart defaults for data visualizations

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2000348037A (en) * 1999-06-02 2000-12-15 Nippon Telegr & Teleph Corp <Ntt> Method/device for converting meaning frame form of user 's request, method/device for reconstituting data, method/ device for visualizing data, meaning frame form converting program of user's request and data reconstitution program and recording medium with visualizing program recorded
JP2006065482A (en) * 2004-08-25 2006-03-09 Toshiba Corp Multi-dimensional data display device and method, multi-dimensional data display program
JP2010250769A (en) * 2009-04-20 2010-11-04 Mitsubishi Electric Corp Device, method and program for visualization of analysis result

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016013099A1 (en) * 2014-07-25 2016-01-28 株式会社日立製作所 Feature data management system and feature data management method
JPWO2016013099A1 (en) * 2014-07-25 2017-04-27 株式会社日立製作所 Feature data management system and feature data management method
US9898464B2 (en) 2014-11-19 2018-02-20 Kabushiki Kaisha Toshiba Information extraction supporting apparatus and method
US10936806B2 (en) 2015-11-04 2021-03-02 Kabushiki Kaisha Toshiba Document processing apparatus, method, and program
US10444742B2 (en) 2016-02-09 2019-10-15 Kabushiki Kaisha Toshiba Material recommendation apparatus
US11037062B2 (en) 2016-03-16 2021-06-15 Kabushiki Kaisha Toshiba Learning apparatus, learning method, and learning program
US11481663B2 (en) 2016-11-17 2022-10-25 Kabushiki Kaisha Toshiba Information extraction support device, information extraction support method and computer program product

Also Published As

Publication number Publication date
JP2014106611A (en) 2014-06-09
JP6025520B2 (en) 2016-11-16

Similar Documents

Publication Publication Date Title
JP6025520B2 (en) Data analysis support processing system and method
JP6319271B2 (en) Event analysis device, event analysis system, event analysis method, and event analysis program
JP7054051B2 (en) Software robot definition information generation system, software robot definition information generation method, and program
JP5984215B2 (en) Dynamic manual creation device and method, and computer program
JP2015028736A (en) Tabular format multi-dimensional data conversion method and apparatus
JP2017111657A (en) Design support apparatus, design support method, and design support program
JP2021117798A (en) Molecular design support system, method for predicting molecular characteristic value, and molecular design support program
JP2017045080A (en) Business flow specification regeneration method
JP2005062963A (en) Time series action pattern display device, method, and program
JP4747034B2 (en) Verification scenario creation program, recording medium, verification scenario creation device, and verification scenario creation method
JP2016151908A (en) Personal information anonymization support device
JP2009187295A (en) Business process analysis program, method and device
JP7257168B2 (en) data analyzer
JP7157586B2 (en) Data analysis device and data analysis method
JP7162516B2 (en) Monitoring screen generation device and monitoring screen generation method
JP6802109B2 (en) Software specification analyzer and software specification analysis method
JP7345446B2 (en) Data display program and programmable display
JP7205007B1 (en) Support device, operation system, display data creation device, support method, display data creation method, support program, and display data creation program
KR101886986B1 (en) Engineering tool program and engineering tool
JP3505138B2 (en) Structure editing device, video structure editing device, and computer-readable recording medium storing a program for causing a computer to execute as each means of the device
WO2023119609A1 (en) Information processing device, layout assistance method, and assistance program
JP7346671B2 (en) Data conversion device and data conversion method
JP6824915B2 (en) Data management system and data management method
JP2011070369A (en) Device and method for integrating database
JP6586207B1 (en) Waveform information processing apparatus, waveform information processing method, and waveform information processing program

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 13857011

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 13857011

Country of ref document: EP

Kind code of ref document: A1