WO2019052339A1 - 一种数据探索管理方法、系统、电子设备及存储介质 - Google Patents

一种数据探索管理方法、系统、电子设备及存储介质 Download PDF

Info

Publication number
WO2019052339A1
WO2019052339A1 PCT/CN2018/102993 CN2018102993W WO2019052339A1 WO 2019052339 A1 WO2019052339 A1 WO 2019052339A1 CN 2018102993 W CN2018102993 W CN 2018102993W WO 2019052339 A1 WO2019052339 A1 WO 2019052339A1
Authority
WO
WIPO (PCT)
Prior art keywords
program code
data
module
storage
storage process
Prior art date
Application number
PCT/CN2018/102993
Other languages
English (en)
French (fr)
Inventor
林宇
Original Assignee
深圳市宇数科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳市宇数科技有限公司 filed Critical 深圳市宇数科技有限公司
Priority to US16/632,908 priority Critical patent/US11907211B2/en
Publication of WO2019052339A1 publication Critical patent/WO2019052339A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/90335Query processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/242Query formulation
    • G06F16/2433Query languages
    • G06F16/2448Query languages for particular applications; for extensibility, e.g. user defined types
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/242Query formulation
    • G06F16/2433Query languages
    • G06F16/2443Stored procedures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2462Approximate or statistical queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/30Creation or generation of source code

Definitions

  • the present invention relates to the field of electronic information technologies, and in particular, to a data discovery management method, system, electronic device, and storage medium.
  • Data exploration not only depends on the final result, but also on the data track in the whole process.
  • the data exploration process often requires multiple methods to be combined. The researcher selects a method based on knowledge and experience, then writes the program code of the method, then runs the program code and obtains the result, and then selects the next method according to the result. Until the final result is obtained and the next method is no longer selected.
  • one of the objects of the present invention is to provide a data exploration management method, realize scientific management of the data exploration process, ensure the repeatability of the data exploration process, and real-time operation and method of the data exploration process. Call, share, and reuse.
  • the second object of the present invention is to provide a data exploration management system, realize scientific management of the data exploration process, ensure the repeatability of the data exploration process, and real-time call, share and reuse of operations and methods of the data exploration process.
  • the third object of the present invention is to provide an electronic device, realize scientific management of the data exploration process, ensure the repeatability of the data exploration process, and real-time call, share and reuse of the operation and method of the data exploration process.
  • a fourth object of the present invention is to provide a computer readable storage medium that implements scientific management of a data discovery process, ensures repeatability of a data discovery process, and real-time invocation, sharing, and multiplexing of operations and methods of the data exploration process.
  • a data exploration management method includes the following steps:
  • Obtain data obtain data input by the user, and the data includes data content and exploration variables;
  • Run the program code run the output program code, get and save the running results.
  • the method further includes the following steps:
  • the data content includes a database, a data table, and a data file.
  • the pre-storage process includes nodes, paths, methods, and flow program code, the nodes and paths constitute operations, and the method includes a pre-storage method, and the flow program code is used to execute the pre-storage process.
  • the pre-storage method includes a statistical method and method program code for executing a pre-storage method.
  • the flow program code calls the method program code to generate an output program code.
  • a data exploration management system comprising a pre-storage method module, a pre-storage process module, a data acquisition module, a process selection module, a program code generation module, a program code operation module, a result display module, and a pre-storage method module and a pre-storage process module
  • the pre-storage process module, the data acquisition module and the process selection module are connected, the process selection module is connected with the program code generation module, the program code generation module is connected with the program code execution module, and the program code execution module is connected with the result display module;
  • the data acquisition module acquires the data input by the user, and the process selection module acquires a pre-storage process in the pre-storage process module according to the data, and the pre-storage process module acquires the pre-storage method and the method program code in the pre-storage method module according to the pre-storage process.
  • the process selection module acquires the pre-storage process selected by the user, the program code generation module generates and saves the output program code, the program code execution module runs the output program code, obtains and saves the running result, and the result display module displays the pre-storage process and the output program code. And run the results.
  • An electronic device comprising: a processor
  • a memory a memory
  • a program wherein the program is stored in the memory and configured to be executed by the processor, the program including a data exploration management method for performing the above.
  • a computer readable storage medium having stored thereon a computer program executed by a processor to perform the above-described data discovery management method.
  • the invention provides a data exploration management method, and relates to a data exploration management system, an electronic device and a computer readable storage medium; and the operation, method and flow program code of the pre-storage process are acquired by selecting a pre-storage process, and generating And save the output program code, run the output program code, get and save the running result, display the pre-storage process, output the program code and the running result, realize the scientific management of the data exploration process, ensure the repeatability of the data exploration process, and the data Exploring the real-time invocation, sharing, and reuse of operations and methods of the process.
  • FIG. 1 is a flowchart of a data exploration management method according to the present invention.
  • Figure 3 is a flow chart of a first embodiment of the present invention.
  • FIG. 5 is a block diagram of a module of a data discovery management system according to the present invention.
  • a data exploration management method includes the following steps:
  • Obtain data obtain data input by the user, and the data includes data content and exploration variables;
  • Run the program code run the output program code, get and save the running result
  • the data content includes a database, a data table, and a data file, and the data content is called by the program code.
  • the description of the selection of the pre-storage process is as follows:
  • the system provides a process selection interface for the user to select, and the user selects a pre-storage process to be executed in the process selection interface, thereby obtaining a pre-storage process selected by the user, and adopting a pre-storage process.
  • Data exploration of data is as follows:
  • the pre-storage process includes nodes, paths, methods, and process code, nodes and paths constitute operations, methods include pre-storage methods, process program code is used to execute pre-storage processes, and pre-storage methods include statistical methods and method program codes.
  • the method code is used to execute the pre-storage method.
  • the flow program code calls the method program code to generate the output program code.
  • Steps to select the pre-storage process and the step-by-step process code to build a standardized exploration process For a data exploration, there are multiple processes required, and each process uses multiple methods; as shown in Figure 2, assume the data.
  • the exploration selects three processes, namely, A process, B process, and C process, wherein the A process uses the a1 method, the a2 method, and the a3 method; that is, when the step selects the pre-storage process and the step generates the program code, the pre-storage process includes A.
  • the process, the B process, and the C process when performing the data exploration, select the three pre-storage processes, and automatically obtain the a1 method, the a2 method, and the a3 method in the A process, and the method code and the process code, thereby generating The output program code of the A process.
  • the first embodiment of the present invention is used for scientific research data exploration. As shown in FIG. 3 and FIG. 4, the steps for exploring hypertension scientific research data are as follows:
  • Run the program code run the output program code of the two-sample mean comparison process, and get and save the corresponding running result.
  • Normality test judge whether the sample size of the data content is greater than 5000. If yes, the data content is subjected to Kolmogorov-Smirnov test, and the normality test result is output, otherwise the data content is subjected to Shapiro-Wilk test, and the normality test result is output;
  • Variance homogeneity test If the normality test result is that the data content conforms to the normal distribution, the data content is subjected to F test, and the variance of the variance is output. Otherwise, the data content is subjected to Wilcoxon rank sum test;
  • the systolic blood pressure and diastolic blood pressure data of the hypertensive patient group and the normal control group were tested using the t test when the normal distribution and the homogeneity of the variance were met;
  • the Welch approximation t test is used when the distribution is not consistent with the homogeneity of the variance;
  • the Wilcoxon rank sum test is used when the normal distribution is not met.
  • the pre-storage methods used for the two-sample mean comparison process include: Kolmogorov-Smirnov test, Shapiro-Wilk test, Wilcoxon rank sum test, F test, Welch approximation t test, and t test. .
  • the second embodiment of the present invention is used for health data exploration. Based on the first embodiment, the flow and method for exploring the research data of hypertension in the first embodiment are changed into corresponding health data exploration processes and methods. So that users can do research and research on health data.
  • the third embodiment of the present invention is used for educational data exploration.
  • the flow and method for exploring the research data of hypertension in the first embodiment are changed into corresponding educational data exploration processes and methods. Therefore, for users to do educational research exploration.
  • a data exploration management system comprising a pre-storage method module, a pre-storage process module, a data acquisition module, a process selection module, a program code generation module, a program code operation module, a result display module, and a pre-storage method module and a pre-storage process module
  • the pre-storage process module, the data acquisition module and the process selection module are connected, the process selection module is connected with the program code generation module, the program code generation module is connected with the program code execution module, and the program code execution module is connected with the result display module.
  • the data acquisition module obtains the data input by the user, and the process selection module acquires a pre-storage process in the pre-storage process module according to the data, and the pre-storage process module acquires the pre-storage method and method in the pre-storage method module according to the pre-storage process.
  • the program code, the process selection module obtains the pre-storage process selected by the user, the program code generation module generates and saves the output program code, the program code execution module runs the output program code, obtains and saves the running result, and the result display module displays the pre-storage flow and output. Program code and running results.
  • An electronic device comprising: a processor; a memory; and a program, wherein the program is stored in the memory and configured to be executed by the processor, the program comprising a data exploration management method for performing the above; a computer readable storage The medium on which the computer program is stored, the computer program being executed by the processor to execute the data exploration management method described above.
  • the invention acquires the operation, method and flow program code of the pre-storage flow by selecting the pre-storage process, generates and saves the output program code, runs the output program code, obtains and saves the running result, and displays the pre-storage flow, the output program code and the running result.
  • the scientific management of the data exploration process is realized, which ensures the repeatability of the data exploration process, and the real-time calling, sharing and multiplexing of the operations and methods of the data exploration process.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Software Systems (AREA)
  • Mathematical Physics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Fuzzy Systems (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Debugging And Monitoring (AREA)

Abstract

一种数据探索管理方法,获取用户所输入的数据,数据包括数据内容、探索变量;获取用户所选择的预存储流程,通过预存储流程对数据进行数据探索;获取预存储流程的操作、方法和流程程序代码,生成并保存输出程序代码;运行输出程序代码,得到并保存运行结果,显示预存储流程、输出程序代码和运行结果。上述方法通过对数据探索过程的科学管理,保证了数据探索过程的可重复性,以及数据探索过程的操作和方法的实时调用、共享和复用。

Description

一种数据探索管理方法、系统、电子设备及存储介质 技术领域
本发明涉及一种电子信息技术领域,尤其涉及一种数据探索管理方法、系统、电子设备及存储介质。
背景技术
随着科学技术的快速发展、海量科学数据的不断产生,以数据为基础进行的数据探索与科技创新越来越多。
数据种类多、数量大、更新快,这对数据探索提出了更高的要求,数据探索不仅要看最终结果,还要看整个过程中的数据轨迹。目前,数据探索过程往往需要多个方法组合使用,研究人员根据知识和经验,选定一个方法,然后编写该方法的程序代码,继而运行程序代码并得到结果,之后根据结果再选定下一个方法,直至得到最终结果、不再选定下一个方法为止。
由此可见,目前数据探索过程因人而异,中间环节缺少必要的管理,数据探索过程可重复性差,整个过程和结果的可靠性无法检验和审核。同时,由于数据探索过程缺乏完整的操作信息和方法信息,使得研究人员不能将同样的操作和方法用于新的研究,或把初始数据放到其他的操作和方法之中,研究这些变化会导致什么新的结果。
发明内容
为了克服现有技术的不足,本发明的目的之一在于提供一种数据探索管理方法,实现数据探索过程的科学管理,保证数据探索过程的可重复性,以及数据探索过程的操作和方法的实时调用、共享和复用。
本发明的目的之二在于提供一种数据探索管理系统,实现数据探索过程的 科学管理,保证数据探索过程的可重复性,以及数据探索过程的操作和方法的实时调用、共享和复用。
本发明的目的之三在于提供一种电子设备,实现数据探索过程的科学管理,保证数据探索过程的可重复性,以及数据探索过程的操作和方法的实时调用、共享和复用。
本发明的目的之四在于提供一种计算机可读存储介质,实现数据探索过程的科学管理,保证数据探索过程的可重复性,以及数据探索过程的操作和方法的实时调用、共享和复用。
本发明的目的之一采用如下技术方案实现:
一种数据探索管理方法,包括以下步骤:
获取数据,获取用户所输入的数据,数据包括数据内容、探索变量;
选择预存储流程,获取用户所选择的预存储流程,通过预存储流程对数据进行数据探索;
生成程序代码,获取预存储流程的操作、方法和流程程序代码,生成并保存程序代码;
运行程序代码,运行输出程序代码,得到并保存运行结果。
进一步地,还包括以下步骤:
显示结果,显示预存储流程、输出程序代码和运行结果。
进一步地,数据内容包括数据库、数据表、数据文件。
进一步地,预存储流程包括节点、路径、方法和流程程序代码,节点和路径组成操作,方法包括预存储方法,流程程序代码用于执行预存储流程。
进一步地,预存储方法包括统计学方法和方法程序代码,方法程序代码用于执行预存储方法。
进一步地,流程程序代码调用方法程序代码,生成输出程序代码。
本发明的目的之二采用如下技术方案实现:
一种数据探索管理系统,包括预存储方法模块、预存储流程模块、数据获取模块、流程选择模块、程序代码生成模块、程序代码运行模块、结果显示模块;预存储方法模块与预存储流程模块连接,预存储流程模块、数据获取模块与流程选择模块连接,流程选择模块与程序代码生成模块连接,程序代码生成模块与程序代码运行模块连接,程序代码运行模块与结果显示模块连接;
数据获取模块获取用户所输入的数据,流程选择模块根据数据,获取预存储流程模块内的预存储流程,预存储流程模块根据预存储流程,获取预存储方法模块内的预存储方法和方法程序代码,流程选择模块获取用户所选择的预存储流程,程序代码生成模块生成并保存输出程序代码,程序代码运行模块运行输出程序代码,得到并保存运行结果,结果显示模块显示预存储流程、输出程序代码和运行结果。
本发明的目的之三采用如下技术方案实现:
一种电子设备,包括:处理器;
存储器;以及程序,其中程序被存储在存储器中,并且被配置成由处理器执行,程序包括用于执行上述的数据探索管理方法。
本发明的目的之四采用如下技术方案实现:
一种计算机可读存储介质,其上存储有计算机程序,计算机程序被处理器执行上述的数据探索管理方法。
相比现有技术,本发明的有益效果在于:
本发明提供了一种数据探索管理方法,还涉及了一种数据探索管理系统、电子设备和计算机可读存储介质;通过选择预储存流程,获取预存储流程的操 作、方法和流程程序代码,生成并保存输出程序代码,运行输出程序代码,得到并保存运行结果,显示预存储流程、输出程序代码和运行结果,实现了数据探索过程的科学管理,保证了数据探索过程的可重复性,以及数据探索过程的操作和方法的实时调用、共享和复用。
附图说明
此处所说明的附图用来提供对本发明的进一步理解,构成本申请的一部分,本发明的示意性实施例及其说明用于解释本发明,并不构成对本发明的不当限定。在附图中:
图1为本发明的一种数据探索管理方法的流程图;
图2为本发明的一个数据探索的流程图;
图3为本发明的第一实施例的流程图;
图4为本发明的第一实施例中的两样本均数比较流程的流程图;
图5为本发明的一种数据探索管理系统的模块连接图。
具体实施方式
下面,结合附图以及具体实施方式,对本发明做进一步描述,需要说明的是,在不相冲突的前提下,以下描述的各实施例之间或各技术特征之间可以任意组合形成新的实施例。
如图1-2所示,一种数据探索管理方法,包括以下步骤:
获取数据,获取用户所输入的数据,数据包括数据内容、探索变量;
选择预存储流程,获取用户所选择的预存储流程,通过预存储流程对数据进行数据探索;
生成程序代码,获取预存储流程的操作、方法和流程程序代码,生成并保 存输出程序代码;
运行程序代码,运行输出程序代码,得到并保存运行结果;
显示结果,显示预存储流程、输出程序代码和运行结果。
其中,数据内容包括数据库、数据表、数据文件,同时,数据内容供程序代码调用。
其中,关于选择预存储流程的说明如下:系统提供了一个供用户选择的流程选择界面,用户在流程选择界面选择要执行的预存储流程,从而获取用户所选择的预存储流程,通过预存储流程对数据进行数据探索。
其中,预存储流程包括节点、路径、方法和流程程序代码,节点和路径组成操作,方法包括预存储方法,流程程序代码用于执行预存储流程;预存储方法包括统计学方法和方法程序代码,方法程序代码用于执行预存储方法。
其中,流程程序代码调用方法程序代码,生成输出程序代码。
步骤选择预存储流程和步骤生成程序代码构建了一个标准化的探索过程,对于一个数据探索,需要的流程有多个,每个流程使用的方法有多个;如图2所示的,假设该数据探索选择进行三个流程,为A流程、B流程和C流程,其中A流程使用a1方法、a2方法和a3方法;即在进行步骤选择预存储流程和步骤生成程序代码时,预存储流程包括A流程、B流程和C流程,当进行该数据探索时,选择这三个预存储流程,则自动获取A流程中的a1方法、a2方法和a3方法,以及方法程序代码、流程程序代码,从而生成A流程的输出程序代码。
本发明的第一实施例用于科研数据探索,如图3、图4所示,进行高血压科研数据探索的步骤如下:
获取数据,获取用户所输入的高血压病人组和正常对照组的收缩压和舒张压数据内容和探索变量;
选择预存储流程,获取用户所选择的两样本均数比较流程,通过该流程对数据进行数据探索;
生成程序代码,获取两样本均数比较流程的操作、方法和流程程序代码,生成并保存相应的输出程序代码;
运行程序代码,运行两样本均数比较流程的输出程序代码,得到并保存相应的运行结果。
保存结果,显示两样本均数比较流程、相应的输出程序代码和相应的运行结果。
如图4所示,在进行探索高血压科研数据探索时,两样本均数比较流程的具体步骤如下:
正态性检验:判断数据内容的样本量是否大于5000,若是,数据内容进行Kolmogorov-Smirnov检验,输出正态性检验结果,否则数据内容进行Shapiro-Wilk检验,输出正态性检验结果;
方差齐性检验:若正态性检验结果为数据内容符合正态分布,数据内容进行F检验,输出方差齐性结果,否则,数据内容进行Wilcoxon秩和检验;
均数比较:若方差齐性结果为符合方差齐性,所述数据内容进行t检验,否则进行Welch近似t检验。
综上所述,对于两样本均数比较流程来说,高血压病人组和正常对照组的收缩压和舒张压数据,在符合正态分布和方差齐性时,使用t检验;在符合正态分布但不符合方差齐性时,使用Welch近似t检验;在不符合正态分布时,使用Wilcoxon秩和检验。
结合上述,对于本实施例来说,对于两样本均数比较流程所用到的预存储方法包括:Kolmogorov-Smirnov检验、Shapiro-Wilk检验、Wilcoxon秩和检验、 F检验、Welch近似t检验和t检验。
在本实施例中,为了便于说明,只举例了两样本均数比较流程,在实际探索过程中,关于高血压科研数据探索还需要进行的流程还有很多,比如选择两样本率比较流程,获取两样本率比较流程的操作、方法和流程程序代码,生成并保存相应的输出程序代码,运行相应的输出程序代码,得到并保存相应的运行结果,显示两样本率比较流程、相应的输出程序代码和相应的运行结果,用于比较高血压病人组和正常对照组的率是否存在差异。
本发明的第二实施例用于健康数据探索,在第一实施例的基础上,将第一实施例中关于高血压科研数据探索的流程、方法等改成相应的健康数据探索的流程、方法,从而供用户做健康数据的探索研究。
本发明的第三实施例用于教育数据探索,在第一实施例的基础上,将第一实施例中关于高血压科研数据探索的流程、方法等改成相应的教育数据探索的流程、方法,从而供用户做教育数据的探索研究。
一种数据探索管理系统,包括预存储方法模块、预存储流程模块、数据获取模块、流程选择模块、程序代码生成模块、程序代码运行模块、结果显示模块;预存储方法模块与预存储流程模块连接,预存储流程模块、数据获取模块与流程选择模块连接,流程选择模块与程序代码生成模块连接,程序代码生成模块与程序代码运行模块连接,程序代码运行模块与结果显示模块连接。
其中,数据获取模块获取用户所输入的数据,流程选择模块根据数据,获取预存储流程模块内的预存储流程,预存储流程模块根据预存储流程,获取预存储方法模块内的预存储方法和方法程序代码,流程选择模块获取用户所选择的预存储流程,程序代码生成模块生成并保存输出程序代码,程序代码运行模块运行输出程序代码,得到并保存运行结果,结果显示模块显示预存储流程、 输出程序代码和运行结果。
一种电子设备,包括:处理器;存储器;以及程序,其中程序被存储在存储器中,并且被配置成由处理器执行,程序包括用于执行上述的数据探索管理方法;一种计算机可读存储介质,其上存储有计算机程序,计算机程序被处理器执行上述的数据探索管理方法。
本发明通过选择预储存流程,获取预存储流程的操作、方法和流程程序代码,生成并保存输出程序代码,运行输出程序代码,得到并保存运行结果,显示预存储流程、输出程序代码和运行结果,实现了数据探索过程的科学管理,保证了数据探索过程的可重复性,以及数据探索过程的操作和方法的实时调用、共享和复用。
上述实施方式仅为本发明的优选实施方式,不能以此来限定本发明保护的范围,本领域的技术人员在本发明的基础上所做的任何非实质性的变化及替换均属于本发明所要求保护的范围。

Claims (9)

  1. 一种数据探索管理方法,其特征在于包括以下步骤:
    获取数据,获取用户所输入的数据,所述数据包括数据内容、探索变量;
    选择预存储流程,获取用户所选择的预存储流程,通过所述预存储流程对所述数据进行数据探索;
    生成程序代码,获取所述预存储流程的操作、方法和流程程序代码,生成并保存输出程序代码;
    运行程序代码,运行所述输出程序代码,得到并保存运行结果。
  2. 如权利要求1所述的一种数据探索管理方法,其特征在于:还包括以下步骤:
    显示结果,显示预存储流程、输出程序代码和运行结果。
  3. 如权利要求1所述的一种数据探索管理方法,其特征在于:所述数据内容包括数据库、数据表、数据文件。
  4. 如权利要求1所述的一种数据探索管理方法,其特征在于:所述预存储流程包括节点、路径、方法和流程程序代码,所述节点和路径组成操作,所述方法包括预存储方法,所述流程程序代码用于执行预存储流程。
  5. 如权利要求4所述的一种数据探索管理方法,其特征在于:所述预存储方法包括统计学方法和方法程序代码,所述方法程序代码用于执行预存储方法。
  6. 如权利要求5所述的一种数据探索管理方法,其特征在于:所述流程程序代码调用所述方法程序代码,生成输出程序代码。
  7. 一种数据探索管理系统,其特征在于:包括预存储方法模块、预存储流程模块、数据获取模块、流程选择模块、程序代码生成模块、程序代码运行模 块、结果显示模块;所述预存储方法模块与所述预存储流程模块连接,所述预存储流程模块、所述数据获取模块与所述流程选择模块连接,所述流程选择模块与所述程序代码生成模块连接,所述程序代码生成模块与所述程序代码运行模块连接,所述程序代码运行模块与所述结果显示模块连接;
    所述数据获取模块获取用户所输入的数据,所述流程选择模块根据所述数据,获取所述预存储流程模块内的预存储流程,所述预存储流程模块根据所述预存储流程,获取所述预存储方法模块内的预存储方法和方法程序代码,所述流程选择模块获取用户所选择的预存储流程,所述程序代码生成模块生成并保存输出程序代码,所述程序代码运行模块运行输出程序代码,得到并保存运行结果,所述结果显示模块显示预存储流程、输出程序代码和所述运行结果。
  8. 一种电子设备,其特征在于包括:处理器;
    存储器;以及程序,其中所述程序被存储在所述存储器中,并且被配置成由处理器执行,所述程序包括用于执行权利要求1-6任意一项所述的方法。
  9. 一种计算机可读存储介质,其上存储有计算机程序,其特征在于:所述计算机程序被处理器执行如权利要求1-6任意一项所述的方法。
PCT/CN2018/102993 2017-09-13 2018-08-29 一种数据探索管理方法、系统、电子设备及存储介质 WO2019052339A1 (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US16/632,908 US11907211B2 (en) 2017-09-13 2018-08-29 Data exploration management method and system, electronic device, and storage medium

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201710822644.2 2017-09-13
CN201710822644.2A CN107766424B (zh) 2017-09-13 2017-09-13 一种数据探索管理方法、系统、电子设备及存储介质

Publications (1)

Publication Number Publication Date
WO2019052339A1 true WO2019052339A1 (zh) 2019-03-21

Family

ID=61265216

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/102993 WO2019052339A1 (zh) 2017-09-13 2018-08-29 一种数据探索管理方法、系统、电子设备及存储介质

Country Status (3)

Country Link
US (1) US11907211B2 (zh)
CN (1) CN107766424B (zh)
WO (1) WO2019052339A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111475488A (zh) * 2020-04-14 2020-07-31 中国长城科技集团股份有限公司 一种数据处理方法、装置、终端设备及可读存储介质

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107766424B (zh) * 2017-09-13 2020-09-15 深圳市宇数科技有限公司 一种数据探索管理方法、系统、电子设备及存储介质
CN110362303B (zh) * 2019-07-15 2020-08-25 深圳市宇数科技有限公司 数据探索方法和系统
CN111679294B (zh) * 2020-07-24 2021-03-19 航天科工通信技术研究院有限责任公司 一种星基增强系统完好性数据的统计检验方法
CN112241447B (zh) * 2020-12-18 2021-04-02 北京世纪好未来教育科技有限公司 一种学情数据处理方法、装置、计算机设备及存储介质

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103886203A (zh) * 2014-03-24 2014-06-25 美商天睿信息系统(北京)有限公司 一种基于指标预测的自动建模系统及其方法
CN106599230A (zh) * 2016-12-19 2017-04-26 北京天元创新科技有限公司 一种分布式数据挖掘模型评估的方法与系统
CN106845942A (zh) * 2017-01-22 2017-06-13 成都生命基线科技有限公司 一种生物分析平台中的工程项目发布系统
CN107038167A (zh) * 2016-02-03 2017-08-11 普华诚信信息技术有限公司 基于模型评估的大数据挖掘分析系统及其分析方法
CN107766424A (zh) * 2017-09-13 2018-03-06 深圳市宇数科技有限公司 一种数据探索管理方法、系统、电子设备及存储介质

Family Cites Families (34)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7523106B2 (en) * 2003-11-24 2009-04-21 International Business Machines Coporation Computerized data mining system, method and program product
US20070147685A1 (en) * 2005-12-23 2007-06-28 3M Innovative Properties Company User interface for statistical data analysis
US7739267B2 (en) * 2006-03-10 2010-06-15 International Business Machines Corporation Classification and sequencing of mixed data flows
US7689582B2 (en) * 2006-03-10 2010-03-30 International Business Machines Corporation Data flow system and method for heterogeneous data integration environments
CN100583098C (zh) * 2007-12-06 2010-01-20 中国电信股份有限公司 一种数据挖掘系统和方法
US8510709B2 (en) * 2009-06-01 2013-08-13 National Instruments Corporation Graphical indicator which specifies parallelization of iterative program code in a graphical data flow program
CA2684441C (en) * 2009-09-22 2012-06-05 Ibm Canada Limited - Ibm Canada Limitee May-constant propagation
US8595153B2 (en) * 2010-06-09 2013-11-26 Microsoft Corporation Exploring data using multiple machine-learning models
CN102004660A (zh) * 2010-11-08 2011-04-06 中兴通讯股份有限公司 一种业务流程的实现方法及装置
US9632763B2 (en) * 2011-03-09 2017-04-25 International Business Machines Corporation Sharing of flows in a stream processing system
US20130055217A1 (en) * 2011-08-23 2013-02-28 International Business Machines Corporation Breakpoint synchronization for modified program source code
CN102508706B (zh) * 2011-11-18 2013-08-07 北京航空航天大学 一种多源数据集成平台及其构建方法
CN102662650A (zh) * 2012-03-08 2012-09-12 北京神州数码思特奇信息技术股份有限公司 一种基于动态业务流程引擎实现方法和系统
US20140040279A1 (en) * 2012-08-02 2014-02-06 International Business Machines Corporation Automated data exploration
US8918410B2 (en) * 2013-02-21 2014-12-23 Sas Institute Inc. System and method for fast identification of variable roles during initial data exploration
EP3200131A1 (en) * 2013-03-15 2017-08-02 Fisher-Rosemount Systems, Inc. Data modeling studio
US8943448B2 (en) * 2013-05-23 2015-01-27 Nvidia Corporation System, method, and computer program product for providing a debugger using a common hardware database
US9244810B2 (en) * 2013-05-23 2016-01-26 Nvidia Corporation Debugger graphical user interface system, method, and computer program product
US9436586B1 (en) * 2013-10-04 2016-09-06 Ca, Inc. Determining code coverage on Z/OS® server
US10419483B1 (en) * 2014-09-17 2019-09-17 Amazon Technologies, Inc. Time-bounded execution for privileged code
DE102015112143B4 (de) * 2015-07-24 2017-04-06 Infineon Technologies Ag Ein Verfahren zum Bestimmen einer Integrität einer Ausführung eines Codefragments und ein Verfahren zum Bereitstellen einer abstrahierten Repräsentation eines Programmcodes
US11341132B2 (en) * 2015-09-01 2022-05-24 Sybase, Inc. Generating a producer-driven execution plan from a consumer-driven iterator-based execution plan
US9921946B2 (en) * 2015-09-16 2018-03-20 Nxp Usa, Inc. Method and system to display and browse program trace using source code decoration
US10162612B2 (en) * 2016-01-04 2018-12-25 Syntel, Inc. Method and apparatus for inventory analysis
US20170242780A1 (en) * 2016-02-08 2017-08-24 Vtool Ltd. Graphical Sequence Builder
CN105808244A (zh) * 2016-03-01 2016-07-27 杭州师范大学 一种基于工作流的动态数据可视化分析工具
CN106446085A (zh) * 2016-09-09 2017-02-22 北京高地信息技术有限公司 一种大数据管理系统
US20180129581A1 (en) * 2016-11-07 2018-05-10 International Business Machines Corporation Method for static and dynamic configuration verification
US10862709B1 (en) * 2017-01-18 2020-12-08 Amazon Technologies, Inc. Conditional flow policy rules for packet flows in provider network environments
US10346145B2 (en) * 2017-06-23 2019-07-09 Intel Corporation Loop execution with predicate computing for dataflow machines
US10445451B2 (en) * 2017-07-01 2019-10-15 Intel Corporation Processors, methods, and systems for a configurable spatial accelerator with performance, correctness, and power reduction features
US10915430B2 (en) * 2017-07-17 2021-02-09 Red Hat Israel, Ltd. Source code test consolidation
US10691320B2 (en) * 2017-09-21 2020-06-23 Sap Se Simulation of cloud platform integration testing in a visual interface
US10474478B2 (en) * 2017-10-27 2019-11-12 Intuit Inc. Methods, systems, and computer program product for implementing software applications with dynamic conditions and dynamic actions

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103886203A (zh) * 2014-03-24 2014-06-25 美商天睿信息系统(北京)有限公司 一种基于指标预测的自动建模系统及其方法
CN107038167A (zh) * 2016-02-03 2017-08-11 普华诚信信息技术有限公司 基于模型评估的大数据挖掘分析系统及其分析方法
CN106599230A (zh) * 2016-12-19 2017-04-26 北京天元创新科技有限公司 一种分布式数据挖掘模型评估的方法与系统
CN106845942A (zh) * 2017-01-22 2017-06-13 成都生命基线科技有限公司 一种生物分析平台中的工程项目发布系统
CN107766424A (zh) * 2017-09-13 2018-03-06 深圳市宇数科技有限公司 一种数据探索管理方法、系统、电子设备及存储介质

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111475488A (zh) * 2020-04-14 2020-07-31 中国长城科技集团股份有限公司 一种数据处理方法、装置、终端设备及可读存储介质
CN111475488B (zh) * 2020-04-14 2023-06-16 中国长城科技集团股份有限公司 一种数据处理方法、装置、终端设备及可读存储介质

Also Published As

Publication number Publication date
US11907211B2 (en) 2024-02-20
US20200167352A1 (en) 2020-05-28
CN107766424B (zh) 2020-09-15
CN107766424A (zh) 2018-03-06

Similar Documents

Publication Publication Date Title
WO2019052339A1 (zh) 一种数据探索管理方法、系统、电子设备及存储介质
CN103744680B (zh) 一种业务流程处理方法及装置
US9720656B2 (en) Method and apparatus for enabling agile development of services in cloud computing and traditional environments
US9304746B2 (en) Creating a user model using component based approach
CN108388515A (zh) 测试数据生成方法、装置、设备以及计算机可读存储介质
CN107733710A (zh) 链路调用关系的构建方法、装置、计算机设备及存储介质
CN113010359B (zh) 总线测试系统生成方法、系统、设备及存储介质
CN104881343A (zh) 一种测试方法和测试系统
CN109376153A (zh) 一种基于NiFi的数据写入图数据库的系统及方法
CN107885965B (zh) 一种数据探索发现方法、系统、电子设备及存储介质
CN113806429A (zh) 基于大数据流处理框架的画布式日志分析方法
JP7090131B2 (ja) 情報を生成するための方法及び装置
CN104598661B (zh) 用于涉及it系统的项目的变更请求分析的方法和装置
CN112446625A (zh) 一种工艺线路生成系统、方法、平台及存储介质
CN109614271A (zh) 多个集群数据一致性的控制方法、装置、设备及存储介质
CN106940648A (zh) 一种自动加载测试项的方法、设备以及测试系统
CN107844363A (zh) 业务事务处理方法、装置、存储介质及设备
US9239669B2 (en) Common user interface view grouping and sharing framework in online applications
CN115857929A (zh) 资源数据处理方法、装置、计算机设备和存储介质
CN104424525B (zh) 辅助确定项目范围的方法和装置
CN115221852A (zh) 基于vxe-table的动态计算表格实现方法、装置和计算机设备
US20130132425A1 (en) Query definition apparatus and method for multi-tenant
KR102099362B1 (ko) 마이크로 서비스를 이용한 데이터 전처리 장치 및 방법
CN113946562A (zh) 一种数据协同链可视化设计平台
CN112035360A (zh) 中间件测试方法、装置、计算机设备及可读存储介质

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205 DATED 08/09/2020)

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18856190

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 18856190

Country of ref document: EP

Kind code of ref document: A1