WO2022121337A1 - Procédé et appareil d'exploration de données, dispositif électronique et support de stockage - Google Patents

Procédé et appareil d'exploration de données, dispositif électronique et support de stockage Download PDF

Info

Publication number
WO2022121337A1
WO2022121337A1 PCT/CN2021/109589 CN2021109589W WO2022121337A1 WO 2022121337 A1 WO2022121337 A1 WO 2022121337A1 CN 2021109589 W CN2021109589 W CN 2021109589W WO 2022121337 A1 WO2022121337 A1 WO 2022121337A1
Authority
WO
WIPO (PCT)
Prior art keywords
probed
probe
field
data table
fields
Prior art date
Application number
PCT/CN2021/109589
Other languages
English (en)
Chinese (zh)
Inventor
霍康
万月亮
火一莽
Original Assignee
北京锐安科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京锐安科技有限公司 filed Critical 北京锐安科技有限公司
Publication of WO2022121337A1 publication Critical patent/WO2022121337A1/fr

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2282Tablespace storage structures; Management thereof
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • G06F16/24564Applying rules; Deductive queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/248Presentation of query results

Definitions

  • the embodiments of the present application relate to the technical field of data analysis, for example, to a data exploration method, apparatus, electronic device, and storage medium.
  • a plurality of independent detection scripts are generally written manually to probe the raw data table by table to analyze the quality of the raw data; and a plurality of independent detection scripts are directly used to execute corresponding task-based queries.
  • every time data detection is performed repeated detection scripts need to be manually written, which will consume a lot of manpower and reduce data detection efficiency.
  • Embodiments of the present application provide a data exploration method, device, electronic device, and storage medium, so as to automatically complete the exploration and analysis of data quality in data tables of different database types, which is convenient to operate and improves the efficiency of data exploration.
  • an embodiment of the present application provides a data detection method, which is applied to a data table detection device, and the method includes:
  • the fields to be detected are respectively detected, and the detection results are determined, wherein the detection rules include field filling detection rules, feature value detection rules, field length detection rules, and field dictionary code detection rules. at least one of.
  • an embodiment of the present application further provides a data table detection device, including:
  • a target data table determination module configured to match at least one target data table from at least one database in a connected state according to the probe scope condition
  • a to-be-explored field determination module configured to acquire a data structure of the at least one target data table, and to determine a to-be-explored field in the at least one target data table according to the data structure of the at least one target data table;
  • the probe result determination module is configured to probe the fields to be probed respectively based on the preset probe rules, and determine the probe results, wherein the probe rules include field fill probe rules, feature value probe rules, and field length probe rules and at least one of the field dictionary code detection rules.
  • an embodiment of the present application further provides an electronic device, the electronic device comprising:
  • processors one or more processors
  • the one or more processors are configured to execute the one or more programs to implement the data detection method provided by any embodiment of the present application.
  • an embodiment of the present application further provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, the data provided by any embodiment of the present application is implemented Probing method.
  • FIG. 1 is a schematic flowchart of a data exploration method provided in Embodiment 1 of the present application;
  • FIG. 2 is a schematic structural diagram of a data table detection device provided in Embodiment 2 of the present application.
  • FIG. 3 is a schematic structural diagram of an electronic device provided in Embodiment 3 of the present application.
  • FIG. 1 is a flowchart of a data exploration method provided in Embodiment 1 of the present application, and this embodiment may be applied to a situation in which performance testing is performed in software testing.
  • the method may be performed by a data table look-up device, which may be implemented by means of software and/or hardware.
  • the data exploration method provided in the first embodiment of the present application can cope with the above-mentioned low data detection efficiency. Integrate the profiling results, and generate a profiling report by combining the profiling results and the data table to be profiled, which is convenient for data analysts to analyze the data quality of the profiling data.
  • the configuration file of the data table detection device is pre-configured, so that when the data detection method executes the data table detection device, the configuration parameters in the configuration file can be directly read, and the data can be successfully executed. Probing method.
  • the configuration file includes configuration parameters such as the data type of the source data, the database instance name, the database connection method, the database instance, the database user name, and the password.
  • the configuration file is read and executed, and at least one database is connected according to the connection mode of at least one database in the configuration file.
  • the database includes Oracle, mysql, mpp, hive, txt, excel, csv, word and other databases.
  • ADO Active Data Objects, Active Data Objects
  • the method includes the following steps.
  • the probe scope condition may be a probe condition for obtaining at least one target database set according to requirements. For example, when querying the all_tab_comments table to obtain a list of library tables, a table name filter condition of table_name may be added.
  • the target data table may be a database to be probed determined from a plurality of databases in a connected state according to a probe scope condition of the database. The number of at least one target data table may be one or more.
  • the probe scope condition and the matching mode may be acquired, a matching instruction may be generated based on the probe scope condition and the matching mode, and the matching instruction may be executed to determine at least one target data table in at least one database,
  • the matching mode includes any one of exact matching parameters, fuzzy matching parameters, exact exclusion parameters and fuzzy exclusion parameters.
  • the exact match parameter is extract_match
  • the fuzzy match parameter is fuzzy_match
  • the exact exclusion parameter is exact_not_match
  • the fuzzy exclusion parameter is fuzzy_not_match.
  • the matching instruction may be a database matching instruction generated based on the matching mode and the conditions of the detection range, and the matching instruction is used to determine at least one target data table to be detected.
  • At least one target data table to be probed can be determined based on the matching mode and the database matching instruction generated based on the probe range condition.
  • S120 Acquire the data structure of the at least one target data table, and determine the fields to be probed in the at least one target data table according to the data structure of the at least one target data table.
  • the data structure may be a way of storing the data to be probed in the data table.
  • the fields to be probed may be fields in the data structure of the target data table, such as fields such as field name, field description, field type, and field length. For example, multiple fields in the data structure can be obtained, preset fields to be explored can be obtained as fields to be probed, all fields can be used as fields to be probed, and fields to be probed can also be set according to actual conditions.
  • the number of the fields to be probed is determined, and when the number of the fields to be probed is greater than a preset number, the data of the fields to be probed is sampled, and the sampled data is determined as the to-be-explored field.
  • the probed data corresponding to the probed field. For example, when the number of fields to be probed in the data table exceeds the preset number of data, and there is a large amount of duplicate data in the data of the fields to be probed, random sampling is performed on the data in the fields to be probed, and the sampled fields are Conduct data exploration.
  • the random sampling method can ensure the validity of the exploration results, and reducing the amount of data can also reduce the computational complexity of the data exploration and improve the efficiency of the exploration.
  • the detection rules include field filling detection rules, feature value detection rules, field length detection rules, and field dictionary code detection at least one of the rules.
  • the detection rule is a detection indicator for the detection and analysis of the field to be explored, and the detection rule includes at least one of a field filling detection rule, a feature value detection rule, a field length detection rule, and a field dictionary code detection rule.
  • the detection rule includes at least one of a field filling detection rule, a feature value detection rule, a field length detection rule, and a field dictionary code detection rule.
  • set profiling rules in advance.
  • the field to be probed is probed and analyzed according to the probe rules, and the probe result of the field to be probed is determined.
  • the detection result includes at least one of the detection filling rate, the feature value coincidence rate, the maximum field length, and the dictionary code corresponding to the field to be detected.
  • check whether the data corresponding to the fields to be checked is filled determine the number of filled fields, and determine the percentage of the filled fields in the fields to be checked to the number of fields to be checked, and determine the Probe fill rate.
  • the number of padding fields may be the number of fields with padding values in the fields to be probed.
  • the determined probe fill rate is presented in a probe report.
  • the formula to calculate the probe fill rate could be:
  • R1 represents the probe fill rate
  • N1 represents the number of padding fields in the fields to be probed
  • M1 represents the number of fields to be probed.
  • the validity of at least one feature value corresponding to the field to be probed is probed, the percentage of the number of valid feature values in the field to be probed to the number of fields to be probed is determined, and the Eigenvalue coincidence rate.
  • the feature value means content capable of identifying real-world entity information, and each feature value has a unique corresponding feature value type.
  • the eigenvalue conformity rate is to detect the conformity of the normative content of the analysis data, and when the eigenvalue conforms to the norm, it is an effective eigenvalue. For example, before calculating the feature value coincidence rate, the feature type corresponding to the feature value in the current field to be probed may be identified to determine the feature value in the current field to be probed.
  • the feature type corresponding to the feature value in the field to be probed is hotel address according to the field description (for example, hotel location, hotel address, hotel details) in the field to be probed, and the current to be probed is determined by the hotel address.
  • the only feature value in the field is Hotel.
  • the feature type corresponding to the feature value in the field to be probed is the license plate number, and the unique feature value in the field to be probed is determined by the license plate number. for the license plate.
  • a feature value of the field to be probed is acquired, and a feature value check is performed on the feature value.
  • the feature type that can perform feature value verification is preset as a preset feature type, and when it is determined that the feature type of the current feature value belongs to the preset feature type, the feature value verification is performed on the feature value belonging to the preset feature type.
  • the preset eigenvalue type is an eigenvalue type conforming to a regular expression.
  • the feature value in the to-be-explored field corresponding to the feature type is determined according to a predefined verification method (for example, the check_carnum verification method) for the feature. value to check.
  • the coincidence rate of the feature value of the current field to be probed is calculated according to the valid feature value of the successful verification and all the feature values participating in the verification. In one embodiment, when the obtained eigenvalue coincidence rate does not meet the preset threshold, the eigenvalue coincidence rate is displayed in the generated exploration report. Among them, the calculation formula of the eigenvalue coincidence rate can be:
  • R2 represents the eigenvalue coincidence rate
  • N2 represents the field number of valid eigenvalues in the to-be-explored field
  • M2 represents the number of all eigenvalues in the to-be-explored field.
  • the content length corresponding to the field to be probed is probed, and the maximum value of the field length of the field to be probed is determined according to the content length.
  • the maximum value of the field length includes the longest value of the field and the shortest value of the field. For example, determine the content length of data corresponding to multiple fields to be probed, and compare the content lengths of multiple fields to determine the longest value or shortest value of multiple fields to be probed, and then generate a probe structure report displayed in.
  • the description information of the field to be searched is searched, and the dictionary code corresponding to the field to be searched is determined according to the description information.
  • the dictionary code may be a gender code, a certificate type code, or the like.
  • a preset identification method is used to identify the dictionary code in the description information in the field to be probed, and the identification result is displayed in the probe report.
  • the preset identification code may be a neural network identification model, or the identification result may be determined according to the input identification information.
  • the dictionary code can be displayed in the probe report in an enumerated manner.
  • the profiling results are integrated, and based on the integrated profiling results and the target Data tables generate profiling reports.
  • the exploration report can be displayed in the form of an Excel table.
  • a probe report includes a probe catalog summary and a probe detail.
  • the general table of exploration catalogue includes the target data table, the data quantity of each target data table, the number of fields, the quantity of feature types, and the feature type information.
  • the probe directory summary table is used to represent the overall statistical information of the current data results to be probed, which is convenient for data analysts to understand the data table to be probed and the basic information of each field in the data table in the current data probe and analysis process.
  • the probe detailed list includes: probe analysis results of the fields to be probed and sample data of the probe analysis results.
  • the probe list is used to represent the detailed information of multiple probe results in the probe analysis results of the current field to be probed, which is convenient for opening and convenient for data analysts to analyze and optimize multiple fields to be probed through the detailed information of the probe results. data performance.
  • At least one target data table is matched from at least one database in a connected state according to the probe scope condition; the data structure of the at least one target data table is acquired, and the target data table is determined according to the data structure of the at least one target data table Describe the fields to be probed in at least one target data table; based on the preset probe rules, probe the fields to be probed respectively, determine the probe results, and fill in the probe results, eigenvalue probe results, fields according to the fields in the probe results The length detection result and the field dictionary code detection result determine the data quality of the field to be detected.
  • the following is an example of the data table detection device provided by the embodiment of the present application, which belongs to the same inventive concept as the data detection method of the above-mentioned embodiment.
  • the data table detection device please refer to the above data detection method. Examples of methods.
  • FIG. 2 is a schematic structural diagram of a data table detection apparatus provided in Embodiment 2 of the present application, and this embodiment can be applied to a situation in which performance testing is performed in software testing.
  • the data table inquiry device includes: a target data table determination module 210 , a field to be inspected field determination module 220 , and an inquiry result determination module 230 . in:
  • the target data table determination module 210 is configured to match at least one target data table from at least one database in the connected state according to the condition of the probe scope.
  • the to-be-explored field determination module 220 is configured to acquire the data structure of the at least one target data table, and determine the to-be-explored field in the at least one target data table according to the data structure of the at least one target data table.
  • the probe result determination module 230 is configured to probe the fields to be probed respectively based on the preset probe rules, and determine the probe results, wherein the probe rules include field fill probe rules, feature value probe rules, and field length probes At least one of a rule and a field dictionary code detection rule.
  • At least one target data table is matched from at least one database in a connected state according to the probe scope condition; the data structure of the at least one target data table is acquired, and the target data table is determined according to the data structure of the at least one target data table Describe the fields to be probed in at least one target data table; based on the preset probe rules, probe the fields to be probed respectively, determine the probe results, and fill in the probe results, eigenvalue probe results, fields according to the fields in the probe results The length detection result and the field dictionary code detection result determine the data quality of the field to be detected.
  • the data table detection device further includes:
  • the database connection unit is configured to read the configuration file, and connect the at least one database according to the connection mode of the at least one database in the configuration file.
  • the target data table determination module 210 includes:
  • a target data table determination unit configured to acquire the probe range condition and the matching mode, generate a matching instruction based on the probe range condition and the matching mode, and execute the matching instruction to determine at least one target data table in at least one database , wherein the matching mode includes any one of exact matching parameters, fuzzy matching parameters, exact exclusion parameters and fuzzy exclusion parameters.
  • the data table detection device further includes:
  • a probe data determination unit configured to determine the number of the fields to be probed, in response to the number of the fields to be probed being greater than a preset number, sampling the data of the fields to be probed, and determining the data obtained by sampling as the The data to be probed corresponding to the field to be probed.
  • the detection result determination module 230 includes:
  • a first probing result determining unit configured to probe whether the probed data corresponding to the fields to be probed is filled, determine the number of filled fields, and determine that the number of filled fields in the fields to be probed accounts for the number of fields to be probed The percentage of determining the probe fill rate
  • a second detection result determination unit configured to detect the validity of at least one feature value corresponding to the field to be detected, and to determine the percentage of the number of valid feature values in the field to be detected accounting for the number of fields to be detected, determining the eigenvalue coincidence rate;
  • a third detection result determination unit configured to detect the content length corresponding to the field to be probed, and to determine the maximum value of the field length of the field to be probed according to the content length;
  • the fourth detection result determination unit is configured to detect the description information of the field to be searched, and determine the dictionary code corresponding to the field to be searched according to the description information.
  • the second detection result determination unit includes:
  • a feature value checking unit configured to, in response to determining that the field to be checked belongs to a preset feature type, obtain a feature value of the field to be checked, and perform feature value check on the feature value.
  • the data table detection device is further configured to integrate the detection results, and generate a detection report based on the integrated detection results and the target data table.
  • the data table detection apparatus provided by the embodiment of the present application can execute the data detection method provided by any embodiment of the present application, and has functional modules corresponding to the execution method.
  • the multiple units and modules included in the above-mentioned embodiments of the data table detection apparatus are only divided according to functional logic, but are not limited to the above-mentioned division manner; in addition, the names of all functional units are only for the convenience of distinguishing from each other.
  • FIG. 3 is a schematic structural diagram of an electronic device provided in Embodiment 3 of the present application.
  • FIG. 3 shows a block diagram of an exemplary electronic device 12 suitable for use in implementing embodiments of the present application.
  • the electronic device 12 shown in FIG. 3 is only one example.
  • the electronic device 12 takes the form of a general-purpose computing electronic device.
  • the components of the electronic device 12 may include: one or more processors or processing units 16, a system memory 28, and a bus 18 connecting the various system components including the system memory 28 and the processing unit 16.
  • System memory 28 may be memory 28 .
  • the bus 18 represents at least one type of bus structure, eg, the bus 18 includes a memory bus or memory controller, a peripheral bus, a graphics acceleration port, a processor, or a local bus using any of a variety of bus structures.
  • these architectures include Industry Standard Architecture (ISA) bus, Micro Channel Architecture (MCA) bus, Enhanced ISA bus, Video Electronics Standards Association (VESA) ) local bus and peripheral component interconnect (peripheral component interconnect, PCI) bus.
  • Electronic device 12 typically includes a variety of computer system readable media. These media can be any available media that can be accessed by electronic device 12, including both volatile and non-volatile media, removable and non-removable media.
  • Memory 28 may include computer system readable media in the form of volatile memory, such as random access memory (RAM) 30 and/or cache 32 .
  • Electronic device 12 may include other removable/non-removable, volatile/non-volatile computer system storage media.
  • storage system 34 may be configured to read and write to non-removable, non-volatile magnetic media, not shown in FIG. 3, commonly referred to as hard disk drives.
  • a magnetic disk drive for reading and writing to removable non-volatile magnetic disks, such as floppy disks, and an optical disk drive for reading and writing to removable non-volatile optical disks, such as removable non-volatile optical disks may be provided For example CD-ROM, DVD-ROM or other optical media.
  • each drive may be connected to bus 18 through one or more data media interfaces.
  • the memory 28 may include at least one program product having a set of, eg, at least one program module configured to perform the functions of the embodiments of the present application.
  • a program/utility 40 having, for example, a set of at least one program module 42, which may be stored, for example, in memory 28, such program module 42 including an operating system, one or more application programs, other program modules, and program data, in these examples Each or some combination of may include an implementation of a network environment.
  • Program modules 42 generally perform the functions and/or methods of the embodiments described herein.
  • the electronic device 12 may also communicate with one or more external devices 14 (eg, a keyboard, pointing device, display 24, etc.), with one or more devices that enable a user to interact with the electronic device 12, and/or with Any device (eg, network card, modem, etc.) that enables the electronic device 12 to communicate with one or more other computing devices. Such communication may take place through input/output (I/O) interface 22 . Also, the electronic device 12 may communicate with one or more networks, such as a Local Area Network (LAN), a Wide Area Network (WAN), and/or a public network, such as the Internet, through a network adapter 20. As shown in FIG. 3 , the network adapter 20 communicates with other modules of the electronic device 12 via the bus 18 . Although not shown in FIG.
  • I/O input/output
  • the electronic device 12 may communicate with one or more networks, such as a Local Area Network (LAN), a Wide Area Network (WAN), and/or a public network, such as the Internet, through a network adapter 20. As
  • the processing unit 16 executes a variety of functional applications and sample data acquisition operations by running the program stored in the memory 28, for example, to implement the steps of a data detection method provided by the embodiment of the present application, and the data detection method includes:
  • the fields to be detected are respectively detected, and the detection results are determined, wherein the detection rules include field filling detection rules, feature value detection rules, field length detection rules, and field dictionary code detection rules. at least one of.
  • the fourth embodiment provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, the data detection method provided by the foregoing embodiments of the present application is implemented.
  • data exploration methods include:
  • the fields to be detected are respectively detected, and the detection results are determined, wherein the detection rules include field filling detection rules, feature value detection rules, field length detection rules, and field dictionary code detection rules. at least one of.
  • the computer storage medium provided by the embodiments of the present application may adopt any combination of one or more computer-readable media.
  • the computer-readable medium may be a computer-readable signal medium or a computer-readable storage medium.
  • the computer-readable storage medium may be, for example, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus or device, or a combination of any of the above.
  • Examples (non-exhaustive list) of computer-readable storage media include: electrical connections with one or more wires, portable computer disks, hard disks, random access memory (RAM), read-only memory (ROM) ), Erasable Programmable Read-Only Memory (EPROM), memory, optical fiber, portable compact disk read-only memory (CD-ROM), optical storage devices, magnetic storage devices, or any suitable of the above The combination.
  • a computer-readable storage medium can be any tangible medium that contains or stores a program that can be used by, or in combination with, an instruction execution system, apparatus, or device.
  • a computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave, with computer-readable program code embodied in the computer-readable signal medium. Such propagated data signals may take a variety of forms, including electromagnetic signals, optical signals, or any suitable combination of the foregoing.
  • a computer-readable signal medium can also be any computer-readable medium other than a computer-readable storage medium that can transmit, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device.
  • the program code embodied on the computer-readable medium may be transmitted by any suitable medium, including: wireless, wire, optical fiber cable, radio frequency (RF), etc., or any suitable combination of the above.
  • any suitable medium including: wireless, wire, optical fiber cable, radio frequency (RF), etc., or any suitable combination of the above.
  • Computer program code for carrying out the operations of this application may be written in one or more programming languages, including object-oriented programming languages, such as Java, Smalltalk, C++, or a combination of programming languages. , but also conventional procedural programming languages - such as C or similar programming languages.
  • the program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer, or entirely on the remote computer or server.
  • the remote computer may be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or may be connected to an external computer, such as through the Internet using an Internet service provider Connect to an external computer.
  • LAN local area network
  • WAN wide area network
  • Internet service provider Connect to an external computer.
  • multiple modules or multiple steps of the present application can be implemented by a general-purpose computing device, and they can be centralized on a single computing device or distributed on a network composed of multiple computing devices .
  • they can be implemented with program codes executable by a computer device, they can be stored in a storage device and executed by the computing device, or they can be separately fabricated into a plurality of integrated circuit modules, or some of them can be combined.
  • Multiple modules or steps are implemented as a single integrated circuit module. In this way, the embodiments of the present application exist in various forms of combinations of hardware and software.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Sont divulgués un procédé et un appareil d'exploration de données, un dispositif électronique et un support de stockage. Le procédé comprend les étapes consistant à : selon une condition pour une plage d'exploration, mettre en correspondance au moins une table de données cible à partir d'une base de données dans un état connecté ; acquérir une structure de données de la ou des tables de données cibles, et déterminer des champs, à explorer, dans la ou les tables de données cibles ; et sur la base d'une règle d'exploration prédéfinie, explorer respectivement lesdits champs, et déterminer un résultat d'exploration.
PCT/CN2021/109589 2020-12-11 2021-07-30 Procédé et appareil d'exploration de données, dispositif électronique et support de stockage WO2022121337A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011462110.1A CN112559523A (zh) 2020-12-11 2020-12-11 数据探查方法、装置、电子设备以及存储介质
CN202011462110.1 2020-12-11

Publications (1)

Publication Number Publication Date
WO2022121337A1 true WO2022121337A1 (fr) 2022-06-16

Family

ID=75062769

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/109589 WO2022121337A1 (fr) 2020-12-11 2021-07-30 Procédé et appareil d'exploration de données, dispositif électronique et support de stockage

Country Status (2)

Country Link
CN (1) CN112559523A (fr)
WO (1) WO2022121337A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116131860A (zh) * 2022-12-28 2023-05-16 山东华科信息技术有限公司 分布式能源并网监测的数据压缩系统和数据压缩方法

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112559523A (zh) * 2020-12-11 2021-03-26 北京锐安科技有限公司 数据探查方法、装置、电子设备以及存储介质
CN113722325A (zh) * 2021-08-31 2021-11-30 北京锐安科技有限公司 数据库中表信息检测方法、装置、计算机设备及存储介质
CN113961571B (zh) * 2021-12-22 2022-03-22 太极计算机股份有限公司 一种基于数据探针的多模态数据感知方法及装置

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106708909A (zh) * 2015-11-18 2017-05-24 阿里巴巴集团控股有限公司 数据质量的检测方法和装置
CN108389621A (zh) * 2018-02-08 2018-08-10 山东康网网络科技有限公司 病案首页数据质量检测方法及系统
CN110737650A (zh) * 2019-09-27 2020-01-31 北京明略软件系统有限公司 数据质量检测方法及装置
CN112559523A (zh) * 2020-12-11 2021-03-26 北京锐安科技有限公司 数据探查方法、装置、电子设备以及存储介质

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106708909A (zh) * 2015-11-18 2017-05-24 阿里巴巴集团控股有限公司 数据质量的检测方法和装置
CN108389621A (zh) * 2018-02-08 2018-08-10 山东康网网络科技有限公司 病案首页数据质量检测方法及系统
CN110737650A (zh) * 2019-09-27 2020-01-31 北京明略软件系统有限公司 数据质量检测方法及装置
CN112559523A (zh) * 2020-12-11 2021-03-26 北京锐安科技有限公司 数据探查方法、装置、电子设备以及存储介质

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116131860A (zh) * 2022-12-28 2023-05-16 山东华科信息技术有限公司 分布式能源并网监测的数据压缩系统和数据压缩方法
CN116131860B (zh) * 2022-12-28 2023-09-05 山东华科信息技术有限公司 分布式能源并网监测的数据压缩系统和数据压缩方法

Also Published As

Publication number Publication date
CN112559523A (zh) 2021-03-26

Similar Documents

Publication Publication Date Title
WO2022121337A1 (fr) Procédé et appareil d'exploration de données, dispositif électronique et support de stockage
CN111343161B (zh) 异常信息处理节点分析方法、装置、介质及电子设备
US9064048B2 (en) Memory leak detection
CN110968985A (zh) 集成电路修补算法确定方法及装置、存储介质、电子设备
CN110647523B (zh) 数据质量的分析方法及装置、存储介质、电子设备
CN112613584A (zh) 一种故障诊断方法、装置、设备及存储介质
WO2022068316A1 (fr) Procédé et appareil de rapprochement de données, dispositif et support de stockage
CN110569266A (zh) 一种数据查询的方法、装置、设备及存储介质
CN114595765A (zh) 数据处理方法、装置、电子设备及存储介质
CN111523764B (zh) 业务架构检测方法、装置、工具、电子设备和介质
CN117593115A (zh) 信贷风险评估模型的特征值确定方法、装置、设备和介质
WO2022062834A1 (fr) Procédé et appareil d'exploration de données, dispositif électronique et support de stockage
CN111125311A (zh) 检验信息归一处理的方法、装置、存储介质及电子设备
CN113792138B (zh) 报表生成方法、装置、电子设备及存储介质
CN116185393A (zh) 接口文档的生成方法、装置、设备、介质及产品
CN113238940B (zh) 一种接口测试结果的比对方法、装置、设备和存储介质
CN114942905A (zh) 一种迁移数据验证方法、装置、设备和存储介质
US11520831B2 (en) Accuracy metric for regular expression
CN114741291A (zh) 一种漏洞信息自动提交的方法、装置、设备及介质
CN112214469A (zh) 路测数据处理方法、装置、服务器及存储介质
CN113656391A (zh) 数据检测方法及装置、存储介质及电子设备
CN111427874A (zh) 医疗数据生产的质控方法、装置以及电子设备
CN113496365A (zh) 一种仓库合并方案确定方法、装置、设备及介质
CN110866557B (zh) 数据评价方法及装置、存储介质及电子设备
CN115292146B (zh) 一种系统容量预估方法、系统、设备及存储介质

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21902047

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21902047

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 21902047

Country of ref document: EP

Kind code of ref document: A1