CN113064922A - Data continuity judging method, terminal equipment and storage medium - Google Patents

Data continuity judging method, terminal equipment and storage medium Download PDF

Info

Publication number
CN113064922A
CN113064922A CN202110375006.7A CN202110375006A CN113064922A CN 113064922 A CN113064922 A CN 113064922A CN 202110375006 A CN202110375006 A CN 202110375006A CN 113064922 A CN113064922 A CN 113064922A
Authority
CN
China
Prior art keywords
data
tested
judging
processor
computer program
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110375006.7A
Other languages
Chinese (zh)
Inventor
刘晹
任勇
刘云
李炜
施良海
黄磊
周凌龙
王玉辉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xiamen Dragon Information Technology Co ltd
Original Assignee
Xiamen Dragon Information Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xiamen Dragon Information Technology Co ltd filed Critical Xiamen Dragon Information Technology Co ltd
Priority to CN202110375006.7A priority Critical patent/CN113064922A/en
Publication of CN113064922A publication Critical patent/CN113064922A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • G06F16/24568Data stream processing; Continuous queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2462Approximate or statistical queries

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Fuzzy Systems (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention relates to a data continuity judging method, a terminal device and a storage medium, wherein the method comprises the following steps: s1: storing the data to be tested into a storage container; s2: extracting n groups of data from the data to be tested, wherein n is an integer greater than 1, and each group of data is three data randomly extracted from the data to be tested; s3: for each set of data, the following operations are performed: y is a-b + c, wherein y represents a calculation result, and a, b and c respectively represent first, second and third data after each group of data is arranged in the order of magnitude; s4: and judging whether the calculation results y of each group of data exist in the storage container, if so, judging that the data to be tested are continuous, and otherwise, judging that the data to be tested are discontinuous. The invention can quickly judge the continuity of the data.

Description

Data continuity judging method, terminal equipment and storage medium
Technical Field
The present invention relates to the field of data exploration, and in particular, to a data continuity determination method, a terminal device, and a storage medium.
Background
The equi-differential data continuity of mass data is an important exploration item for data exploration. Data exploration is a link of data investigation at the early stage of data management, data is known and known through data exploration, and data continuity exploration is an important basis for judging whether a field is a main key or not and how data is sliced and processed. The existing common mass data continuity exploration method has the defects of slow storage and large consumption of space resources.
Disclosure of Invention
In order to solve the above problem, the present invention provides a data continuity determination method, a terminal device, and a storage medium.
The specific scheme is as follows:
a data continuity judging method includes the following steps:
s1: storing the data to be tested into a storage container;
s2: extracting n groups of data from the data to be tested, wherein n is an integer greater than 1, and each group of data is three data randomly extracted from the data to be tested;
s3: for each set of data, the following operations are performed:
y=a-b+c
wherein y represents a calculation result, and a, b and c respectively represent first data, second data and third data after each group of data is arranged according to the size sequence;
s4: and judging whether the calculation results y of each group of data exist in the storage container, if so, judging that the data to be tested are continuous, and otherwise, judging that the data to be tested are discontinuous.
Further, the storage container adopts a bloom filter.
A data continuity determination terminal device includes a processor, a memory, and a computer program stored in the memory and operable on the processor, and when the processor executes the computer program, the processor implements the steps of the method described above according to the embodiment of the present invention.
A computer-readable storage medium, in which a computer program is stored, which, when being executed by a processor, carries out the steps of the method as described above for an embodiment of the invention.
By adopting the technical scheme, the continuity of the data can be rapidly judged, the data loading cost can be greatly reduced by adopting the bloom filter, the operation time is reduced, and the performance is improved.
Drawings
Fig. 1 is a flowchart illustrating a first embodiment of the present invention.
Detailed Description
To further illustrate the various embodiments, the invention provides the accompanying drawings. The accompanying drawings, which are incorporated in and constitute a part of this disclosure, illustrate embodiments of the invention and, together with the description, serve to explain the principles of the embodiments. Those skilled in the art will appreciate still other possible embodiments and advantages of the present invention with reference to these figures.
The invention will now be further described with reference to the accompanying drawings and detailed description.
The first embodiment is as follows:
an embodiment of the present invention provides a data continuity determining method, as shown in fig. 1, which is a flowchart of the data continuity determining method according to the embodiment of the present invention, and the method includes the following steps:
s1: and storing the data to be tested into a storage container.
The data to be tested is data which needs to be judged whether the data has continuity or not, and the data to be tested is usually mass data.
S2: extracting n groups of data from the data to be tested, wherein n is an integer greater than 1, and each group of data is three data randomly extracted from the data to be tested.
S3: for each set of data, the following operations are performed:
y=a-b+c
wherein y represents the calculation result, and a, b and c represent the first, second and third data after each group of data is arranged according to the size sequence.
Because a is more than or equal to y and more than or equal to c, the calculation result y is not higher than the maximum value of the data to be tested and is not lower than the minimum value of the data to be tested.
S4: and judging whether the calculation results y of each group of data exist in the storage container (in the data to be tested), if so, judging that the data to be tested are continuous, otherwise (as long as one group of calculation results y do not exist in the storage container), judging that the data to be tested are discontinuous.
Before the existing method for judging whether data are continuous or not, the data need to be loaded into a certain container firstly, and the storage structure of the conventional container has the defects of slow storage, large consumption of space resources and the like. In this embodiment, the storage container preferably uses a bloom filter, which is a space-saving data structure (bitmap + hash), and the space efficiency and query time of the storage container far exceed those of a general algorithm, so that the storage container has great advantages in both time and space, thereby greatly reducing the loading cost and reducing the operation time. In addition, the bloom filter does not store elements, and has great advantages for some scenes with strict confidentiality requirements.
Example two:
the present invention further provides a data continuity determination terminal device, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor, where the processor implements the steps in the foregoing method embodiments of the first embodiment of the present invention when executing the computer program.
Further, as an executable scheme, the data continuity determination terminal device may be a computing device such as a desktop computer, a notebook, a palm computer, and a cloud server. The data continuity determination terminal device may include, but is not limited to, a processor, a memory. It is understood by those skilled in the art that the above-mentioned composition structure of the data continuity determination terminal device is only an example of the data continuity determination terminal device, and does not constitute a limitation on the data continuity determination terminal device, and may include more or less components than the above, or combine some components, or different components, for example, the data continuity determination terminal device may further include an input-output device, a network access device, a bus, and the like, which is not limited by the embodiment of the present invention.
Further, as an executable solution, the Processor may be a Central Processing Unit (CPU), other general purpose Processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other Programmable logic device, a discrete Gate or transistor logic device, a discrete hardware component, and the like. The general-purpose processor may be a microprocessor or the processor may be any conventional processor or the like, and the processor is a control center of the data continuity determination terminal device, and various interfaces and lines are used to connect various parts of the entire data continuity determination terminal device.
The memory may be used for storing the computer program and/or the module, and the processor may implement various functions of the data continuity determination terminal device by executing or executing the computer program and/or the module stored in the memory and calling data stored in the memory. The memory can mainly comprise a program storage area and a data storage area, wherein the program storage area can store an operating system and an application program required by at least one function; the storage data area may store data created according to the use of the mobile phone, and the like. In addition, the memory may include high speed random access memory, and may also include non-volatile memory, such as a hard disk, a memory, a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), at least one magnetic disk storage device, a Flash memory device, or other volatile solid state storage device.
The invention also provides a computer-readable storage medium, in which a computer program is stored, which, when being executed by a processor, carries out the steps of the above-mentioned method of an embodiment of the invention.
The module/unit integrated with the data continuity determination terminal device may be stored in a computer-readable storage medium if it is implemented in the form of a software functional unit and sold or used as a separate product. Based on such understanding, all or part of the flow of the method according to the embodiments of the present invention may also be implemented by a computer program, which may be stored in a computer-readable storage medium, and when the computer program is executed by a processor, the steps of the method embodiments may be implemented. Wherein the computer program comprises computer program code, which may be in the form of source code, object code, an executable file or some intermediate form, etc. The computer-readable medium may include: any entity or device capable of carrying the computer program code, recording medium, usb disk, removable hard disk, magnetic disk, optical disk, computer Memory, Read-Only Memory (ROM), Random Access Memory (RAM), software distribution medium, and the like.
While the invention has been particularly shown and described with reference to a preferred embodiment, it will be understood by those skilled in the art that various changes in form and detail may be made therein without departing from the spirit and scope of the invention as defined by the appended claims.

Claims (4)

1. A data continuity judging method, characterized by comprising the steps of:
s1: storing the data to be tested into a storage container;
s2: extracting n groups of data from the data to be tested, wherein n is an integer greater than 1, and each group of data is three data randomly extracted from the data to be tested;
s3: for each set of data, the following operations are performed:
y=a-b+c
wherein y represents a calculation result, and a, b and c respectively represent first data, second data and third data after each group of data is arranged according to the size sequence;
s4: and judging whether the calculation results y of each group of data exist in the storage container, if so, judging that the data to be tested are continuous, and otherwise, judging that the data to be tested are discontinuous.
2. The data continuity determination method according to claim 1, characterized in that: the storage container employs a bloom filter.
3. A data continuity determination terminal device characterized by: comprising a processor, a memory and a computer program stored in the memory and running on the processor, the processor implementing the steps of the method according to any of claims 1-2 when executing the computer program.
4. A computer-readable storage medium storing a computer program, characterized in that: the computer program when executed by a processor implementing the steps of the method as claimed in any one of claims 1-2.
CN202110375006.7A 2021-04-08 2021-04-08 Data continuity judging method, terminal equipment and storage medium Pending CN113064922A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110375006.7A CN113064922A (en) 2021-04-08 2021-04-08 Data continuity judging method, terminal equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110375006.7A CN113064922A (en) 2021-04-08 2021-04-08 Data continuity judging method, terminal equipment and storage medium

Publications (1)

Publication Number Publication Date
CN113064922A true CN113064922A (en) 2021-07-02

Family

ID=76566085

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110375006.7A Pending CN113064922A (en) 2021-04-08 2021-04-08 Data continuity judging method, terminal equipment and storage medium

Country Status (1)

Country Link
CN (1) CN113064922A (en)

Similar Documents

Publication Publication Date Title
US11605087B2 (en) Method and apparatus for identifying identity information
US9329873B2 (en) Method and device, terminal and computer readable medium for accelerating startup of operating system
CN103942292A (en) Virtual machine mirror image document processing method, device and system
US10628066B2 (en) Ensuring in-storage data atomicity and consistency at low cost
US20110307471A1 (en) Accelerating Database Management System Operations
CN110750793B (en) Vulnerability scanning method and device
CN111949681A (en) Data aggregation processing device and method and storage medium
CN111209257A (en) Method and device for fragmenting file system
CN114780501A (en) Data processing method, electronic device and computer program product
CN114374392A (en) Data compression storage method and device, terminal equipment and readable storage medium
CN110688223B (en) Data processing method and related product
CN111459937A (en) Data table association method, device, server and storage medium
CN113064922A (en) Data continuity judging method, terminal equipment and storage medium
CN107729577B (en) Data searching method based on multidimensional hash table, terminal equipment and storage medium
CN113806737B (en) Malicious process risk level assessment method, terminal equipment and storage medium
CN114281766A (en) TrueCrypt hidden partition identification method, terminal device and storage medium
CN111158994B (en) Method and device for testing pressing performance
CN113986980A (en) Data sorting method and device
CN111198900B (en) Data caching method and device for industrial control network, terminal equipment and medium
CN114416655A (en) Hive file processing method and device, computer equipment and storage medium
CN108804533B (en) Heterogeneous big data information filtering method and device
CN111241099A (en) Industrial big data storage method and device
CN113419792A (en) Event processing method and device, terminal equipment and storage medium
CN113342270A (en) Volume unloading method and device and electronic equipment
EP3048541A1 (en) File access method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination