CN113064922A - Data continuity judging method, terminal equipment and storage medium - Google Patents
Data continuity judging method, terminal equipment and storage medium Download PDFInfo
- Publication number
- CN113064922A CN113064922A CN202110375006.7A CN202110375006A CN113064922A CN 113064922 A CN113064922 A CN 113064922A CN 202110375006 A CN202110375006 A CN 202110375006A CN 113064922 A CN113064922 A CN 113064922A
- Authority
- CN
- China
- Prior art keywords
- data
- tested
- judging
- processor
- computer program
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2455—Query execution
- G06F16/24568—Data stream processing; Continuous queries
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2458—Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
- G06F16/2462—Approximate or statistical queries
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- General Physics & Mathematics (AREA)
- Probability & Statistics with Applications (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Fuzzy Systems (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention relates to a data continuity judging method, a terminal device and a storage medium, wherein the method comprises the following steps: s1: storing the data to be tested into a storage container; s2: extracting n groups of data from the data to be tested, wherein n is an integer greater than 1, and each group of data is three data randomly extracted from the data to be tested; s3: for each set of data, the following operations are performed: y is a-b + c, wherein y represents a calculation result, and a, b and c respectively represent first, second and third data after each group of data is arranged in the order of magnitude; s4: and judging whether the calculation results y of each group of data exist in the storage container, if so, judging that the data to be tested are continuous, and otherwise, judging that the data to be tested are discontinuous. The invention can quickly judge the continuity of the data.
Description
Technical Field
The present invention relates to the field of data exploration, and in particular, to a data continuity determination method, a terminal device, and a storage medium.
Background
The equi-differential data continuity of mass data is an important exploration item for data exploration. Data exploration is a link of data investigation at the early stage of data management, data is known and known through data exploration, and data continuity exploration is an important basis for judging whether a field is a main key or not and how data is sliced and processed. The existing common mass data continuity exploration method has the defects of slow storage and large consumption of space resources.
Disclosure of Invention
In order to solve the above problem, the present invention provides a data continuity determination method, a terminal device, and a storage medium.
The specific scheme is as follows:
a data continuity judging method includes the following steps:
s1: storing the data to be tested into a storage container;
s2: extracting n groups of data from the data to be tested, wherein n is an integer greater than 1, and each group of data is three data randomly extracted from the data to be tested;
s3: for each set of data, the following operations are performed:
y=a-b+c
wherein y represents a calculation result, and a, b and c respectively represent first data, second data and third data after each group of data is arranged according to the size sequence;
s4: and judging whether the calculation results y of each group of data exist in the storage container, if so, judging that the data to be tested are continuous, and otherwise, judging that the data to be tested are discontinuous.
Further, the storage container adopts a bloom filter.
A data continuity determination terminal device includes a processor, a memory, and a computer program stored in the memory and operable on the processor, and when the processor executes the computer program, the processor implements the steps of the method described above according to the embodiment of the present invention.
A computer-readable storage medium, in which a computer program is stored, which, when being executed by a processor, carries out the steps of the method as described above for an embodiment of the invention.
By adopting the technical scheme, the continuity of the data can be rapidly judged, the data loading cost can be greatly reduced by adopting the bloom filter, the operation time is reduced, and the performance is improved.
Drawings
Fig. 1 is a flowchart illustrating a first embodiment of the present invention.
Detailed Description
To further illustrate the various embodiments, the invention provides the accompanying drawings. The accompanying drawings, which are incorporated in and constitute a part of this disclosure, illustrate embodiments of the invention and, together with the description, serve to explain the principles of the embodiments. Those skilled in the art will appreciate still other possible embodiments and advantages of the present invention with reference to these figures.
The invention will now be further described with reference to the accompanying drawings and detailed description.
The first embodiment is as follows:
an embodiment of the present invention provides a data continuity determining method, as shown in fig. 1, which is a flowchart of the data continuity determining method according to the embodiment of the present invention, and the method includes the following steps:
s1: and storing the data to be tested into a storage container.
The data to be tested is data which needs to be judged whether the data has continuity or not, and the data to be tested is usually mass data.
S2: extracting n groups of data from the data to be tested, wherein n is an integer greater than 1, and each group of data is three data randomly extracted from the data to be tested.
S3: for each set of data, the following operations are performed:
y=a-b+c
wherein y represents the calculation result, and a, b and c represent the first, second and third data after each group of data is arranged according to the size sequence.
Because a is more than or equal to y and more than or equal to c, the calculation result y is not higher than the maximum value of the data to be tested and is not lower than the minimum value of the data to be tested.
S4: and judging whether the calculation results y of each group of data exist in the storage container (in the data to be tested), if so, judging that the data to be tested are continuous, otherwise (as long as one group of calculation results y do not exist in the storage container), judging that the data to be tested are discontinuous.
Before the existing method for judging whether data are continuous or not, the data need to be loaded into a certain container firstly, and the storage structure of the conventional container has the defects of slow storage, large consumption of space resources and the like. In this embodiment, the storage container preferably uses a bloom filter, which is a space-saving data structure (bitmap + hash), and the space efficiency and query time of the storage container far exceed those of a general algorithm, so that the storage container has great advantages in both time and space, thereby greatly reducing the loading cost and reducing the operation time. In addition, the bloom filter does not store elements, and has great advantages for some scenes with strict confidentiality requirements.
Example two:
the present invention further provides a data continuity determination terminal device, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor, where the processor implements the steps in the foregoing method embodiments of the first embodiment of the present invention when executing the computer program.
Further, as an executable scheme, the data continuity determination terminal device may be a computing device such as a desktop computer, a notebook, a palm computer, and a cloud server. The data continuity determination terminal device may include, but is not limited to, a processor, a memory. It is understood by those skilled in the art that the above-mentioned composition structure of the data continuity determination terminal device is only an example of the data continuity determination terminal device, and does not constitute a limitation on the data continuity determination terminal device, and may include more or less components than the above, or combine some components, or different components, for example, the data continuity determination terminal device may further include an input-output device, a network access device, a bus, and the like, which is not limited by the embodiment of the present invention.
Further, as an executable solution, the Processor may be a Central Processing Unit (CPU), other general purpose Processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other Programmable logic device, a discrete Gate or transistor logic device, a discrete hardware component, and the like. The general-purpose processor may be a microprocessor or the processor may be any conventional processor or the like, and the processor is a control center of the data continuity determination terminal device, and various interfaces and lines are used to connect various parts of the entire data continuity determination terminal device.
The memory may be used for storing the computer program and/or the module, and the processor may implement various functions of the data continuity determination terminal device by executing or executing the computer program and/or the module stored in the memory and calling data stored in the memory. The memory can mainly comprise a program storage area and a data storage area, wherein the program storage area can store an operating system and an application program required by at least one function; the storage data area may store data created according to the use of the mobile phone, and the like. In addition, the memory may include high speed random access memory, and may also include non-volatile memory, such as a hard disk, a memory, a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), at least one magnetic disk storage device, a Flash memory device, or other volatile solid state storage device.
The invention also provides a computer-readable storage medium, in which a computer program is stored, which, when being executed by a processor, carries out the steps of the above-mentioned method of an embodiment of the invention.
The module/unit integrated with the data continuity determination terminal device may be stored in a computer-readable storage medium if it is implemented in the form of a software functional unit and sold or used as a separate product. Based on such understanding, all or part of the flow of the method according to the embodiments of the present invention may also be implemented by a computer program, which may be stored in a computer-readable storage medium, and when the computer program is executed by a processor, the steps of the method embodiments may be implemented. Wherein the computer program comprises computer program code, which may be in the form of source code, object code, an executable file or some intermediate form, etc. The computer-readable medium may include: any entity or device capable of carrying the computer program code, recording medium, usb disk, removable hard disk, magnetic disk, optical disk, computer Memory, Read-Only Memory (ROM), Random Access Memory (RAM), software distribution medium, and the like.
While the invention has been particularly shown and described with reference to a preferred embodiment, it will be understood by those skilled in the art that various changes in form and detail may be made therein without departing from the spirit and scope of the invention as defined by the appended claims.
Claims (4)
1. A data continuity judging method, characterized by comprising the steps of:
s1: storing the data to be tested into a storage container;
s2: extracting n groups of data from the data to be tested, wherein n is an integer greater than 1, and each group of data is three data randomly extracted from the data to be tested;
s3: for each set of data, the following operations are performed:
y=a-b+c
wherein y represents a calculation result, and a, b and c respectively represent first data, second data and third data after each group of data is arranged according to the size sequence;
s4: and judging whether the calculation results y of each group of data exist in the storage container, if so, judging that the data to be tested are continuous, and otherwise, judging that the data to be tested are discontinuous.
2. The data continuity determination method according to claim 1, characterized in that: the storage container employs a bloom filter.
3. A data continuity determination terminal device characterized by: comprising a processor, a memory and a computer program stored in the memory and running on the processor, the processor implementing the steps of the method according to any of claims 1-2 when executing the computer program.
4. A computer-readable storage medium storing a computer program, characterized in that: the computer program when executed by a processor implementing the steps of the method as claimed in any one of claims 1-2.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110375006.7A CN113064922A (en) | 2021-04-08 | 2021-04-08 | Data continuity judging method, terminal equipment and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110375006.7A CN113064922A (en) | 2021-04-08 | 2021-04-08 | Data continuity judging method, terminal equipment and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN113064922A true CN113064922A (en) | 2021-07-02 |
Family
ID=76566085
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110375006.7A Pending CN113064922A (en) | 2021-04-08 | 2021-04-08 | Data continuity judging method, terminal equipment and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113064922A (en) |
-
2021
- 2021-04-08 CN CN202110375006.7A patent/CN113064922A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11605087B2 (en) | Method and apparatus for identifying identity information | |
US9329873B2 (en) | Method and device, terminal and computer readable medium for accelerating startup of operating system | |
CN103942292A (en) | Virtual machine mirror image document processing method, device and system | |
US10628066B2 (en) | Ensuring in-storage data atomicity and consistency at low cost | |
US20110307471A1 (en) | Accelerating Database Management System Operations | |
CN110750793B (en) | Vulnerability scanning method and device | |
CN111949681A (en) | Data aggregation processing device and method and storage medium | |
CN111209257A (en) | Method and device for fragmenting file system | |
CN114780501A (en) | Data processing method, electronic device and computer program product | |
CN114374392A (en) | Data compression storage method and device, terminal equipment and readable storage medium | |
CN110688223B (en) | Data processing method and related product | |
CN111459937A (en) | Data table association method, device, server and storage medium | |
CN113064922A (en) | Data continuity judging method, terminal equipment and storage medium | |
CN107729577B (en) | Data searching method based on multidimensional hash table, terminal equipment and storage medium | |
CN113806737B (en) | Malicious process risk level assessment method, terminal equipment and storage medium | |
CN114281766A (en) | TrueCrypt hidden partition identification method, terminal device and storage medium | |
CN111158994B (en) | Method and device for testing pressing performance | |
CN113986980A (en) | Data sorting method and device | |
CN111198900B (en) | Data caching method and device for industrial control network, terminal equipment and medium | |
CN114416655A (en) | Hive file processing method and device, computer equipment and storage medium | |
CN108804533B (en) | Heterogeneous big data information filtering method and device | |
CN111241099A (en) | Industrial big data storage method and device | |
CN113419792A (en) | Event processing method and device, terminal equipment and storage medium | |
CN113342270A (en) | Volume unloading method and device and electronic equipment | |
EP3048541A1 (en) | File access method and device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |