CN117851159A - Time sequence data rapid compression method and system based on rounding transformation - Google Patents
Time sequence data rapid compression method and system based on rounding transformation Download PDFInfo
- Publication number
- CN117851159A CN117851159A CN202311703829.3A CN202311703829A CN117851159A CN 117851159 A CN117851159 A CN 117851159A CN 202311703829 A CN202311703829 A CN 202311703829A CN 117851159 A CN117851159 A CN 117851159A
- Authority
- CN
- China
- Prior art keywords
- sequence
- data
- interval
- flag
- compressed
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000009466 transformation Effects 0.000 title claims abstract description 55
- 230000006835 compression Effects 0.000 title claims abstract description 44
- 238000007906 compression Methods 0.000 title claims abstract description 43
- 238000000034 method Methods 0.000 title claims abstract description 42
- 230000008859 change Effects 0.000 claims abstract description 20
- 238000013144 data compression Methods 0.000 claims abstract description 9
- 238000000605 extraction Methods 0.000 claims description 9
- 238000005070 sampling Methods 0.000 claims description 8
- FGUUSXIOTUKUDN-IBGZPJMESA-N C1(=CC=CC=C1)N1C2=C(NC([C@H](C1)NC=1OC(=NN=1)C1=CC=CC=C1)=O)C=CC=C2 Chemical compound C1(=CC=CC=C1)N1C2=C(NC([C@H](C1)NC=1OC(=NN=1)C1=CC=CC=C1)=O)C=CC=C2 FGUUSXIOTUKUDN-IBGZPJMESA-N 0.000 claims description 4
- 238000013500 data storage Methods 0.000 claims description 4
- 238000010586 diagram Methods 0.000 description 5
- 230000005540 biological transmission Effects 0.000 description 4
- 238000012544 monitoring process Methods 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 238000004891 communication Methods 0.000 description 3
- 230000011218 segmentation Effects 0.000 description 3
- 230000002159 abnormal effect Effects 0.000 description 2
- 238000003491 array Methods 0.000 description 2
- 239000000835 fiber Substances 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 230000001902 propagating effect Effects 0.000 description 2
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 229910052802 copper Inorganic materials 0.000 description 1
- 239000010949 copper Substances 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000003203 everyday effect Effects 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 238000012549 training Methods 0.000 description 1
Landscapes
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
The invention belongs to the technical field of data compression, and particularly relates to a time sequence data rapid compression method and system based on rounding transformation, wherein the method comprises the following steps: acquiring time sequence data to be compressed; configuring a transformation base according to the variable type and the time sequence data; traversing each numerical value in the sequence, and performing rounding transformation to obtain a transformed sequence; extracting state change key points from the transformed sequence, and dividing interval states to obtain an interval state set containing the key points; according to the interval state set containing the key points, the original time sequence data is compressed, the original time sequence is subjected to rounding transformation, the state change key points are extracted, the original time sequence data is divided into a plurality of intervals in a stable state or a fluctuation state, different compression methods are adopted according to the types of the state intervals, on the basis of ensuring rapid compression, key inflection point information is identified and reserved as much as possible, and the information loss of the compressed data is reduced.
Description
Technical Field
The invention belongs to the technical field of data compression, and particularly relates to a time sequence data rapid compression method and system based on rounding transformation.
Background
The monitoring index generally collects numerical information of the operation state of the monitoring object at fixed time intervals, and the format of the numerical information can be generally expressed as a 'time stamp, a numerical value'. Sampling the monitoring object at fixed time intervals within a period of time to obtain a time sequence arranged in time sequence.
In the field of cloud computing, as the scale of cloud services increases, a system becomes more and more complex, monitored indexes are more and more, and massive monitoring data can be generated every day. Since the system is in normal operation most of the time, abnormal fluctuations are a minority of cases. In application scenarios based on data-driven anomaly detection, root cause diagnosis, etc., anomaly sample data is critical to model training for machine learning or deep learning. Because the proportion of the abnormal sample in the whole sampling is low, the cost for permanently storing the whole original data is high and the efficiency is low. In order to extract and retain valuable data as long as possible, improve storage efficiency, save storage cost, compress the data, reduce redundant, low-value data.
The compression methods of time series can be classified into lossless compression and lossy compression, and most of the compression methods of time series belong to the lossy compression type. Among these, there are methods that are based mainly on lossy compression of piecewise linear representations or piecewise aggregated approximation representations, etc. The method is simple, quick and wide in application range, but has some defects:
after segmentation, the k data points in the segmentation are usually represented by means of an average value, and for important key points, more information is lost; the segmentation method is insensitive to inflection points of some state changes and cannot effectively characterize the state changes of some inflection points.
Disclosure of Invention
The invention aims to provide a time sequence data rapid compression method and system based on rounding transformation, which are used for extracting state change key points by rounding transformation on an original time sequence, dividing the state change key points into a plurality of stable state or fluctuation state intervals, adopting different compression methods according to the types of the state intervals, identifying and retaining key inflection point information as much as possible on the basis of ensuring rapid compression, and reducing the information loss of compressed data so as to solve the problems in the background technology.
In order to achieve the above purpose, the invention adopts the following technical scheme: a time sequence data rapid compression method based on rounding transformation comprises the following steps: acquiring time sequence data to be compressed; configuring a transformation base for the time series data according to the variable type; traversing each numerical value in the sequence, and performing rounding transformation to obtain a transformed sequence; extracting state change key points from the transformed sequence, and dividing interval states to obtain an interval state set containing the key points; compressing the original time sequence data according to the interval state set containing the key points to obtain compressed sequence data; and storing the compressed sequence data into a specified database.
Preferably, the time-series data includes: x= { X 1 ,…,x i ,…,x n X, where x i At time t as a variable i Is used for the sampling value of (a).
Preferably, the performing rounding transformation to obtain a transformed sequence includes:
traversing each numerical value in the sequence X, carrying out upward rounding transformation on each data value in the sequence X, and carrying out the method i =C(x i )=Ceil(x i C) to obtain the transformed sequence y= { Y 1 ,…,y i ,…,y n };
Wherein, ceil is rounded up, and C is the transform radix.
Preferably, the rounding transform further comprises a down rounding transform; the down-rounding transform includes: y is i =F(x i )=Floor(x i C) C; wherein Floor is rounded down.
Preferably, the extracting the state change key point includes: the extraction method of the state change key points comprises the following steps:
step a: initializing variables j=1, i=1;
step b: setting the position of the left mark point of the jth interval as t jL =t i Setting the current interval state markflag j =0;
Step c: judging y i+1 And y i Whether or not to be equal, e.g. y i+1 ≠y i Setting a flag j =1; otherwise, executing the step d;
step d: setting i=i+1;
step e: such as (y) i+1 ==y i and flag j ==1)or(y i+1 ≠y i and flag j Set t, set t = =0) jR =t i Obtaining the j-th state interval (t jL ,t jR ,flag j ) Setting j=j+1 and i=i+1 at the same time, and repeating steps b to d; otherwise, executing the step f;
step f: repeating steps d-e until all values in Y are traversed to obtain a set containing m state intervals
S={(t 1L ,t 1R ,flag 1 ),…,(t jL ,t jR ,flag j ),…,(t mL ,t mR ,flag m )};
Wherein (t) jL ,t jR ,flag j ) For the j-th state interval, t jL T is the start point of the interval jR As the end point, flag j The interval =0 is a stationary interval, flag j The interval is referred to as a fluctuation interval.
Preferably, when the value of i=i+1 in step d or step e is n, such as t mL If the value of (2) is not null, then t mR =t n Ending the extraction process of the key points; for example t mL If the value of (1) is null, then t mL =t mR =t n ,flag m =0, ending the extraction process of the key points.
Preferably, the compressing the original time series data includes:
step 1: sequentially extracting a state section (t jL ,t jR ,flag j );
Step 2: for example, a flag j If the value of (2) is 0, then calculate X in time interval t jL ,t jR ]The average value of each data value in the data is taken as the compressed sampling value, and the time stamp is carried outSet to t jR The method comprises the steps of carrying out a first treatment on the surface of the For example, a flag j If the value of (2) is 1, X is set to be within the time interval t jL ,t jR ]Original value of each data value in the streamAs compressed sample values, while preserving the time stamp of the original value;
step 3: saving the compressed numerical value to a sequence Z, and setting j=j+1;
step 4: and (3) repeating the steps 1 to 3 until the step S is traversed, and obtaining the compressed sequence Z of X.
Preferably, in step 2, an interval sequence length threshold L is set as flag j Is 0 and sequence X is in time interval t jL ,t jR ]When the length of the subsequence in the sequence is less than or equal to L, X is in a time interval [ t ] jL ,t jR ]Original value of each data value in the streamAs compressed sample values, while preserving the time stamp of the original value.
On the other hand, the invention also provides a time sequence data rapid compression system based on rounding transformation, which comprises the following steps:
the data acquisition module is used for acquiring time sequence data to be compressed;
the data compression module is used for receiving the time series data to be compressed and compressing the time series data;
and the data storage module is used for receiving the compressed data and storing the compressed data into a specified database.
Preferably, the data compression module includes:
the configuration module is used for configuring a transformation base number for the time sequence data according to the variable types;
the transformation module is used for traversing each numerical value in the sequence and carrying out rounding transformation to obtain a transformed sequence;
the set acquisition module is used for extracting state change key points from the transformed sequence and dividing interval states to obtain an interval state set containing the key points;
and the compression module is used for compressing the original time sequence data according to the interval state set containing the key points to obtain compressed sequence data.
The invention has the technical effects and advantages that: compared with the prior art, the time sequence data rapid compression method and system based on rounding transformation provided by the invention have the following advantages:
the method comprises the steps of obtaining time sequence data to be compressed; configuring a transformation base for the time series data according to the variable type; traversing each numerical value in the sequence, and performing rounding transformation to obtain a transformed sequence; extracting state change key points from the transformed sequence, and dividing interval states to obtain an interval state set containing the key points; compressing the original time sequence data according to the interval state set containing the key points to obtain compressed sequence data; storing the compressed sequence data into a specified database, extracting state change key points by rounding transformation on an original time sequence, dividing the state change key points into a plurality of intervals in a stable state or a fluctuation state, adopting different compression methods according to the types of the state intervals, identifying and retaining key inflection point information as much as possible on the basis of ensuring quick compression, and reducing the information loss of the compressed data.
Drawings
FIG. 1 is a time series line diagram of an embodiment of the present invention;
FIG. 2 is a schematic diagram of performing an up-conversion on a time sequence according to an embodiment of the present invention;
FIG. 3 is a block diagram of a transform sequence according to an embodiment of the present invention;
FIG. 4 is a diagram showing the comparison of the original time series and the compressed series in the embodiment of the present invention;
FIG. 5 is a flowchart of a method for fast compression of time series data based on rounding transformation according to an embodiment of the present invention;
fig. 6 is a block diagram of a sequential data rapid compression system based on rounding transformation in an embodiment of the invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. The specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
The invention provides a time sequence data rapid compression method based on rounding transformation as shown in fig. 5, which comprises the following steps:
step one: and obtaining time sequence data to be compressed, and configuring a transformation base for the time sequence data according to the variable type.
Specifically, time-series data x= { X to be compressed is acquired 1 ,…,x i ,…,x n X, where x i Indicating that the variable is at time t 1 Is used for the sampling value of (a). The transformation radix C is configured according to the variable type.
Step two: traversing each numerical value in the sequence, and carrying out rounding transformation to obtain a transformed sequence.
Specifically, each numerical value in the sequence X is traversed, and rounding transformation is performed to obtain a transformed sequence y= { Y 1 ,…,y i ,…,y n }。
The method for carrying out the up-rounding transformation C (-) on each data value in the sequence X is as follows:
y i =C(x i )=Ceil(x i /C)*C
wherein Ceil represents a rounded up.
Optionally, each data value in the sequence X may be subjected to a F (-) rounding transform, where the method of F (-) transform is:
y i =F(x i )=Floor(x i /C)*C
wherein Floor represents a rounding down.
Taking the time sequence of fig. 1 as an example, the transform base c=5 is configured, and the up-rounding transform is performed, so that a new sequence can be obtained, as shown in fig. 2.
Step three: and extracting state change key points from the transformed sequence, and dividing interval states to obtain an interval state set containing the key points.
Illustratively, for the transformed sequence y= { Y 1 ,…,y i ,…,y n Extracting state change key points, and dividing interval states to obtain an interval state set containing the key points.
Specifically, the method for extracting the state change key points comprises the following steps:
step 3.1, initializing variables j=1, i=1;
step 3.2, setting the position of the left marking point of the jth interval as t j =t i Setting a flag of the current interval state j =0;
Step 3.3, judging y i+1 And y i Whether or not to be equal, if y i+1 ≠y i Setting a flag j =1; otherwise, executing the step 3.4;
step 3.4, setting i=i+1;
step 3.5 if (y i+1 ==y i and flag j ==1)or(y i+1 ≠y i and flag j Set t, set t = =0) jR =t i Obtaining the j-th state interval (t jL ,t jR ,flag j ) Setting j=j+1 and i=i+1 at the same time, and repeating steps 3.2 to 3.5; otherwise, executing the step 3.6;
step 3.6, repeating steps 3.4 to 3.5 until all values in Y are traversed to obtain a set of m state intervals
S={(t 1L ,t 1R ,flag 1 ),…,(t jL ,t jR ,flag j ),…,(t mL ,t mR ,flag m )}。
Wherein (t) jL ,t jR ,flag j ) Represents the j-th state interval, t jL T is the start point of the interval jR As the end point, flag j =0 indicates that this interval is a stationary interval, flag j =1 means that this section is a fluctuation section.
In particular, when the value of i=i+1 in step 3.4 or step 3.5 is n, if t mL If the value of (2) is not null, then t mR =t n Ending the extraction process of the key points; if t mL If the value of (1) is null, then t mL =t nR =t n ,flag m =0, and then the extraction process of the key points is ended.
As shown in fig. 3, performing the above-described key point extraction operation on the transformation sequence in fig. 2 may result in the following state interval division.
Step four: and compressing the original time sequence data according to the interval state set containing the key points to obtain compressed sequence data.
Specifically, according to the interval state set S including the key points obtained in the third step, for the original time sequence x= { X 1 ,…,x i ,…,x n Compression to obtain a compressed sequence Z.
The sequence compression method comprises the following steps:
step 4.1, extracting a state interval from S in order (t jL ,t jR ,flag j )
Step 4.2, if flag j If the value of (2) is 0, then calculate X in time interval t jL ,t jR ]The average value of each data value in the data is taken as the compressed sampling value, and the time stamp is set as t jR The method comprises the steps of carrying out a first treatment on the surface of the If a flag is j If the value of (2) is 1, X is set to be within the time interval t jL ,t jR ]Original value of each data value in the streamAs compressed sample values, while preserving the time stamp of the original value;
and 4.3, storing the compressed numerical value into a sequence Z, setting j=j+1, and repeating the steps 4.1 to 4.3 until the step S is traversed to obtain a compressed sequence Z of X.
Optionally, in step 4.2, provision is made forSetting interval sequence length threshold L as flag j Is 0 and sequence X is in time interval t jL ,t jR ]When the length of the subsequence in the sequence is less than or equal to L, X is in a time interval [ t ] jL ,t jR ]Original value of each data value in the streamAs compressed sample values, while preserving the time stamp of the original value.
Taking the interval sequence length threshold l=4 as an example, according to the interval division including the key points shown in fig. 3, the original time sequence in fig. 1 is compressed, and the obtained compressed sequence is shown by a black dot mark in fig. 4, and the compression rate at this time is 59.5%. When the interval sequence length threshold l=0, the compression rate is 67.5%. The higher the plateau duty cycle in the time series, the higher the compression rate of the sample taking method.
Step five: and storing the compressed sequence data into a specified database.
The state division method based on rounding transformation is innovatively provided in the method, so that the state division can be performed on the time sequence more quickly, and the time complexity of the algorithm is only related to the length of the time sequence; in addition, by setting the parameter C, the granularity of state division can be freely adjusted according to different variable types and dimensions, and the method has better universality. The data in the stable state is averaged, and the data with larger fluctuation is taken as the original value, so that the key change characteristics of the time sequence can be better reserved. By setting the interval sequence length threshold L in the sequence compression link, the compression rate can be further adjusted, and the accuracy of compressed data is improved.
In another aspect, the present invention proposes a time series data rapid compression system based on rounding transformation, as shown in fig. 6, including: the device comprises a data acquisition module, a data compression module and a data storage module.
Illustratively, the data acquisition module is configured to acquire time-series data to be compressed;
illustratively, the data compression module is configured to receive the time-series data to be compressed, and compress the time-series data;
illustratively, the data storage module is configured to receive the compressed data and store the compressed data in a specified database.
In addition, the data compression module includes: the device comprises a configuration module, a transformation module, a collection acquisition module and a compression module.
Illustratively, the configuration module is configured to configure a transformation radix for the time series data according to a variable type;
illustratively, the transformation module is used for traversing each numerical value in the sequence, and performing rounding transformation to obtain a transformed sequence;
the set acquisition module is used for extracting state change key points from the transformed sequence and dividing interval states to obtain an interval state set containing the key points;
the compression module is used for compressing the original time sequence data according to the interval state set containing the key points to obtain compressed sequence data.
In addition, the invention also provides a terminal device, and the time sequence data rapid compression method based on rounding transformation in the embodiment is mainly applied to the terminal device, wherein the terminal device can be a PC, a portable computer, a mobile terminal and other devices with display and processing functions.
In particular, the terminal device may include a processor (e.g., CPU), a communication bus, a user interface, a network interface, and a memory. Wherein the communication bus is used for realizing connection communication among the components; the user interface may include a Display screen (Display), an input unit such as a Keyboard (Keyboard); the network interface may optionally include a standard wired interface, a wireless interface (e.g., WI-FI interface); the memory may be a high-speed RAM memory or a stable memory (non-volatile memory), such as a disk memory, or alternatively may be a storage device independent of the aforementioned processor.
The memory stores a readable storage medium, the time sequence data rapid compression program in the readable storage medium, and the processor can call the prediction program stored in the memory and execute the time sequence data rapid compression method based on rounding transformation.
It will be appreciated that the readable storage medium may be a tangible device that can hold and store instructions for use by the instruction execution device. The computer readable storage medium may be, for example, but not limited to, an electronic storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing. More specific examples (a non-exhaustive list) of the computer-readable storage medium would include the following: portable computer disks, hard disks, random Access Memory (RAM), read-only memory (ROM), erasable programmable read-only memory (EPROM or flash memory), static Random Access Memory (SRAM), portable compact disk read-only memory (CD-ROM), digital Versatile Disks (DVD), memory sticks, floppy disks, mechanical coding devices, punch cards or in-groove structures such as punch cards or grooves having instructions stored thereon, and any suitable combination of the foregoing. Computer-readable storage media, as used herein, are not to be construed as transitory signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through waveguides or other transmission media (e.g., optical pulses through fiber optic cables), or electrical signals transmitted through wires.
The computer readable program instructions described herein may be downloaded from a computer readable storage medium to a respective computing/processing device or to an external computer or external storage device over a network, such as the internet, a local area network, a wide area network, and/or a wireless network. The network may include copper transmission cables, fiber optic transmissions, wireless transmissions, routers, firewalls, switches, gateway computers and/or edge servers. The network interface card or network interface in each computing/processing device receives computer readable program instructions from the network and forwards the computer readable program instructions for storage in a computer readable storage medium in the respective computing/processing device.
Computer program instructions for performing the operations of the present disclosure may be assembly instructions, instruction Set Architecture (ISA) instructions, machine-related instructions, microcode, firmware instructions, state setting data, or source or object code written in any combination of one or more programming languages, including an object oriented programming language such as Smalltalk, c++ or the like and conventional procedural programming languages, such as the "C" language or similar programming languages. The computer readable program instructions may be executed entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer through any kind of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or may be connected to an external computer (for example, through the Internet using an Internet service provider). In some embodiments, aspects of the present disclosure are implemented by personalizing electronic circuitry, such as programmable logic circuitry, field Programmable Gate Arrays (FPGAs), or Programmable Logic Arrays (PLAs), with state information of computer readable program instructions, which can execute the computer readable program instructions.
Finally, it should be noted that: the foregoing description is only illustrative of the preferred embodiments of the present invention, and although the present invention has been described in detail with reference to the foregoing embodiments, it will be apparent to those skilled in the art that modifications may be made to the embodiments described, or equivalents may be substituted for elements thereof, and any modifications, equivalents, improvements or changes may be made without departing from the spirit and principles of the present invention.
Claims (10)
1. A method for rapidly compressing time series data based on rounding transformation, comprising the steps of:
acquiring time sequence data to be compressed;
configuring a transformation base for the time series data according to the variable type;
traversing each numerical value in the sequence, and performing rounding transformation to obtain a transformed sequence;
extracting state change key points from the transformed sequence, and dividing interval states to obtain an interval state set containing the key points;
compressing the original time sequence data according to the interval state set containing the key points to obtain compressed sequence data;
and storing the compressed sequence data into a specified database.
2. The method for rapid compression of time series data based on rounding transformation according to claim 1, wherein the time series data comprises: x= { X 1 ,…,x i ,…,x n X, where x i At time t as a variable j Is used for the sampling value of (a).
3. The method for rapid compression of time series data based on rounding transformation according to claim 2, wherein the step of rounding transformation to obtain a transformed sequence comprises the steps of:
traversing each numerical value in the sequence X, carrying out upward rounding transformation on each data value in the sequence X, and carrying out the method i =C(x i )=Ceil(x i C) to obtain the transformed sequence y= { Y 1 ,…,y i ,…,y n };
Wherein, ceil is rounded up, and C is the transform radix.
4. The method for rapid compression of time series data based on rounding transformation according to claim 2, wherein the rounding transformation further comprises a down rounding transformation;
the down-rounding transform includes: y is i =F(x i )=Floor(x i /C)*C;
Wherein Floor is rounded down.
5. A method of fast compression of time series data based on rounding transformation according to claim 3, wherein the extracting the state change key point comprises:
the extraction method of the state change key points comprises the following steps:
step a: initializing variables j=1, i=1;
step b: setting the position of the left mark point of the jth interval as t jL =t i Setting a flag of the current interval state j =0;
Step c: judging y i+1 And y i Whether or not to be equal, e.g. y i+1 ≠y i Setting a flag j =1; otherwise, executing the step d;
step d: setting i=i+1;
step e: such as (y) i+1 ==y i and flag j ==1)or(y i+1 ≠y i and flag j Set t, set t = =0) jR =t i Obtaining the j-th state interval (t jL ,t jR ,flag j ) Setting j=j+1 and i=i+1 at the same time, and repeating steps b to d; otherwise, executing the step f;
step f: repeating steps d-e until all values in Y are traversed to obtain a set containing m state intervals
S={(t 1L ,t 1R ,flag 1 ),…,(t jL ,t jR ,flag j ),…,(t mL ,t mR ,flag m )};
Wherein (t) jL ,t jR ,flag j ) For the j-th state interval, t jL T is the start point of the interval jR As the end point, flag j The interval =0 is a stationary interval, flag j The interval is referred to as a fluctuation interval.
6. The method of claim 5, wherein when the value of i=i+1 in step d or step e is n, t is as follows mL If the value of (2) is not null, then t mR =t n Ending the extraction process of the key points; for example t mL If the value of (1) is null, then t mL =t mR =t n ,flag m =0, ending the extraction process of the key points.
7. The method for fast compressing time series data based on rounding transformation according to claim 5, wherein said compressing the original time series data comprises:
step 1: sequentially extracting a state section (t jL ,t jR ,flag j );
Step 2: for example, a flag j If the value of (2) is 0, then calculate X in time interval t jL ,t jR ]The average value of each data value in the data is taken as the compressed sampling value, and the time stamp is set as t jR The method comprises the steps of carrying out a first treatment on the surface of the For example, a flag j If the value of (2) is 1, X is set to be within the time interval t jL ,t jR ]Original value of each data value in the streamAs compressed sample values, while preserving the time stamp of the original value;
step 3: saving the compressed numerical value to a sequence Z, and setting j=j+1;
step 4: and (3) repeating the steps 1 to 3 until the step S is traversed, and obtaining the compressed sequence Z of X.
8. The method for rapid compression of time series data based on rounding transformation as claimed in claim 7, wherein in step 2, a section sequence length threshold L is set as a flag j Is 0 and sequence X is in time interval t jL ,t jR ]When the length of the subsequence in the sequence is less than or equal to L, X is in a time interval [ t ] jL ,t jR ]Original value of each data value in the streamAs compressed sample values, while preserving the time stamp of the original value.
9. A rounding transform-based time series data rapid compression system, comprising:
the data acquisition module is used for acquiring time sequence data to be compressed;
the data compression module is used for receiving the time series data to be compressed and compressing the time series data;
and the data storage module is used for receiving the compressed data and storing the compressed data into a specified database.
10. The system of claim 9, wherein the data compression module comprises:
the configuration module is used for configuring a transformation base number for the time sequence data according to the variable types;
the transformation module is used for traversing each numerical value in the sequence and carrying out rounding transformation to obtain a transformed sequence;
the set acquisition module is used for extracting state change key points from the transformed sequence and dividing interval states to obtain an interval state set containing the key points;
and the compression module is used for compressing the original time sequence data according to the interval state set containing the key points to obtain compressed sequence data.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311703829.3A CN117851159A (en) | 2023-12-12 | 2023-12-12 | Time sequence data rapid compression method and system based on rounding transformation |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311703829.3A CN117851159A (en) | 2023-12-12 | 2023-12-12 | Time sequence data rapid compression method and system based on rounding transformation |
Publications (1)
Publication Number | Publication Date |
---|---|
CN117851159A true CN117851159A (en) | 2024-04-09 |
Family
ID=90530085
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202311703829.3A Pending CN117851159A (en) | 2023-12-12 | 2023-12-12 | Time sequence data rapid compression method and system based on rounding transformation |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN117851159A (en) |
-
2023
- 2023-12-12 CN CN202311703829.3A patent/CN117851159A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110995273B (en) | Data compression method, device, equipment and medium for power database | |
CN113590645A (en) | Searching method, searching device, electronic equipment and storage medium | |
CN110335058B (en) | Sample generation method and device of user satisfaction prediction model | |
CN110162518B (en) | Data grouping method, device, electronic equipment and storage medium | |
CN113723618B (en) | SHAP optimization method, equipment and medium | |
CN116010688A (en) | User behavior label identification method, system, equipment and storage medium | |
CN114564345A (en) | Server abnormity detection method, device, equipment and storage medium | |
CN113886821A (en) | Malicious process identification method and device based on twin network, electronic equipment and storage medium | |
CN116756522B (en) | Probability forecasting method and device, storage medium and electronic equipment | |
CN112419312B (en) | Similar house source information detection method and device, electronic equipment and readable medium | |
CN113553309A (en) | Log template determination method and device, electronic equipment and storage medium | |
CN113129122A (en) | Financial risk early warning auditing method and device, electronic equipment and storage medium | |
CN117851159A (en) | Time sequence data rapid compression method and system based on rounding transformation | |
CN114882334B (en) | Method for generating pre-training model, model training method and device | |
CN112200194B (en) | Formula identification method and device, electronic equipment and storage medium | |
CN115146641A (en) | NER noisy training method and device | |
CN110032432A (en) | The decompressing method and device of the compression method and device of example, example | |
CN114358581A (en) | Method and device for determining abnormal threshold of performance index, equipment and storage medium | |
CN115186738A (en) | Model training method, device and storage medium | |
US11442650B2 (en) | Generating predicted usage of storage capacity | |
CN110852077B (en) | Method, device, medium and electronic equipment for dynamically adjusting Word2Vec model dictionary | |
CN110674839B (en) | Abnormal user identification method and device, storage medium and electronic equipment | |
CN110674020B (en) | APP intelligent recommendation method and device and computer readable storage medium | |
CN113312619A (en) | Malicious process detection method and device based on small sample learning, electronic equipment and storage medium | |
CN112686330A (en) | KPI abnormal data detection method and device, storage medium and electronic equipment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |