CN111680012A

CN111680012A - Data compression method for monitoring data of heating system

Info

Publication number: CN111680012A
Application number: CN202010532632.8A
Authority: CN
Inventors: 刘希闻; 黄振群; 王松寒; 王忠言; 司瑞才; 耿娜; 王春玲; 金春林; 李佳; 姚卓宏; 周驰; 夏志
Original assignee: STATE GRID JILINSHENG ELECTRIC POWER SUPPLY Co ELECTRIC POWER RESEARCH INSTITUTE; State Grid Corp of China SGCC; Electric Power Research Institute of State Grid Jilin Electric Power Co Ltd
Current assignee: STATE GRID JILINSHENG ELECTRIC POWER SUPPLY Co ELECTRIC POWER RESEARCH INSTITUTE; State Grid Corp of China SGCC; Electric Power Research Institute of State Grid Jilin Electric Power Co Ltd
Priority date: 2020-06-12
Filing date: 2020-06-12
Publication date: 2020-09-18

Abstract

A data compression method for monitoring data of a heat supply system belongs to the technical field of data compression, and adopts a two-stage compression method for mass heat supply data.

Description

Data compression method for monitoring data of heating system

Technical Field

The invention belongs to the technical field of data compression, and particularly relates to a data compression method for monitoring data of a heating system.

Background

Data compression is a technical method for reducing the data volume to reduce the storage space and improve the transmission, storage and processing efficiency of data or reorganizing data according to a certain rule and reducing the redundancy and storage space of data on the premise of ensuring that effective information is transmitted.

The heat supply trade metering system collects data of each heat supply pipeline in real time, and performs accumulated calculation on heat consumption of each user to perform trade metering. Meanwhile, the collected data contain signals of working pressure, working temperature, flow and the like of each heat supply pipeline, and operation management personnel can monitor the data in real time and know the operation condition of the system in time. The heat supply pipelines generate massive monitoring data every day, the data are important resources of heat supply enterprises, on one hand, the data are used for trade measurement and provide calculation data for heat charge measurement, on the other hand, the change rule of the operation of the whole heat supply pipeline network can be simulated by analyzing historical data of different pipelines, and visual and accurate data basis is provided for control optimization of a heat supply network. Because heat supply trade measurement system carries out data acquisition and control to a plurality of heat supply pipelines, consequently can produce huge data volume, if not compress processing to these data, need often change the hard disk to the system, be unfavorable for the steady operation and the maintenance of system. Therefore, the data needs to be compressed, and the storage space is saved.

Currently, known data compression methods are mainly divided into two major categories, namely lossy compression and lossless compression. The lossless algorithm mainly comprises an LZ series algorithm, a BWT algorithm, a Huffman algorithm and the like, and can ensure that data is not distorted in the compression process, but generally the compression ratio is very low and the compression effect is not obvious. Lossless algorithms are commonly used in industry, and are algorithms that compress data according to their characteristics, such as boxcar method, backward slope method, PLOT algorithm, and revolving gate compression algorithm. The lossless algorithm is singly adopted, the data can be guaranteed not to be distorted, but the compression ratio is low, the storage space cannot be effectively saved, only the lossy compression is adopted, the compression effect of the data compression can be effectively improved, the compression ratio of the data is improved, and the compression effect has an improved space.

Therefore, there is a need in the art for a new technical solution to solve this problem.

Disclosure of Invention

The technical problem to be solved by the invention is as follows: the data compression method for the monitoring data of the heating system is characterized in that a two-stage compression method is adopted for mass heating data, first-stage compression is carried out on the heating data, a revolving door compression algorithm is adopted as a compression algorithm, then secondary compression is carried out on the obtained compression data, a lossless compression algorithm is adopted as the secondary compression, a Hoffman algorithm is adopted for compression, the compression algorithm is simple and easy to realize, the calculation speed is high, the compression ratio and the compression precision are high, the heating data are compressed to the maximum degree, the storage space is saved, and meanwhile the precision of the compression data is guaranteed.

A data compression method for monitoring data of a heating system is characterized by comprising the following steps: comprises the following steps which are sequentially carried out,

reading heat supply data by a system, performing primary compression, initializing compression parameters, setting a maximum time interval TLM and compressing an offset E;

reading an initial data point value y1, an initial time point value t1, compressing initial data point records, a second time data point value y2 and a second time point value t2, and obtaining the initial slope of the revolving door as follows:

upper slope k1 ═ (y2- (y1-E))/(t2-t1),

lower slope k2 ═ (y2- (y1+ E))/(t2-t 1);

step three, taking the next data point yn, and obtaining the slope between the next time point tn and the last recorded compression point:

the upper slope k1n ═ (yn- (y1-E))/(tn-t1),

lower slope k2n ═ (yn- (y1+ E))/(tn-t 1);

step four, comparing the upper slope k1 obtained in the step two and the step three with the upper slope k1n and the lower slope k2 with the lower slope k2n, wherein the upper slope k1 is smaller than the upper slope k1n, the upper slope k1 is made to be the upper slope k1n, the lower slope k2 is smaller than the lower slope k2n, and the lower slope k2 is made to be the lower slope k2n, and the point is compressed;

step five, repeating the step three, wherein tn-t1> the maximum time interval TLM, or the upper slope k1> is equal to the lower slope k2, the point is not compressed and is stored and recorded, and the data point yn is used as the starting point of a new compression section;

step six, repeating the step two to the step five to complete the first-stage compression of the data;

step seven, converting the data stream obtained after the first-stage compression in the step six into a character group;

step eight, calculating the occurrence frequency of each character in the whole data stream, and sequencing the characters according to the occurrence frequency from high to low;

and step nine, constructing an optimal Huffman tree, establishing a coding table according to the Huffman tree, searching codes of the character array according to the established Huffman tree, writing the codes into an output stream, coding the character array obtained in the step seven into a bit output string, writing the bit output string into the output stream, and combining the bit output string with the codes of the character array searched by the Huffman tree to finish the second-stage compression of the data.

The maximum time interval set in the first step is 300s, and the compression offset is 0.05.

Through the design scheme, the invention can bring the following beneficial effects: a data compression method for monitoring data of a heat supply system adopts a two-stage compression method for mass heat supply data, firstly, the heat supply data is subjected to first-stage compression, a revolving door compression algorithm is adopted as a compression algorithm, then, the obtained compressed data is subjected to secondary compression, a lossless compression algorithm is adopted as the secondary compression, and a Hoffman algorithm is adopted for compression.

Drawings

The invention is further described with reference to the following figures and detailed description:

fig. 1 is a schematic flow chart of a data compression method for monitoring data of a heating system according to the present invention.

Fig. 2 is a schematic diagram of an optimal huffman tree constructed by an embodiment of the data compression method for monitoring data of a heating system according to the present invention.

Detailed Description

A data compression method for monitoring data of a heating system, as shown in figure 1,

firstly, heating data to be compressed is obtained, whether the data is subjected to primary compression or not is judged, if the data is subjected to the primary compression, compression parameters of the data are initialized, and then the data is compressed.

And obtaining initial 2 points of data to be compressed, judging whether the time interval between the data is greater than the set maximum time interval, if so, storing, and if not, performing the next step and starting the revolving door compression test.

In the revolving door compression test, the slope calculation of the data to be compressed is carried out, the upper slope k1 and the lower slope k2 are calculated, if the upper slope k1> is equal to the lower slope k2, the point is stored, if the upper slope k1< the lower slope k2, the point is compressed, and new data points are taken for recalculation until the revolving door compression is completed.

And after the first-stage revolving door compression is finished, starting the second-stage compression, counting the frequency of numerical values in the array to be compressed, and sequencing the numerical values from high to low according to the frequency of occurrence.

And constructing an optimal Hoffman tree, coding the appeared numerical values to obtain a coding table corresponding to the numerical values, then coding the band compression data by using the coding table, outputting the coded table as a character string, outputting the coding table into the previous character string, combining the coding table into a compressed code, and storing the compressed code.

A particular embodiment includes the following steps that,

the method comprises the steps of firstly, performing first-stage compression on heat supply data according to the obtained heat supply data, judging whether the heat supply data is first-time compression data, initializing compression parameters, setting a maximum time interval TLM and compressing an offset E.

Step two, reading the initial data point with the value y1 and the time t1, the point is recorded and compressed, and calculating the initial slope of the revolving door with the second data point y2 and the time t 2:

upper slope k1 ═ (y2- (y1-E))/(t2-t1)

Lower slope k2 ═ (y2- (y1+ E))/(t2-t1)

Step three, taking the next data point yn with time tn, and calculating the slope between the next data point yn and the last recorded compression point:

upper slope k1n ═ (yn- (y1-E))/(tn-t1)

Lower slope k2n ═ (yn- (y1+ E))/(tn-t1)

And step four, comparing the sizes of the upper slope k1 with the upper slope k1n, and the lower slope k2 with the lower slope k2n, if the upper slope k1 is less than the upper slope k1n, making the upper slope k1 equal to the upper slope k1n, and if the lower slope k2 is less than the lower slope k2n, making the lower slope k2 equal to the lower slope k2n, and compressing the point and not recording the point.

And step five, repeating the step three, if tn-t1> the maximum time interval TLM, or the upper slope k1> is equal to the lower slope k2, then the point is not compressed and is stored and recorded, and meanwhile, the data point yn is used as the starting point of a new compressed segment.

And step six, repeating the step two to the step five until the first-stage compression of the data is completed.

And step seven, reading in the complete data stream after the first-stage compression, and converting the complete data stream into a character array, wherein the following array is taken as an example, and assuming that the character array aaccdbbbeeeaaa exists, the memory occupies 15 bytes.

And step eight, calculating the number of times that each character appears in the whole data stream in the step seven, and sorting the characters from high to low according to the appearance frequency, so as to obtain a-5, e-4, b-3, c-2 and d-1.

And step nine, constructing an optimal Huffman tree according to the sequence obtained in the step eight, as shown in FIG. 2.

Step ten, establishing an encoding table according to the Huffman tree: a is 0, e is 10, b is 110, d is 1110, and c is 1111.

Step eleven, searching the code of the character array according to the established Huffman tree and writing the code into an output stream, wherein the code is 0011111111111011011011010101010000.

Step twelve, coding the total number of words into bit output strings and writing the bit output strings into an output stream, recording the middle node as 0, recording the leaf nodes as 1, and constructing a word list as 0101100001010110010101011000100101100100101100011. And combining with the encoding in the step eleven to obtain the data with the length of 12 bytes after compression, completing the secondary compression, wherein the compression rate is 80%.

The above description is only a preferred embodiment of the present invention and is not intended to limit the present invention, and various modifications and changes may be made by those skilled in the art. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

1. A data compression method for monitoring data of a heating system is characterized by comprising the following steps: comprises the following steps which are sequentially carried out,

upper slope k1 ═ (y2- (y1-E))/(t2-t1),

lower slope k2 ═ (y2- (y1+ E))/(t2-t 1);

the upper slope k1n ═ (yn- (y1-E))/(tn-t1),

lower slope k2n ═ (yn- (y1+ E))/(tn-t 1);

2. A data compression method for heating system monitoring data according to claim 1, characterized by: the maximum time interval set in the first step is 300s, and the compression offset is 0.05.