CN102545909A - Greedy type intelligent fitting lossy data compression method - Google Patents

Greedy type intelligent fitting lossy data compression method Download PDF

Info

Publication number
CN102545909A
CN102545909A CN201110426264XA CN201110426264A CN102545909A CN 102545909 A CN102545909 A CN 102545909A CN 201110426264X A CN201110426264X A CN 201110426264XA CN 201110426264 A CN201110426264 A CN 201110426264A CN 102545909 A CN102545909 A CN 102545909A
Authority
CN
China
Prior art keywords
data
compression
point
value
make
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201110426264XA
Other languages
Chinese (zh)
Other versions
CN102545909B (en
Inventor
许峰
周文欢
张雪洁
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hohai University HHU
Original Assignee
Hohai University HHU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hohai University HHU filed Critical Hohai University HHU
Priority to CN201110426264.XA priority Critical patent/CN102545909B/en
Publication of CN102545909A publication Critical patent/CN102545909A/en
Application granted granted Critical
Publication of CN102545909B publication Critical patent/CN102545909B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention discloses a greedy type intelligent fitting lossy data compression method. If the current data point is an initial point for the first round of compression, (bU)=ak-E and (bL)=ak-E are set, and the processing to ak is finished; for any probable value of i, if (KU) is smaller than or equal to (KL)i, whether (KU)i is increased if Ui passes through ak is judged, if (KU)i is increased is increased, Ui is ensured to pass through ak; for any probable value of i, if (KU) is smaller than or equal to (KL)i, whether (KU)i is decreased if Ui passes through ak is judged, if (KU)i is decreased, Li passes is ensured to pass through ak; if for all possible values of I, (KU)i is larger than or equal to o (KL)i, ak at the moment is singly stored; and ak+1 is continuously examined, if (KU)i is larger than (KL)i is satisfied, ak+2, ak+3, ak+4 and the like are continuously examined according to steps II and III till an incongruent element exists, and the incongruent element is taken as an new initial point to start a new round of compression. The method can get rid of the influence of data noise points, and can further greatly increase the compression ratio.

Description

The match of greedy formula intelligence diminishes data compression method
Technical field
The greedy formula intelligence match that the present invention relates in a kind of field of data compression diminishes data compression method.
Background technology
The main method of real time data compression at present can be divided three classes, i.e. piecewise linearity method, vector quantization method and signal transformation method.The piecewise linearity method comprises that boxcar method (boxcar), back are to slope method (backward slope), revolving door trending method (SDT) and the online trending method of piecewise linearity (PLOT); Although these method compression ratios are high not as signal transformation method; But that its advantage is an algorithm is simple, code execution speed is fast, in process industrial, is used widely.Vector quantization method need be spent a lot of Time Calculation code books, and code book do not have versatility, and promptly the code book of a data set can not be used for other data set, because amount of calculation is too big, vector quantization method is not suitable for the process data line compression.Signal transformation method is a lot; Like Fourier transform, wavelet transformation etc., though small wave converting method also exists some actual techniques problems to need to solve at present, Fourier transform is the main means of signal processing always for a long time; But it can only picked up signal entire spectrum; And be difficult to obtain the local feature of signal, and becoming in the time of can not fully portraying, the characteristic of non-stationary signal, the lossy compression method threshold value in data is difficult to control simultaneously; The problem of adding a lot of reality has to be solved, so actual use and few.In traditional linear compression algorithm, all be only to adopt straight line to portray a certain section data in the space to be compressed generally, regardless of the concrete trend of the data in this section space; All be unalterable to approach match with straight line; And do not ignore concrete data characteristics, during the actual compression algorithm, often through investigating each data point successively; When some data point error ranges meet the compression that just continues next point when coming the error of match with straight line; Otherwise this wheel compression of terminates, the compression of beginning next round is such as traditional revolving door algorithm.This simple compress mode often causes the compression ratio of data low, and compression efficiency is low.
Summary of the invention
The present invention seeks to provides the match of a kind of greedy formula intelligence to diminish data compression method to the defective that prior art exists.
The present invention adopts following technical scheme for realizing above-mentioned purpose:
The match of the greedy formula intelligence of the present invention diminishes data compression method, it is characterized in that comprising the steps:
(1) if current data point is the starting point of a wheel compression, puts (bU)=a k-E with (bL)=a k-E finishes a kProcessing, a kBe current data point, U iAnd L iFunction expression be:
Figure BDA0000121789380000021
Figure BDA0000121789380000022
a iRepresent i element, i=0,1,2...n, t=1,2,3...n, k=1,2,3...n, wherein (KL), (KU) represent the coordinate figure of point, E representes the correction value of error amount, (bL), (bU) expression data point;
(2) to arbitrary probable value of i, if (KU)≤(KL) i, judge if let U iThrough a kWhether can increase (KU) iIf can make (KU) iIncrease and then increase (KU) iMake U iThrough a k
(3) to arbitrary probable value of i, if (KU)≤(KL) i, judge if let L iThrough a kWhether can reduce (KL) iIf can make (KL) iReduce then to reduce (KL) iMake L iThrough a k
(4) if all probable values of i are had (KU) i>(KL) i, preserve a of this moment separately kContinuation is according to investigating a K+1If meet (KU) i>(KL) iThen continue to investigate a according to (2) (3) K+2, a K+3, a K+4... until incongruent element, be the compression that starting point begins a new round with incongruent element.
After a wheel compression finishes, starting point and endpoint data that recording compressed is interval, this algorithm also need write down last a simultaneously iValue; The end point values of algorithm records neither destination county initial data, but handle U after the endpoint data iAnd L i, in the average of destination county functional value.
The present invention is in real data, and through analyzing, data to be compressed are in a certain section compression stroke; Often be not the single type that is in line variation tendency, data also have other forms of variation tendency, such as similar parabolic; And the point that often finishes certain wheel compression process is one " noise ", continues to investigate next point and can find that it also is the error that meets this wheel compression if skip this " noise ", compression algorithm that can be traditional this wheel compression that but is through with; So our algorithm is when packed data; Not only need not go the fitting data variation tendency by independent straight line, in each compression process, not only can investigate point next to be compressed simultaneously, but investigate following two points to be compressed; The influence of data noise can not only be got rid of like this, more compression ratio can be improved greatly.
Description of drawings
Fig. 1: the changing trend diagram of data;
Fig. 2: the anti-parabolic line chart of element;
Fig. 3: data compression basic flow sheet;
Fig. 4: gunzip flow chart.
Embodiment
In commercial production, the variation of data generally has different trend, thus our algorithm be exactly in each compression process, investigate below two data points to be compressed, the compression that carries out curve fitting of special processing noise, adaptive selection data variation trend.
As shown in Figure 1, after having analyzed data, roughly find in one section data segment to be compressed following 3 kinds of variation tendencies are arranged substantially: linear pattern, parabolic type, anti-parabolic type.
Wherein linear pattern is represented smooth change, and two promptly adjacent real time data differences remain constant; The parabolic type representative changes gradually and increases, and the difference between two promptly adjacent real time datas is increasing; Anti-parabolic type reduces gradually, and the difference between two promptly adjacent real time datas is more and more littler.
Traditional algorithm is just adopted in a like fashion to above three kinds of situation and is carried out match like linear pattern; If require algorithm can obtain better compression effectiveness, new algorithm must be according to variation tendency dynamic form of adjusting the expression formula of fitting function when moving of real data.In this project, we suppose that our fitting function is y=k * t a+ b (wherein t=1,2,3...n, and k, a are variable undetermined, a is value in advance)------(1)
As shown in Figure 2; From fitting function, can find out: the represented figure of function when a=1 (1) is linear pattern exactly; The represented figure of function when a=2 (1) is parabolic type exactly; The represented figure of function when a=0.5 (1) is anti-parabolic type exactly, so we might as well suppose a=1,2,0.5 3 numbers at this.
For the threshold value of controlling compression is a compressed error, still use two each compression of taking turns of fan gate control up and down in the actual algorithm:
As shown in the figure, be starting point with data point 1.Range data point 1 has two fulcrums for the place of E in vertical direction.Corresponding two straight line or the curves that connect fulcrums and process data of each element are regarded these two straight lines or curve as two fans virtual " door " that with the fulcrum be spool.Three elements are arranged among the i, have six fan doors.This six fans door was all closed when algorithm began.Six fan doors can only be opened and can not close in the algorithm implementation.With a iRepresent the individual element of i (i=0,1,2), represent two straight lines or the describing function of curve, the then U that a is corresponding with U and l iAnd L iFunction expression be:
L i = ( KL ) i × t a i + ( bL ) U i = ( KU ) i × t a i + ( bU )
Wherein t is the moment that real time data arrives.
Like Fig. 3 and shown in Figure 4, with a 1, a 2, a 3... .a k... .a nExpression real process data suppose that current data point is a k, this algorithm is receiving processing below the work after this:
(1) if current data point is the starting point of a wheel compression, puts (bU)=a k-E with (bL)=a k-E finishes a kProcessing;
(2) to arbitrary probable value of i, if (KU)≤(KL) i, judge if let U iThrough a kWhether can increase (KU) iIf can make KU iIncrease and then increase (KU) iMake U iThrough a k
(3) to arbitrary probable value of i, if KU≤KL i, judge if let L iThrough a kWhether can reduce (KL) iIf can make KL iReduce then to reduce (KL) iMake L iThrough a k
(4) if all probable values of i are had (KU) i>(KL) i, preserve a of this moment separately k, continue according to investigating a K+1If, eligible then continue a according to (2) (3) K+2, a K+3, a K+4... have only a of working as K+1Do not meet, just with a yet K+1Begin the compression of a new round for starting point.Need be after a wheel compression finish, starting point and endpoint data that recording compressed is interval, this algorithm also need write down last a simultaneously iValue.The end point values of algorithm records neither destination county initial data, but handle U after the endpoint data iAnd L i, in the average of destination county functional value.
When reduction of data; The information of the incident that we write down according to the compression that each is taken turns in compression algorithm; The time point that comprises compression point, the matched curve parameter of the match value of compression point and the value of noise, compression point, and then utilize function that original point is restored.

Claims (2)

1. greedy formula intelligence match diminishes data compression method, it is characterized in that comprising the steps:
(1) if current data point is the starting point of a wheel compression, puts (bU)=a k-E with (bL)=a k-E finishes a kProcessing, a kBe current data point, U iAnd L iFunction expression be:
Figure FDA0000121789370000011
a iRepresent i element, i=0,1,2...n, t=1,2,3...n, k=1,2,3...n, wherein (KU), (KU) represent the coordinate figure of point, E representes the correction value of error amount, (bL), (bU) expression data point;
(2) to arbitrary probable value of i, if (KU)≤(KL) i, judge if let U iThrough a kWhether can increase (KU) iIf can make (KU) iIncrease and then increase (KU) iMake U iThrough a k
(3) to arbitrary probable value of i, if (KU)≤(KL) i, judge if let L iThrough a kWhether can reduce (KL) iIf can make (KL) iReduce then to reduce (KL) iMake L iThrough a k
(4) if all probable values of i are had (KU) i>(KL) i, preserve a of this moment separately kContinuation is according to investigating a K+1If meet (KU) i>(KL) iThen continue to investigate a according to (2) (3) K+2, a K+3, a K+4... until incongruent element, be the compression that starting point begins a new round with incongruent element.
2. greedy formula intelligence according to claim 1 match diminishes data compression method, it is characterized in that
After a wheel compression finishes, starting point and endpoint data that recording compressed is interval, this algorithm also need write down last a simultaneously iValue; The end point values of algorithm records neither destination county initial data, but handle U after the endpoint data iAnd L i, in the average of destination county functional value.
CN201110426264.XA 2011-12-19 2011-12-19 Greedy type intelligent fitting lossy data compression method Expired - Fee Related CN102545909B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201110426264.XA CN102545909B (en) 2011-12-19 2011-12-19 Greedy type intelligent fitting lossy data compression method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201110426264.XA CN102545909B (en) 2011-12-19 2011-12-19 Greedy type intelligent fitting lossy data compression method

Publications (2)

Publication Number Publication Date
CN102545909A true CN102545909A (en) 2012-07-04
CN102545909B CN102545909B (en) 2014-07-09

Family

ID=46351979

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201110426264.XA Expired - Fee Related CN102545909B (en) 2011-12-19 2011-12-19 Greedy type intelligent fitting lossy data compression method

Country Status (1)

Country Link
CN (1) CN102545909B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110262819A (en) * 2019-06-04 2019-09-20 深圳前海微众银行股份有限公司 A kind of the model parameter update method and device of federal study
CN112260694A (en) * 2020-09-21 2021-01-22 广州中望龙腾软件股份有限公司 Data compression method of simulation file

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5790131A (en) * 1996-05-15 1998-08-04 Iterated Systems, Inc. System and method for lossy compression of data with output file size control
CN101807925A (en) * 2010-02-08 2010-08-18 南京朗坤软件有限公司 Historical data compression method based on numerical ordering and linear fitting
CN101945431A (en) * 2010-08-30 2011-01-12 京信通信系统(中国)有限公司 Lossy data compression method and lossy data compression-based digital communication system
CN102298630A (en) * 2011-08-30 2011-12-28 国电南瑞科技股份有限公司 Line style-based process data lossy compression method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5790131A (en) * 1996-05-15 1998-08-04 Iterated Systems, Inc. System and method for lossy compression of data with output file size control
CN101807925A (en) * 2010-02-08 2010-08-18 南京朗坤软件有限公司 Historical data compression method based on numerical ordering and linear fitting
CN101945431A (en) * 2010-08-30 2011-01-12 京信通信系统(中国)有限公司 Lossy data compression method and lossy data compression-based digital communication system
CN102298630A (en) * 2011-08-30 2011-12-28 国电南瑞科技股份有限公司 Line style-based process data lossy compression method

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110262819A (en) * 2019-06-04 2019-09-20 深圳前海微众银行股份有限公司 A kind of the model parameter update method and device of federal study
CN110262819B (en) * 2019-06-04 2021-02-26 深圳前海微众银行股份有限公司 Method and device for updating model parameters of federated learning
CN112260694A (en) * 2020-09-21 2021-01-22 广州中望龙腾软件股份有限公司 Data compression method of simulation file

Also Published As

Publication number Publication date
CN102545909B (en) 2014-07-09

Similar Documents

Publication Publication Date Title
Tang et al. Oil price shocks and their short-and long-term effects on the Chinese economy
CN106685478B (en) Frequency hopping signal parameter estimation method based on signal time-frequency image information extraction
CN110824450A (en) Radar target HRRP robust identification method in noise environment
CN106844949B (en) Training method of bidirectional LSTM model for realizing energy-saving control of locomotive
CN102594360B (en) Method and device for computer data compression
CN102592263A (en) Image reinforcement method based on frequency domain
CN100580694C (en) Rapid multi-threshold value dividing method for gray-scale image
CN105427264A (en) Image reconstruction method based on group sparsity coefficient estimation
CN102098058B (en) Method for efficiently and linearly compressing and decompressing time-series data in real time
CN103680515A (en) Proportional adaptive filter coefficient vector updating method using coefficient reusing
CN104461551A (en) Parallel data processing based big data processing system
CN113221781B (en) Carrier signal detection method based on multitask deep convolutional neural network
CN101583147B (en) Segmental online support vector regression method applied in traffic prediction
CN105718943A (en) Character selection method based on particle swarm optimization algorithm
CN102545909B (en) Greedy type intelligent fitting lossy data compression method
CN105243241B (en) The two passage biorthogonal figure filter set designing methods based on lift structure
CN103778486A (en) Power distribution network load predication method
CN102685501A (en) Fixed-point wavelet transform method for joint photographic experts group 2000 (JPEG2000) image compression
CN102298630B (en) Line style-based process data lossy compression method
Guthrie Uncertainty and the trade-off between scale and flexibility in investment
CN103281532B (en) A kind of quantization method of adjustment based on motion strength detection and system
CN107038489B (en) Multi-target unit combination optimization method based on improved NBI method
CN105572733A (en) Seismic velocity spectrum automatic pick-up method
CN101540833B (en) Anti-interference real-time tracking method for profile of object
CN103514589A (en) Filtering method in image recovery

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C53 Correction of patent of invention or patent application
CB03 Change of inventor or designer information

Inventor after: Xu Feng

Inventor after: Zhou Wenhuan

Inventor after: Zhang Xuejie

Inventor after: Yang Fang

Inventor before: Xu Feng

Inventor before: Zhou Wenhuan

Inventor before: Zhang Xuejie

COR Change of bibliographic data

Free format text: CORRECT: INVENTOR; FROM: XU FENG ZHOU WENHUAN ZHANG XUEJIE TO: XU FENG ZHOU WENHUAN ZHANG XUEJIE YANG FANG

C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20140709

Termination date: 20161219

CF01 Termination of patent right due to non-payment of annual fee