CN110890956A

CN110890956A - Improved data blocking method for key data stream

Info

Publication number: CN110890956A
Application number: CN201911057388.8A
Authority: CN
Inventors: 高明; 罗锦; 焦海; 周慧颖; 应丽莉
Original assignee: Zhejiang Gongshang University
Current assignee: Zhejiang Gongshang University
Priority date: 2019-10-31
Filing date: 2019-10-31
Publication date: 2020-03-17
Anticipated expiration: 2039-10-31
Also published as: CN110890956B

Abstract

The invention discloses an improved data blocking method for key data streams, which is an acceleration mechanism based on a software defined wide area network. The invention comprehensively uses the fixed block algorithm, the Bloom Filter algorithm and the MD5 algorithm, and can meet the different accelerated transmission requirements of different data streams in the SD-WAN, the functions of load balancing of network flow, data stream classification and the like. The method can adopt different acceleration strategies and scheduling schemes aiming at different data flows so as to meet the requirements of users and realize the maximization of the network utilization rate.

Description

Improved data blocking method for key data stream

Technical Field

The invention belongs to the technical field of network communication, and particularly relates to an improved data blocking method for a key data stream

Background

The combination of the SDN technology and the WAN transmission improves the network transmission capability, and if the improved WAN acceleration technology is integrated into the SD-WAN, the transmission efficiency will be greatly improved for large file transmission or repeated data transmission. The traditional wide area network acceleration mode is not only unnecessary to accelerate a plurality of data streams with low QoS requirements, but also influences the transmission of some important data streams, so that the transmission quality of key data streams can not be ensured.

Disclosure of Invention

An improved data blocking method for critical data streams, comprising the steps of:

the method comprises the following steps that (1) data transmitted in the wide area network are divided into two types: critical data streams and non-critical data streams.

The key data stream class comprises a plurality of data streams of different types, namely each stream in the key data streams has certain QoS requirements, and the data streams are classified into one class only and adopt the same acceleration strategy.

The non-critical data flow is a data flow without Qos requirement.

Step (2) inputting a key data stream, carrying out size detection on the key data stream, if the data is greater than 4KB, executing step (3), otherwise, not carrying out any processing on the key data stream;

step (3) equally dividing the key data stream into data blocks with the size of 4KB by adopting a technology similar to fixed block division, and only recording the position of each block point;

step (4) using a sliding window with the size of 256 bytes to detect from the position of each block point; the implementation method is as the step (5)

Step (5) calculating the MD5 fingerprint value in the sliding window, using the MD5 fingerprint value as the input of a Bloom Filter, if the fingerprint value passes through the Bloom Filter, indicating that the data block is a high-frequency data block, executing step (6), and if the fingerprint value does not pass through the Bloom Filter, executing step (7);

step (6) detecting the MD5 fingerprint value of the data block in a repeated data base, if so, executing step (8), otherwise, executing step (9);

the repeated data base is an original data block corresponding to each MD5 value, when the MD5 value calculated by the key data stream compression module is searched in the repeated data base, if the searching is successful, the data block is the repeated data block, and the corresponding label index is found in the repeated data base for replacement.

Step (7), the sliding window is moved backwards by one byte, and the step (3) is executed until the next partitioning point is met;

step (8) indicating that the data block is a repeated data block, replacing the data block with the index value of the data block in the repeated data base, and transmitting;

step (9), the data block is not a repeated data block, but belongs to a high-frequency data block and needs to be added into a repeated data base;

and (10) repeating the step (5) until the data flow is ended.

The invention has the following beneficial effects:

the invention comprehensively uses the fixed block algorithm, the Bloom Filter algorithm and the MD5 algorithm, and can meet the different accelerated transmission requirements of different data streams in the SD-WAN, the functions of load balancing of network flow, data stream classification and the like. The method can adopt different acceleration strategies and scheduling schemes aiming at different data flows so as to meet the requirements of users and realize the maximization of the network utilization rate. The concrete embodiment is as follows:

(1) firstly, the speed of fixed blocking is far higher than that of a CDC algorithm, and no substantial blocking is carried out; (2) the algorithm adopts the MD5 algorithm to calculate the hash value of the data block, the calculation speed of the MD5 algorithm is about 227MB/S, the calculation speed of the SHA-1 algorithm is only 83MB/S, the calculation speed of the MD5 algorithm is about three times faster than that of the SHA-1 algorithm, and the method is very suitable for accelerating the wide area network; (3) the method only needs to perform substantial blocking on original data once, does not need to calculate the Rabin fingerprint value and then calculate the SHA-1 value like CDC algorithm, only needs to calculate the MD5 value, and is used as the input of a Bloom Filter and the retrieval value of a repeated database, thereby saving much time and space consumption; (4) the whole calculation process of the method is not complex, and the burden on the system is far less than that of a data coding mode requiring complex coding.

Drawings

FIG. 1 is a flow diagram of a compressed key data stream;

Detailed Description

The invention is further illustrated by the following figures and examples.

The non-critical data flow is a data flow without Qos requirement.

step (4) using a sliding window with the size of 256 bytes to detect from the position of each block point; the method is realized as the step (5);

and (10) repeating the step (5) until the data flow is ended.

Claims

1. An improved data blocking method for critical data streams, comprising the steps of:

The non-critical data flow is a data flow without Qos requirement.

step (4) using a sliding window with the size of 256 bytes to detect from the position of each block point;

and (10) repeating the step (5) until the data flow is ended.