WO2024021491A1

WO2024021491A1 - Data slicing method, apparatus and system

Info

Publication number: WO2024021491A1
Application number: PCT/CN2022/141819
Authority: WO
Inventors: 刘利; 姚栋; 赵真; 李丽; 赵龙飞; 杨思源
Original assignee: 天翼云科技有限公司
Priority date: 2022-07-29
Filing date: 2022-12-26
Publication date: 2024-02-01
Also published as: CN115470186A

Abstract

A data slicing method, apparatus and system. The data slicing method comprises: acquiring a target data stream to be sliced (S21); acquiring from the target data stream a target character at a slicing starting position, and searching a preset array for a numerical value corresponding to the target character, wherein in the preset array, the numerical values corresponding to at least some characters are different (S22); and according to the numerical value corresponding to the target character, determining a slicing length, and starting from the slicing starting position, slicing the target data stream according to the slicing length (S23). The method can improve the deduplication rate.

Description

A data slicing method, device and system

Technical field

The invention relates to the field of performance testing, and in particular to a data slicing method, device and system.

Background technique

With the maturity and widespread application of data backup technology, the amount of backup data has shown explosive growth in recent years. Research shows that backup data in large enterprises is growing at a rate of 40% to 60% every year, and the size of backup data in many companies will double or more every year. There are many identical data blocks in these backup data, which greatly increases the enterprise's cost and waste of storage space.

Currently, part of the duplicate data can be deleted through data deduplication technology to reduce storage capacity. Among them, data deduplication technology refers to slicing the data stream or file according to a certain method (also known as dicing), and performs hash calculations on data blocks to find the same data blocks for deletion. Data deduplication technology can compress duplicate data in the storage system and reduce storage capacity. This technology is currently widely used in backup systems.

technical problem

Currently, some technologies use fixed-length slicing methods to slice data streams. Among them, fixed-length slicing means defining a fixed slice length (for example, 2 bytes is a data block), and then dividing the data stream into data blocks of the same length for storage according to the defined slice length. This slicing method has a low deduplication rate for data streams in some new and deleted scenarios. For example, compared to data stream B, data stream A only adds a new character at the front end of data stream B. However, when the two data streams are sliced using the fixed-length slicing method, the data blocks cut out are quite different. , the deduplication rate is not high.

Technical solutions

In view of this, embodiments of the present invention provide a data slicing method, a data slicing device, a data slicing system and a computer-readable storage medium, which can improve the data deduplication rate.

In one aspect, the present invention provides a data slicing method, which method includes:

Get the target data stream to be sliced;

Obtain the target character at the starting position of the slice from the target data stream, and search for the numerical value corresponding to the target character from the preset array, wherein in the preset array, the numerical value corresponding to at least some characters is not identical; and

According to the numerical value corresponding to the target character, the slice length is determined, and starting from the starting position of the slice, the target data stream is sliced according to the slice length.

In some embodiments, searching for a value corresponding to the target character from a preset array includes:

Search the character number of the target character from a preset character table, where the preset character table includes a correspondence between characters and character numbers, and different characters have different character numbers;

Using the character number of the target character as an index, search for the value corresponding to the target character from the preset array.

In some embodiments, determining the slice length according to the numerical value corresponding to the target character includes:

Perform operations on the numerical values corresponding to the target characters to obtain the corresponding intermediate values;

If the obtained intermediate value is less than the slice length threshold, the following operations are performed sequentially for the characters after the slice starting position until the slice end position is determined, and the length between the slice end position and the slice start position is taken as The slice length:

Find the value corresponding to the character from the preset array;

Perform operations on the found values to obtain the corresponding intermediate values;

If the obtained intermediate value is greater than or equal to the slice length threshold, the position of the character is determined as the end position of the slice.

In some embodiments, for any value found from the preset array, the corresponding intermediate value is determined based on the following method:

Perform an XOR operation on this value and the value of the preset variable to obtain the first intermediate value;

Subtract a preset value from the first intermediate value to obtain a second intermediate value;

Perform an XOR operation on the first intermediate value and the second intermediate value to obtain the corresponding intermediate value.

In some embodiments, after each calculation of the first intermediate value, the method further includes:

Update the value of the preset variable to the first intermediate value, so that the next time the value of the preset variable is XORed with the value found from the preset array, based on the updated value The value of the preset variable is XORed with the value found in the preset array.

In some embodiments, after obtaining the target data stream to be sliced, the method further includes:

Record the position of the last character in the target data stream as the character end position;

For any character in the target data stream, when the intermediate value corresponding to the character is less than the fragment length threshold, determine whether the position of the character is the end position of the character, and if so, move the character The length between the end position and the starting position of the slice is used as the slice length.

In some embodiments, after calculating the numerical value corresponding to the target character to obtain the corresponding intermediate value, the method further includes:

If the obtained intermediate value is greater than or equal to the slice length threshold, the target character is used as a data block to be cut, and the target data stream is sliced.

On the other hand, the present invention also provides a data slicing device, which includes:

Data acquisition module, used to obtain the target data stream to be sliced;

A search module, configured to obtain the target character at the starting position of the slice from the target data stream, and search for the numerical value corresponding to the target character from the preset array, wherein in the preset array, at least part of The characters correspond to different numerical values; and

A slicing module, configured to determine the slice length according to the numerical value corresponding to the target character, and slice the target data stream according to the slice length starting from the starting position of the slice.

Another aspect of the present invention also provides a computer-readable storage medium. The computer-readable storage medium is used to store a computer program. When the computer program is executed by a processor, the method as described above is implemented.

On the other hand, the present invention also provides a data slicing system. The data slicing system includes a processor and a memory. The memory is used to store a computer program. When the computer program is executed by the processor, the above-mentioned steps are implemented. Methods.

beneficial effects

In some embodiments of the present application, the target character at the starting position of the slice is obtained from the target data stream, and the value corresponding to the target character is searched from the preset array, and then the slice length is determined based on the value corresponding to the target character, to slice the target data stream. In this way, the target data stream can be sliced into variable lengths, which can reduce the impact of newly added and deleted characters on data slicing in scenarios such as data addition and deletion, effectively improving the deduplication rate.

Description of drawings

The features and advantages of the present invention will be more clearly understood by referring to the accompanying drawings, which are schematic and should not be construed as limiting the invention in any way, in which:

Figure 1 shows a schematic diagram of data slicing in some technologies;

Figure 2 shows a schematic flowchart of a data slicing method provided by an embodiment of the present application;

Figure 3 shows a schematic diagram of a target data flow provided by an embodiment of the present application;

Figure 4 shows a schematic diagram of data slicing provided by one embodiment of the present application;

Figure 5 shows a flow chart of data slicing provided by an embodiment of the present application;

Figure 6 shows a flow chart of data slicing provided by another embodiment of the present application;

Figure 7 shows a flow chart of data slicing provided by another embodiment of the present application;

Figure 8 shows a schematic module diagram of a data slicing device provided by an embodiment of the present application;

Figure 9 shows a schematic diagram of a data slicing system provided by an embodiment of the present application.

Embodiments of the invention

In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments These are some embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those skilled in the art without creative efforts fall within the scope of protection of the present invention.

See Figure 1 for a schematic diagram of data slicing in some technologies. In Figure 1, compared to data stream A, data stream B has new characters in the gray box. When slicing data stream B and data stream A using fixed-length slicing (every 3 characters is a data block), affected by the new characters in data stream B, the data stream B and data stream A are cut out Among the data blocks, there are fewer identical data blocks. The storage system needs to allocate storage space for data flow A and data flow B respectively. Although there are more identical characters in data stream B and data stream A, the deduplication rate is low and no compressed storage space is used.

To this end, this application provides a data slicing method that can improve the deduplication rate. The data slicing method provided in this application can be applied to electronic devices. Electronic devices include but are not limited to tablets, desktop computers, laptops, and servers. Please refer to Figure 2, which is a schematic flow chart of a data slicing method according to an embodiment of the present application. The data slicing method includes steps S21 to S23.

Step S21: Obtain the target data stream to be sliced.

In some embodiments, the target data stream includes content to be stored in the storage system, such as a file data stream and a video image data stream.

Step S22: Obtain the target character at the starting position of the slice from the target data stream, and search for the value corresponding to the target character from the preset array, where at least some of the characters in the preset array have different values.

In some embodiments, the slicing starting position refers to the starting position when slicing the target data stream. For a target data stream that has not yet started slicing, the starting position of the slice can be the position of the first character of the target data stream. For a target data stream that has started slicing, the slicing starting position may be the position of the first character in the target data stream that has not yet been sliced after the last slicing is completed. For ease of understanding, please refer to FIG. 3 , which is a schematic diagram of a target data flow provided by an embodiment of the present application. In Figure 3, it is assumed that the first character on the left is the starting position of the target data stream, and the last character on the right is the end position of the target data stream. When slicing the target data stream has not yet started, the starting position of the slicing can be the position of the first character on the left; when slicing the target data stream has already started, for example, the last slice is from the dotted line If cutting is performed, then the next time the slice is sliced, the starting position of the slice can be the position of the first character after the dotted line (that is, the character o).

In some embodiments, the preset array may be a random array generated in a random manner. Consider that in computers, a character is represented by 8 bits (8bit). 8 bits can represent 256 different characters, so the default array can include 256 values. Each numerical value corresponds to one of the above 256 characters. That is, for each character in the target data stream, a corresponding value can be found in the preset array. For example, character A corresponds to the value 20 in the preset array, and character B corresponds to the value 15 in the preset array.

In some embodiments, after the preset array is generated in a random manner, the preset array can be saved. For characters in different target data streams, the values corresponding to the characters can be found in the same preset array. That is, only one preset array can be generated, and there is no need to generate separate preset arrays for different target data streams.

In some embodiments, when searching for the numerical value corresponding to the target character from the preset array, the character number of the target character can be searched from the preset character table, where the preset character table includes the correspondence between characters and character numbers, Different characters have different character numbers. Then the character number of the target character can be used as an index to find the value corresponding to the target character from the preset array.

The preset character table may be an ASCII code table. The character number can be the ASCII code of the character in the ASCII code table. Of course, it is understandable that as long as there is a corresponding relationship between characters and character numbers in the character table, it does not necessarily have to be an ASCII code table. For example, the character table can also be pre-established by technical personnel. In the character table, technicians can assign each character a character number that is different from the ASCII code.

In some embodiments, after the character number of the target character is found from the character table, the character number can be used as the position information of the numerical value to search for the numerical value corresponding to the target character in the preset array. For example, assuming the character number is 20, the 20th value in the default array is used as the value corresponding to the target character.

Step S23: Determine the slice length according to the numerical value corresponding to the target character, and slice the target data stream according to the slice length starting from the starting position of the slice.

In some embodiments, the value corresponding to the target character can be directly used as the slice length, that is, starting from the starting position of the slice, the target data stream is sliced according to the length defined by the value corresponding to the target character. Specifically, the value corresponding to the target character can be used as the number of characters to be sliced. For ease of understanding, please refer to Figure 3. For example, assuming that the target character in Figure 3 is the first character i on the left, and the value corresponding to the target character is 5, then starting from the target character i, a data block containing 5 characters can be cut out from the target data stream ( i.e. itwea). Of course, the value corresponding to the target character can also be used as the number of bits to be sliced. For example, assuming that the target character in Figure 3 is the first character i on the left, and the value corresponding to the target character is 56, then starting from the target character i, a data block containing 7 characters can be cut out from the target data stream. (i.e. itweasc).

In some embodiments, since the values corresponding to at least some characters in the preset array are different, and under normal circumstances, the characters in the target data stream are not completely the same, the length of the cut data block is also different. It should not be exactly the same, that is, when slicing the target data stream, the variable-length slicing method is used. And since it is a variable-length slice, it is possible to solve the problem described in Figure 1. Referring to FIG. 4 , a schematic diagram of data slicing is provided for an embodiment of the present application. Data stream B in Figure 4 is similar to data stream A. The characters in the gray box are new characters in data stream B compared to data stream A. Since this application is a variable-length slice, for data stream A and data stream B, it is possible to separately cut the newly added characters in data stream B into a data block. For example, assuming that the corresponding values of characters i, a, and o in the preset array are all 3, the corresponding value of character e in the preset array is 1, and the corresponding value of character 0 in the preset array is 2, then according to this application method, data stream A and data stream B can be cut into data blocks as shown in Figure 4. As can be seen from Figure 4, data flow A and data flow B jointly include data blocks itw, asc, old, ayi, and, then during data storage, only one of these data blocks can be saved. In this way, the purpose of compressing storage space and improving deduplication rate is achieved.

Furthermore, consider that in some scenarios, the lengths of different target data streams may vary greatly. For example, some target data streams include a larger number of characters, and some target data streams include a smaller number of characters. If the value corresponding to the target character is large, it is more suitable to slice the target data stream with a longer length. However, for a target data stream with a shorter length, there may be a problem that it cannot be sliced. For example, assume that the target data stream includes 5 characters, but the value corresponding to the target character is 10 (representing 10 characters). In this case, it may not be possible to slice the target data stream with a short length. On the contrary, if the value corresponding to the target character is small, in this case, although the target data stream with a shorter length can be sliced, for the target data stream with a longer length, the cut data blocks may be too small. Too many questions. In view of the above description, in some embodiments of the present application, the slice length may be determined based on the following method.

Specifically, the numerical value corresponding to the target character can be operated to obtain the corresponding intermediate value. If the obtained intermediate value is less than the slice length threshold, perform the following operations on the characters after the slice start position in sequence until the slice end position is determined, and the length between the slice end position and the slice start position is used as the slice length:

Find the value corresponding to the character from the preset array;

In some embodiments, for any value found from the preset array, the corresponding intermediate value can be determined based on the following method:

Subtract the preset value from the first intermediate value to obtain the second intermediate value;

In some embodiments, after each calculation of the first intermediate value, the value of the preset variable can also be updated to the first intermediate value, so that the value of the preset variable can be compared with the value found from the preset array next time. When performing an XOR operation on a value, the XOR operation is performed on the value of the updated preset variable and the value found in the preset array.

In some embodiments, after obtaining the target data stream to be sliced, the position of the last character in the target data stream may also be recorded as the character end position. For any character in the target data stream, when the corresponding intermediate value of the character is less than the slice length threshold, determine whether the position of the character is the end position of the character. If so, the distance between the end position of the character and the start position of the slice is length as slice length.

In some embodiments, after calculating the numerical value corresponding to the target character and obtaining the corresponding intermediate value, if the obtained intermediate value is greater than or equal to the slice length threshold, the target character can be used as a data block to be cut, and the target data can be The stream is sliced.

For ease of understanding, refer to FIG. 5 , which is a flow chart of data slicing provided in an embodiment of the present application. In Figure 5, the first character on the left is the target character i. Assume that the preset variable is x, and the initial value of the preset variable x is one of the values in the preset array, or the initial value of the preset variable x is a randomly generated or specified value, the film length threshold is slen, and the target data The position of the last character in the stream is max, the value corresponding to character i is i', the value corresponding to character t is t', and the value corresponding to character w is w', The corresponding value of character e is e’, the position of character i is id, the position of character t is td, the position of character w is wd, and the position of character e is ed.

First, perform operations on the numerical value corresponding to the character i.

1) The value corresponding to i can be XORed with the value of the preset variable x to obtain the first intermediate value corresponding to the character i. In this embodiment, after dividing the value of the preset variable x by 2, perform an XOR operation on the divided value and the value corresponding to i to obtain the first intermediate value, and update the value of the preset variable x to First intermediate value. This process corresponds to x=x>>1)^i’ in step 1 of Figure 5.

2) The first intermediate value corresponding to character i can be subtracted from the preset value to obtain the second intermediate value. In this embodiment, after subtracting 1 from the first intermediate value corresponding to character i, the second intermediate value corresponding to character i is obtained. Then perform an XOR operation on the first intermediate value and the second intermediate value to obtain the intermediate value corresponding to character i, and determine that the intermediate value is smaller than the slice length threshold slen. This process corresponds to x^(x-1)<slen in step 1 of Figure 5.

3) Determine that the position of character i is not the position max of the last character of the target data stream. This process corresponds to id!=max in step 1 of Figure 5.

Further, perform operations on the numerical value corresponding to the character t.

1) The value corresponding to t can be XORed with the value of the updated preset variable x to obtain the first intermediate value corresponding to the character t. This process corresponds to x=x>>1)^t’ in step 2 of Figure 5.

2) The first intermediate value corresponding to character i can be subtracted from the preset value to obtain the second intermediate value. Then perform an XOR operation on the first intermediate value and the second intermediate value to obtain the intermediate value corresponding to the character t, and determine that the intermediate value is smaller than the slice length threshold slen. This process corresponds to x^(x-1)<slen in step 2 of Figure 5.

3) Determine that the position of character t is not the position max of the last character of the target data stream. This process corresponds to td!=max in step 2 of Figure 5.

Further, perform operations on the numerical value corresponding to the character w.

1) The value corresponding to w can be XORed with the value of the updated preset variable x to obtain the first intermediate value corresponding to the character w. This process corresponds to x=x>>1)^w’ in step 3 of Figure 5.

2) The first intermediate value corresponding to the character w can be subtracted from the preset value to obtain the second intermediate value. Then perform an XOR operation on the first intermediate value and the second intermediate value to obtain the intermediate value corresponding to the character w, and determine that the intermediate value is smaller than the slice length threshold slen. This process corresponds to x^(x-1)<slen in step 3 of Figure 5.

3) Determine that the position of character w is not the position max of the last character of the target data stream. This process corresponds to wd!=max in step 3 of Figure 5.

Further, the numerical value corresponding to the character e is calculated.

1) The value corresponding to e can be XORed with the value of the updated preset variable x to obtain the first intermediate value corresponding to the character e. This process corresponds to x=x>>1)^e’ in step 4 of Figure 5.

2) The first intermediate value corresponding to the character e can be subtracted from the preset value to obtain the second intermediate value. Then perform an XOR operation on the first intermediate value and the second intermediate value to obtain the intermediate value corresponding to the character e, and determine that the intermediate value is greater than the slice length threshold slen. This process corresponds to x^(x-1)>slen in step 4 of Figure 5.

3) Determine that the position of character e is not the position max of the last character of the target data stream. This process corresponds to ed!=max in step 4 of Figure 5.

After calculating the numerical value of character e, since the obtained intermediate value is greater than the slice length threshold slen, the position of character e determines the end position of the slice, and then the value between character e and character i (including character e and character i) is The length is used as the slice length to slice the target data stream. Here, since the slice length should be 4 (meaning 4 characters). In this way, itwe can be used as a data block to slice the target data stream. Then the position of character a after character e can be used as the starting position of slicing, and character a can be used as the target character, and the above steps can be performed to continue slicing the target data stream.

Continuing to refer to FIG. 6 , a flow chart of data slicing is provided for another embodiment of the present application. In Figure 6, after calculating the value corresponding to the target character i, the obtained intermediate value is greater than the slice length threshold slen. At this time, the target character i is treated as a separate data block and the target data stream is sliced.

Continuing to refer to FIG. 7 , a flow chart of data slicing is provided for another embodiment of the present application. In Figure 7, it is assumed that the value corresponding to character d is d’, and the position of character d is dd. As can be seen from Figure 7, assuming that the target data stream is sliced starting from character c, after reaching the position of character d according to a process similar to Figure 5, the value corresponding to character d is calculated, and the obtained intermediate value is less than the slice length Threshold slen, but the position of character d is the position max of the last character of the target data stream, then the character d can be used as the end position of the character, and the character cold can be used as a data block to cut the target data stream.

In some embodiments of the present application, the target character at the starting position of the slice is obtained from the target data stream, and the value corresponding to the target character is searched from the preset array, and then the slice length is determined based on the value corresponding to the target character, to slice the target data stream. In this way, the target data stream can be sliced into variable lengths, which can reduce the impact of added and deleted characters on data slicing in scenarios such as data addition and deletion, effectively improving the deduplication rate.

Please refer to FIG. 8 , which is a schematic module diagram of a data slicing device according to an embodiment of the present application. Data slicing devices include:

Data acquisition module, used to obtain the target data stream to be sliced;

The search module is used to obtain the target character at the starting position of the slice from the target data stream, and to find the value corresponding to the target character from the preset array, where at least some of the characters in the preset array have different values. ;and

The slicing module is used to determine the slice length according to the value corresponding to the target character, and slice the target data stream according to the slice length starting from the starting position of the slice.

Please refer to Figure 9, which is a schematic diagram of a data slicing system provided by an embodiment of the present application. The data slicing system includes a processor and a memory. The memory is used to store computer programs. When the computer program is executed by the processor, the above-mentioned data slicing method is implemented.

The processor may be a central processing unit (Central Processing Unit, CPU). The processor can also be other general-purpose processors, Digital Signal Processor (DSP), Application Specific Integrated Circuit (ASIC), Field-Programmable Gate Array (FPGA) or other Chips such as programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, or combinations of these types of chips.

As a non-transitory computer-readable storage medium, the memory can be used to store non-transitory software programs, non-transitory computer executable programs and modules, such as program instructions/modules corresponding to the methods in the embodiments of the present invention. The processor executes various functional applications and data processing of the processor by running non-transient software programs, instructions and modules stored in the memory, that is, implementing the method in the above method implementation.

The memory may include a program storage area and a data storage area, where the program storage area may store an operating system and an application program required for at least one function; the data storage area may store data created by the processor, etc. In addition, the memory may include high-speed random access memory and may also include non-transitory memory, such as at least one magnetic disk storage device, flash memory device, or other non-transitory solid-state storage device. In some embodiments, the memory may optionally include memory located remotely from the processor, and these remote memories may be connected to the processor via a network. Examples of the above-mentioned networks include but are not limited to the Internet, intranets, local area networks, mobile communication networks and combinations thereof.

One embodiment of the present application also provides a computer-readable storage medium, which is used to store a computer program. When the computer program is executed by a processor, the above-mentioned data slicing method is implemented.

Although the embodiments of the present invention have been described in conjunction with the accompanying drawings, those skilled in the art can make various modifications and variations without departing from the spirit and scope of the invention. Such modifications and variations are covered by the appended claims. within the limited scope.

Claims

A data slicing method, characterized in that the method includes:

Get the target data stream to be sliced;

Obtain the target character at the starting position of the slice from the target data stream, and search for the numerical value corresponding to the target character from the preset array, wherein in the preset array, the numerical value corresponding to at least some characters is not are the same; and determine the slice length according to the numerical value corresponding to the target character, and slice the target data stream according to the slice length starting from the starting position of the slice.
The method of claim 1, wherein searching for a value corresponding to the target character from a preset array includes:

Search the character number of the target character from a preset character table, where the preset character table includes a correspondence between characters and character numbers, and different characters have different character numbers;

Using the character number of the target character as an index, search for the value corresponding to the target character from the preset array.
The method of claim 1, wherein determining the slice length according to the numerical value corresponding to the target character includes:

Perform operations on the numerical values corresponding to the target characters to obtain the corresponding intermediate values;

If the obtained intermediate value is less than the slice length threshold, the following operations are performed sequentially for the characters after the slice starting position until the slice end position is determined, and the length between the slice end position and the slice start position is taken as The slice length:

Find the value corresponding to the character from the preset array;

Perform operations on the found values to obtain the corresponding intermediate values;

If the obtained intermediate value is greater than or equal to the slice length threshold, the position of the character is determined as the end position of the slice.
The method according to claim 3, characterized in that, for any value found from the preset array, the corresponding intermediate value is determined based on the following method:

Perform an XOR operation on this value and the value of the preset variable to obtain the first intermediate value;

Subtract a preset value from the first intermediate value to obtain a second intermediate value;

Perform an XOR operation on the first intermediate value and the second intermediate value to obtain the corresponding intermediate value.
The method of claim 4, wherein after each calculation of the first intermediate value, the method further includes:

Update the value of the preset variable to the first intermediate value, so that the next time the value of the preset variable is XORed with the value found from the preset array, based on the updated value The value of the preset variable is XORed with the value found in the preset array.
The method of claim 3, wherein after obtaining the target data stream to be sliced, the method further includes:

Record the position of the last character in the target data stream as the character end position;

For any character in the target data stream, when the intermediate value corresponding to the character is less than the fragment length threshold, determine whether the position of the character is the end position of the character, and if so, move the character The length between the end position and the starting position of the slice is used as the slice length.
The method according to claim 3, characterized in that, after calculating the numerical value corresponding to the target character and obtaining the corresponding intermediate value, the method further includes:

If the obtained intermediate value is greater than or equal to the slice length threshold, the target character is used as a data block to be cut, and the target data stream is sliced.
A data slicing device, characterized in that the device includes:

Data acquisition module, used to obtain the target data stream to be sliced;

A search module, configured to obtain the target character at the starting position of the slice from the target data stream, and search for the numerical value corresponding to the target character from the preset array, wherein in the preset array, at least part of The numerical values corresponding to the characters are different; and a slicing module is used to determine the slice length according to the numerical value corresponding to the target character, and start from the starting position of the slice and perform the processing on the target data stream according to the slice length. slice.
A computer-readable storage medium, characterized in that the computer-readable storage medium is used to store a computer program. When the computer program is executed by a processor, the method according to any one of claims 1 to 7 is implemented.
A data slicing system, characterized in that the data slicing system includes a processor and a memory, and the memory is used to store a computer program. When the computer program is executed by the processor, it implements claims 1 to 7 any of the methods described.