CN111090392A

CN111090392A - Cold and hot data separation method based on feature codes

Info

Publication number: CN111090392A
Application number: CN201911144275.1A
Authority: CN
Inventors: 付楷; 吴大畏; 李晓强
Original assignee: Yeestor Microelectronics Co ltd
Current assignee: Yeestor Microelectronics Co ltd
Priority date: 2019-11-20
Filing date: 2019-11-20
Publication date: 2020-05-01

Abstract

The invention discloses a cold and hot data separation method based on a feature code, which judges whether the written data has the feature code or not, if so, stores the data in a corresponding interval according to the feature code, if not, separates the cold and hot data according to the read-write frequency, and simultaneously evaluates and updates the feature code and adjusts the feature code in the above process. According to the cold and hot data separation method based on the feature codes, data are classified in advance, the data are stored in different regions according to classification, the data regions are divided according to the attributes of the data regions from the source, the erasing times of each block are uniform, and the service life of Nand storage equipment is prolonged.

Description

Cold and hot data separation method based on feature codes

Technical Field

The invention relates to the technical field of memories, in particular to a cold and hot data separation method based on feature codes.

Background

Currently, Nand storage devices use Nand Flash as the basic storage medium. The Nand Flash requires that blocks are erased before data is written, the writing sequence in each block has a strict sequence requirement, the minimum unit of reading and writing is a page, and when the erasing frequency (abrasion) of each block exceeds a certain value, the data is easy to lose.

In order to meet the basic characteristics of Nand and the random reading and writing requirements of upper-Layer users, a Flash Translation Layer (FTL) will implement the conversion of the process.

Based on the characteristics of Nand Flash, the FTL manages data written by a user, and when addresses are repeatedly written, an original Nand physical address is not immediately written, but a new address is selected for writing, and the old address is set as invalid. When Nand is to be written to, FTL will move some blocks of valid data away and erase the whole block to ensure enough space to store data. This process is referred to as a garbage recovery operation (GC).

The service life of the Nand storage device is affected by the wear times and wear balance, frequently written data (namely hot write data) and rarely written data are respectively stored, and the data moved away every time are less, so that the write efficiency can be improved, and the service life of the storage device can be prolonged due to the fact that the moving times are less.

In some existing separation methods, the FTL can only "predict" the cold and hot properties of the data after the written data, and such separation cannot identify the change of the "cold and hot" properties of the data, nor can determine the "cold and hot" properties of the data written for the first time.

Therefore, how to distinguish cold and hot intervals according to the cold and hot characteristics of data to realize the distinction of accurate 'cold and hot' attributes is an urgent problem to be solved.

Disclosure of Invention

The invention aims to provide a cold and hot data separation method based on a feature code, which judges whether the feature code exists in written data or not, stores the data in a corresponding interval according to the feature code if the feature code exists, separates the cold and hot data according to the read-write frequency if the feature code does not exist, and simultaneously evaluates and updates the feature code and adjusts the feature code. According to the cold and hot data separation method based on the feature codes, data are stored in a distinguishing mode, the erasing times of each block are uniform, and the service life of the Nand storage device is prolonged.

The above object of the present invention is achieved by the following technical solutions:

a cold and hot data separation method based on feature codes comprises the following steps:

s1, inputting a feature code;

s2, inputting data;

s3, judging whether the data are the data which accord with the feature codes, if so, turning to S8, and if not, entering the next step;

s4, preparing to store data;

s5, separating cold and hot data according to the read-write frequency;

s6, respectively storing cold and hot data according to the separation result;

s7, evaluating the feature code, and turning to S11;

s8, preparing to store data;

s9, separating cold and hot data according to the feature codes;

s10, respectively storing cold and hot data according to the separation result;

s11, updating the feature code rule;

and S12, ending.

The invention is further configured to: the feature codes comprise hot-write data feature codes, hot-read data feature codes and cold data feature codes.

The invention is further configured to: the feature code comprises a region start address, a region length, a data cold and hot type and a reserved field of cold data or hot data.

The invention is further configured to: in step S5, a threshold is set for the read/write frequency, and when the read/write frequency is equal to or higher than the threshold, the data is classified as hot data, and when the read/write frequency is lower than the threshold, the data is classified as cold data.

The invention is further configured to: in step S7, feature codes are extracted based on the separation result of the cold and hot data, and new feature codes are obtained.

The invention is further configured to: in step S9, the data separation according to the feature code includes the following steps:

a1, start

A2, identifying the area;

a3, judging whether the data are the data which accord with the characteristic code of the hot-written data, if so, turning to A6, and if not, entering the next step;

a4, judging whether the data is the data which accords with the characteristic code of the thermal reading data, if so, turning to A7, and if not, entering the next step;

a5, storing the data in a cold data area, and turning to A8;

a6, storing the data in a hot data area, and turning to A8;

a7, storing data in a cold data area which is checked regularly;

a8, counting the access frequency;

a9, evaluating a feature code;

and A10, ending.

The invention is further configured to: and evaluating the feature codes according to the access frequency of the data, and redefining the cold and hot features of the data.

The invention is further configured to: in step S1, the system file of the operating system is defined as hot read data.

The invention is further configured to: in step S1, the data in the frequently read/write section in the application program is defined as the hot write data.

The invention is further configured to: in step S1, the backup data is defined as cold data.

Compared with the prior art, the invention has the beneficial technical effects that:

1. according to the method, the characteristic codes are set, the cold and hot properties of the data are judged according to the characteristic codes, and the data are stored in different areas according to the data properties, so that the erasing times of each block of Nand Flash are uniform, the service life of equipment is effectively prolonged, and the reading and writing efficiency is effectively improved;

2. furthermore, according to the data property, the corresponding feature codes are given to the data, and the data property is timely modified according to the change of the situation in the storage process, so that the dynamic modification of the data property is realized;

3. furthermore, the characteristic codes are evaluated and updated in the operation process, so that the cold and hot properties of the data are updated in time according to the change of the condition, and the phenomenon that the erasing times are uneven due to the change is avoided.

Drawings

FIG. 1 is a schematic flow chart of a method for separating cold and hot data according to an embodiment of the present invention;

fig. 2 is a flowchart illustrating a method for separating cold and hot data based on feature codes according to an embodiment of the present invention.

Detailed Description

The present invention will be described in further detail with reference to the accompanying drawings.

The invention relates to a cold and hot data separation method based on feature codes, which comprises the following steps as shown in figure 1:

s1, inputting a feature code;

s2, inputting data;

s4, preparing to store data;

s5, separating cold and hot data according to the read-write frequency;

s6, respectively storing cold and hot data according to the separation result;

s7, evaluating the feature code, and turning to S11;

s8, preparing to store data;

s9, separating cold and hot data according to the feature codes;

s10, respectively storing cold and hot data according to the separation result;

s11, updating the feature code rule;

and S12, ending.

Specifically, the feature code is a hot-write data feature code, a hot-read data feature code, and a cold data feature code.

The expression mode of the feature code comprises the following steps: area start address, area length, data hot and cold type, reserved field of cold data or hot data.

In one embodiment of the present application, in step S1, the system file of the operating system is defined as the hot read data.

And defining the data of the frequently read-write interval in the application program as the hot-write data.

The backup data is defined as cold data.

In one embodiment of the present application, the mobile phone installs the operating system for the first time, the operating system is installed in the interval of [0,2G ], and the operating system is installed in the interval of [0,2G ]:

SignatureCode sc；

lb = 0; // data area start Address

Len =2 × 1024 × 2; // zone length

Type = 0; // Heat data

In a specific embodiment of the present application, the device is re-partitioned, and the original space [1G,2G ] will be changed from the cold data interval to the hot data interval, using the following feature codes:

SignatureCode sc；

lba =1 × 1024 × 2; // data area start Address

Len =1 × 1024 × 2; // zone length

Type = 0; // Heat data

The original [18G,20G ] space, which will change from hot data interval to cold data interval, uses the following signature:

SignatureCode sc; // data area start Address

Lba =18 × 1024 × 2; // zone length

sc.len=2*1024*1024*2；

Type = 1; // Cold data

In one embodiment of the present application, a newly installed application will frequently write data in [25G,30G ], using the following feature codes:

SignatureCode sc；

lba =25 × 1024 × 2; // data area start Address

Len =30 × 1024 × 2; // zone length

Type = 0; // Heat data

In one embodiment of the present application, after an application exits, data will not be written to the application until the occupied data interval [24G,31G ] is restarted.

SignatureCode sc；

Lba =24 × 1024 × 2; // data area start Address

Len =7 × 1024 × 2; // zone length

Type = 1; // Cold data

In step S2, the written data is either with or without a signature.

In step S3, it is determined whether the written data has a feature code, and if so, the data is further distinguished by entering a determination mode with a feature code; if the feature code is not contained, entering a judgment mode without the feature code.

In step S5, a threshold is set for the read/write frequency, and when the read/write frequency of a certain data is equal to or higher than the threshold, the data is classified as hot data, and when the read/write frequency of a certain data is lower than the threshold, the data is classified as cold data.

In step S6, the data with the hot and cold attributes separated is stored in a partitioned manner for each attribute. And at different periods, the storage areas of the cold data and the hot data are exchanged to even the times of erasing the storage areas, and the probability of damage to the storage equipment caused by excessive erasing of a certain block is reduced.

In step S7, according to the separation result of the cold and hot data, the data is tracked again to see whether the data keeps the original cold and hot attributes, and if the cold and hot attributes change, the feature codes are extracted again to obtain new feature codes.

In step S9, the data separation according to the feature code includes the following steps:

a1, start

A2, identifying the area;

a5, storing the data in a cold data area, and turning to A8;

a6, storing the data in a hot data area, and turning to A8;

a7, storing data in a cold data area which is checked regularly;

a8, counting the access frequency;

a9, evaluating a feature code;

and A10, ending.

And for the data with the feature codes, the data is stored in a partitioned mode according to the feature codes as long as the attribute of the feature codes is judged.

After the data are stored, the feature code evaluation is carried out according to the access frequency of the data, and the cold and hot features of the data are redefined.

And updating the feature codes according to the evaluation results of the feature codes in the steps S7 and S11, so as to realize dynamic management of cold and hot data.

The embodiments of the present invention are preferred embodiments of the present invention, and the scope of the present invention is not limited by these embodiments, so: all equivalent changes made according to the structure, shape and principle of the invention are covered by the protection scope of the invention.

Claims

1. A cold and hot data separation method based on feature codes is characterized in that: the method comprises the following steps:

s1, inputting a feature code;

s2, inputting data;

s4, preparing to store data;

s5, separating cold and hot data according to the read-write frequency;

s6, respectively storing cold and hot data according to the separation result;

s7, evaluating the feature code, and turning to S11;

s8, preparing to store data;

s9, separating cold and hot data according to the feature codes;

s10, respectively storing cold and hot data according to the separation result;

s11, updating the feature code rule;

and S12, ending.

2. A cold-hot data separation method based on feature codes according to claim 1, characterized in that: the feature codes comprise hot-write data feature codes, hot-read data feature codes and cold data feature codes.

3. A cold-hot data separation method based on feature codes according to claim 1, characterized in that: the feature code comprises a region start address, a region length, a data cold and hot type and a reserved field of cold data or hot data.

4. A cold-hot data separation method based on feature codes according to claim 1, characterized in that: in step S5, a threshold is set for the read/write frequency, and when the read/write frequency is equal to or higher than the threshold, the data is classified as hot data, and when the read/write frequency is lower than the threshold, the data is classified as cold data.

5. A cold-hot data separation method based on feature codes according to claim 1, characterized in that: in step S7, feature codes are extracted based on the separation result of the cold and hot data, and new feature codes are obtained.

6. A cold-hot data separation method based on feature codes according to claim 1, characterized in that: in step S9, the data separation according to the feature code includes the following steps:

a1, start

A2, identifying the area;

a5, storing the data in a cold data area, and turning to A8;

a6, storing the data in a hot data area, and turning to A8;

a7, storing data in a cold data area which is checked regularly;

a8, counting the access frequency;

a9, evaluating a feature code;

and A10, ending.

7. A cold-hot data separation method based on feature codes according to claim 1 or 6, characterized in that: and evaluating the feature codes according to the access frequency of the data, and redefining the cold and hot features of the data.

8. A cold-hot data separation method based on feature codes according to claim 1, characterized in that: in step S1, the system file of the operating system is defined as hot read data.

9. A cold-hot data separation method based on feature codes according to claim 1, characterized in that: in step S1, the data in the frequently read/write section in the application program is defined as the hot write data.

10. A cold-hot data separation method based on feature codes according to claim 1, characterized in that: in step S1, the backup data is defined as cold data.