CN107562375A - More copy isomery storage methods and framework based on data value - Google Patents

More copy isomery storage methods and framework based on data value Download PDF

Info

Publication number
CN107562375A
CN107562375A CN201710700649.8A CN201710700649A CN107562375A CN 107562375 A CN107562375 A CN 107562375A CN 201710700649 A CN201710700649 A CN 201710700649A CN 107562375 A CN107562375 A CN 107562375A
Authority
CN
China
Prior art keywords
data
value
copy
storage
stored
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710700649.8A
Other languages
Chinese (zh)
Inventor
李晖
李惠娟
温兆聪
吴俊敏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dongguan Power Supply Bureau of Guangdong Power Grid Co Ltd
Original Assignee
Dongguan Power Supply Bureau of Guangdong Power Grid Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dongguan Power Supply Bureau of Guangdong Power Grid Co Ltd filed Critical Dongguan Power Supply Bureau of Guangdong Power Grid Co Ltd
Priority to CN201710700649.8A priority Critical patent/CN107562375A/en
Publication of CN107562375A publication Critical patent/CN107562375A/en
Pending legal-status Critical Current

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a kind of more copy isomery storage methods based on data value, including:The dynamic value Vd of data is obtained according to data access frequency, the static value Vs of data delimited according to data type, the subjective value Vu of data is defined according to user's request;The total value V=wd*Vd+ws*Vs+wu*Vu of data is calculated, wherein wd is the weight of dynamic value, and ws is the weight of static value, and wu is the weight of subjective value, wd+ws+wu=1;By the total value of data compared with default first threshold and Second Threshold, data are divided into high price Value Data, middle value numerical value and low price Value Data;All copies of low price Value Data are stored on disk, all copies of high price Value Data are stored on solid state hard disc, a copy of middle value data are stored on solid state hard disc, remaining copy is stored on disk.According to the suitable storage strategy of the different choice of data value, by copy storage on a different storage medium, so as to reach the optimization to systematic function and cost.

Description

More copy isomery storage methods and framework based on data value
Technical field
It is different more particularly to a kind of more copies based on data value the present invention relates to a kind of isomery storage method of data Structure storage method and framework.
Background technology
In distributed memory system, in order to ensure the reliability of data, data redundancy method, i.e. a data are employed The method for storing multiple copies, the number of copies under default situations is 3, but is selection SSD or magnetic for the storage location of copy Disk is stored, and system is randomly selected, so as to not fully take into account the characteristic of different storage mediums, it is impossible to given play to The advantages of isomery stores, therefore strong influence is caused to the storage performance of system.Although SSD performance is largely Better than disk, but it is not meant to that SSD can substitute disk, because SSD has the characteristic for writing preceding erasing, that is, is written to SSD's Data can not can be covered directly as disk, it is necessary to which first erasing is repeated and write, so as to not only reducing SSD performance And it have impact on SSD life-span.
Although being directed to the evaluation of data value in other researchs, dynamic value is only considered, dynamic value refers to Be data access frequency height, change that visiting frequency can be over time and change, its data value can also change therewith, Therefore referred to as dynamic value.Visiting frequency is more high then to represent that the use degree of data is higher, i.e., dynamic value is higher;Visiting frequency is got over Low, then data are low using degree, i.e., dynamic value is low.In general, the data access frequency that newly writes is certainly very high, therefore can be with Its magnitude of value is provided according to the visiting frequency in one day, a week or one month respectively.But if only consideration data The data value that this factor of dynamic value obtains is often inaccuracy,
The content of the invention
For above-mentioned technical problem, the present invention seeks to:More copy isomeries based on data value are provided to deposit Method for storing and framework, from the angle of data value, there is provided the isomery storage architecture of copy more than one, according to the difference of data value Suitable storage strategy is selected, by copy storage on a different storage medium, so as to reach to the excellent of systematic function and cost Change, dynamic value is both considered in the evaluation of data value, it is further contemplated that static value and subjective value so that the data valency of calculating Value is more accurate.
The technical scheme is that:
A kind of more copy isomery storage methods based on data value, comprise the following steps:
S01:The dynamic value Vd of data is obtained according to data access frequency, the static valency of data delimited according to data type Value Vs, the subjective value Vu of data is defined according to user's request;
S02:The total value V=wd*Vd+ws*Vs+wu*Vu of data is calculated, wherein wd is the weight of dynamic value, and ws is The weight of static state value, wu is the weight of subjective value, wd+ws+wu=1;
S03:By the total value of data compared with default first threshold and Second Threshold, data are divided into high value number According to, middle value numerical value and low price Value Data;
S04:All copies of low price Value Data are stored on disk, all copies of high price Value Data are stored in solid On state hard disk, a copy of middle value data is stored on solid state hard disc, remaining copy is stored on disk.
Preferably, the total value of data is calculated in real time and compared with default first threshold and Second Threshold, judge data Axiological idea whether be consistent with all copy storage locations of the data, if not being consistent, triggering transferring module change the data The storage location of copy.
The invention also discloses a kind of more copy isomery storage architectures based on data value, including:
One data memory access logging modle, the memory access information of all data in record system;
One data value computing module, the dynamic value Vd of data is obtained according to data access frequency, according to data type The static value Vs of data delimited, the subjective value Vu of data is defined according to user's request;Calculate the total value V=wd* of data Vd+ws*Vs+wu*Vu, wherein wd are the weights of dynamic value, and ws is the weight of static value, and wu is the weight of subjective value, Wd+ws+wu=1;
One monitoring and mapping block, real-time monitoring system, find all copies of data, judge the axiological ideas of data with Whether all copy storage locations of the data are consistent, if not being consistent, change data value and the mapping of copy storage strategy are closed System;
One transferring module, transferring module is triggered after monitoring have changed the storage strategy of data with mapping block, migrates mould The copy storage strategy migrating data that root tuber is changed according to migration.
Compared with prior art, it is an advantage of the invention that:
From the angle of data value, there is provided the isomery storage architecture of copy more than one, according to the different choice of data value Suitable storage strategy, by copy storage on a different storage medium, the advantage of different storage mediums is given full play to, using not The advantages of with storage medium, the shortcomings that avoiding different storage mediums, so as to reach the optimization to systematic function and cost.Data valency Dynamic value is both considered in the evaluation of value, it is further contemplated that static value and subjective value so that the data value of calculating is more accurate Really, can completely embody data to system, to the importance of user.
Brief description of the drawings
Below in conjunction with the accompanying drawings and embodiment the invention will be further described:
Fig. 1 is the structural representation of more copy isomery storage architectures based on data value;
Fig. 2 is the flow chart of more copy isomery storage architectures based on data value.
Embodiment
Such scheme is described further below in conjunction with specific embodiment.It should be understood that these embodiments are to be used to illustrate The present invention and be not limited to limit the scope of the present invention.The implementation condition used in embodiment can be done according to the condition of specific producer Further adjustment, unreceipted implementation condition is usually the condition in normal experiment.
Embodiment:
As shown in figure 1, framework mainly includes four modules:Data memory access logging modle, data value computing module, monitoring With mapping block and transferring module, wherein first three module is in the name server of distributed file system, constitutes one Individual isomery storage control, last module are in data server, are an executors.
Data memory access logging modle is mainly responsible for the memory access information of all data in record system.The module is isomery storage Inlet module, record the access type, access times and the time accessed every time of all data.Recorded by data memory access Module can obtain data access frequency, so as to which the module needs the relevant information of collection being sent to data value calculating mould Block, the information that the module provides are used for the dynamic value Vd for calculating data.
The main task of data value computing module is to calculate the total value of all data.Remembered by data memory access module The information of record draws dynamic value Vd;The static value Vs of data delimited according to data type, data are defined according to user's request Subjective value Vu.
Static state value refers to the value that the static feature of data in itself determines, such as different file types, size.One As provide a magnitude of value respectively to different types of data by system manager.
Subjective value is that user defines according to their needs, can be provided by user.
Pass through formula V=wd*Vd+ws*Vs+wu*Vu (1)
Calculate the total value of data.Different data have different values, and the height of value determines depositing for they Storage strategy.
Monitoring, it is necessary to monitor whole system in real time, finds all copies of data, judges the value of data with mapping block Whether type is consistent with all copy storage locations of the data, if not being consistent, change data value and copy storage strategy Mapping relations.The total value of data can be over time change and change, monitor data value and copy storage strategy During mismatch, it is necessary to change the copy storage strategy of the data, and match copy storage strategy corresponding to data value therewith.
The function of transferring module is corresponding to migrating data to matching storage strategy in storage medium.When monitoring with After mapping block have changed the storage strategy of data, transferring module will be triggered, transferring module will immediately be migrated and have changed The data of copy storage strategy.
The difference of the difference of data storage feature as needed, i.e. data value selects different copies to store plan respectively Slightly, the low data selection HDD strategies of data value, the high data selection SSD strategies of data value, the medium data of data value Select SSD+HDD strategies.It is the description tactful to three kinds below:
HDD strategies:All copies for needing storage file are stored on disk.
SSD strategies:All copies for needing storage file are stored on solid state hard disc.
SSD+HDD strategies:One of storage file will be needed to store onto solid state hard disc, disk is arrived in the storage of remaining copy On.This strategy is a combination of HDD strategies and SSD strategies, and the advantages of can making full use of HDD and SSD, being primarily adapted for use in needs The file the to be stored data that both repetition in need is read or repetition in need is write.
The overall workflow of adaptive more copy isomery storages based on data value is made up of four major parts, with Fig. 1 Described in module design correspond, they are respectively:The calculating of data memory access record information, data value, system monitoring With the migration of mapping mechanism and data.In order to which the working mechanism of whole system is explained in more detail, its workflow such as Fig. 2 It is shown
The later step of system operation is as follows:
The first step:Collect the memory access information of data.Including access time, access type and access times, so as to To obtain access information of the data in different time sections, such as one hour, one day, a week.
Second step:Calculate data value.Data value is calculated according to the memory access information being collected into, different data values Data are by using in different copy storage classes.
3rd step:Judge whether data value changes.Change over time, data value may change Become, the mapping relations of different pieces of information value are different.If data value is changed, the 4th step can be carried out, if turning without if To second step, continue to calculate data value.
4th step:Change the mapping relations of data.Data value is changed, then corresponding mapping relations will be fitted no longer Close, so as to need to change mapping relations, therefore the copy storage strategy of data is changed.
5th step:Migrating data.After the copy storage strategy of data changes, the current storage class of data has been not suitable for The storage of data, now just need to migrate data, in the storage class that copy is moved to corresponding storage strategy.
So far, the workflow of more copy isomery storage schemes based on data value terminates.
More copy isomery storage architectures based on data value proposed by the invention, it is most according to the difference of data value Copy may be stored in most suitable storage medium.Its main target is to give full play to the advantage of different storage mediums, The advantages of using different storage mediums, the shortcomings that avoiding different storage mediums, there is provided give user one storage compared with high performance-price ratio Equipment.The cost of storage device so can be both controlled, equally also each storage medium is brought out one's strong points, ensures to deposit Cost performance is improved as far as possible while storage system capacity.
The foregoing examples are merely illustrative of the technical concept and features of the invention, its object is to allow the person skilled in the art to be Present disclosure can be understood and implemented according to this, it is not intended to limit the scope of the present invention.It is all smart according to the present invention The equivalent transformation or modification that refreshing essence is done, should all be included within the scope of the present invention.

Claims (3)

1. a kind of more copy isomery storage methods based on data value, it is characterised in that comprise the following steps:
S01:The dynamic value Vd of data is obtained according to data access frequency, delimiting the static of data according to data type is worth Vs, the subjective value Vu of data is defined according to user's request;
S02:The total value V=wd*Vd+ws*Vs+wu*Vu of data is calculated, wherein wd is the weight of dynamic value, and ws is static The weight of value, wu are the weights of subjective value, wd+ws+wu=1;
S03:By the total value of data compared with default first threshold and Second Threshold, by data be divided into high price Value Data, Middle value numerical value and low price Value Data;
S04:All copies of low price Value Data are stored on disk, it is hard that all copies of high price Value Data are stored in into solid-state On disk, a copy of middle value data is stored on solid state hard disc, remaining copy is stored on disk.
2. more copy isomery storage methods according to claim 1 based on data value, it is characterised in that calculate in real time The total value of data and compared with default first threshold and Second Threshold, judge all of the axiological ideas of data and the data Whether copy storage location is consistent, if not being consistent, triggering transferring module changes the storage location of the data trnascription.
A kind of 3. more copy isomery storage architectures based on data value, it is characterised in that including:
One data memory access logging modle, the memory access information of all data in record system;
One data value computing module, the dynamic value Vd of data is obtained according to data access frequency, delimited according to data type The static value Vs of data, the subjective value Vu of data is defined according to user's request;Calculate the total value V=wd*Vd+ of data Ws*Vs+wu*Vu, wherein wd are the weights of dynamic value, and ws is the weight of static value, and wu is the weight of subjective value, wd+ Ws+wu=1;
One monitoring and mapping block, real-time monitoring system, all copies of data are found, judge the axiological idea and the number of data According to all copy storage locations whether be consistent, if not being consistent, the mapping relations of change data value and copy storage strategy;
One transferring module, transferring module, transferring module root are triggered after monitoring have changed the storage strategy of data with mapping block According to the copy storage strategy migrating data of migration change.
CN201710700649.8A 2017-08-16 2017-08-16 More copy isomery storage methods and framework based on data value Pending CN107562375A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710700649.8A CN107562375A (en) 2017-08-16 2017-08-16 More copy isomery storage methods and framework based on data value

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710700649.8A CN107562375A (en) 2017-08-16 2017-08-16 More copy isomery storage methods and framework based on data value

Publications (1)

Publication Number Publication Date
CN107562375A true CN107562375A (en) 2018-01-09

Family

ID=60974092

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710700649.8A Pending CN107562375A (en) 2017-08-16 2017-08-16 More copy isomery storage methods and framework based on data value

Country Status (1)

Country Link
CN (1) CN107562375A (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104462389A (en) * 2014-12-10 2015-03-25 上海爱数软件有限公司 Method for implementing distributed file systems on basis of hierarchical storage
CN106469029A (en) * 2011-12-31 2017-03-01 华为数字技术(成都)有限公司 Data hierarchy storage processing method, device and storage device
WO2017085792A1 (en) * 2015-11-17 2017-05-26 株式会社日立製作所 Storage system and storage system control method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106469029A (en) * 2011-12-31 2017-03-01 华为数字技术(成都)有限公司 Data hierarchy storage processing method, device and storage device
CN104462389A (en) * 2014-12-10 2015-03-25 上海爱数软件有限公司 Method for implementing distributed file systems on basis of hierarchical storage
WO2017085792A1 (en) * 2015-11-17 2017-05-26 株式会社日立製作所 Storage system and storage system control method

Similar Documents

Publication Publication Date Title
CN103927251B (en) Log management method for embedded system
CN102156738B (en) Method for processing data blocks, and data block storage equipment and system
CN104050097B (en) Selected between the Nonvolatile memery unit with different smallest addressable data unit sizes
US9141457B1 (en) System and method for predicting multiple-disk failures
US5764880A (en) Method and system for rebuilding log-structured arrays
US5794242A (en) Temporally and spatially organized database
US7653781B2 (en) Automatic RAID disk performance profiling for creating optimal RAID sets
US11204705B2 (en) Retention-aware data tiering algorithm for hybrid storage arrays
CN104583930B (en) Method, controller and the data migration device of Data Migration
CN103135940B (en) Implementing enhanced fragmented stream handling in a shingled disk drive
CN107037976A (en) Storage device and its method of work
CN104268099B (en) A kind of method and device for managing reading and writing data
CN107003937A (en) For the cache optimization technology of large-scale work data set
CN107273046B (en) A kind of data processing method and system based on Solid-state disc array
US8234464B2 (en) Hybrid storage data migration by selective data removal
CN106463145B (en) Cache systems and method for hard disk drive and hybrid drive
US20170364447A1 (en) Read cache management
CN101004936A (en) Information storage device and its control method
CN108897492A (en) A kind of method for writing data and device
KR20090129791A (en) Multiple value flash memory
CN102184080B (en) Curing band based data reading and writing method used after capacity expansion of RAID (Redundant Array of Independent Disk) system
CN107092525A (en) data cache method and device
Ma et al. Alleviating hot data write back effect for shingled magnetic recording storage systems
CN106095330A (en) A kind of storage method and apparatus of metadata
CN105913857A (en) Magnetic disk drive and rewrite processing method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20180109

RJ01 Rejection of invention patent application after publication