CN115857807A - Data pre-recall method and device, electronic equipment and storage medium - Google Patents

Data pre-recall method and device, electronic equipment and storage medium Download PDF

Info

Publication number
CN115857807A
CN115857807A CN202211535815.0A CN202211535815A CN115857807A CN 115857807 A CN115857807 A CN 115857807A CN 202211535815 A CN202211535815 A CN 202211535815A CN 115857807 A CN115857807 A CN 115857807A
Authority
CN
China
Prior art keywords
target
storage system
primary storage
objects
metadata information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211535815.0A
Other languages
Chinese (zh)
Inventor
黄梦龙
陈亮
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hunan Happly Sunshine Interactive Entertainment Media Co Ltd
Original Assignee
Hunan Happly Sunshine Interactive Entertainment Media Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hunan Happly Sunshine Interactive Entertainment Media Co Ltd filed Critical Hunan Happly Sunshine Interactive Entertainment Media Co Ltd
Priority to CN202211535815.0A priority Critical patent/CN115857807A/en
Publication of CN115857807A publication Critical patent/CN115857807A/en
Pending legal-status Critical Current

Links

Images

Abstract

The application discloses a data pre-recall method and device, electronic equipment and a storage medium, wherein the method comprises the following steps: when the pre-recall service is triggered, a list of each target object and metadata information of each target object are requested from a primary storage system; judging whether the current state of the primary storage system is in an idle state or not; if the mobile terminal is in the idle state, screening a plurality of migration objects related to the target storage object from the list of each target object based on the metadata information of each target object and the metadata information of the target storage object; and requesting the secondary tape library to migrate each migration object to the primary storage system. Therefore, the access object is not only migrated from the secondary tape library to the primary storage system, but the object related to the access object is recovered from the tape library to the primary storage system in advance through the pre-recall service, so that the hit rate of the user for accessing the storage object is improved, and the waiting time of the user is also reduced.

Description

Data pre-recall method and device, electronic equipment and storage medium
Technical Field
The present application relates to the field of distributed storage technologies, and in particular, to a method and an apparatus for pre-recalling data, an electronic device, and a storage medium.
Background
With the continuous development of computer technology, storage technology is correspondingly promoted, and storage media are more and more selected, wherein a tape library is a relatively common storage medium, and has the advantages of large capacity, low power consumption, and capability of completing storage backup without manual interference, but the tape library is relatively troublesome to use.
In the prior art, it is necessary to move the magnetic tape from the magnetic tape slot to the magnetic tape drive, load the magnetic tape medium, position the magnetic tape at the recorded position of the data file, read the data file in the magnetic tape medium, write it into the upper storage, rewind the magnetic tape to the tape head of the magnetic tape through the magnetic tape drive, unload the magnetic tape medium, and move the magnetic tape medium from the magnetic tape drive to the magnetic tape slot to complete the operation of reading and writing the data file in the magnetic tape library.
However, since the tape library is used as a cold backup storage system, when the usage rate of the primary storage system is too high, some cold data that is out of date or used for a long time is migrated to the secondary storage system, so that when a user wants to access the cold data, the user needs to notify the tape library to synchronize the cold data to the primary storage system and then can access the cold data, and further, the user has to wait for the recovery time of the access object to be too long, and the user experience is poor.
Disclosure of Invention
Based on the defects of the prior art, the application provides a data pre-recall method and device, electronic equipment and a storage medium, so as to solve the problem that the recovery time of the object waiting for access is too long in the prior art.
In order to achieve the above object, the present application provides the following technical solutions:
a first aspect of the present application provides a method for recalling data, including:
when the pre-recall service is triggered, requesting a list of each target object and metadata information of each target object from a primary storage system; wherein the target object refers to an object that has been migrated from the primary storage system into the secondary tape library;
judging whether the current state of the primary storage system is in an idle state or not;
if the current state of the primary storage system is judged to be in an idle state, screening a plurality of migration objects related to the target storage object from a list of the target objects based on metadata information of the target objects and metadata information of the target storage object; wherein the target storage object refers to an object stored in the primary storage system determined from access information;
requesting the secondary tape library to migrate each migration object to the primary storage system.
Optionally, in the above method for pre-recalling data, before the step of requesting, when the pre-recall service is triggered, a list of each target object and metadata information of each target object from the primary storage system, the method further includes:
when a processing request of a user for accessing a missing object sent by a primary storage system is received, requesting a secondary tape library to migrate the missing object to the primary storage system; wherein the missing object refers to an object that has been migrated from the primary storage system into the secondary tape library;
triggering a pre-recall service when detecting that the capacity utilization rate of the primary storage system is less than a capacity threshold.
Optionally, in the above method for pre-recalling data, before requesting, from the primary storage system, a list of each target object and metadata information of each target object when the pre-recall service is triggered, the method further includes:
detecting whether the capacity utilization rate of the primary storage system is smaller than a capacity threshold value every a first preset time interval;
triggering a pre-recall service whenever a capacity usage of the primary storage system is detected to be less than a capacity threshold.
Optionally, in the above method for pre-recalling data, the screening, based on the metadata information of each target object and the metadata information of a target storage object, a plurality of migration objects related to the target storage object from a list of each target object includes:
taking the missing object as the target storage object;
calculating a total score of each target object based on the coincidence degree of the tag pair of each target object and the tag pair of the target storage object and the access frequency of the target storage object; wherein the tag pair is stored in the metadata information; the higher the coincidence degree of the tag pair of the target object and the tag pair of the target storage object and the access frequency of the target storage object are, the higher the total score of the target object is;
and screening N target objects from the list of the target objects as the migration objects based on the total score of the target objects.
Optionally, in the above method for pre-recalling data, the screening, based on the metadata information of each target object and the metadata information of a target storage object, a plurality of migration objects related to the target storage object from a list of each target object includes:
acquiring each storage object with the access frequency greater than the preset frequency and metadata information of each storage object from the primary storage system;
taking each storage object as each target storage object;
calculating a total score of each target object based on the coincidence degree of the label pair of each target object and the label pair of each target storage object and the access frequency of each target storage object; wherein the tag pair is stored in the metadata information; the higher the coincidence degree of the tag pair of the target object and the tag pair of the target storage object and the access frequency of the target storage object, the higher the total score of the target object is;
and screening M target objects from the list of the target objects as the migration objects based on the total score of the target objects.
Optionally, in the above method for recalling data, the calculating a total score of each target object based on a degree of coincidence between the tag pair of each target object and the tag pair of each target storage object and an access frequency of each target storage object includes:
and respectively accumulating the product of the access frequency corresponding to each target storage with the label pair and a preset coefficient into the total score of the target object aiming at each label pair of each target object.
Optionally, in the above method for pre-recalling data, after determining whether the current state of the primary storage system is in an idle state, the method further includes:
and if the current state of the primary storage system is judged to be in a non-idle state, returning and executing the request of each target object and the metadata information of each target object from the primary storage system after a second preset time interval.
A second aspect of the present application provides an apparatus for recalling data, including:
the first request unit is used for requesting a list of each target object and metadata information of each target object from a primary storage system when the pre-recall service is triggered; wherein the target object refers to an object that has been migrated from the primary storage system into the secondary tape library;
the judging unit is used for judging whether the current state of the primary storage system is in an idle state or not;
the first screening unit is used for screening a plurality of migration objects related to the target storage object from a list of the target objects based on metadata information of the target objects and metadata information of the target storage object if the current state of the primary storage system is judged to be in an idle state; wherein the target storage object refers to an object stored in the primary storage system determined from access information;
and the migration unit is used for requesting the secondary tape library to migrate each migration object to the primary storage system.
Optionally, in the apparatus for pre-recalling data, the method further includes:
the second request unit is used for requesting the secondary tape library to migrate the missing object to the primary storage system when receiving a processing request sent by the primary storage system and used for accessing the missing object by a user; wherein the missing object refers to an object that has been migrated from the primary storage system into the secondary tape library;
the first triggering unit is used for triggering the pre-recall service when detecting that the capacity utilization rate of the primary storage system is smaller than a capacity threshold value.
Optionally, in the apparatus for pre-recalling data, the method further includes:
the detection unit is used for detecting whether the capacity utilization rate of the primary storage system is smaller than a capacity threshold value or not every interval of a first preset time length;
and the second trigger unit is used for triggering the pre-recall service when detecting that the capacity utilization rate of the primary storage system is less than the capacity threshold value.
Optionally, in the apparatus for pre-recalling data described above, the first filtering unit includes:
a first as unit for taking the missing object as the target storage object;
a first calculation unit, configured to calculate a total score of each target object based on a degree of coincidence of the tag pair of each target object with the tag pair of the target storage object and an access frequency of the target storage object; wherein the tag pair is stored in the metadata information; the higher the coincidence degree of the tag pair of the target object and the tag pair of the target storage object and the access frequency of the target storage object are, the higher the total score of the target object is;
and the second screening unit is used for screening N target objects from the list of the target objects as the migration objects based on the total scores of the target objects.
Optionally, in the apparatus for pre-recalling data described above, the first filtering unit includes:
the acquisition unit is used for acquiring each storage object with the access frequency greater than the preset frequency and the metadata information of each storage object from the primary storage system;
a second serving unit configured to serve each of the storage objects as each of the target storage objects;
a second calculation unit, configured to calculate a total score of each target object based on a degree of coincidence of the tag pair of each target object with the tag pair of each target storage object and an access frequency of each target storage object; wherein the tag pair is stored in the metadata information; the higher the coincidence degree of the tag pair of the target object and the tag pair of the target storage object and the access frequency of the target storage object are, the higher the total score of the target object is;
and the third screening unit is used for screening M target objects from the list of the target objects as the migration objects based on the total score of the target objects.
Optionally, in the apparatus for pre-recalling data described above, the second computing unit includes:
and the accumulation unit is used for respectively accumulating the product of the access frequency corresponding to each target storage with the label pair and a preset coefficient to the total score of the target object aiming at each label pair of each target object.
Optionally, in the apparatus for pre-recalling data, the method further includes:
and the execution unit is used for returning and executing the request of each target object and the metadata information of each target object from the primary storage system after a second preset time interval if the current state of the primary storage system is judged to be in a non-idle state.
A third aspect of the present application provides an electronic device comprising:
a memory and a processor;
wherein the memory is used for storing programs;
the processor is configured to execute the program, and when the program is executed, the program is specifically configured to implement a method of pre-recalling data as described in any of the above.
A fourth aspect of the present application provides a computer storage medium storing a computer program which, when executed, is operable to implement a method of pre-recalling data as claimed in any preceding claim.
The method for pre-recalling data includes the steps of requesting a list of target objects and metadata information of the target objects from a primary storage system when a pre-recall service is triggered, judging whether the current state of the primary storage system is in an idle state or not, screening multiple migration objects related to the target storage objects from the list of the target objects based on the metadata information of the target objects and the metadata information of the target storage objects if the current state of the primary storage system is judged to be in the idle state, wherein the target storage objects refer to objects which are determined according to access information and stored in the primary storage system, and finally requesting the secondary storage system to migrate the migration objects to the primary storage system. Therefore, the access object is not only migrated from the secondary tape library to the primary storage system, but the object related to the access object is recovered from the tape library to the primary storage system in advance through the pre-recall service, so that the hit rate of the user for accessing the storage object can be effectively improved, and the waiting time of the user is also reduced.
Drawings
In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings used in the embodiments or the prior art descriptions will be briefly described below, it is obvious that the drawings in the following description are only the embodiments of the present application, and for those skilled in the art, other drawings can be obtained according to the provided drawings without creative efforts.
FIG. 1 is a flow chart of a method for pre-recalling data according to an embodiment of the present disclosure;
fig. 2 is a flowchart of a method for triggering a recall service according to an embodiment of the present disclosure;
fig. 3 is a flowchart of a method for detecting a recall service according to an embodiment of the present application;
fig. 4 is a flowchart of a screening method for migrated objects according to an embodiment of the present disclosure;
fig. 5 is a flowchart of an acquisition method for a migrated object according to an embodiment of the present application;
FIG. 6 is a schematic structural diagram of an apparatus for pre-recalling data according to another embodiment of the present application;
fig. 7 is a schematic structural diagram of an electronic device according to another embodiment of the present application.
Detailed Description
The technical solutions in the embodiments of the present application will be described clearly and completely with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are only some embodiments of the present application, and not all embodiments. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments in the present application without making any creative effort belong to the protection scope of the present application.
In this application, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising a … …" does not exclude the presence of another identical element in a process, method, article, or apparatus that comprises the element.
An embodiment of the present application provides a data recall method, as shown in fig. 1, specifically including the following steps:
s101, when the pre-recall service is triggered, a list of all target objects and metadata information of all target objects are requested from a primary storage system.
Wherein, the target object refers to an object which has been migrated from the primary storage system to the secondary tape library. It should be noted that the request for the primary storage system to send the list of each target object is to avoid erroneously migrating an object in a non-primary storage system to the primary storage system, and to count a list that is not already in the primary storage system.
Optionally, when the user wants to access the target object of the primary storage system, the service of recalling the missing object is triggered, so in another embodiment of the present application, before executing step S101, as shown in fig. 2, the method may further include:
s201, when a processing request sent by a primary storage system and used for accessing the missing object by a user is received, requesting a secondary tape library to migrate the missing object to the primary storage system.
Wherein a missing object refers to an object that has been migrated from the primary storage system to the secondary tape library.
Specifically, when a user accesses a missing object of the primary storage system, the primary storage system sends a request for accessing the missing object to the recall service, and then the recall service requests the secondary tape library to migrate the missing object back to the primary storage system for the user to access.
Alternatively, the missing objects of the primary storage system may be accessible by a front end, tool or API in addition to the user, and not limited to the user.
S202, detecting whether the capacity utilization rate of the primary storage system is smaller than a capacity threshold value.
It should be noted that, in order to prevent the situation that the primary storage system may face data leakage caused by data overflow due to the missing objects being migrated into the primary storage system when the capacity usage of the primary storage system is in an abundant state, it is necessary to detect in advance whether the capacity usage of the primary storage system is smaller than a capacity threshold value in order to prevent the situation from occurring, and if it is detected that the capacity usage of the primary storage system is smaller than the capacity threshold value, step S203 is executed. Optionally, if it is detected that the capacity utilization rate of the primary storage system is greater than the capacity threshold, a prompt message of access failure may be fed back to the user, so as to avoid wasting time of the user.
Optionally, the capacity threshold may be 70%, and of course, other thresholds may also be used, and may be specifically set according to requirements.
And S203, triggering the pre-recall service.
Specifically, when the capacity utilization rate of the primary storage system is detected to be smaller than the capacity threshold value, a service mechanism for recalling the missing object is triggered.
Optionally, in the primary storage system, there may be a case that the capacity usage rate is too low, and the object in the secondary tape library may be migrated back to the primary storage system in advance, so as to reduce the latency and further improve the access hit rate of the object, so in another embodiment of the present application, before performing step S101, as shown in fig. 3, the method may further include:
s301, detecting whether the capacity utilization rate of the primary storage system is smaller than a capacity threshold value or not at intervals of a first preset time length.
It should be noted that, in the embodiment of the present application, it is necessary to detect the capacity utilization rate of the primary storage system at regular time, and when the capacity utilization rate is too low, the capacity utilization rate is maintained, so that the capacity utilization rate is wasted. Specifically, if it is detected that the capacity utilization rate of the primary storage system is smaller than the capacity threshold, step S302 is executed.
S302, triggering the pre-recall service.
In particular, whenever the capacity utilization of the primary storage system is detected to be less than a capacity threshold, a service mechanism is triggered that recalls objects that have been migrated from the primary storage system to the secondary tape library.
S102, judging whether the current state of the primary storage system is in an idle state or not.
It should be noted that, considering that the stability of the primary storage system cannot be guaranteed if the primary storage system is in a busy state, it is necessary to determine in advance whether the current state of the primary storage system is in an idle state, and if the current state of the primary storage system is determined to be in the idle state, it is determined that the pre-recall service can be executed, so step S103 is performed. For example, it may be determined whether a query-per-second (QPS) of the primary storage system exceeds a query threshold to indicate whether the current state of the primary storage system is idle, where the query threshold may be set to 1000, or may be other thresholds, and may be specifically set according to requirements.
Optionally, in the application embodiment, after step S102, the method further includes:
and if the current state of the primary storage system is judged to be in a non-idle state, returning to the step S101 of requesting each target object and the metadata information of each target object from the primary storage system after a second preset time interval.
It should be noted that, when it is determined that the current state of the primary storage system is in the non-idle state, it is determined that the primary storage system is currently in the system busy state, and the pre-recall service cannot be executed, so that it is necessary to wait for a period of time to elapse before returning to the primary storage system to request each target object and metadata information of each target object, thereby avoiding that the system is still in the busy state.
Optionally, the second preset time length may be 1 hour, or may be other thresholds, and may be specifically set according to requirements.
S103, screening a plurality of migration objects related to the target storage object from the list of each target object based on the metadata information of each target object and the metadata information of the target storage object.
Wherein the target storage object refers to an object stored in the primary storage system determined according to the access information. Specifically, access information may be requested from the primary storage system, so as to obtain a target storage object with a large number of access times from the access information, and the target storage object is used to determine a pre-recalled object.
Optionally, in another embodiment of the present application, when the target storage object is a missing object, as shown in fig. 4, a specific implementation manner of step S103 includes the following steps:
s401, the missing object is used as a target storage object.
S402, calculating total scores of the target objects based on the coincidence degree of the label pair of each target object and the label pair of the target storage object and the access frequency of the target storage object.
Wherein the tag pairs are stored in the metadata information. The higher the degree of coincidence of the tag pair of the target object with the tag pair of the target storage object and the access frequency of the target storage object, the higher the total score of the target object.
It should be noted that the specific implementation manner in step S402 and the specific implementation manner in step S503 are only that the number of the target storage objects is different, the target storage object in step S402 is one, and the target storage objects in step S503 are multiple, so that the specific implementation manner in step S402 may refer to step S503 in the following method embodiment, and details are not described here again.
S403, screening N target objects from the list of the target objects based on the total score of the target objects to serve as migration objects.
Specifically, the target objects may be sorted according to the total score, and top N target objects are screened from the list of the target objects, and the capacity of the N target objects cannot exceed the usage threshold of the capacity of the primary storage system.
Optionally, in another embodiment of the present application, when the target storage object is a storage object, a specific implementation manner of step S103, as shown in fig. 5, includes the following steps:
s501, obtaining each storage object with the access frequency larger than the preset frequency and metadata information of each storage object from the primary storage system.
Specifically, in order to improve the access hit rate of the primary storage system object and reduce the waiting time, in this embodiment of the present application, each storage object and the metadata information of each storage object, of which the access frequency is greater than the preset frequency in a month or several weeks, may be acquired from the primary storage system.
Optionally, the preset frequency may be 1k or 1w, and of course, other thresholds may also be used, and the preset frequency may be specifically set according to requirements.
S502, taking each storage object as each target storage object.
And S503, calculating a total score of each target object based on the coincidence degree of the label pair of each target object and the label pair of each target storage object and the access frequency of each target storage object.
Wherein the tag pairs are stored in the metadata information. The higher the degree of coincidence of the tag pair of the target object with the tag pair of the target storage object and the access frequency of the target storage object, the higher the total score of the target object.
Optionally, the preset coefficient may be 0.1, and of course, other coefficient thresholds may also be used, and the specific setting may be performed according to the requirement.
Optionally, in another embodiment of the present application, a specific implementation manner of step S503 includes the following steps:
and respectively accumulating the products of the access frequency corresponding to each target storage with the tag pair and a preset coefficient into the total score of the target object aiming at each tag pair of each target object.
Specifically, the initial score of each target object is preset to be 0, score (obj-i) is used to represent each target object, and the access frequency of each target storage can be represented by QPS (obj-m), and the specific calculation process may be:
first, 10 target objects may be assumed, (1 st iteration) for each target object, access is started from the first target object, denoted by obj-i, indicating that the ith object (1 < = i < = 10) is currently accessed:
(Recirculation 2) for obj-i, each tag pair of the first target object is accessed, as there are two tag pairs for the first target object: { "video classification": movie "," other classification ": "action" }, denoted by obj-i-tag-j (1 < = j < = 2),
(3 rd recycle) for each target storage object, assuming there are 10 target storage objects, the mth object (1 < = m < = 10) may be represented by obj-m:
(Recirculation 4) for obj-m, each tag pair of the first target storage object is accessed, as there are two tag pairs for the first target storage object: { "video classification": movie "," other classification ": "action" }, denoted by obj-m-tag-n (1 < = n < = 2),
if obj-i-tag-j and obj-m-tag-n are completely equal, i.e. the tag pairs are the same, score (obj-i) = score (obj-i) + QPS (obj-m) × 0.1, otherwise, the traversal calculation is continued for the next tag pair until all target objects and target storage objects are completely accessed.
S504, based on the total score of each target object, screening M target objects from the list of each target object to be used as migration objects.
It should be noted that, in the specific implementation of step S504, reference may be made to step S403 accordingly, which is not described herein again. Optionally, the M target objects screened in step S504 may be the same as or different from the N target objects screened in step S403, and the specific screening condition needs to be determined according to the total score of the target objects.
And S104, requesting the secondary tape library to migrate each migration object to the primary storage system.
Alternatively, each migration object determined in step S103 may be sent to the secondary tape library, and then the secondary tape library may be requested to migrate each migration object to the primary storage system.
The method for pre-recalling data includes the steps of requesting a list of target objects and metadata information of the target objects from a primary storage system when a pre-recall service is triggered, judging whether the current state of the primary storage system is in an idle state or not, screening a plurality of migration objects related to the target storage objects from the list of the target objects based on the metadata information of the target objects and the metadata information of the target storage objects if the current state of the primary storage system is judged to be in the idle state, wherein the target storage objects refer to objects which are determined according to access information and stored in the primary storage system, and finally requesting the secondary tape library to migrate the migration objects to the primary storage system. Therefore, the access object is not only migrated from the secondary tape library to the primary storage system, but the object related to the access object is recovered from the tape library to the primary storage system in advance through the pre-recall service, so that the hit rate of the user for accessing the storage object can be effectively improved, and the waiting time of the user is also reduced.
Another embodiment of the present application provides a data recall apparatus, as shown in fig. 6, including the following units:
a first requesting unit 601, configured to request a list of each target object and metadata information of each target object from the primary storage system when the pre-recall service is triggered.
Wherein, the target object refers to an object which has been migrated from the primary storage system to the secondary tape library.
The determining unit 602 is configured to determine whether a current state of the primary storage system is in an idle state.
A first screening unit 603, configured to, if it is determined that the current state of the primary storage system is in an idle state, screen a plurality of migration objects related to the target storage object from a list of each target object based on the metadata information of each target object and the metadata information of the target storage object.
Wherein the target storage object refers to an object stored in the primary storage system determined according to the access information.
The migration unit 604 is configured to request the secondary tape library to migrate each migration object to the primary storage system.
It should be noted that, for the specific working process of the foregoing unit in the embodiment of the present application, reference may be made to step S101 to step S104 in the foregoing method embodiment, which is not described herein again.
Optionally, in a device for recalling data provided in another embodiment of the present application, the device further includes:
and the second request unit is used for requesting the secondary tape library to migrate the missing object to the primary storage system when receiving a processing request sent by the primary storage system and used for accessing the missing object by a user.
Wherein a missing object refers to an object that has been migrated from the primary storage system to the secondary tape library.
The first triggering unit is used for triggering the pre-recall service when detecting that the capacity utilization rate of the primary storage system is smaller than a capacity threshold value.
It should be noted that, for the specific working processes of each unit provided in the foregoing embodiments of the present application, corresponding steps in the foregoing method embodiments may be correspondingly referred to, and details are not described here again.
Optionally, in a device for recalling data provided in another embodiment of the present application, the device further includes:
and the detection unit is used for detecting whether the capacity utilization rate of the primary storage system is less than a capacity threshold value every a first preset time length.
And the second trigger unit is used for triggering the pre-recall service when detecting that the capacity utilization rate of the primary storage system is less than the capacity threshold value.
It should be noted that, for the specific working processes of each unit provided in the foregoing embodiments of the present application, corresponding steps in the foregoing method embodiments may be referred to accordingly, and are not described herein again.
Optionally, in an apparatus for pre-recalling data provided in another embodiment of the present application, the first filtering unit 603 includes:
the first as a unit for targeting the missing object as the target storage object.
And the first calculating unit is used for calculating the total score of each target object based on the coincidence degree of the label pair of each target object and the label pair of the target storage object and the access frequency of the target storage object.
Wherein the tag pairs are stored in the metadata information. The higher the degree of coincidence of the tag pair of the target object with the tag pair of the target storage object and the access frequency of the target storage object, the higher the total score of the target object.
And the second screening unit is used for screening the N target objects from the list of the target objects as migration objects based on the total scores of the target objects.
It should be noted that, for the specific working processes of each unit provided in the foregoing embodiments of the present application, corresponding steps in the foregoing method embodiments may be referred to accordingly, and are not described herein again.
Optionally, in an apparatus for pre-recalling data provided in another embodiment of the present application, the first filtering unit 603 includes:
and the acquisition unit is used for acquiring each storage object with the access frequency greater than the preset frequency and the metadata information of each storage object from the primary storage system.
And a second as unit for regarding each storage object as each target storage object.
And the second calculating unit is used for calculating the total score of each target object based on the coincidence degree of the label pair of each target object and the label pair of each target storage object and the access frequency of each target storage object.
Wherein the tag pairs are stored in the metadata information. The higher the degree of coincidence of the tag pair of the target object with the tag pair of the target storage object and the access frequency of the target storage object, the higher the total score of the target object.
And the third screening unit is used for screening M target objects from the list of the target objects as migration objects based on the total score of the target objects.
It should be noted that, for the specific working processes of each unit provided in the foregoing embodiments of the present application, corresponding steps in the foregoing method embodiments may be referred to accordingly, and are not described herein again.
Optionally, in a data recall device provided in another embodiment of the present application, the second computing unit includes:
and the accumulation unit is used for respectively accumulating the products of the access frequency corresponding to each target storage with the tag pair and a preset coefficient into the total score of the target object aiming at each tag pair of each target object.
It should be noted that, for the specific working processes of each unit provided in the foregoing embodiments of the present application, corresponding steps in the foregoing method embodiments may be referred to accordingly, and are not described herein again.
Optionally, in a device for recalling data provided in another embodiment of the present application, the device further includes:
and the execution unit is used for returning and executing the request of each target object and the metadata information of each target object to the primary storage system after a second preset time interval if the current state of the primary storage system is judged to be in a non-idle state.
It should be noted that, for the specific working processes of each unit provided in the foregoing embodiments of the present application, corresponding steps in the foregoing method embodiments may be referred to accordingly, and are not described herein again.
Another embodiment of the present application provides an electronic device, as shown in fig. 7, including:
a memory 701 and a processor 702.
The memory 701 is used for storing programs.
The processor 702 is configured to execute a program, which when executed is particularly configured to implement a method for pre-recalling data as provided in any of the embodiments above.
Another embodiment of the present application provides a computer storage medium for storing a computer program, which when executed, implements a method for pre-recalling data as provided in any of the above embodiments.
Computer storage media, including persistent and non-persistent, removable and non-removable media, may implement information storage by any method or technology. The information may be computer readable instructions, data structures, modules of a program, or other data. Examples of computer storage media include, but are not limited to, phase change memory (PRAM), static Random Access Memory (SRAM), dynamic Random Access Memory (DRAM), other types of Random Access Memory (RAM), read Only Memory (ROM), electrically Erasable Programmable Read Only Memory (EEPROM), flash memory or other memory technology, compact disc read only memory (CD-ROM), digital Versatile Discs (DVD) or other optical storage, magnetic cassettes, magnetic tape magnetic disk storage or other magnetic storage devices, or any other non-transmission medium that can be used to store information that can be accessed by a computing device. As defined herein, a computer readable medium does not include a transitory computer readable medium such as a modulated data signal and a carrier wave.
Those of skill would further appreciate that the various illustrative elements and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, computer software, or combinations of both, and that the various illustrative components and steps have been described above generally in terms of their functionality in order to clearly illustrate this interchangeability of hardware and software. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the technical solution. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present application.
The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present application. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the application. Thus, the present application is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims (10)

1. A method for pre-recalling data, comprising:
when the pre-recall service is triggered, requesting a list of each target object and metadata information of each target object from a primary storage system; wherein the target object refers to an object that has been migrated from the primary storage system into the secondary tape library;
judging whether the current state of the primary storage system is in an idle state or not;
if the current state of the primary storage system is judged to be in an idle state, screening a plurality of migration objects related to the target storage object from a list of the target objects based on the metadata information of the target objects and the metadata information of the target storage object; wherein the target storage object refers to an object stored in the primary storage system determined from access information;
requesting the secondary tape library to migrate each migration object to the primary storage system.
2. The method of claim 1, wherein before requesting the list of each target object and the metadata information of each target object from the primary storage system when triggering the recall service, the method further comprises:
when a processing request sent by a primary storage system for accessing a missing object by a user is received, requesting a secondary tape library to migrate the missing object to the primary storage system; wherein the missing object refers to an object that has been migrated from the primary storage system into the secondary tape library;
and triggering a pre-recall service when detecting that the capacity utilization rate of the primary storage system is less than a capacity threshold value.
3. The method of claim 1, wherein before requesting the list of each target object and the metadata information of each target object from the primary storage system when triggering the recall service, the method further comprises:
detecting whether the capacity utilization rate of the primary storage system is smaller than a capacity threshold value or not every interval of a first preset time length;
triggering a recall-in service whenever a capacity usage of the primary storage system is detected to be less than a capacity threshold.
4. The method of claim 2, wherein the screening a plurality of migration objects related to the target storage object from the list of each target object based on the metadata information of each target object and the metadata information of the target storage object comprises:
taking the missing object as the target storage object;
calculating a total score of each target object based on the coincidence degree of the tag pair of each target object and the tag pair of the target storage object and the access frequency of the target storage object; wherein the tag pair is stored in the metadata information; the higher the coincidence degree of the tag pair of the target object and the tag pair of the target storage object and the access frequency of the target storage object, the higher the total score of the target object is;
and screening N target objects from the list of the target objects as the migration objects based on the total score of the target objects.
5. The method of claim 3, wherein the screening out a plurality of migration objects related to the target storage object from the list of the target objects based on the metadata information of the target objects and the metadata information of the target storage objects comprises:
acquiring each storage object with the access frequency greater than the preset frequency and metadata information of each storage object from the primary storage system;
taking each storage object as each target storage object;
calculating a total score of each target object based on the coincidence degree of the label pair of each target object and the label pair of each target storage object and the access frequency of each target storage object; wherein the tag pair is stored in the metadata information; the higher the coincidence degree of the tag pair of the target object and the tag pair of the target storage object and the access frequency of the target storage object are, the higher the total score of the target object is;
and screening M target objects from the list of the target objects as the migration objects based on the total score of the target objects.
6. The method of claim 5, wherein calculating the total score for each of the target objects based on the degree of coincidence of the tag pair of each of the target objects with the tag pair of each of the target storage objects and the access frequency of each of the target storage objects comprises:
and respectively accumulating the product of the access frequency corresponding to each target storage with the label pair and a preset coefficient into the total score of the target object aiming at each label pair of each target object.
7. The method of claim 1, wherein after determining whether the current state of the primary storage system is in an idle state, further comprising:
and if the current state of the primary storage system is judged to be in a non-idle state, returning and executing the request of each target object and the metadata information of each target object from the primary storage system after a second preset time interval.
8. An apparatus for pre-recalling data, comprising:
the first request unit is used for requesting a list of each target object and metadata information of each target object from a primary storage system when the pre-recall service is triggered; wherein the target object refers to an object that has been migrated from the primary storage system into the secondary tape library;
the judging unit is used for judging whether the current state of the primary storage system is in an idle state or not;
the first screening unit is used for screening a plurality of migration objects related to the target storage object from a list of the target objects based on metadata information of the target objects and metadata information of the target storage object if the current state of the primary storage system is judged to be in an idle state; wherein the target storage object refers to an object stored in the primary storage system determined according to access information;
and the migration unit is used for requesting the secondary tape library to migrate each migration object to the primary storage system.
9. An electronic device, comprising:
a memory and a processor;
wherein the memory is used for storing programs;
the processor is configured to execute the program, which when executed is particularly configured to implement a method of pre-recalling data as claimed in any of claims 1 to 7.
10. A computer storage medium storing a computer program which, when executed, implements a method of pre-recalling data as claimed in any one of claims 1 to 7.
CN202211535815.0A 2022-12-02 2022-12-02 Data pre-recall method and device, electronic equipment and storage medium Pending CN115857807A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211535815.0A CN115857807A (en) 2022-12-02 2022-12-02 Data pre-recall method and device, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211535815.0A CN115857807A (en) 2022-12-02 2022-12-02 Data pre-recall method and device, electronic equipment and storage medium

Publications (1)

Publication Number Publication Date
CN115857807A true CN115857807A (en) 2023-03-28

Family

ID=85669166

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211535815.0A Pending CN115857807A (en) 2022-12-02 2022-12-02 Data pre-recall method and device, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN115857807A (en)

Similar Documents

Publication Publication Date Title
EP3223132B1 (en) Method and apparatus for tiered storage processing of data, and storage device
CN111506251B (en) Data processing method and device, SMR storage system and storage medium
CN107506266B (en) Data recovery method and system
WO2023029971A1 (en) Heterogeneous memory-based data migration method
WO2023185767A1 (en) Slow disk drive detection method and apparatus, and electronic device and storage medium
CN107402870B (en) Method and device for processing log segment in metadata server
US9152490B2 (en) Detection of user behavior using time series modeling
US9207877B1 (en) Detection and avoidance of stalled filesystems to prevent stalling of virtual tape drives during tape mounts
CN102981944A (en) Log storage method based on file system
CN111737265B (en) Block data access method, block data storage method and device
CN111026333A (en) Access request processing method, processing device, electronic equipment and storage medium
JP2006092070A (en) Disk array device, its control method and control program
CN115857807A (en) Data pre-recall method and device, electronic equipment and storage medium
CN111625456A (en) Stuck positioning method and device
CN103176753A (en) Storage device and data management method of storage device
CN112882659B (en) Information obtaining method and device, electronic equipment and storage medium
CN104102554B (en) Data backup method and data backup device
CN110837428A (en) Storage device management method and device
CN114237489B (en) Method and device for writing logic resources into SMR disk, electronic equipment and storage medium
CN112181277B (en) Data storage space management method and device, storage medium and electronic equipment
CN109976663B (en) Distributed storage response method and system
CN113268437A (en) Method and equipment for actively triggering memory sorting
CN110019233B (en) Data storage method and system
CN111625506A (en) Distributed data deleting method, device and equipment based on deleting queue
CN106959888B (en) Task processing method and device in cloud storage system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination