CN116996530A - Processing method and system for enhancing reliability of cloud hard disk based on cloud platform - Google Patents

Processing method and system for enhancing reliability of cloud hard disk based on cloud platform Download PDF

Info

Publication number
CN116996530A
CN116996530A CN202310948674.3A CN202310948674A CN116996530A CN 116996530 A CN116996530 A CN 116996530A CN 202310948674 A CN202310948674 A CN 202310948674A CN 116996530 A CN116996530 A CN 116996530A
Authority
CN
China
Prior art keywords
hard disk
storage
cloud
cloud hard
utilization rate
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310948674.3A
Other languages
Chinese (zh)
Inventor
路小敏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jinan Inspur Data Technology Co Ltd
Original Assignee
Jinan Inspur Data Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jinan Inspur Data Technology Co Ltd filed Critical Jinan Inspur Data Technology Co Ltd
Priority to CN202310948674.3A priority Critical patent/CN116996530A/en
Publication of CN116996530A publication Critical patent/CN116996530A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1097Protocols in which an application is distributed across nodes in the network for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/16Threshold monitoring
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1095Replication or mirroring of data, e.g. scheduling or transport for data synchronisation between network nodes
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The application provides a processing method and a processing system for enhancing the reliability of a cloud hard disk based on a cloud platform, wherein the system comprises a label setting module, a monitoring alarm module, a judging execution module and a calculation storage module; the label setting module is used for setting the priority of the cloud host, setting a storage utilization rate threshold value and a buffer threshold value, and setting a cloud hard disk performance reduction percentage threshold value; the monitoring alarm module is used for monitoring performance data of the cloud host, the cloud hard disk and the storage resources; the judging and executing module is used for processing the cloud hard disk on the storage according to the monitored storage utilization rate condition; the computing storage module is used for managing and scheduling the whole life cycle of the cloud host and the cloud hard disk. The application solves the problems that two sets of storage guarantee data are needed, the standby storage is wasted during normal use and the storage cannot be processed before the storage failure, greatly saves the cost, reduces the abnormal influence and enhances the usability.

Description

Processing method and system for enhancing reliability of cloud hard disk based on cloud platform
Technical Field
The application relates to the technical field of cloud platforms, in particular to a processing method and a processing system for enhancing reliability of a cloud hard disk based on a cloud platform.
Background
Cloud platforms provide basic services such as computing, networking, storage, etc. by pooling resources throughout a data center. The cloud host technology enables a plurality of operating systems to run on the same physical server, provides reliable isolation, and greatly improves the reusability of physical resources. With the general application of cloud computing technology and the continuous development of storage technology, the types of storage back ends used in public cloud and private cloud are more and more, the performances are different, and when the read-write performance configuration of upper-layer service and the storage is not matched, in order to meet the requirement of a cloud host for adjusting the performance of the storage service, the data migration between different storage back ends has become a necessary function.
However, there are several problems with current storage usage of cloud hosts: 1. the performances of the rear ends of the different storage devices are different, the use values of the cloud hard disk are different when the storage use rate reaches the upper limit, and the use condition of the cloud hard disk of the cloud host cannot be identified in advance, so that the cloud hard disk cannot be used due to the fact that the storage is full of the cloud hard disk, and loss is caused for a user; 2. cloud hard disks are generally lack of high availability through manual migration; 3. the storage performance of the system disk and the data disk after use is not clearly distributed, and the storage distribution cannot be performed more efficiently.
At present, a cloud platform is used for coping with storage faults and guaranteeing to use disaster recovery technology (double-active-volume storage, double-center double-active-volume storage and double-center main-standby) or a scheme of timing snapshot or backup of a cloud hard disk. The dual-active volume storage is configured into a set of dual-active storage clusters through butt joint of two storage devices by a single cloud platform resource pool, and when the main storage is unexpectedly down, the dual-active storage clusters continuously provide storage IO through internal main-standby fault switching, so that cloud host service corresponding to a cloud hard disk is uninterrupted, perceived and data are lost zero. The double-center double-activity utilizes a storage double-activity technology, double-activity clusters are deployed through two sets of storage of the double center, double-activity of data in the double center is realized, when any center of the double-activity center fails, bottom-layer data automatically performs failure switching through the double-activity storage clusters, zero loss of data is guaranteed, and IO is not interrupted. The dual-center main-standby main center bears all the services, the standby center bears the backup of the core service, the service runs in the main center, and when the main center fails, the disaster recovery protection service is recovered in the standby center to provide service for the outside. The cloud hard disk snapshot is used for recording the cloud hard disk state at a certain time point and supporting the recovery of the cloud hard disk according to the cloud hard disk snapshot at the certain time point.
Because dual live volume storage, dual center dual live, dual center primary and backup all require two sets of storage and are used in the event of storage failure such as a power outage. The cloud hard disk timing snapshot or backup is only the backup and storage of the cloud hard disk data, and can be used for recovering the cloud hard disk data. At present, the reliability of the storage disaster recovery technology is guaranteed, and the scheme of using the snapshot or backing up the stored data by the cloud hard disk has the following problems: 1. two sets of storage are needed in any disaster recovery scheme, and one set of storage can not play a role when no fault exists, so that the consumption of customers is increased; 2. disaster recovery can be triggered only when a fault occurs, and data can not be reliably protected when the performance is reduced or the fault downtime is not reached; 3. the snapshot and the backup of the cloud hard disk are the backup and the preservation of the data on the cloud hard disk, so that the zero loss of the data of the cloud hard disk during the storage failure can not be ensured, and IO is not interrupted.
Disclosure of Invention
The application aims to solve the problems that a cloud hard disk of a cloud host in the existing cloud platform lacks to guarantee the use of the cloud hard disk when the storage is normal and tends to be abnormal, and the processing method and the processing system for enhancing the reliability of the cloud hard disk based on the cloud platform are provided, so that the problems that two sets of storage guarantee data are needed, the storage is wasted when the cloud hard disk is normally used, and the processing cannot be carried out before the storage failure. Aiming at the read-write error or the read-write stopping mechanism after the utilization rate of a plurality of current storage back ends reaches a certain value, the proposal is provided for avoiding the storage and cloud hard disk use risks.
Based on the above object, in a first aspect, the present application provides a processing system for enhancing reliability of a cloud hard disk based on a cloud platform, where the system includes a tag setting module, a monitoring alarm module, a judgment execution module, and a calculation storage module;
the label setting module is used for setting the priority of the cloud host, setting a storage utilization rate threshold value and a buffer threshold value, and setting a cloud hard disk performance reduction percentage threshold value;
the monitoring alarm module is used for monitoring performance data of the cloud host, the cloud hard disk and the storage resources;
the judging and executing module is used for processing the cloud hard disk on the storage according to the monitored storage utilization rate condition;
the computing storage module is used for managing and scheduling the whole life cycle of the cloud host and the cloud hard disk.
As a further aspect of the present application, when the judging and executing module processes the cloud hard disk on the storage according to the monitored storage usage rate condition, the judging and executing module includes:
when the storage utilization rate is larger than a set utilization rate threshold value, checking the utilization condition of the cloud hard disk on storage every x% of the storage utilization rate;
when the cloud host with the cloud hard disk is low in priority and is not used for a long time, the cloud hard disk is migrated online, and the cloud hard disk is migrated to a storage with the lowest storage utilization rate;
when the storage utilization rate does not reach the threshold value, checking whether the storage utilization rate is buffered, and when the storage utilization rate exceeds the buffer, monitoring the performance of the cloud hard disk;
when the read-write performance of the cloud hard disk is larger than the read-write performance reduction percentage threshold value, checking the use condition of the cloud hard disk on storage to judge whether online migration is needed; and when the storage utilization rate buffer threshold value is smaller than the storage utilization rate buffer threshold value, monitoring is continued.
When the cloud hard disk on the storage is processed according to the monitored storage utilization rate, the judging and executing module judges whether the cloud hard disk on the storage is a data disk or a system disk when detecting that the cloud host which is not used for a long time and has low priority is detected, the data disk is processed preferentially, and the cloud hard disk with the highest performance reduction percentage is migrated to the storage with the lowest storage utilization rate.
As a further scheme of the application, when the judging and executing module processes the cloud hard disk on the storage according to the monitored storage utilization rate, when all the storage utilization rates exceed a threshold value, the cloud host is fused, and the cloud host, the data disk cloud host and the system disk cloud host which are not used for a long time with low priority are fused in sequence.
As a further scheme of the application, when the tag setting module sets the storage usage rate threshold, different storage usage rate thresholds are set by different storage users, and the storage usage rate threshold is automatically adjusted according to stored performance monitoring data, wherein the setting of the storage usage rate threshold comprises setting a cloud hard disk performance reduction ratio threshold and a buffer threshold.
As a further scheme of the application, the processing system for enhancing the reliability of the cloud hard disk based on the cloud platform further comprises a notification module, wherein the notification module is used for sending notification and alarm information to related parties.
As a further scheme of the present application, the tag setting module is further configured to set an operation mode of the cloud host, a cache policy of the cloud hard disk, and a stored redundancy configuration.
As a further scheme of the application, the monitoring alarm module is also used for monitoring the read-write performance, the storage service condition and the performance reduction condition of the cloud hard disk of resources such as the cloud host, the cloud hard disk, the storage and the like in real time.
In a second aspect, the present application provides a processing method for enhancing reliability of a cloud hard disk based on a cloud platform, the method comprising the following steps:
a) Monitoring storage service conditions and read-write service conditions of the cloud hard disk in real time;
b) Setting a storage threshold value and a buffer value, and triggering a processing measure before a storage use approaches a critical point;
c) Performing multidimensional comprehensive calculation according to the read-write performance and the service condition of the cloud hard disk, and guaranteeing the normal use of the cloud hard disk;
d) And applying a protection mechanism and a fusing mechanism during exception handling to minimize a handling mode affecting the usability of the cloud hard disk.
As a further scheme of the present application, the processing method for enhancing the reliability of the cloud hard disk based on the cloud platform further comprises the following steps:
1) Setting the priority of a cloud host to determine the processing priority of a cloud hard disk;
2) Setting a storage utilization rate threshold value and a buffer threshold value, and judging whether the storage utilization condition reaches a processing condition or not;
3) And setting a performance reduction percentage threshold of the cloud hard disk, and judging whether the performance condition of the cloud hard disk needs to be processed or not.
As a further scheme of the application, the real-time monitoring comprises monitoring performance data of resources such as a cloud host, a cloud hard disk and storage, so as to acquire real-time storage service conditions and read-write performance conditions of the cloud hard disk.
As a further scheme of the application, the multidimensional comprehensive calculation comprises the step of considering the read-write performance of the cloud hard disk, the use condition and the priority of the cloud host so as to determine the processing mode of the cloud hard disk.
As a further scheme of the application, the processing measures comprise online migration of the cloud hard disk to the storage with the lowest storage utilization rate so as to ensure the balanced use of the storage.
As a further scheme of the application, the processing measures further comprise performance monitoring of the cloud hard disk, and judging whether the cloud hard disk needs to be migrated online or not when the read-write performance of the cloud hard disk is reduced to exceed a set threshold value.
As a further scheme of the application, the processing measure also comprises judging whether the cloud hard disk on the storage is a data disk or a system disk, preferentially processing the data disk, and migrating the cloud hard disk with the highest performance reduction to the storage with the lowest storage utilization rate.
In yet another aspect of the present application, there is also provided a computer device, including a memory and a processor, where the memory stores a computer program, and the computer program is executed by the processor to perform any one of the above processing methods for enhancing reliability of a cloud hard disk based on a cloud platform according to the present application.
In still another aspect of the present application, there is further provided a computer readable storage medium storing computer program instructions that when executed implement any one of the above-mentioned processing methods for enhancing reliability of a cloud hard disk based on a cloud platform according to the present application.
Compared with the prior art, the processing method for enhancing the reliability of the cloud hard disk based on the cloud platform and the processing system have the following beneficial technical effects:
the cloud platform-based processing method and system for enhancing the reliability of the cloud hard disk have the following beneficial effects:
1. the reliability of the cloud hard disk is improved: by monitoring the storage service condition and the cloud hard disk read-write service condition in real time and processing according to the set threshold, the problems of abnormal storage, performance reduction of the cloud hard disk and the like can be found in time, and corresponding measures are taken for processing, so that the reliability and stability of the cloud hard disk are improved.
2. Optimizing storage resource utilization: by setting the storage threshold and the buffer threshold, the system can process before the storage usage approaches the critical point to avoid overloading the storage resources. Through multidimensional comprehensive calculation, including factors such as cloud hard disk read-write performance, service conditions, cloud host priority and the like, the system can intelligently decide a processing mode of the cloud hard disk, and reasonable utilization and balanced distribution of storage resources are realized.
3. Exception handling protection mechanism: the system is provided with a protection mechanism and a fusing mechanism when the system processes abnormal conditions. When the storage utilization rate exceeds a threshold value or the performance of the cloud hard disk is reduced by more than a set threshold value, the system can preferentially migrate the cloud host, the data disk cloud host and the system disk cloud host which are low in priority and are not used for a long time so as to ensure the normal use of the cloud hard disk.
4. Automated management scheduling: and the system carries out management and scheduling of the whole life cycle on the cloud host and the cloud hard disk through the calculation storage module. The automatic management scheduling can reduce the burden of manual intervention, improve the management efficiency and ensure that the system can make timely processing decisions according to real-time resource conditions.
5. User experience and service continuity are improved: through the reliability and the stability of the cloud hard disk are enhanced, the system can effectively reduce the failure rate and the data loss risk of the cloud hard disk, and the experience of a user on the cloud platform is improved. Meanwhile, the processing strategy and the protection mechanism of the system can ensure the normal operation of the cloud host and the cloud hard disk, and ensure the continuity and stability of the service.
In summary, the application provides a processing method and a processing system for enhancing the reliability of a cloud hard disk based on a cloud platform, which solve the problems that the storage failure is prevented at present, double sets of storage resources are required to be used for idle waste, user service data can be processed early when the storage has no failure, and the phenomena of service stop and IO interruption are avoided. The normal use of most cloud hard disks is guaranteed by processing individual non-critical cloud hard disks, and the influence on the use is as few as possible according to business requirement processing in a multi-dimensional mode, so that the cost is greatly saved, the abnormal influence is reduced, and the usability is enhanced.
These and other aspects of the application will be more readily apparent from the following description of the embodiments. It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the application as claimed.
Drawings
In order to more clearly illustrate the embodiments of the application or the technical solutions in the prior art, the drawings that are necessary for the description of the embodiments or the prior art will be briefly described, it being obvious that the drawings in the following description are only some embodiments of the application and that other embodiments may be obtained according to these drawings without inventive effort for a person skilled in the art.
In the figure:
fig. 1 is a flow chart of a processing method for enhancing reliability of a cloud hard disk based on a cloud platform according to an embodiment of the present application;
fig. 2 is a schematic hardware structure diagram of an embodiment of a computer device implementing a processing method for enhancing reliability of a cloud hard disk based on a cloud platform according to an embodiment of the present application;
fig. 3 is a schematic diagram of an embodiment of a computer readable storage medium implementing a processing method for enhancing reliability of a cloud hard disk based on a cloud platform according to an embodiment of the present application.
Detailed Description
The present application will be further described with reference to the accompanying drawings and detailed description, wherein it is to be understood that, on the premise of no conflict, the embodiments described below or technical features may be arbitrarily combined to form new embodiments.
In order to make the objects, technical solutions and advantages of the present application more apparent, the following embodiments of the present application will be described in further detail with reference to the accompanying drawings. It should be understood that the specific embodiments described herein are for purposes of illustration only and are not intended to limit the scope of the application.
It should be noted that, in the embodiments of the present application, all the expressions "first" and "second" are used to distinguish two non-identical entities with the same name or non-identical parameters, and it is noted that the "first" and "second" are only used for convenience of expression, and should not be construed as limiting the embodiments of the present application. Furthermore, the terms "comprise" and "have," and any variations thereof, are intended to cover a non-exclusive inclusion, such as a process, method, system, article, or other step or unit that comprises a list of steps or units.
The following description of the embodiments of the present application will be made clearly and fully with reference to the accompanying drawings, in which it is evident that the embodiments described are some, but not all embodiments of the application. All other embodiments, which can be made by those skilled in the art based on the embodiments of the application without making any inventive effort, are intended to be within the scope of the application.
The flow diagrams depicted in the figures are merely illustrative and not necessarily all of the elements and operations/steps are included or performed in the order described. For example, some operations/steps may be further divided, combined, or partially combined, so that the order of actual execution may be changed according to actual situations.
Some embodiments of the present application are described in detail below with reference to the accompanying drawings. The following embodiments and features of the embodiments may be combined with each other without conflict.
The problem that two sets of storage guarantee data are needed, standby storage is wasted in normal use and storage failure can not be processed is solved. Therefore, the application provides an improved processing method and system for enhancing the reliability of a cloud hard disk based on a cloud platform, and aims to solve the problems that two sets of storage guarantee data are needed, standby storage is wasted in normal use and the processing cannot be performed before storage faults. Aiming at the read-write error or the read-write stopping mechanism after the utilization rate of a plurality of current storage back ends reaches a certain value, the proposal is provided for avoiding the storage and cloud hard disk use risks.
The embodiment of the application provides a processing system for enhancing the reliability of a cloud hard disk based on a cloud platform.
In this embodiment, the tag setting module: setting the priority of the cloud host, setting a storage utilization threshold and a buffer threshold, and setting a cloud hard disk performance reduction percentage threshold.
And a monitoring alarm module: and monitoring performance data of resources such as a cloud host, a cloud hard disk, storage and the like.
And the judgment execution module is as follows: and processing the cloud hard disk on the storage according to the monitored storage utilization rate condition. When the storage utilization rate is larger than a set utilization rate threshold value, checking the utilization condition of the cloud hard disk on storage every x% of the storage utilization rate, and when the priority of the cloud host where the cloud hard disk is located is low and the cloud host is not used for a long time, migrating the cloud hard disk on line, and migrating the cloud hard disk to the storage with the lowest storage utilization rate. When the storage utilization rate does not reach the threshold value, checking whether the storage utilization rate is buffered or not, when the storage utilization rate exceeds the buffer, monitoring the performance of the cloud hard disk, and when the storage utilization rate is larger than the threshold value of the reduction percentage of the read-write performance of the cloud hard disk, checking the use condition of the cloud hard disk on the storage to judge whether online migration is needed or not; and when the storage utilization rate buffer threshold value is smaller than the storage utilization rate buffer threshold value, monitoring is continued. When the cloud host which is not low in priority and is unused for a long time is detected, judging whether the cloud hard disk on the storage is a data disk or a system disk, preferentially processing the data disk, and migrating the cloud hard disk with the highest cloud hard disk performance reduction percentage to the storage with the lowest storage utilization rate. When all storage utilization rates exceed a threshold value, the cloud host is fused, and the cloud host, the data disk cloud host and the system disk cloud host which are not used for a long time with low priority are fused in sequence.
And the calculation storage module is used for: and managing and scheduling the whole life cycle of the cloud host and the cloud hard disk.
In this embodiment, the processing system for enhancing reliability of a cloud hard disk based on a cloud platform further includes a notification module, where the notification module is configured to send notification and alarm information to a relevant party.
In this embodiment, the tag setting module is further configured to set an operation mode of the cloud host, a cache policy of the cloud hard disk, and a stored redundancy configuration.
In this embodiment, the monitoring alarm module is further configured to monitor, in real time, read-write performance, storage usage condition, and performance degradation condition of the cloud host, the cloud hard disk, and storage resources.
According to the cloud platform-based enhanced cloud hard disk reliability processing system, the storage service condition and the cloud hard disk read-write service condition are monitored in real time, the normal use of the cloud hard disk is actively ensured according to the multi-dimensional comprehensive calculation of the cloud hard disk read-write performance and the service condition before the storage service condition reaches a critical point by setting the storage threshold value and the buffer value, and meanwhile, a protection mechanism and a fusing mechanism are arranged when the cloud hard disk is abnormal to process the least cloud hard disk to ensure that more cloud hard disks are available.
Referring to fig. 1, an embodiment of the present application provides a processing method for enhancing reliability of a cloud hard disk based on a cloud platform, the method includes the following steps:
a) Monitoring storage service conditions and read-write service conditions of the cloud hard disk in real time;
b) Setting a storage threshold value and a buffer value, and triggering a processing measure before a storage use approaches a critical point;
c) Performing multidimensional comprehensive calculation according to the read-write performance and the service condition of the cloud hard disk, and guaranteeing the normal use of the cloud hard disk;
d) And applying a protection mechanism and a fusing mechanism during exception handling to minimize a handling mode affecting the usability of the cloud hard disk.
The processing method for enhancing the reliability of the cloud hard disk based on the cloud platform further comprises the following steps:
1) Setting the priority of a cloud host to determine the processing priority of a cloud hard disk;
2) Setting a storage utilization rate threshold value and a buffer threshold value, and judging whether the storage utilization condition reaches a processing condition or not;
3) And setting a performance reduction percentage threshold of the cloud hard disk, and judging whether the performance condition of the cloud hard disk needs to be processed or not.
In this embodiment, the real-time monitoring includes monitoring performance data of resources such as a cloud host, a cloud hard disk, and storage, so as to obtain real-time storage usage conditions and read-write performance conditions of the cloud hard disk. The multi-dimensional comprehensive calculation comprises the step of considering the read-write performance, the service condition and the priority of the cloud host computer of the cloud hard disk so as to determine the processing mode of the cloud hard disk.
In this embodiment, the processing measures include online migration of the cloud hard disk to the storage with the lowest storage usage rate, so as to ensure balanced use of the storage; the processing measures further comprise performance monitoring of the cloud hard disk, and judging whether the cloud hard disk needs to be transferred online or not when the read-write performance of the cloud hard disk is reduced to exceed a set threshold value; the processing measure also comprises judging whether the cloud hard disk on the storage is a data disk or a system disk, preferentially processing the data disk, and migrating the cloud hard disk with the highest performance reduction to the storage with the lowest storage utilization rate.
In summary, the processing method of the processing system for enhancing the reliability of the cloud hard disk based on the cloud platform comprises the following steps:
step one, a setting module sets a storage utilization rate threshold value, different storage utilization rate threshold values can be set in a self-defined mode by different storage, and the storage utilization rate threshold value can be automatically adjusted according to stored performance monitoring data as a preferable mode. A cloud hard disk performance degradation ratio threshold and a buffer threshold can be set.
And secondly, monitoring a plurality of storage utilization rates, cloud hard disk read-write performance and other utilization conditions corresponding to the cloud platform by a monitoring module, checking the utilization conditions of the cloud hard disk on storage every x% when the storage utilization rate is larger than a set utilization rate threshold, and migrating the cloud hard disk on line when the priority of a cloud host where the cloud hard disk is located is low and the cloud hard disk is not used for a long time, so that the cloud hard disk is migrated to the storage with the lowest storage utilization rate. When the storage utilization rate does not reach the threshold value, checking whether the storage utilization rate is buffered or not, when the storage utilization rate exceeds the buffer, monitoring the performance of the cloud hard disk, and when the storage utilization rate is larger than the threshold value of the reduction percentage of the read-write performance of the cloud hard disk, checking the use condition of the cloud hard disk on the storage to judge whether online migration is needed or not; and when the storage utilization rate buffer threshold value is smaller than the storage utilization rate buffer threshold value, monitoring is continued.
And step three, when the cloud host which has no low priority and is not used for a long time is detected, judging whether the cloud hard disk on the storage is a data disk or a system disk, preferentially processing the data disk, and migrating the cloud hard disk with the highest cloud hard disk performance reduction percentage to the storage with the lowest storage utilization rate.
And fourthly, when all storage utilization rates exceed the threshold value, fusing the cloud host, and sequentially fusing the cloud host, the data disk cloud host and the system disk cloud host which are not used for a long time with low priority.
The application provides a processing method for enhancing the reliability of a cloud hard disk based on a cloud platform, which solves the problems that the storage failure must be prevented from using double sets of storage resources to be idle and waste currently, and user service data can be processed early when the storage has no failure, so that the phenomena of service stop and IO interruption are avoided. The normal use of most cloud hard disks is guaranteed by processing individual non-critical cloud hard disks, and the influence on the use is as few as possible according to business requirement processing in a multi-dimensional mode, so that the cost is greatly saved, the abnormal influence is reduced, and the usability is enhanced.
It is noted that the above-described figures are only schematic illustrations of processes involved in a method according to an exemplary embodiment of the application, and are not intended to be limiting. It will be readily appreciated that the processes shown in the above figures do not indicate or limit the temporal order of these processes. In addition, it is also readily understood that these processes may be performed synchronously or asynchronously, for example, among a plurality of modules.
It should be understood that although described in a certain order, the steps are not necessarily performed sequentially in the order described. The steps are not strictly limited to the order of execution unless explicitly recited herein, and the steps may be executed in other orders. Moreover, some steps of the present embodiment may include a plurality of steps or stages, which are not necessarily performed at the same time, but may be performed at different times, and the order of the steps or stages is not necessarily sequential, but may be performed alternately or alternately with at least a part of the steps or stages in other steps or other steps.
In a third aspect of the embodiment of the present application, there is also provided a computer device 1000, including a memory 1001 and a processor 1002, in which a computer program is stored which, when executed by the processor, implements the method of any of the embodiments described above.
Fig. 2 is a schematic hardware structure diagram of an embodiment of a computer device for executing a processing method for enhancing reliability of a cloud hard disk based on a cloud platform according to the present application. Taking the example of a computer device 1000 as shown in fig. 2, a processor 1002 and a memory 1001 are included in the computer device, and may further include: an input device 430 and an output device 440. The processor 1002, memory 1001, input device 430, and output device 440 may be connected by a bus or otherwise, for example in fig. 2. Input device 430 may receive input numeric or character information and generate signal inputs related to processing based on the cloud platform to enhance the reliability of the cloud hard disk. The output 440 may include a display device such as a display screen.
The memory 1001 is used as a non-volatile computer readable storage medium, and may be used to store a non-volatile software program, a non-volatile computer executable program, and a module, such as a program instruction/module corresponding to a processing method for enhancing reliability of a cloud hard disk based on a cloud platform in an embodiment of the present application. The memory 1001 may include a storage program area that may store an operating system, at least one application program required for functions, and a storage data area; the storage data area may store data created by use of a processing method of enhancing reliability of a cloud hard disk based on a cloud platform, and the like. In addition, memory 1001 may include high-speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other non-volatile solid-state storage device. In some embodiments, memory 1001 optionally includes memory remotely located with respect to processor 1002, which may be connected to the local module via a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
The processor 1002 may be a central processing unit (Central Processing Unit, CPU), controller, microcontroller, microprocessor, or other data processing chip in some embodiments. The processor 1002 is typically employed to control the overall operation of the computer device 1000. In this embodiment, a processor 1002 is used to execute program code or process data stored in a memory 1001. The processor 1002 of the multiple computer devices 1000 of the computer device of the present embodiment executes various functional applications and data processing of the server by running nonvolatile software programs, instructions and modules stored in the memory 1001, that is, implements the processing method for enhancing the reliability of the cloud hard disk based on the cloud platform in the method embodiment, and includes the following steps:
step one, a setting module sets a storage utilization rate threshold value, different storage utilization rate threshold values can be set in a self-defined mode by different storage, and the storage utilization rate threshold value can be automatically adjusted according to stored performance monitoring data as a preferable mode. A cloud hard disk performance degradation ratio threshold and a buffer threshold can be set.
And secondly, monitoring a plurality of storage utilization rates, cloud hard disk read-write performance and other utilization conditions corresponding to the cloud platform by a monitoring module, checking the utilization conditions of the cloud hard disk on storage every x% when the storage utilization rate is larger than a set utilization rate threshold, and migrating the cloud hard disk on line when the priority of a cloud host where the cloud hard disk is located is low and the cloud hard disk is not used for a long time, so that the cloud hard disk is migrated to the storage with the lowest storage utilization rate. When the storage utilization rate does not reach the threshold value, checking whether the storage utilization rate is buffered or not, when the storage utilization rate exceeds the buffer, monitoring the performance of the cloud hard disk, and when the storage utilization rate is larger than the threshold value of the reduction percentage of the read-write performance of the cloud hard disk, checking the use condition of the cloud hard disk on the storage to judge whether online migration is needed or not; and when the storage utilization rate buffer threshold value is smaller than the storage utilization rate buffer threshold value, monitoring is continued.
And step three, when the cloud host which has no low priority and is not used for a long time is detected, judging whether the cloud hard disk on the storage is a data disk or a system disk, preferentially processing the data disk, and migrating the cloud hard disk with the highest cloud hard disk performance reduction percentage to the storage with the lowest storage utilization rate.
And fourthly, when all storage utilization rates exceed the threshold value, fusing the cloud host, and sequentially fusing the cloud host, the data disk cloud host and the system disk cloud host which are not used for a long time with low priority.
In a fourth aspect of the embodiment of the present application, a computer readable storage medium is provided, and fig. 3 is a schematic diagram of a computer readable storage medium according to a processing method for enhancing reliability of a cloud hard disk based on a cloud platform according to an embodiment of the present application. As shown in fig. 3, the computer-readable storage medium 2000 stores computer program instructions 2001, which computer program instructions 2001 may be executed by a processor. The computer program instructions 2001, when executed, implement the method of any one of the embodiments, that is, implement the processing method for enhancing reliability of a cloud hard disk based on a cloud platform of the method embodiment, including the following steps:
step one, a setting module sets a storage utilization rate threshold value, different storage utilization rate threshold values can be set in a self-defined mode by different storage, and the storage utilization rate threshold value can be automatically adjusted according to stored performance monitoring data as a preferable mode. A cloud hard disk performance degradation ratio threshold and a buffer threshold can be set.
And secondly, monitoring a plurality of storage utilization rates, cloud hard disk read-write performance and other utilization conditions corresponding to the cloud platform by a monitoring module, checking the utilization conditions of the cloud hard disk on storage every x% when the storage utilization rate is larger than a set utilization rate threshold, and migrating the cloud hard disk on line when the priority of a cloud host where the cloud hard disk is located is low and the cloud hard disk is not used for a long time, so that the cloud hard disk is migrated to the storage with the lowest storage utilization rate. When the storage utilization rate does not reach the threshold value, checking whether the storage utilization rate is buffered or not, when the storage utilization rate exceeds the buffer, monitoring the performance of the cloud hard disk, and when the storage utilization rate is larger than the threshold value of the reduction percentage of the read-write performance of the cloud hard disk, checking the use condition of the cloud hard disk on the storage to judge whether online migration is needed or not; and when the storage utilization rate buffer threshold value is smaller than the storage utilization rate buffer threshold value, monitoring is continued.
And step three, when the cloud host which has no low priority and is not used for a long time is detected, judging whether the cloud hard disk on the storage is a data disk or a system disk, preferentially processing the data disk, and migrating the cloud hard disk with the highest cloud hard disk performance reduction percentage to the storage with the lowest storage utilization rate.
And fourthly, when all storage utilization rates exceed the threshold value, fusing the cloud host, and sequentially fusing the cloud host, the data disk cloud host and the system disk cloud host which are not used for a long time with low priority.
It should be appreciated that all of the embodiments, features and advantages set forth above for the cloud platform based processing method of enhancing cloud hard disk reliability according to the present application apply equally, without conflict, to the cloud platform based processing system and storage medium of enhancing cloud hard disk reliability according to the present application.
Those of skill would further appreciate that the various illustrative logical blocks, modules, circuits, and algorithm steps described in connection with the disclosure herein may be implemented as electronic hardware, computer software, or combinations of both. To clearly illustrate this interchangeability of hardware and software, various illustrative components, blocks, modules, circuits, and steps have been described above generally in terms of their functionality. Whether such functionality is implemented as software or hardware depends upon the particular application and design constraints imposed on the overall system. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present disclosure.
Finally, it should be noted that the computer-readable storage media (e.g., memory) herein can be either volatile memory or nonvolatile memory, or can include both volatile and nonvolatile memory. By way of example, and not limitation, nonvolatile memory can include Read Only Memory (ROM), programmable ROM (PROM), electrically Programmable ROM (EPROM), electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM), which acts as external cache memory. By way of example, and not limitation, RAM may be available in a variety of forms such as synchronous RAM (DRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDR SDRAM), enhanced SDRAM (ESDRAM), synchronous Link DRAM (SLDRAM), and Direct Rambus RAM (DRRAM). The storage devices of the disclosed aspects are intended to comprise, without being limited to, these and other suitable types of memory.
The various illustrative logical blocks, modules, and circuits described in connection with the disclosure herein may be implemented or performed with the following components designed to perform the functions herein: a general purpose processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other programmable logic device, discrete gate or transistor logic, discrete hardware components, or any combination thereof. A general purpose processor may be a microprocessor, but in the alternative, the processor may be any conventional processor, controller, microcontroller, or state machine. A processor may also be implemented as a combination of computing devices, e.g., a combination of a DSP and a microprocessor, a plurality of microprocessors, one or more microprocessors in conjunction with a DSP and/or any other such configuration.
In summary, the application provides a processing method and a processing system for enhancing the reliability of a cloud hard disk based on a cloud platform, which solve the problems that the storage failure is prevented at present, double sets of storage resources are required to be used for idle waste, user service data can be processed early when the storage has no failure, and the phenomena of service stop and IO interruption are avoided. The normal use of most cloud hard disks is guaranteed by processing individual non-critical cloud hard disks, and the influence on the use is as few as possible according to business requirement processing in a multi-dimensional mode, so that the cost is greatly saved, the abnormal influence is reduced, and the usability is enhanced.
The foregoing is an exemplary embodiment of the present disclosure, but it should be noted that various changes and modifications could be made herein without departing from the scope of the disclosure as defined by the appended claims. The functions, steps and/or actions of the method claims in accordance with the disclosed embodiments described herein need not be performed in any particular order. Furthermore, although elements of the disclosed embodiments may be described or claimed in the singular, the plural is contemplated unless limitation to the singular is explicitly stated.
It should be understood that as used herein, the singular forms "a", "an", and "the" are intended to include the plural forms as well, unless the context clearly supports the exception. It should also be understood that "and/or" as used herein is meant to include any and all possible combinations of one or more of the associated listed items. The foregoing embodiment of the present application has been disclosed with reference to the number of embodiments for the purpose of description only, and does not represent the advantages or disadvantages of the embodiments.
Those of ordinary skill in the art will appreciate that: the above discussion of any embodiment is merely exemplary and is not intended to imply that the scope of the disclosure of embodiments of the application, including the claims, is limited to such examples; combinations of features of the above embodiments or in different embodiments are also possible within the idea of an embodiment of the application, and many other variations of the different aspects of the embodiments of the application as described above exist, which are not provided in detail for the sake of brevity. Therefore, any omission, modification, equivalent replacement, improvement, etc. of the embodiments should be included in the protection scope of the embodiments of the present application.

Claims (10)

1. The processing system for enhancing the reliability of the cloud hard disk based on the cloud platform is characterized by comprising a label setting module, a monitoring alarm module, a judging execution module and a calculation storage module;
the label setting module is used for setting the priority of the cloud host, setting a storage utilization rate threshold value and a buffer threshold value, and setting a cloud hard disk performance reduction percentage threshold value;
the monitoring alarm module is used for monitoring performance data of the cloud host, the cloud hard disk and the storage resources;
the judging and executing module is used for processing the cloud hard disk on the storage according to the monitored storage utilization rate condition;
the computing storage module is used for managing and scheduling the whole life cycle of the cloud host and the cloud hard disk.
2. The processing system for enhancing reliability of a cloud hard disk based on a cloud platform according to claim 1, wherein the determining execution module, when processing the cloud hard disk on the storage according to the monitored storage usage condition, comprises:
when the storage utilization rate is larger than a set utilization rate threshold value, checking the utilization condition of the cloud hard disk on storage every x% of the storage utilization rate;
when the cloud host with the cloud hard disk is low in priority and is not used for a long time, the cloud hard disk is migrated online, and the cloud hard disk is migrated to a storage with the lowest storage utilization rate;
when the storage utilization rate does not reach the threshold value, checking whether the storage utilization rate is buffered, and when the storage utilization rate exceeds the buffer, monitoring the performance of the cloud hard disk;
when the read-write performance of the cloud hard disk is larger than the read-write performance reduction percentage threshold value, checking the use condition of the cloud hard disk on storage to judge whether online migration is needed; and when the storage utilization rate buffer threshold value is smaller than the storage utilization rate buffer threshold value, monitoring is continued.
3. The processing system for enhancing reliability of cloud hard disk based on cloud platform according to claim 1, wherein the judging and executing module judges whether the cloud hard disk on the storage is a data disk or a system disk when detecting that there is no cloud host with low priority and long-term unused when processing the cloud hard disk on the storage according to the monitored storage usage condition, preferentially processes the data disk, and migrates the cloud hard disk with the highest percentage of performance degradation of the cloud hard disk to the storage with the lowest storage usage.
4. The processing system for enhancing reliability of a cloud hard disk based on a cloud platform according to claim 3, wherein the judging and executing module, when processing the cloud hard disk on the storage according to the monitored storage usage, fuses the cloud host when all the storage usage exceeds a threshold value, and fuses the cloud host, the data disk cloud host and the system disk cloud host which are not used for a long time with low priority in sequence.
5. The processing system for enhancing reliability of a cloud hard disk based on a cloud platform according to claim 4, wherein when the tag setting module sets a storage usage threshold, different storage usage thresholds are set by different storage users, and the storage usage threshold is automatically adjusted according to stored performance monitoring data, wherein setting the storage usage threshold includes setting a cloud hard disk performance degradation ratio threshold and a buffer threshold.
6. The processing system for enhancing reliability of a cloud platform based on a cloud hard disk of claim 2, further comprising a notification module for sending notification and alert information to a interested party.
7. The processing system for enhancing reliability of a cloud hard disk based on a cloud platform of claim 5, wherein the tag setting module is further configured to set an operation mode of a cloud host, a cache policy of the cloud hard disk, and a stored redundancy configuration.
8. A processing method for enhancing reliability of a cloud hard disk based on a cloud platform is characterized by comprising the following steps:
a) Monitoring storage service conditions and read-write service conditions of the cloud hard disk in real time;
b) Setting a storage threshold value and a buffer value, and triggering a processing measure before a storage use approaches a critical point;
c) Performing multidimensional comprehensive calculation according to the read-write performance and the service condition of the cloud hard disk, and guaranteeing the normal use of the cloud hard disk;
d) And applying a protection mechanism and a fusing mechanism during exception handling to minimize a handling mode affecting the usability of the cloud hard disk.
9. The method for enhancing reliability of a cloud hard disk based on a cloud platform according to claim 8, wherein the method for enhancing reliability of a cloud hard disk based on a cloud platform further comprises the steps of:
1) Setting the priority of a cloud host to determine the processing priority of a cloud hard disk;
2) Setting a storage utilization rate threshold value and a buffer threshold value, and judging whether the storage utilization condition reaches a processing condition or not;
3) And setting a performance reduction percentage threshold of the cloud hard disk, and judging whether the performance condition of the cloud hard disk needs to be processed or not.
10. The processing method for enhancing reliability of a cloud hard disk based on a cloud platform according to claim 9, wherein the real-time monitoring comprises monitoring performance data of resources such as a cloud host, the cloud hard disk and storage to obtain real-time storage use conditions and read-write performance conditions of the cloud hard disk; the multi-dimensional comprehensive calculation comprises the step of considering the read-write performance, the service condition and the priority of the cloud host computer of the cloud hard disk so as to determine the processing mode of the cloud hard disk.
CN202310948674.3A 2023-07-31 2023-07-31 Processing method and system for enhancing reliability of cloud hard disk based on cloud platform Pending CN116996530A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310948674.3A CN116996530A (en) 2023-07-31 2023-07-31 Processing method and system for enhancing reliability of cloud hard disk based on cloud platform

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310948674.3A CN116996530A (en) 2023-07-31 2023-07-31 Processing method and system for enhancing reliability of cloud hard disk based on cloud platform

Publications (1)

Publication Number Publication Date
CN116996530A true CN116996530A (en) 2023-11-03

Family

ID=88526010

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310948674.3A Pending CN116996530A (en) 2023-07-31 2023-07-31 Processing method and system for enhancing reliability of cloud hard disk based on cloud platform

Country Status (1)

Country Link
CN (1) CN116996530A (en)

Similar Documents

Publication Publication Date Title
EP3518110B1 (en) Designation of a standby node
CN105187249B (en) A kind of fault recovery method and device
WO2017152763A1 (en) Physical machine failure classification handling method and device, and virtual machine recovery method and system
CN110147155A (en) Server power supply cold redundancy control method, device and BMC based on BMC
CN105224888B (en) A kind of data of magnetic disk array protection system based on safe early warning technology
CN113051104B (en) Method and related device for recovering data between disks based on erasure codes
CN107480014A (en) A kind of High Availabitity equipment switching method and device
CN111176888B (en) Disaster recovery method, device and system for cloud storage
CN105554074A (en) NAS resource monitoring system and monitoring method based on RPC communication
CN109491609A (en) A kind of caching data processing method, device, equipment and readable storage medium storing program for executing
CN111181774A (en) High-availability method, system, terminal and storage medium for MapReduce task
CN111181780A (en) HA cluster-based host pool switching method, system, terminal and storage medium
CN114064374A (en) Fault detection method and system based on distributed block storage
CN115396291A (en) Redis cluster fault self-healing method based on kubernets trustees
CN103297264B (en) Cloud platform failure recovery method and system
CN106534262B (en) A kind of switching method of network information system failure
JP6000352B2 (en) System and method for powering redundant components
CN116996530A (en) Processing method and system for enhancing reliability of cloud hard disk based on cloud platform
CN117076196A (en) Database disaster recovery management and control method and device
CN108964992B (en) Node fault detection method and device and computer readable storage medium
CN116192885A (en) High-availability cluster architecture artificial intelligent experiment cloud platform data processing method and system
CN112269693B (en) Node self-coordination method, device and computer readable storage medium
CN115686951A (en) Fault processing method and device for database server
CN107590032A (en) The method and storage cluster system of storage cluster failure transfer
CN114189429A (en) System, method, device and medium for monitoring server cluster faults

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination