CN116820822A - Read-write link control method, device and medium - Google Patents

Read-write link control method, device and medium Download PDF

Info

Publication number
CN116820822A
CN116820822A CN202310792979.XA CN202310792979A CN116820822A CN 116820822 A CN116820822 A CN 116820822A CN 202310792979 A CN202310792979 A CN 202310792979A CN 116820822 A CN116820822 A CN 116820822A
Authority
CN
China
Prior art keywords
read
link
write
write link
unstable
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310792979.XA
Other languages
Chinese (zh)
Inventor
王皓
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Inspur Intelligent Technology Co Ltd
Original Assignee
Suzhou Inspur Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Inspur Intelligent Technology Co Ltd filed Critical Suzhou Inspur Intelligent Technology Co Ltd
Priority to CN202310792979.XA priority Critical patent/CN116820822A/en
Publication of CN116820822A publication Critical patent/CN116820822A/en
Pending legal-status Critical Current

Links

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Debugging And Monitoring (AREA)

Abstract

The application relates to the field of communication, and discloses a read-write link control method, a device and a medium, wherein the method comprises the following steps: acquiring performance information of each read-write link so as to judge whether each read-write link is in an abnormal state according to the performance information, and sending an alarm to a manager if the read-write link is in the abnormal state, so that the manager can timely find out and timely process the read-write link with the fault in the storage platform; if the read-write speed of each read-write link does not meet the preset condition, judging whether the read-write speed of each read-write link meets the preset condition; if the information is not satisfied, the corresponding link information is acquired, so that an unstable link in the storage platform is found, and the unstable read-write link is processed conveniently. According to the application, the read-write link in an unprocessed abnormal state is determined through the performance information of the read-write link, and the read-write speed of the read-write link in the abnormal state is judged, so that the abnormal read-write link in the multi-path storage platform is determined, the management personnel can process in time, and the use experience of the user is improved.

Description

Read-write link control method, device and medium
Technical Field
The present application relates to the field of communications, and in particular, to a method, an apparatus, and a medium for controlling a read-write link.
Background
With the development of cloud computing technology, more and more enterprises choose to store data in a cloud computing platform, and the data volume in the cloud computing platform also increases rapidly. How to guarantee the use experience of cloud platform users under high data volume becomes a problem to be solved by cloud computing service providers.
To achieve this goal, cloud computing service providers typically choose Linux multi-path storage technology to ensure high fluency and high availability of storage devices of servers where cloud computing platforms are located. The Linux multipath storage technology refers to multipath software used under a Linux operating system to realize multipath access to storage devices. It can make the computer system access to disk, and can obtain higher usability and stronger fault tolerance. When a storage device or path fails, it may continue to provide storage services to the system to prevent system paralysis. However, if an IO fluctuation or interruption occurs due to an abnormality in an Input/Output (IO) link in the multipath storage, the IO reading and writing may be affected, so that the normal operation of the cloud computing platform is affected.
Therefore, how to provide an IO read-write link control method to prevent the normal operation of the cloud computing platform from being affected by the IO link fault is a problem that needs to be solved by those skilled in the art.
Disclosure of Invention
The application aims to provide a read-write link control method, a read-write link control device and a read-write link control medium, so as to prevent the normal operation of a cloud computing platform from being influenced by IO link faults and improve the stability of a system and the use experience of a user.
In order to solve the above technical problems, the present application provides a read-write link control method, which is applied to a multipath storage platform, and includes:
acquiring performance information of each read-write link;
judging whether each read-write link is in an abnormal state according to the performance information;
if the read-write link is in the abnormal state, an alarm is sent to a manager;
if the read-write links are not in the abnormal state, judging whether the read-write speed of each read-write link meets a preset condition or not;
if the preset condition is not met, determining that the read-write link is an unstable link, and acquiring link information of the unstable link so as to process the read-write link.
In some embodiments, the obtaining the performance information of each read-write link includes:
acquiring log information of each read-write link in a working state and state information of all the read-write links;
and determining performance information of each link according to the state information and the log information.
In some embodiments, the determining whether the read-write speed of each read-write link meets the preset condition includes:
judging whether the read-write speed of the read-write link is greater than a read-write speed threshold;
if the read-write speed fluctuation is larger than the read-write speed threshold, judging whether the read-write speed fluctuation of the read-write link is larger than the fluctuation threshold;
and if the read-write link is larger than the fluctuation threshold, determining that the read-write link does not meet the preset condition.
In some embodiments, the sending an alert to the manager comprises:
the abnormal state information is written into the log file, and an alarm is sent to the manager in an email mode.
In some embodiments, after obtaining the link information of the unstable link, the method further includes:
and reducing the signal transmission level of the unstable link to reduce the utilization rate of the unstable link.
In some embodiments, after the step of reducing the signal transmission level of the unstable link, the method further includes:
judging whether the fluctuation of the read-write speed of the unstable link in the detection period is larger than a fluctuation threshold value or not;
and if the signal transmission level of the unstable link is not greater than the fluctuation threshold value, recovering the signal transmission level of the unstable link.
In some embodiments, after the step of sending an alert to the manager, the method further comprises:
acquiring fault information of the read-write link in an abnormal state, and determining a fault reason according to the fault information;
and repairing the read-write link or updating the drive according to the fault reason.
In order to solve the technical problem, the present application further provides a read-write link control device, which is characterized by comprising:
the acquisition module is used for acquiring the performance information of each read-write link;
the first judging module is used for judging whether each read-write link is in an abnormal state or not according to the performance information;
the alarm module is used for sending an alarm to a manager if the read-write link is in the abnormal state;
the second judging module is used for judging whether the read-write speed of each read-write link meets a preset condition or not if the read-write link is not in the abnormal state;
and the determining module is used for determining that the read-write link is an unstable link and acquiring link information of the unstable link if the preset condition is not met so as to process the read-write link.
In order to solve the technical problem, the application also provides a read-write link control device, which is characterized by comprising a memory for storing a computer program;
and the processor is used for realizing the steps of the read-write link control method when executing the computer program.
In order to solve the above technical problem, the present application further provides a computer readable storage medium, wherein the computer readable storage medium stores a computer program, and the computer program when executed by a processor implements the steps of the read-write link control method.
The application provides a read-write link control method, which is applied to a multipath storage platform and comprises the following steps: acquiring performance information of each read-write link so as to judge whether each read-write link is in an abnormal state according to the performance information, and sending an alarm to a manager if the read-write link is in the abnormal state, so that the manager can timely find out and timely process the read-write link with the fault in the storage platform; if the read-write links are not in an abnormal state, judging whether the read-write speed of each read-write link meets a preset condition or not; if the preset condition is not met, corresponding link information is acquired, so that an unstable link in the storage platform is found, and the unstable read-write link is processed conveniently. Therefore, according to the technical scheme provided by the application, the read-write link in an unprocessed abnormal state is determined through the performance information of the read-write link, and the read-write speed of the read-write link in the abnormal state is judged, so that the abnormal read-write link in the multipath storage platform is determined, and the management personnel can process the abnormal read-write link in time, thereby preventing the abnormal read-write link from affecting the normal work of the cloud computing platform, and improving the use experience of users.
In some embodiments, determining whether the read-write speed of each read-write link meets the preset condition includes: judging whether the read-write speed of the read-write link is greater than a read-write speed threshold; if the read-write speed fluctuation is larger than the read-write speed threshold, judging whether the read-write speed fluctuation of the read-write link is larger than the fluctuation threshold; if the read-write link is larger than the fluctuation threshold, determining that the read-write link does not meet the preset condition. By comparing the read-write speed with the fluctuation degree, whether the currently detected read-write link can meet the demand of the read-write link or not is judged, if not, the read-write link is processed in time, and the data security and stability caused by link fluctuation are prevented.
In some embodiments, sending an alert to the manager includes: the abnormal state information is written into the log file, and an alarm is sent to the manager in an email mode. Therefore, a manager can visually check the fault condition of the read-write link by checking the log file, and the system is quickly maintained.
In some embodiments, after obtaining the link information of the unstable link, the method further includes: the signal transmission level of the unstable link is reduced to reduce the usage of the unstable link. The system is prevented from being failed due to overlarge service pressure born by the unstable link, and the data safety and the use experience of a user are prevented from being influenced.
In some embodiments, after the step of reducing the signal transmission level of the unstable link, further comprising: judging whether the fluctuation of the read-write speed of the unstable link in the detection period is larger than a fluctuation threshold value or not; and if the signal transmission level is not greater than the fluctuation threshold value, restoring the signal transmission level of the unstable link. The unstable link which does not have abnormal fluctuation in the detection period is timely recovered to be a normal link, so that the data processing capacity of the storage system is improved.
In some embodiments, after the step of sending the alert to the manager, further comprising: acquiring fault information of a read-write link in an abnormal state, and determining a fault reason according to the fault information; and repairing the read-write link or updating the drive according to the fault reason. The performance of the storage system and the cloud computing system is further improved by quickly recovering simpler anomalies, so that the use experience of users is improved.
In addition, the application also provides a read-write link control device and a medium, which are applied to the multipath storage platform and comprise the following components: acquiring performance information of each read-write link so as to judge whether each read-write link is in an abnormal state according to the performance information, and sending an alarm to a manager if the read-write link is in the abnormal state, so that the manager can timely find out and timely process the read-write link with the fault in the storage platform; if the read-write links are not in an abnormal state, judging whether the read-write speed of each read-write link meets a preset condition or not; if the preset condition is not met, corresponding link information is acquired, so that an unstable link in the storage platform is found, and the unstable read-write link is processed conveniently. Therefore, according to the technical scheme provided by the application, the read-write link in an unprocessed abnormal state is determined through the performance information of the read-write link, and the read-write speed of the read-write link in the abnormal state is judged, so that the abnormal read-write link in the multipath storage platform is determined, and the management personnel can process the abnormal read-write link in time, thereby preventing the abnormal read-write link from affecting the normal work of the cloud computing platform, and improving the use experience of users.
Drawings
For a clearer description of embodiments of the present application, the drawings that are required to be used in the embodiments will be briefly described, it being apparent that the drawings in the following description are only some embodiments of the present application, and other drawings may be obtained according to the drawings without inventive effort for those skilled in the art.
FIG. 1 is a schematic diagram of a multi-path storage platform;
fig. 2 is a flowchart of a read-write link control method according to an embodiment of the present application;
fig. 3 is a block diagram of a read-write link control device according to an embodiment of the present application;
fig. 4 is a block diagram of another read-write link control device according to an embodiment of the present application.
Detailed Description
The following description of the embodiments of the present application will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present application, but not all embodiments. Based on the embodiments of the present application, all other embodiments obtained by a person of ordinary skill in the art without making any inventive effort are within the scope of the present application.
The core of the application is to provide a read-write link control method, a device and a medium, so as to prevent the normal operation of a cloud computing platform from being influenced by IO link faults and improve the stability of a system and the use experience of a user.
In order to improve the performance of the cloud computing platform and ensure the fluency and high availability of the cloud computing platform, a cloud computing service provider mostly chooses to store data in the cloud computing platform through a multilink storage platform, fig. 1 is a schematic diagram of a multipath storage platform, and as shown in fig. 1, a plurality of IO read-write links exist between each storage unit of the storage device and an HOST HOST, and when any IO read-write link fails, the HOST HOST can communicate with the storage unit through other IO read-write links, so that normal data reading and writing are ensured. However, even in the multilink storage platform, when there are too many communication links with faults, IO reading and writing may be affected, so that normal operation of the cloud computing platform is affected. In order to solve the technical problem, the application provides a read-write link control method, which is used for acquiring the performance information of each read-write link so as to judge whether each read-write link is in an abnormal state according to the performance information, and sending an alarm to a manager if the read-write link is in the abnormal state, so that the manager can timely find out the read-write link with a fault in a storage platform and timely process the read-write link; if the read-write links are not in an abnormal state, judging whether the read-write speed of each read-write link meets a preset condition or not; if the preset condition is not met, corresponding link information is acquired, so that an unstable link in the storage platform is found, and the unstable read-write link is processed conveniently. Therefore, according to the technical scheme provided by the application, the read-write link in an unprocessed abnormal state is determined through the performance information of the read-write link, and the read-write speed of the read-write link in the abnormal state is judged, so that the abnormal read-write link in the multipath storage platform is determined, and the management personnel can process the abnormal read-write link in time, thereby preventing the abnormal read-write link from affecting the normal work of the cloud computing platform, and improving the use experience of users.
In order to better understand the aspects of the present application, the present application will be described in further detail with reference to the accompanying drawings and detailed description.
Fig. 2 is a flowchart of a read-write link control method according to an embodiment of the present application, where the method is applied to a multi-path storage platform, as shown in fig. 2, and the method includes:
s10: acquiring performance information of each read-write link;
s11: judging whether each read-write link is in an abnormal state according to the performance information;
s12: and if the read-write link is in an abnormal state, sending an alarm to a manager.
The IO link (IEC 61131-9) is an open standard serial communication protocol that allows for bi-directional data exchange from sensors and devices that support the IO link and are connected to a host. Currently, linux multi-path storage technology is generally adopted to manage a multi-link storage system. The Linux multipath storage technology refers to multipath software used under a Linux operating system to realize multipath access to storage devices. It can make the computer system access to disk, and can obtain higher usability and stronger fault tolerance. When a storage device or path fails, it may continue to provide storage services to the system to prevent system paralysis. Linux multipath storage techniques can be obtained in various ways, such as hardware virtualization, soft binding, bit copying, etc. Hardware virtualization technology deploys multiple virtual nodes on each physical storage device, each node consisting of multiple virtual tracks, virtualizing the real physical paths to provide multi-path access. Soft bonding achieves multipath transmission by mapping the same member of one physical device onto multiple target devices to achieve high availability. Bit copying is the copying of data from one source storage device to multiple target storage devices to ensure system performance. By implementing the Linux multi-path storage technology, a storage service with high availability can be obtained. The system can identify and recover the single-path faults in the system in real time, and provides safe, reliable, efficient and highly available storage service for the system.
In specific implementation, the read-write link control process monitors the state of each read-write link according to the acquired performance information of each read-write link, so as to determine whether the read-write link is in an abnormal state. If the storage system is in an abnormal state, alarm information is sent to the manager in time, so that the manager maintains the storage system.
It will be appreciated that the processing results and detailed information of the fault may be sent to the administrator, and may be notified in a short message or mail to ensure that the fault is handled reasonably, in specific forms including, but not limited to, creating a chart or curve to assist the administrator in more intuitively observing the fault information.
It should be noted that the read-write links mentioned in the scheme of the present application include an operating read-write link in an operating state and an idle read-write link not in an operating state. All performance information of the working read-write link and the idle read-write link needs to be acquired simultaneously. Monitoring in real time through each read-write link to judge whether information of an error link exists or not so as to monitor the working read-write link; and meanwhile, the current multipath equipment state information is regularly checked to cover the monitoring range to the idle read-write link.
S13: if the read-write links are not in an abnormal state, judging whether the read-write speed of each read-write link meets a preset condition or not;
s14: if the preset condition is not met, determining the read-write link as an unstable link, and acquiring link information of the unstable link so as to process the read-write link.
In specific implementation, due to the large data volume involved in the cloud computing platform, under a large service pressure, the problem caused by fluctuation of the read-write link has a great influence on the data transmission operation. Therefore, in order to further improve the stability of the system, the real-time transmission speed of the read-write link needs to be monitored, when the transmission speed of the read-write link is too slow or the fluctuation of the read-write speed is too large, the current link is unstable, and when the service pressure is too large, the operation of the storage system may be affected due to the instability. Therefore, when an unstable read-write link is detected, link information of the unstable link needs to be acquired in time so as to process the read-write link.
It will be appreciated that the processing of the unstable read-write link may be to maintain it, or the link may be discarded directly, or discarded after maintenance, which is not limited herein. In specific implementation, when the service pressure of the system is too high (greater than the service pressure threshold), the unstable link can be maintained; when the traffic pressure is small (not greater than the traffic pressure threshold), the unstable link is discarded.
The application provides a read-write link control method, which is applied to a multipath storage platform and comprises the following steps: acquiring performance information of each read-write link so as to judge whether each read-write link is in an abnormal state according to the performance information, and sending an alarm to a manager if the read-write link is in the abnormal state, so that the manager can timely find out and timely process the read-write link with the fault in the storage platform; if the read-write links are not in an abnormal state, judging whether the read-write speed of each read-write link meets a preset condition or not; if the preset condition is not met, corresponding link information is acquired, so that an unstable link in the storage platform is found, and the unstable read-write link is processed conveniently. Therefore, according to the technical scheme provided by the application, the read-write link in an unprocessed abnormal state is determined through the performance information of the read-write link, and the read-write speed of the read-write link in the abnormal state is judged, so that the abnormal read-write link in the multipath storage platform is determined, and the management personnel can process the abnormal read-write link in time, thereby preventing the abnormal read-write link from affecting the normal work of the cloud computing platform, and improving the use experience of users.
As an embodiment in some embodiments, obtaining performance information of each read-write link includes: acquiring log information of each read-write link in a working state and state information of all read-write links; and determining the performance information of each link according to the state information and the log information.
In specific implementation, real-time monitoring is carried out on a plurality of multi-path read-write links to obtain corresponding multi-path read-write link real-time information; judging whether the real-time information of the multi-path read-write link contains information for indicating error events of the multi-path read-write link; meanwhile, the current multipath equipment state information is periodically checked, the length of the checking period can be set by operation and maintenance personnel, and can also be set according to the service pressure, and the checking period is not limited herein.
The data volume related to the cloud computing platform is large, and under the condition of large service pressure, the problem caused by fluctuation of a read-write link has great influence on data transmission operation. Therefore, in order to further improve the stability of the system, the real-time transmission speed of the read-write link needs to be monitored.
In a specific implementation, determining whether the read-write speed of each read-write link meets a preset condition includes: judging whether the read-write speed of the read-write link is greater than a read-write speed threshold; if the read-write speed fluctuation is larger than the read-write speed threshold, judging whether the read-write speed fluctuation of the read-write link is larger than the fluctuation threshold; if the read-write link is larger than the fluctuation threshold, determining that the read-write link does not meet the preset condition.
Detecting whether fluctuation exists in the reading and writing process of a reading and writing link in real time, defining a fluctuation threshold A, defining fluctuation interval time T1, defining a time period T2 for detecting the fluctuation, and marking the link as a sub-health state if the reading and writing speed of any reading and writing link of the multi-path equipment has fluctuation with a value larger than the fluctuation threshold A within the interval time T1 range, so that the subsequent processing is facilitated.
When the traffic pressure is small, the read-write link in the sub-health state can be degraded and prevented from being in the alternative link so as to reduce the use rate of the unstable link. If the unstable link does not fluctuate again in the verification period T2, the unstable link is recovered to a 'healthy' state, and the read-write operation of the link is recovered. It should be noted that, when judging whether the read-write link in the "sub-health" state is in the "health" state, test data may be selected, or real service data may be adopted, which will not be described herein. However, it should be noted that the real service data of the read-write link in the "sub-health" state for testing should be the data backed up in advance, so as to ensure the security of the data. In addition, after determining the link as a sub-health state, if idle read-write links exist in the system, a corresponding number of idle read-write links need to be started so as to improve the use experience of the user.
When the service pressure of the storage system is high (for example, when all read-write links are in working state), only the abnormal links are marked, and the actual working state of the links is not changed, so that the reduction of the read-write links can be prevented, and the reduction of the data read-write efficiency can be prevented.
In implementations, sending the alert to the manager includes: the abnormal state information is written into the log file, and an alarm is sent to the manager in an email mode. The management personnel can check the working condition of the link in the storage system according to the log file, so that corresponding measures are taken to ensure the normal operation of the storage system.
It can be understood that, to prevent occurrence of data loss, the method further includes, after obtaining link information of the unstable link: the signal transmission level of the unstable link is reduced to reduce the usage of the unstable link.
Correspondingly, after the step of reducing the signal transmission level of the unstable link, the method further comprises the following steps: judging whether the fluctuation of the read-write speed of the unstable link in the detection period is larger than a fluctuation threshold value or not; and if the signal transmission level is not greater than the fluctuation threshold value, restoring the signal transmission level of the unstable link.
In one embodiment, after detecting a failed read-write link, an alert is sent to the administrator and self-repair should be attempted to ensure proper operation of the system as much as possible. In a specific implementation, after the step of sending the alarm to the manager, the method further includes: acquiring fault information of a read-write link in an abnormal state, and determining a fault reason according to the fault information; and repairing the read-write link or updating the drive according to the fault reason.
Specifically, the self-repair process includes: collecting fault information of a multi-path storage system; and analyzing the position and the reason of the fault according to the fault information, and carrying out path repair or reset driving aiming at different reasons of the fault. It should be noted that, when the multi-path storage fails in the read/write data, the failure information may be collected by using, but not limited to, a hook function installed in the multi-path storage drive.
In a specific implementation, since the location of collecting the fault information is in the kernel, and the subsequent processing occurs in the user mode, PCIE fault information needs to be transmitted from the kernel to the user mode, and a specific transmission manner includes, but is not limited to, using a Netlink channel.
Specifically, according to a rule base of 'a fault or a processing mode corresponding to some faults', if a path fault is reported by a multipath, a program can correspondingly recover or isolate the path, and the position and the reason of the multipath storage fault can be classified and counted to obtain an analysis result.
It should be noted that, in the case of repairable paths, the paths with faults are repaired, and in some cases, the paths cannot be repaired, so that the problems can be solved by adopting a reset driving mode, and the reliable and stable operation of multi-path storage is ensured. The number of read-write links in a normal working state is increased, high availability and smoothness of the storage system are guaranteed, and the use experience of a user is improved.
In the above embodiments, the read-write link control method is described in detail, and the present application further provides a corresponding embodiment of the read-write link control device. It should be noted that the present application describes an embodiment of the device portion from two angles, one based on the angle of the functional module and the other based on the angle of the hardware.
Fig. 3 is a block diagram of a read-write link control device according to an embodiment of the present application, as shown in fig. 3, where the device includes:
an acquisition module 10, configured to acquire performance information of each read-write link;
a first judging module 11, configured to judge whether each read-write link is in an abnormal state according to the performance information;
an alarm module 12, configured to send an alarm to a manager if the read-write link is in an abnormal state;
the second judging module 13 is configured to judge whether the read-write speed of each read-write link meets a preset condition if the read-write link is not in an abnormal state;
and the determining module 14 is configured to determine that the read-write link is an unstable link if the preset condition is not satisfied, and acquire link information of the unstable link, so as to process the read-write link.
In addition, the read-write link control device provided by the embodiment of the application further comprises a signal transmission level reducing module, a recovery module and a maintenance module. The signal transmission level reducing module is used for reducing the signal transmission level of the unstable link after acquiring the link information of the unstable link so as to reduce the use rate of the unstable link. The recovery module is used for judging whether the fluctuation of the read-write speed of the unstable link in the detection period is larger than a fluctuation threshold value after the step of reducing the signal transmission grade of the unstable link; and if the signal transmission level of the unstable link is not greater than the fluctuation threshold value, recovering the signal transmission level of the unstable link. The maintenance module is used for acquiring fault information of the read-write link in an abnormal state after the step of sending an alarm to a manager, and determining a fault reason according to the fault information; and repairing the read-write link or updating the drive according to the fault reason.
Since the embodiments of the apparatus portion and the embodiments of the method portion correspond to each other, the embodiments of the apparatus portion are referred to the description of the embodiments of the method portion, and are not repeated herein.
The application provides a read-write link control device, which is applied to a multipath storage platform and comprises: acquiring performance information of each read-write link so as to judge whether each read-write link is in an abnormal state according to the performance information, and sending an alarm to a manager if the read-write link is in the abnormal state, so that the manager can timely find out and timely process the read-write link with the fault in the storage platform; if the read-write links are not in an abnormal state, judging whether the read-write speed of each read-write link meets a preset condition or not; if the preset condition is not met, corresponding link information is acquired, so that an unstable link in the storage platform is found, and the unstable read-write link is processed conveniently. Therefore, according to the technical scheme provided by the application, the read-write link in an unprocessed abnormal state is determined through the performance information of the read-write link, and the read-write speed of the read-write link in the abnormal state is judged, so that the abnormal read-write link in the multipath storage platform is determined, and the management personnel can process the abnormal read-write link in time, thereby preventing the abnormal read-write link from affecting the normal work of the cloud computing platform, and improving the use experience of users.
Fig. 4 is a block diagram of another read-write link control device according to an embodiment of the present application, where, as shown in fig. 4, the read-write link control device includes: a memory 20 for storing a computer program;
a processor 21 for implementing the steps of the read-write link control method according to the above embodiment when executing a computer program.
The read-write link control device provided in this embodiment may include, but is not limited to, a smart phone, a tablet computer, a notebook computer, a desktop computer, or the like.
Processor 21 may include one or more processing cores, such as a 4-core processor, an 8-core processor, etc. The processor 21 may be implemented in hardware in at least one of a digital signal processor (Digital Signal Processor, DSP), a Field programmable gate array (Field-Programmable Gate Array, FPGA), a programmable logic array (Programmable Logic Array, PLA). The processor 21 may also comprise a main processor, which is a processor for processing data in an awake state, also called central processor (Central Processing Unit, CPU), and a coprocessor; a coprocessor is a low-power processor for processing data in a standby state. In some embodiments, the processor 21 may be integrated with an image processor (Graphics Processing Unit, GPU) for taking care of rendering and rendering of the content that the display screen is required to display. In some embodiments, the processor 21 may also include an artificial intelligence (Artificial Intelligence, AI) processor for processing computing operations related to machine learning.
Memory 20 may include one or more computer-readable storage media, which may be non-transitory. Memory 20 may also include high-speed random access memory, as well as non-volatile memory, such as one or more magnetic disk storage devices, flash memory storage devices. In this embodiment, the memory 20 is at least used for storing a computer program 201, where the computer program, when loaded and executed by the processor 21, can implement the relevant steps of the read-write link control method disclosed in any of the foregoing embodiments. In addition, the resources stored in the memory 20 may further include an operating system 202, data 203, and the like, where the storage manner may be transient storage or permanent storage. The operating system 202 may include Windows, unix, linux, among others. The data 203 may include, but is not limited to, performance information, status information of the read-write link, and the like.
In some embodiments, the read-write link control device may further include a display 22, an input-output interface 23, a communication interface 24, a power supply 25, and a communication bus 26.
Those skilled in the art will appreciate that the configuration shown in fig. 4 is not limiting of the read-write link control device and may include more or fewer components than shown.
The read-write link control device provided by the embodiment of the application comprises a memory and a processor, wherein the processor can realize the following method when executing a program stored in the memory: acquiring performance information of each read-write link; judging whether each read-write link is in an abnormal state according to the performance information; if the read-write link is in an abnormal state, an alarm is sent to a manager; if the read-write links are not in an abnormal state, judging whether the read-write speed of each read-write link meets a preset condition or not; if the preset condition is not met, determining the read-write link as an unstable link, and acquiring link information of the unstable link so as to process the read-write link.
The application provides a read-write link control device, which is applied to a multipath storage platform and comprises: acquiring performance information of each read-write link so as to judge whether each read-write link is in an abnormal state according to the performance information, and sending an alarm to a manager if the read-write link is in the abnormal state, so that the manager can timely find out and timely process the read-write link with the fault in the storage platform; if the read-write links are not in an abnormal state, judging whether the read-write speed of each read-write link meets a preset condition or not; if the preset condition is not met, corresponding link information is acquired, so that an unstable link in the storage platform is found, and the unstable read-write link is processed conveniently. Therefore, according to the technical scheme provided by the application, the read-write link in an unprocessed abnormal state is determined through the performance information of the read-write link, and the read-write speed of the read-write link in the abnormal state is judged, so that the abnormal read-write link in the multipath storage platform is determined, and the management personnel can process the abnormal read-write link in time, thereby preventing the abnormal read-write link from affecting the normal work of the cloud computing platform, and improving the use experience of users.
Finally, the application also provides a corresponding embodiment of the computer readable storage medium. The computer-readable storage medium has stored thereon a computer program which, when executed by a processor, performs the steps as described in the method embodiments above. The method comprises the steps of determining a read-write link in an unprocessed abnormal state through performance information of the read-write link, judging the read-write speed of the read-write link in the abnormal state, and determining the abnormal read-write link in the multi-path storage platform, so that management personnel can process the abnormal read-write link in time, the abnormal read-write link is prevented from affecting the normal work of the cloud computing platform, and the use experience of a user is improved.
It will be appreciated that the methods of the above embodiments, if implemented in the form of software functional units and sold or used as stand-alone products, may be stored on a computer readable storage medium. Based on this understanding, the technical solution of the present application may be embodied essentially or in part or all of the technical solution or in part in the form of a software product stored in a storage medium for performing all or part of the steps of the method according to the embodiments of the present application. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a random access Memory (Random Access Memory, RAM), a magnetic disk, or an optical disk, or other various media capable of storing program codes.
The read-write link control method, the device and the medium provided by the application are described in detail. In the description, each embodiment is described in a progressive manner, and each embodiment is mainly described by the differences from other embodiments, so that the same similar parts among the embodiments are mutually referred. For the device disclosed in the embodiment, since it corresponds to the method disclosed in the embodiment, the description is relatively simple, and the relevant points refer to the description of the method section. It should be noted that it will be apparent to those skilled in the art that various modifications and adaptations of the application can be made without departing from the principles of the application and these modifications and adaptations are intended to be within the scope of the application as defined in the following claims.
It should also be noted that in this specification, relational terms such as first and second, and the like are used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Moreover, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising one … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.

Claims (10)

1. The read-write link control method is characterized by being applied to a multipath storage platform and comprising the following steps of:
acquiring performance information of each read-write link;
judging whether each read-write link is in an abnormal state according to the performance information;
if the read-write link is in the abnormal state, an alarm is sent to a manager;
if the read-write links are not in the abnormal state, judging whether the read-write speed of each read-write link meets a preset condition or not;
if the preset condition is not met, determining that the read-write link is an unstable link, and acquiring link information of the unstable link so as to process the read-write link.
2. The method for controlling a read-write link according to claim 1, wherein the obtaining performance information of each read-write link includes:
acquiring log information of each read-write link in a working state and state information of all the read-write links;
and determining performance information of each link according to the state information and the log information.
3. The method of claim 1, wherein determining whether the read-write speed of each of the read-write links satisfies a preset condition comprises:
judging whether the read-write speed of the read-write link is greater than a read-write speed threshold;
if the read-write speed fluctuation is larger than the read-write speed threshold, judging whether the read-write speed fluctuation of the read-write link is larger than the fluctuation threshold;
and if the read-write link is larger than the fluctuation threshold, determining that the read-write link does not meet the preset condition.
4. The read-write link control method according to claim 1, wherein said sending an alarm to a manager includes:
the abnormal state information is written into the log file, and an alarm is sent to the manager in an email mode.
5. The method for controlling a read-write link according to claim 1, wherein after obtaining the link information of the unstable link, further comprising:
and reducing the signal transmission level of the unstable link to reduce the utilization rate of the unstable link.
6. The method for controlling a read-write link according to claim 5, further comprising, after said step of reducing the signal transmission level of said unstable link:
judging whether the fluctuation of the read-write speed of the unstable link in the detection period is larger than a fluctuation threshold value or not;
and if the signal transmission level of the unstable link is not greater than the fluctuation threshold value, recovering the signal transmission level of the unstable link.
7. The method according to any one of claims 1 to 6, wherein after the step of sending an alarm to the manager, further comprising:
acquiring fault information of the read-write link in an abnormal state, and determining a fault reason according to the fault information;
and repairing the read-write link or updating the drive according to the fault reason.
8. A read-write link control device, applied to a multi-path storage platform, comprising:
the acquisition module is used for acquiring the performance information of each read-write link;
the first judging module is used for judging whether each read-write link is in an abnormal state or not according to the performance information;
the alarm module is used for sending an alarm to a manager if the read-write link is in the abnormal state;
the second judging module is used for judging whether the read-write speed of each read-write link meets a preset condition or not if the read-write link is not in the abnormal state;
and the determining module is used for determining that the read-write link is an unstable link and acquiring link information of the unstable link if the preset condition is not met so as to process the read-write link.
9. A read-write link control device comprising a memory for storing a computer program;
a processor for implementing the steps of the read-write link control method according to any one of claims 1 to 7 when executing the computer program.
10. A computer readable storage medium, characterized in that the computer readable storage medium has stored thereon a computer program which, when executed by a processor, implements the steps of the read-write link control method according to any one of claims 1 to 7.
CN202310792979.XA 2023-06-30 2023-06-30 Read-write link control method, device and medium Pending CN116820822A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310792979.XA CN116820822A (en) 2023-06-30 2023-06-30 Read-write link control method, device and medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310792979.XA CN116820822A (en) 2023-06-30 2023-06-30 Read-write link control method, device and medium

Publications (1)

Publication Number Publication Date
CN116820822A true CN116820822A (en) 2023-09-29

Family

ID=88121781

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310792979.XA Pending CN116820822A (en) 2023-06-30 2023-06-30 Read-write link control method, device and medium

Country Status (1)

Country Link
CN (1) CN116820822A (en)

Similar Documents

Publication Publication Date Title
Gunawi et al. Fail-slow at scale: Evidence of hardware performance faults in large production systems
US8935563B1 (en) Systems and methods for facilitating substantially continuous availability of multi-tier applications within computer clusters
CN110750213A (en) Hard disk management method and device
EP3956771B1 (en) Timeout mode for storage devices
CN104036043A (en) High availability method of MYSQL and managing node
CN113051104B (en) Method and related device for recovering data between disks based on erasure codes
CN111324304A (en) Data protection method and device based on SSD hard disk life prediction
US7278048B2 (en) Method, system and computer program product for improving system reliability
CN111858240A (en) Monitoring method, system, equipment and medium of distributed storage system
US8918863B1 (en) Method and apparatus for monitoring source data that is a target of a backup service to detect malicious attacks and human errors
CN115793963A (en) Hard disk fault processing method, device, equipment and storage medium
CN110502345A (en) A kind of overload protection method, device, computer equipment and storage medium
US20140201566A1 (en) Automatic computer storage medium diagnostics
CN109271270A (en) The troubleshooting methodology, system and relevant apparatus of bottom hardware in storage system
CN111124818B (en) Monitoring method, device and equipment for Expander
CN116820822A (en) Read-write link control method, device and medium
CN114884836A (en) High-availability method, device and medium for virtual machine
WO2012037514A1 (en) Fault handling systems and methods
CN111831511A (en) Detection processing method, device and medium for service host of cloud service
CN111190781A (en) Test self-check method of server system
JP5832408B2 (en) Virtual computer system and control method thereof
CN114301927B (en) Main node selection method, device and medium in distributed system
CN112084097B (en) Disk alarm method and device
JP7436737B1 (en) Server management system that supports multi-vendors
CN109213446B (en) Write cache mode switching method, device and equipment and readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination