CN115981982A - Equipment management method, device, equipment and storage medium - Google Patents

Equipment management method, device, equipment and storage medium Download PDF

Info

Publication number
CN115981982A
CN115981982A CN202211721975.4A CN202211721975A CN115981982A CN 115981982 A CN115981982 A CN 115981982A CN 202211721975 A CN202211721975 A CN 202211721975A CN 115981982 A CN115981982 A CN 115981982A
Authority
CN
China
Prior art keywords
performance
equipment
configuration information
configuration
target
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211721975.4A
Other languages
Chinese (zh)
Inventor
杨业刚
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing QIYI Century Science and Technology Co Ltd
Original Assignee
Beijing QIYI Century Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing QIYI Century Science and Technology Co Ltd filed Critical Beijing QIYI Century Science and Technology Co Ltd
Priority to CN202211721975.4A priority Critical patent/CN115981982A/en
Publication of CN115981982A publication Critical patent/CN115981982A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Debugging And Monitoring (AREA)

Abstract

The embodiment of the invention provides a method, a device, equipment and a storage medium for equipment management, which relate to the technical field of computers, and the specific implementation scheme is as follows: acquiring performance data corresponding to each performance index of target equipment providing business service in real time; if the target equipment is determined to have abnormal performance based on the acquired performance data, determining standby equipment and migrating the service to the standby equipment; acquiring configuration information of the target equipment aiming at software and/or hardware; if the target equipment is determined to have abnormal configuration based on the acquired configuration information, setting the equipment state of the target equipment to be an unavailable state; setting a device state of a target device to an available state in response to configuration information of the target device being updated. The scheme provided by the embodiment of the invention can ensure the normal operation of the business service.

Description

Equipment management method, device, equipment and storage medium
Technical Field
The present invention relates to the field of computer technologies, and in particular, to a device management method, apparatus, device, and storage medium.
Background
Various information platforms such as audio and video can provide business services such as advertisement recall, advertisement recommendation and the like for users, and if the equipment which is arranged on the platforms and used for providing the business services has performance problems, the quality of the provided business services may be affected, for example, service data transmission delay occurs when the business services are provided.
Therefore, a performance check is required for the devices of the information platform to determine and handle the devices that generate the performance problems. However, the devices deployed on the service platform often exist in the form of a device cluster, and when there are thousands of devices in the device cluster, it is difficult to manually check the performance of the devices one by one due to the large number of devices. In the current solution, generally, a user using a service finds a service performance problem, feeds back the service performance problem to an operation and maintenance engineer, and then the operation and maintenance engineer manually intervenes to determine a device with the performance problem, and finally manually processes the performance problem. By adopting the solution, the information platform providing the business service can not actively discover the equipment with the performance problem, always discover the equipment with the performance problem and process the equipment with the performance problem after the business service acquired by the user is affected, and can not ensure the normal operation of the business service.
Disclosure of Invention
The embodiment of the invention aims to provide a device management method, a device, equipment and a storage medium, so as to ensure that business services run normally. The specific technical scheme is as follows:
in a first aspect of the present invention, there is provided a device management method, including:
acquiring performance data corresponding to each performance index of target equipment providing business service in real time;
if the target equipment is determined to have abnormal performance based on the acquired performance data, determining standby equipment and migrating the service to the standby equipment;
in a second aspect of the present invention, there is also provided an apparatus for device management, the apparatus including:
the performance data acquisition module is used for acquiring performance data corresponding to each performance index of target equipment providing business service in real time;
the business service migration module is used for determining standby equipment and migrating the business service to the standby equipment if the target equipment is determined to have abnormal performance based on the acquired performance data;
the configuration information acquisition module is used for acquiring the configuration information of the target equipment aiming at the software and/or the hardware;
the first state setting module is used for setting the equipment state of the target equipment to be an unavailable state if the target equipment is determined to have abnormal configuration based on the acquired configuration information;
and the second state setting module is used for setting the equipment state of the target equipment to be an available state in response to the updated configuration information of the target equipment.
In a third aspect of the present invention, there is also provided an electronic device, including a processor, a communication interface, a memory, and a communication bus, where the processor, the communication interface, and the memory complete communication with each other through the communication bus;
a memory for storing a computer program;
and the processor is used for realizing the steps of the equipment management method when executing the program stored in the memory.
In a fourth aspect implemented by the present invention, there is also provided a computer-readable storage medium having stored therein a computer program, which when executed by a processor implements the above-mentioned device management method.
In yet another aspect of the present invention, there is also provided a computer program product containing instructions which, when run on a computer, cause the computer to perform the above-mentioned device management method.
Therefore, in the scheme provided by the embodiment of the invention, the performance abnormality of the target equipment can be determined according to the performance data, and under the condition that the target equipment has the abnormality, the business service can be transferred in time by the scheme provided by the embodiment of the invention, so that the target equipment with the abnormality and performance problems is prevented from influencing the normal operation of the business service.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below.
Fig. 1a is a schematic flowchart of a first device management method according to an embodiment of the present invention.
Fig. 1b is a flowchart illustrating a second device management method according to an embodiment of the present invention.
Fig. 2 is a flowchart illustrating a third device management method according to an embodiment of the present invention.
Fig. 3 is a schematic structural diagram of an apparatus management device according to an embodiment of the present invention.
Fig. 4 is a schematic structural diagram of an electronic device according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be described below with reference to the drawings in the embodiments of the present invention.
In order to improve efficiency of processing performance problems, embodiments of the present invention provide a device management method, apparatus, device, and storage medium, which are described below separately.
In an embodiment of the present invention, a device management method is provided, including:
acquiring performance data corresponding to each performance index of target equipment providing business service in real time;
if the target equipment is determined to have performance abnormity based on the acquired performance data, determining standby equipment and migrating the service to the standby equipment;
therefore, in the scheme provided by the embodiment of the invention, the performance abnormality of the target equipment can be determined according to the performance data, and under the condition that the target equipment has the abnormality, the business service can be transferred in time by the scheme provided by the embodiment of the invention, so that the target equipment with the abnormality and performance problems is prevented from influencing the normal operation of the business service.
In an embodiment of the present invention, referring to fig. 1a, a flowchart of a first device management method is provided, where the method includes the following steps S101 to S102.
Step S101: and acquiring performance data corresponding to each performance index of target equipment providing the business service in real time.
The business service is a service required for realizing a business process. The service to be realized may be advertisement recall, video recommendation, and the like, which is not limited in the embodiments of the present invention.
The specific service may be processing service data under a service to be implemented, and the like. For example, in one embodiment of the invention, the business service is a data service that performs data processing based on a neural network model.
Wherein, the specific data processing process is determined by the type of the service data. For example, when the service data is a voice segment, the data processing may be voice recognition; when the data to be processed is a text statement, the data processing may be semantic recognition, and the like.
Different types of neural network models can be deployed according to different service types. For example, in the case of implementing a video recommendation service, a tensflow model may be deployed; under the condition of realizing the image recognition service, a MobileNet model and the like can be deployed. In this way, different business services can be provided by performing data processing on different data based on the neural network model.
The target device is a device for providing service to the user.
The performance index is an evaluation item for measuring the performance of the equipment in different aspects. In one embodiment of the invention, the performance indicator comprises at least one of the following information: performance indicators representing the degree to which computing resources are occupied, performance indicators representing the degree to which network resources are occupied, and performance indicators representing the rate of queries per second for services of the target device.
The performance index representing the occupied degree of the computing resource may include: CPU utilization, memory utilization, etc.; the performance indicators indicating the degree of network resource occupancy may include delay times, such as latency-999 (99.9% quantile delay time), max latency (maximum delay time), and the like.
The query rate Per Second, qps (Queries Per Second), is an indicator for testing the concurrency capability of the target device, and is determined according to the number of Queries to which the target device responds Per unit time.
Therefore, the scheme provided by the embodiment of the invention provides various performance indexes to monitor the performance of the target equipment, so that maintenance personnel can know the equipment performance more comprehensively.
The performance data may record parameter values representing the performance of the target device under each performance index. For example, for a performance index, i.e., CPU utilization, a corresponding percentage may be recorded in the performance data.
The following describes an implementation of collecting performance data.
The target device may record its own performance data, and in this case, may send a call request to the target device in real time at a time when the performance data needs to be acquired, to obtain the performance data included in the response information of the target device. In addition, a preset monitoring program can be used to obtain the performance data from the target device, for example, a native monitoring program and a bvar monitoring program integrated in the Jarvis platform are used.
Step S102: and if the target equipment is determined to have abnormal performance based on the acquired performance data, determining the standby equipment and transferring the service to the standby equipment.
The standby device is a device that provides the same service as the target device.
In one embodiment of the present invention, the standby device is: and the performance data conforms to the performance index.
When the device cluster where the target device and the standby device are located starts to work, the devices in the device cluster may be debugged first to make the devices meet the performance index. In this case, any device in the device cluster that does not detect the performance abnormality may be used as a standby device.
Thus, the business service can be migrated to the standby equipment with normal performance, so that the user can stably acquire the required service from the standby equipment.
In one embodiment of the present invention, whether the target device has a performance abnormality may be determined by:
judging whether the performance data corresponding to the performance index is in the performance data range corresponding to the performance index or not aiming at each performance index, and if not, determining that the target equipment has abnormal performance;
for example, if a highest threshold or a lowest threshold for a performance index is set, the data range lower than the highest threshold or higher than the lowest threshold is the performance data range, and for each performance index, the performance data indicates that the target device has no performance abnormality for the performance index in the data range, otherwise, the performance abnormality exists. Through the detection of the performance indexes, the problem causing the performance abnormity is determined, and the follow-up repair is facilitated.
In one embodiment of the invention, the business services may be migrated as follows.
When providing a service to a user, a target device may obtain an address of the service. In this case, the address of the target device may be acquired and sent to the standby device, so that the standby device continues to provide the service to the user according to the received address.
In addition, the service data used by the target device in the process of providing the business service for the user can be written into a storage unit independent of the target device and the standby device through data falling and other measures, so that the standby device can obtain the service data from the storage unit under the condition that the target device does not provide the service and normally use the service data in the process of providing the business service for the user.
Therefore, in the scheme provided by the embodiment of the invention, the performance abnormality of the target equipment can be determined according to the performance data, and under the condition that the target equipment has the abnormality, the business service can be transferred in time by the scheme provided by the embodiment of the invention, so that the target equipment with the abnormality and performance problems is prevented from influencing the normal operation of the business service.
In an embodiment of the present invention, referring to fig. 1b, after performing service migration, the target device with performance problem may also be repaired according to the following steps S103 to S105.
Steps S101-S102 are the same as in the embodiment of FIG. 1a and will not be described in detail here.
Step S103: configuration information of the target device for software and/or hardware is collected.
The manner of collecting the configuration information is similar to the implementation manner of collecting the performance data in step S101, and the difference is only the conceptual substitution of names of the configuration information, the performance data, and the like, and the detailed description is omitted here.
The configuration information is configuration data stored in the memory, and for the configured device, the configuration information is used to indicate a behavior manner of software and/or hardware installed in the operating system of the target device, and accordingly, the system behavior of the operating system can be correspondingly adjusted by modifying the configuration information. For example, the configuration information may include a maximum frequency of the CPU, and after the configuration information is modified, for example, the maximum frequency of the CPU is modified from 4GHz to 3.5GHz, and accordingly, when the target device runs the program, the operation speed of the CPU may change before and after the modification.
The configuration information may include:
configuration information of software or hardware, for example, in one embodiment of the invention, the configuration information includes at least one of the following: configuration information indicating a hyper-thread configuration, configuration information indicating a performance-related configuration.
The configuration information of the software can be changed by directly modifying parameter values in a configuration file associated with the configuration information in the target equipment; if the configuration information of the hardware needs to be updated, an alarm instruction can be sent to a maintenance person, and the configuration information is modified into the setting after the maintenance person adjusts the hardware configuration in response to the replacement and maintenance actions of the hardware in the target equipment by the maintenance person.
The hyper-thread configuration may include the number of cores, and the performance-related configuration may include the number of CPUs, the size of the memory, and the like. By collecting various configuration information, possible abnormalities can be detected more comprehensively.
The target device runs its own program and provides service according to the configuration information, that is, the configuration information is modified, and the running mode of the target device is adjusted.
Step S104: and if the target equipment is determined to have abnormal configuration based on the acquired configuration information, setting the equipment state of the target equipment to be an unavailable state.
The device status of the target device indicates the availability of the target device, i.e. whether it can be used to provide a business service to the user. The device state of the target device may be represented by a preset state identifier, for example, 0 represents an unavailable state, 1 represents an available state, and the like; alternatively, different character information may be used to indicate different device states, which is not limited in the embodiment of the present invention.
In an embodiment of the present invention, a user may obtain a service in the following manner: and setting a management device, and acquiring a device with a device state being an available state from the optional device cluster by using the management device as a target device to provide service. Accordingly, when the management device finds that the device is in the unavailable state, the management device does not select the device to provide the service, so that the device in the unavailable state can be offline to receive maintenance adjustment without providing business service.
In one embodiment of the present invention, determining whether the target device has a configuration exception may include:
and judging whether the configuration item of the configuration information is consistent with the configuration item of the set configuration information, and if not, determining that the target equipment has abnormal configuration.
The configuration information may include settings for a plurality of configuration items, for example, the maximum main frequency of the CPU, the number of cores, and the like may be one of the configuration items. The setting configuration information corresponding to each configuration item can be set according to the experience of workers or according to the configuration items of the existing equipment which normally operates in the equipment cluster.
When the device cluster is used to provide services for the user and the devices in the device cluster are initialized, the devices in the cluster may be set according to the setting configuration information. However, in the process of actually providing the service, the configuration information of the device may be affected by a failure encountered by the device, or the target device is moved to another device cluster having different set configuration information for use, so that the configuration information of the target device is different from the set configuration information, and thus, there is a configuration abnormality. By comparing the configuration items of the configuration information with the configuration items of the set configuration information, the target equipment with the configuration information inconsistent with the set configuration information can be found in time, so that the reason for generating the abnormity is determined, and the adjustment is convenient.
Step S105: setting a device state of the target device to an available state in response to the configuration information of the target device being updated.
The configuration information may be updated according to a performance index indicating that there is a performance abnormality, or may be updated according to set configuration information, which may be specifically referred to in the following embodiments, and is not described in detail here.
Therefore, in the scheme provided by the embodiment of the invention, the performance abnormality of the target equipment can be determined according to the performance data, and the configuration abnormality of the target equipment can also be determined according to the configuration information, that is, more comprehensive information about the target equipment is collected, so that the equipment abnormality can be more comprehensively and sufficiently detected, the abnormality can be conveniently processed, and the performance problem of the target equipment can be solved.
When the performance problem of the target equipment is solved, the target equipment can be directly adjusted by updating the configuration information under the condition of detecting the abnormity without waiting for the feedback after the business side receives the image of the performance problem, so that the automation is realized, the defects of untimely response and manual intervention are avoided, the service stability can be greatly improved, and the processing efficiency is improved.
In addition, under the condition that the target equipment has abnormity, the scheme provided by the embodiment of the invention can timely transfer the service, and set the target equipment to be in an unavailable state until the configuration information is updated, so that the influence of the abnormal target equipment with performance problems on the use experience of a user is avoided.
A specific implementation of updating the configuration information based on the performance indicators indicating the presence of a performance anomaly is described below.
In an embodiment of the present invention, the configuration information of the target device may be updated in the following manner:
determining a first configuration item having a preset corresponding relation with a first performance index, wherein the first performance index is as follows: reflecting the performance index of the target equipment with abnormal performance; and updating the configuration information corresponding to the first configuration item of the target equipment according to the setting configuration information corresponding to the first configuration item.
The correspondence may be set according to the association between the configuration and the performance index found when the performance test is performed on the device.
For example, if it is found in the performance test that the fan of the target device affects the temperature of the target device, a preset corresponding relationship between the first performance index "temperature" and the first configuration item "fan speed" may be set.
From the above, when the first performance index is determined, the first configuration item is determined, so that the setting of the first configuration item recorded in the configuration information can be obtained. This allows the configuration item of the target device to be adjusted according to the obtained settings, thereby enabling configuration information update. The configuration information of the target equipment can be updated according to the configuration items which have the corresponding relation with the performance indexes, and the updated first configuration item is related to the performance indexes with abnormal performance, so that the setting of the first configuration item is modified, the target equipment runs according to the new configuration information, the performance abnormity represented by the performance indexes can be correspondingly solved, and the equipment performance problem can be accurately processed.
In one embodiment of the present invention, the configuration information of the target device is updated in the following manner:
and updating the configuration information corresponding to the second configuration item of the target equipment according to the set configuration information corresponding to the second configuration item reflecting that the target equipment has abnormal configuration.
Because the target device and the setting configuration information both have the same configuration item, after the second configuration item is determined, the second configuration item of the target device is set according to the setting parameters under the configuration item which is the same as the second configuration item in the setting configuration information, that is, the updating is completed.
Therefore, the configuration information of the target equipment is updated according to the set configuration information, so that the optimal configuration can be preset manually, the target equipment is adjusted to operate according to the optimal configuration, the operation performance of the target equipment is improved, and the performance problem is solved.
The overall flow of the scheme provided by the embodiment of the present invention is described below according to the flow diagram of the device management method shown in fig. 2.
As shown in fig. 2, first, service deployment is performed, so that a target device provides a service to a user; under the condition that the target device is operated, a monitor is started, so that the monitor acquires the performance index, namely, the performance data corresponding to the performance index is obtained. Specifically, the above collection can be performed by using monitor in the existing software tool, for example, the software tool can be Prometheus, etc.
And on the other hand, performing plug-in deployment and starting a plug-in, and acquiring configuration information in real time when the target device runs, namely, realizing the step of acquiring machine configuration by the plug-in. For example, an automation plug-in issued by hub, through which configuration information of a target device is periodically collected, may be used.
After the performance data and the configuration information are obtained, collected data storage is carried out, namely the obtained performance data and the configuration information are recorded. The performance data and configuration information can be prestored in the Jarvis database.
Acquiring a preset threshold and configuration as preset data, and comparing the acquired data with the preset data: on one hand, comparing the performance data with a preset threshold value so as to determine whether the performance data corresponding to each performance index is located in a data range specified by the preset threshold value; on the other hand, the configuration information is compared with the configuration in the figure, that is, the configuration information is set.
And if the comparison result shows that the performance abnormality and/or the configuration abnormality exist, updating the machine configuration. For a specific updating manner, reference is made to a specific implementation manner of updating the configuration information in the foregoing embodiment, and details are not described here. In addition, when the configuration is updated, the standby equipment which provides the same service in the equipment cluster can be used for continuing working to provide the business service, for a user, the target equipment and the standby equipment provide the same business service, and the standby equipment does not have the performance problem of the target equipment. Therefore, through the processes of information acquisition and comparison with preset data, performance problems can be found in advance before user perception, the availability of the cluster is improved, and the business service requirements of the user can be timely responded through cooperation of a plurality of devices in the cluster.
Corresponding to the method embodiment, the embodiment of the invention also provides a device management device.
Referring to fig. 3, an embodiment of the present invention provides a schematic structural diagram of an apparatus management device, where the apparatus includes:
the performance data acquisition module 301 is configured to acquire, in real time, performance data corresponding to each performance index of a target device that provides a service;
a service migration module 302, configured to determine a standby device and migrate the service to the standby device if it is determined that the target device has performance abnormality based on the collected performance data;
therefore, in the scheme provided by the embodiment of the invention, the performance abnormality of the target equipment can be determined according to the performance data, and under the condition that the target equipment has the abnormality, the business service can be transferred in time by the scheme provided by the embodiment of the invention, so that the target equipment with the abnormality and performance problems is prevented from influencing the normal operation of the business service.
In one embodiment of the present invention, the apparatus further comprises:
the configuration information acquisition module is used for acquiring the configuration information of the target equipment aiming at the software and/or the hardware;
the first state setting module is used for setting the equipment state of the target equipment to be an unavailable state if the target equipment is determined to have abnormal configuration based on the acquired configuration information;
a second state setting module, configured to set a device state of the target device to an available state in response to configuration information of the target device being updated.
In an embodiment of the present invention, the business service migration module 302 determines whether the target device has a performance anomaly by the following steps:
judging whether the performance data corresponding to the performance index is in the performance data range corresponding to the performance index or not for each performance index, and if not, determining that the target equipment has abnormal performance;
and/or
The first state setting module determines whether the target device has a configuration exception or not by the following means, including:
and judging whether the configuration item of the configuration information is consistent with the configuration item of the set configuration information, if not, determining that the target equipment has performance abnormity.
Through the detection of the performance indexes, the problem causing performance abnormity is determined, and the subsequent repair is facilitated.
By comparing the configuration items of the configuration information with the configuration items of the set configuration information, the target equipment with the configuration information inconsistent with the set configuration information can be found in time, so that the reason for generating the abnormity is determined, and the adjustment is facilitated.
In one embodiment of the present invention, the apparatus further comprises:
a first configuration update module, configured to update the configuration information of the target device by:
determining a first configuration item having a preset corresponding relationship with a first performance index, wherein the first performance index is as follows: reflecting the performance index of the target equipment with abnormal performance; and updating the configuration information corresponding to the first configuration item of the target equipment according to the set configuration information corresponding to the first configuration item.
The configuration information of the target equipment can be updated according to the configuration items which have the corresponding relation with the performance indexes, and the updated first configuration item is related to the performance indexes with abnormal performance, so that the setting of the first configuration item is modified, the target equipment runs according to the new configuration information, the performance abnormity represented by the performance indexes can be correspondingly solved, and the equipment performance problem can be accurately processed.
In one embodiment of the present invention, the apparatus further comprises:
a second configuration updating module, configured to update the configuration information of the target device by:
and updating the configuration information corresponding to the second configuration item of the target equipment according to the set configuration information corresponding to the second configuration item reflecting that the target equipment has abnormal configuration.
Therefore, the configuration information of the target equipment is updated according to the set configuration information, so that the optimal configuration can be preset manually, the target equipment is adjusted to operate according to the optimal configuration, the operation performance of the target equipment is improved, and the performance problem is solved.
In one embodiment of the invention, the performance indicator comprises at least one of the following information: the performance index representing the occupation degree of the computing resources, the performance index representing the occupation degree of the network resources and the performance index representing the query rate per second for querying the service of the target equipment;
and/or
The configuration information includes at least one of the following information: configuration information indicating a hyper-thread configuration, configuration information indicating a performance-related configuration.
By collecting various configuration information, possible abnormalities can be detected more comprehensively.
In an embodiment of the present invention, the business service is a data service for performing data processing based on a neural network model.
In this way, different business services can be provided by performing data processing on different data based on the neural network model.
In an embodiment of the present invention, the standby device is: and the performance data conforms to the performance index.
Thus, the business service can be migrated to the standby equipment with normal performance, so that the user can stably acquire the required service from the standby equipment.
An embodiment of the present invention further provides an electronic device, as shown in fig. 4, including a processor 401, a communication interface 402, a memory 403, and a communication bus 404, where the processor 401, the communication interface 402, and the memory 403 complete mutual communication through the communication bus 404,
a memory 403 for storing a computer program;
the processor 401 is configured to implement the steps of the device management method according to any of the foregoing embodiments when executing the program stored in the memory 403.
The communication bus mentioned in the above terminal may be a Peripheral Component Interconnect (PCI) bus, an Extended Industry Standard Architecture (EISA) bus, or the like. The communication bus may be divided into an address bus, a data bus, a control bus, etc. For ease of illustration, only one thick line is shown, but this does not mean that there is only one bus or one type of bus.
The communication interface is used for communication between the terminal and other equipment.
The Memory may include a Random Access Memory (RAM), and may also include a non-volatile Memory (non-volatile Memory), such as at least one disk Memory. Optionally, the memory may also be at least one memory device located remotely from the processor.
The Processor may be a general-purpose Processor, including a Central Processing Unit (CPU), a Network Processor (NP), and the like; the device can also be a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other Programmable logic device, a discrete Gate or transistor logic device, or a discrete hardware component.
In still another embodiment of the present invention, a computer-readable storage medium is further provided, in which a computer program is stored, and the computer program, when executed by a processor, implements the device management method in any of the above embodiments.
In yet another embodiment provided by the present invention, there is also provided a computer program product containing instructions which, when run on a computer, cause the computer to perform the device management method of any of the above embodiments.
In the above embodiments, the implementation may be wholly or partially realized by software, hardware, firmware, or any combination thereof. When implemented in software, it may be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When loaded and executed on a computer, cause the processes or functions described in accordance with the embodiments of the invention to occur, in whole or in part. The computer may be a general purpose computer, a special purpose computer, a network of computers, or other programmable device. The computer instructions may be stored in a computer readable storage medium or transmitted from one computer readable storage medium to another, for example, from one website site, computer, server, or data center to another website site, computer, server, or data center via wired (e.g., coaxial cable, fiber optic, digital Subscriber Line (DSL)) or wireless (e.g., infrared, wireless, microwave, etc.). The computer-readable storage medium can be any available medium that can be accessed by a computer or a data storage device, such as a server, a data center, etc., that incorporates one or more of the available media. The usable medium may be a magnetic medium (e.g., floppy Disk, hard Disk, magnetic tape), an optical medium (e.g., DVD), or a semiconductor medium (e.g., solid State Disk (SSD)), among others.
It is noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrases "comprising a," "...," or "comprising" does not exclude the presence of additional like elements in a process, method, article, or apparatus that comprises the element.
All the embodiments in the present specification are described in a related manner, and the same and similar parts among the embodiments may be referred to each other, and each embodiment focuses on differences from other embodiments. In particular, as for the apparatus, device and storage medium embodiments, since they are substantially similar to the method embodiments, the description is relatively simple, and reference may be made to some descriptions of the method embodiments for relevant points.
The above description is only for the preferred embodiment of the present invention, and is not intended to limit the scope of the present invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention shall fall within the protection scope of the present invention.

Claims (11)

1. A method for device management, the method comprising:
acquiring performance data corresponding to each performance index of target equipment providing business service in real time;
and if the target equipment is determined to have abnormal performance based on the acquired performance data, determining standby equipment and transferring the service to the standby equipment.
2. The method of claim 1, further comprising:
acquiring configuration information of the target equipment aiming at software and/or hardware;
if the target equipment is determined to have abnormal configuration based on the acquired configuration information, setting the equipment state of the target equipment to be an unavailable state;
setting a device state of a target device to an available state in response to configuration information of the target device being updated.
3. The method of claim 1,
determining whether a performance anomaly exists for the target device by:
judging whether the performance data corresponding to the performance index is in the performance data range corresponding to the performance index or not aiming at each performance index, and if not, determining that the target equipment has abnormal performance;
and/or
Determining whether the target device has a configuration exception by:
and judging whether the configuration item of the configuration information is consistent with the configuration item of the set configuration information, if not, determining that the target equipment has performance abnormity.
4. The method of claim 2, wherein the configuration information of the target device is updated by:
determining a first configuration item having a preset corresponding relationship with a first performance index, wherein the first performance index is as follows: reflecting the performance index of the target equipment with abnormal performance;
and updating the configuration information corresponding to the first configuration item of the target equipment according to the set configuration information corresponding to the first configuration item.
5. The method of claim 2, wherein the configuration information of the target device is updated by:
and updating the configuration information corresponding to the second configuration item of the target equipment according to the set configuration information corresponding to the second configuration item reflecting that the target equipment has abnormal configuration.
6. The method of claim 2,
the performance indicator includes at least one of the following information: the performance index representing the occupation degree of the computing resources, the performance index representing the occupation degree of the network resources and the performance index representing the query rate per second for querying the service of the target equipment;
and/or
The configuration information includes at least one of: configuration information indicating a hyper-thread configuration, configuration information indicating a performance-related configuration.
7. The method according to any of claims 1-6, characterized in that the business service is a data service for data processing based on a neural network model.
8. The method according to any one of claims 1-6, wherein the backup device is: and the performance data conforms to the performance index.
9. An apparatus for device management, the apparatus comprising:
the performance data acquisition module is used for acquiring performance data corresponding to each performance index of target equipment providing business service in real time;
and the business service migration module is used for determining standby equipment and migrating the business service to the standby equipment if the target equipment is determined to have abnormal performance based on the acquired performance data.
10. The electronic equipment is characterized by comprising a processor, a communication interface, a memory and a communication bus, wherein the processor and the communication interface are used for realizing the communication between the processor and the memory through the communication bus;
a memory for storing a computer program;
a processor for implementing the method steps of any one of claims 1 to 8 when executing a program stored in a memory.
11. A computer-readable storage medium, characterized in that a computer program is stored in the computer-readable storage medium, which computer program, when being executed by a processor, carries out the method steps of any one of the claims 1-8.
CN202211721975.4A 2022-12-30 2022-12-30 Equipment management method, device, equipment and storage medium Pending CN115981982A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211721975.4A CN115981982A (en) 2022-12-30 2022-12-30 Equipment management method, device, equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211721975.4A CN115981982A (en) 2022-12-30 2022-12-30 Equipment management method, device, equipment and storage medium

Publications (1)

Publication Number Publication Date
CN115981982A true CN115981982A (en) 2023-04-18

Family

ID=85969778

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211721975.4A Pending CN115981982A (en) 2022-12-30 2022-12-30 Equipment management method, device, equipment and storage medium

Country Status (1)

Country Link
CN (1) CN115981982A (en)

Similar Documents

Publication Publication Date Title
CN109032824B (en) Database verification method, database verification device, computer equipment and storage medium
WO2020029407A1 (en) Alarm data management method and apparatus, and computer device and storage medium
CN111736875A (en) Version updating monitoring method, device, equipment and computer storage medium
CN110784374A (en) Method, device, equipment and system for monitoring operation state of service system
US20230047615A1 (en) Communication Device, Surveillance Server, and Log Collection Method
CN113645085B (en) Method and device for detecting abnormality of intelligent network card, electronic equipment and storage medium
CN112148733A (en) Method, device, electronic device and computer readable medium for determining fault type
CN113656252B (en) Fault positioning method, device, electronic equipment and storage medium
CN114793132A (en) Optical module detection method and device, electronic equipment and storage medium
CN113918438A (en) Method and device for detecting server abnormality, server and storage medium
JP6015750B2 (en) Log collection server, log collection system, and log collection method
JP2016181022A (en) Information processing apparatus, information processing program, information processing method, and data center system
CN112260902A (en) Network equipment monitoring method, device, equipment and storage medium
CN108737184B (en) Management method and device of disaster recovery system
CN115037653B (en) Service flow monitoring method, device, electronic equipment and storage medium
CN115981982A (en) Equipment management method, device, equipment and storage medium
CN115580522A (en) Method and device for monitoring running state of container cloud platform
KR102549129B1 (en) Method for provinding integrated management platform for device failures
CN112214437B (en) Storage device, communication method and device and computer readable storage medium
CN114116357A (en) Automatic test method, device and tool for hard disk
CN114039878A (en) Network request processing method and device, electronic equipment and storage medium
CN111722977A (en) System inspection method and device and electronic equipment
JP6926646B2 (en) Inter-operator batch service management device and inter-operator batch service management method
CN111835566A (en) System fault management method, device and system
CN111143325A (en) Data acquisition monitoring method, monitoring device and readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination