CN116955093B - CPU state monitoring method, device, system, equipment and storage medium - Google Patents

CPU state monitoring method, device, system, equipment and storage medium Download PDF

Info

Publication number
CN116955093B
CN116955093B CN202311219947.7A CN202311219947A CN116955093B CN 116955093 B CN116955093 B CN 116955093B CN 202311219947 A CN202311219947 A CN 202311219947A CN 116955093 B CN116955093 B CN 116955093B
Authority
CN
China
Prior art keywords
service terminal
target
register
configuration file
monitoring
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202311219947.7A
Other languages
Chinese (zh)
Other versions
CN116955093A (en
Inventor
伍桓民
杨文娟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ziguang Hengyue Technology Co ltd
Original Assignee
Ziguang Hengyue Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ziguang Hengyue Technology Co ltd filed Critical Ziguang Hengyue Technology Co ltd
Priority to CN202311219947.7A priority Critical patent/CN116955093B/en
Publication of CN116955093A publication Critical patent/CN116955093A/en
Application granted granted Critical
Publication of CN116955093B publication Critical patent/CN116955093B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • G06F11/3024Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system component is a central processing unit [CPU]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • G06F11/3006Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system is distributed, e.g. networked systems, clusters, multiprocessor systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3055Monitoring arrangements for monitoring the status of the computing system or of the computing system component, e.g. monitoring if the computing system is on, off, available, not available
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/06Protocols specially adapted for file transfer, e.g. file transfer protocol [FTP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L69/00Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass
    • H04L69/16Implementation or adaptation of Internet protocol [IP], of transmission control protocol [TCP] or of user datagram protocol [UDP]
    • H04L69/161Implementation details of TCP/IP or UDP/IP stack architecture; Specification of modified or new header fields
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L69/00Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass
    • H04L69/22Parsing or analysis of headers
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computing Systems (AREA)
  • Physics & Mathematics (AREA)
  • Quality & Reliability (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Mathematical Physics (AREA)
  • Computer Security & Cryptography (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The embodiment of the invention provides a method, a device, a system, equipment and a storage medium for monitoring a CPU state, and relates to the technical field of computers. The CPU state monitoring method is suitable for the operation and maintenance terminal, and comprises the following steps: screening all preselected service terminals from all service terminals, and generating a preselected service terminal equipment information list according to equipment information of each preselected service terminal; the preselection service terminal is provided with a BMC supporting the monitoring of the CPU state; based on the information list of the preselected service terminal equipment, at least one preselected service terminal is selected as a target service terminal, each target service terminal is triggered to monitor the CPU state according to the configuration file of the target service terminal, and the CPU state monitoring result of all the target service terminals is obtained. The embodiment of the invention can realize the technical effects of automatically monitoring the CPU states of a large number of service terminals and improving the operation and maintenance efficiency of the service terminals.

Description

CPU state monitoring method, device, system, equipment and storage medium
Technical Field
The present invention relates to the field of computer technologies, and in particular, to a method, an apparatus, a system, a device, and a storage medium for monitoring a CPU state.
Background
The CPU (Central Processing Unit/Processor) is one of the core devices of the computer, if the CPU fails in the use process, the software or the system is abnormally operated, and if the CPU is heavy, the equipment is in downtime, and the equipment cannot be started normally, and other abnormal conditions are caused.
The CPU of the domestic autonomous platform and the relevant domestic BMC (Baseboard Management Controller ) and BIOS (Basic Input Output System, basic input output system) firmware versions used in a matched manner do not support to check the state and abnormal information of the CPU on a BMC management page, and if the CPU is wrong and causes the breakdown of a server, no clear information and log about the CPU can be obtained from the BMC, so that the CPU is very unfavorable for daily maintenance and problem positioning. In order to locate the problem related to the CPU, at present, whether the CPU is normal or not, whether PCIe (Peripheral Component Interconnect express, a high-speed serial computer expansion bus standard) exists or not, and errors in hardware related aspects such as internal memory, FIT (Fast Interconnect Transport, a direct connection port between chips is used by interconnection between two paths of CPUs) and the like are mainly determined by manually reading CPU registers of all servers one by one.
With the advancement of domestic automation in the IT (internet Technology ) field, the S2500 CPU is basically a large-scale internet data center applied in the industry field, and generally has a number of hundreds of servers. Under such a scenario, if a manual reading is required during routine maintenance or problem positioning, and whether the return value is correct is determined, the operation is too cumbersome and not convenient.
Disclosure of Invention
The embodiment of the invention aims to provide a CPU state monitoring method, device, system, equipment and storage medium, which are used for realizing the technical effects of automatically monitoring the CPU states of a large number of service terminals and improving the operation and maintenance efficiency of the service terminals.
In a first aspect, an embodiment of the present invention provides a method for monitoring a CPU state, which is applicable to an operation and maintenance terminal, where the method includes:
screening all preselected service terminals from all service terminals, and generating a preselected service terminal equipment information list according to equipment information of each preselected service terminal; the pre-selected service terminal is provided with a BMC supporting the monitoring of the CPU state;
and selecting at least one pre-selected service terminal as a target service terminal based on the pre-selected service terminal equipment information list, and triggering each target service terminal to monitor the CPU state according to the configuration file of the target service terminal to obtain the CPU state monitoring results of all the target service terminals.
In the implementation process, configuration files are stored in each pre-selected service terminal in advance, at least one target service terminal is selected from all pre-selected service terminals by the operation and maintenance terminal, each target service terminal is triggered to monitor the CPU state according to the configuration files of the target service terminal, and the CPU state monitoring result of all target service terminals is obtained, so that the CPU states of a large number of service terminals can be monitored automatically, and the operation and maintenance efficiency of the service terminals is improved.
Further, the method includes the steps of selecting all the preselected service terminals from all the service terminals, and generating a preselected service terminal equipment information list according to equipment information of each preselected service terminal, wherein the method specifically includes the steps of:
sending a wakeup request message to all the service terminals, so that all the preselected service terminals analyze the wakeup request message and return a wakeup response message;
correspondingly extracting the equipment information of each pre-selected service terminal from the wake-up response message returned by each pre-selected service terminal;
and generating the pre-selected service terminal equipment information list according to the equipment information of all the pre-selected service terminals.
In the implementation process, the operation and maintenance terminal sends the awakening request message which can be analyzed by the preselection service terminal only to all the service terminals, and the equipment information of each preselection service terminal is correspondingly extracted from the awakening response message returned by each preselection service terminal to generate a preselection service terminal equipment information list, so that all the preselection service terminals can be rapidly and accurately screened and the equipment information of each preselection service terminal can be acquired, and further the operation and maintenance efficiency of the service terminal can be improved.
Further, before the selecting at least one pre-selected service terminal as a target service terminal based on the pre-selected service terminal device information list and triggering each target service terminal to monitor the CPU state according to its own configuration file, the method further includes:
responding to a configuration file customization request initiated by a user, and modifying an initial configuration file based on a predefined configuration file writing rule to obtain an updated configuration file;
and selecting at least one pre-selected service terminal as a customized service terminal based on the pre-selected service terminal equipment information list, and triggering each customized service terminal to download the updated configuration file as a configuration file of the customized service terminal.
In the implementation process, the operation and maintenance terminal responds to the configuration file customization request initiated by the user to write the update configuration file, and the selected customization service terminals are triggered to download the update configuration file as the configuration file of the operation and maintenance terminal, so that the operation and maintenance requirements of the user can be better met, and the operation and maintenance efficiency of the service terminal is further improved.
Further, the triggering each customized service terminal to download the updated configuration file as the configuration file thereof specifically includes:
And for each customized service terminal, sending a configuration file update trigger message to the customized service terminal, so that the customized service terminal downloads the update configuration file and reports the update configuration file downloading result based on a TFTP (transport stream transmission protocol) after a preset interval time.
In the implementation process, after the operation and maintenance terminal triggers the customized service terminal to download the updated configuration file and report the updated configuration file downloading result based on the TFTP protocol at a preset interval time, the configuration files of each customized service terminal can be ensured to be effectively updated, and the operation and maintenance efficiency of the service terminal is further improved.
Further, the triggering each target service terminal monitors the CPU state according to its own configuration file to obtain the CPU state monitoring results of all the target service terminals, which specifically includes:
for each target service terminal, sending a CPU state monitoring trigger message to the target service terminal, and triggering the target service terminal to execute:
according to a first configuration file, taking all registers enabling monitoring tasks in the target service terminal CPU as target registers;
according to a second configuration file, for each target register, when the target register starts an initialization operation, initializing the target register;
Executing a register information reading instruction of the target register according to the monitoring task execution period of the target register to obtain information of the target register;
judging whether the target register runs abnormally or not according to the information of the target register based on a register monitoring rule of the target register, and obtaining a monitoring result of the target register;
combining the monitoring results of all the target registers to obtain a CPU state monitoring result of the target service terminal;
the configuration file of the target service terminal comprises a first configuration file and a second configuration file, the first configuration file comprises IDs (identity) of all registers in a CPU (central processing unit) of the target service terminal, a monitoring task enabling state and a monitoring task executing period, and the second configuration file comprises an initialization operation enabling state, a register information reading instruction and a register monitoring rule of all registers in the CPU of the target service terminal.
In the implementation process, the operation and maintenance terminals trigger each target service terminal to monitor the CPU state according to the configuration file of each target service terminal, so that each target service terminal can rapidly and accurately monitor the information of the designated register in the configuration file to assist the operation and maintenance terminals to monitor the CPU state, the CPU states of a large number of service terminals can be monitored automatically, and the operation and maintenance efficiency of the service terminals is improved.
Further, the triggering each target service terminal to monitor the CPU state according to its own configuration file, so as to obtain the CPU state monitoring results of all the target service terminals, and specifically further includes:
when the monitoring result of the target register is that the target register operates abnormally, receiving target register abnormal prompt information sent by the target service terminal; the target register exception prompt information comprises an ID of the target register;
and sending a register information collection trigger message to the target service terminal, and triggering the target service terminal to send information of each register of the CPU.
In the implementation process, when the operation and maintenance terminal receives the target register abnormality prompt information sent by the target service terminal, the operation and maintenance terminal sends a register information collecting and triggering message to the target service terminal to trigger the target service terminal to send the information of each register of the CPU, so that the information of each register of the CPU of the target service terminal can be timely obtained, and a user can conveniently conduct abnormality analysis according to the information of each register of the CPU of the target service terminal.
In a second aspect, an embodiment of the present invention provides a CPU state monitoring device, which is applicable to an operation and maintenance terminal, and the device includes:
The service terminal screening module is used for screening all preselected service terminals from all service terminals and generating a preselected service terminal equipment information list according to the equipment information of each preselected service terminal; the pre-selected service terminal is provided with a BMC supporting the monitoring of the CPU state;
and the CPU state monitoring module is used for selecting at least one preselected service terminal as a target service terminal based on the preselected service terminal equipment information list, triggering each target service terminal to monitor the CPU state according to the configuration file of the target service terminal, and obtaining the CPU state monitoring results of all the target service terminals.
In a third aspect, an embodiment of the present invention provides a CPU state monitoring system, including an operation and maintenance terminal and at least one service terminal; each service terminal is in communication connection with the operation and maintenance terminal;
the operation and maintenance terminal is used for:
screening all preselected service terminals from all service terminals, and generating a preselected service terminal equipment information list according to equipment information of each preselected service terminal; the pre-selected service terminal is provided with a BMC supporting the monitoring of the CPU state;
And selecting at least one pre-selected service terminal as a target service terminal based on the pre-selected service terminal equipment information list, and triggering each target service terminal to monitor the CPU state according to the configuration file of the target service terminal to obtain the CPU state monitoring results of all the target service terminals.
In a fourth aspect, embodiments of the present invention provide an electronic device comprising a processor, a memory, and a computer program stored in the memory and configured to be executed by the processor; the memory is coupled to the processor and the processor, when executing the computer program, implements the CPU state monitoring method as described above.
In a fifth aspect, embodiments of the present invention provide a computer-readable storage medium including a stored computer program; wherein the computer readable storage medium is controlled to execute the CPU state monitoring method as described above when the computer program is running.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings that are needed in the embodiments of the present invention will be briefly described below, it should be understood that the following drawings only illustrate some embodiments of the present invention and should not be considered as limiting the scope, and other related drawings can be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a flowchart of a CPU status monitoring method according to a first embodiment of the present invention;
fig. 2 is a schematic structural diagram of a two-layer interactive message according to an example of the first embodiment of the present invention;
fig. 3 is a data flow diagram of an operation and maintenance terminal generating a pre-selected service terminal device information list according to an example of the first embodiment of the present invention;
fig. 4 is a schematic structural diagram of a three-layer interactive message according to an example of the first embodiment of the present invention;
FIG. 5 is a data flow diagram of an operation and maintenance terminal triggering each customized service terminal to download a data flow diagram of an update configuration file according to an example of the first embodiment of the present invention;
fig. 6 is a data flow diagram of an operation and maintenance terminal according to an example of the first embodiment of the present invention acquiring information of each register of a CPU of a target service terminal;
FIG. 7 is a schematic diagram of a CPU status monitor according to a second embodiment of the present invention;
FIG. 8 is a schematic diagram of a CPU status monitoring system according to a third embodiment of the present invention;
fig. 9 is a schematic structural diagram of an electronic device according to a fourth embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be described below with reference to the accompanying drawings in the embodiments of the present invention.
It should be noted that: in the description of the present invention, the terms "first," "second," and the like are used merely to distinguish between descriptions and are not to be construed as indicating or implying relative importance. Meanwhile, step numbers herein are only for convenience of explanation of the embodiments of the present invention, and are not used as limiting the order of execution of the steps. The method provided by the embodiment of the invention can be executed by the related terminal equipment, and the operation and maintenance terminal is taken as an execution main body for illustration.
Referring to fig. 1, fig. 1 is a flowchart of a method for monitoring a CPU status according to a first embodiment of the present invention. The first embodiment of the invention provides a CPU state monitoring method which is suitable for operation and maintenance terminals, and comprises the following steps of S101-S102:
s101, screening all preselected service terminals from all service terminals, and generating a preselected service terminal equipment information list according to equipment information of each preselected service terminal; the preselection service terminal is provided with a BMC supporting the monitoring of the CPU state;
s102, selecting at least one preselected service terminal as a target service terminal based on the information list of the preselected service terminal equipment, and triggering each target service terminal to monitor the CPU state according to the configuration file of the target service terminal to obtain the CPU state monitoring results of all the target service terminals.
By way of example, the operation and maintenance terminal includes a mobile phone, a computer, a tablet, etc. running operation and maintenance software, and an intelligent terminal device capable of communicating with the service terminal. The service terminal is typically provided with a CPU (central processing unit ) and a BMC (Baseboard Management Controller, baseboard management controller) inside.
The CPU is an operation Core (Core) and a Control Core (Control Unit) of the service terminal. The CPU can be divided into an arithmetic unit, a register and a controller according to basic functions. The registers are used primarily to cache instructions, data, and addresses of the CPU. The register is used for receiving the instruction of the controller, transmitting the instruction of the controller to the arithmetic unit and recording the processed or to-be-processed data by the arithmetic unit. Registers can be generally classified into general purpose registers, special purpose registers, and control registers, wherein some of the registers are generally used to indicate the execution state of a device or to store important information during operation of a CPU, and can be used to monitor the state of the CPU.
BMC is also one of the core components of the service terminal. The BMC is mainly responsible for monitoring the hardware state of the service terminal and managing the service terminal, and records the behavior of related hardware through a log, and is used for prompting a user and positioning the problem. BMC is a separate, small operating system with separate IP, i.e. independent of other hardware on the service terminal, such as CPU, memory, etc., and independent of BIOS (Basic Input Output System ), operating system, etc. The BMC may interact with other components, such as the BIOS, the CPU, through different hardware interfaces, such as LPC (Low pin count), I2C (Inter-integrated circuit bus), SMBUS (System Management Bus ), serial (Serial communication interface).
A large number of servers, i.e. service terminals, are stored in the machine room. After the operation and maintenance terminal is accessed to the computer room network, screening service terminals provided with BMCs supporting the monitoring CPU state from all service terminals on site, taking all screened service terminals as preselected service terminals, acquiring equipment information of all preselected service terminals, such as IP addresses, SN bar codes and the like, and generating a preselected service terminal equipment information table according to the equipment information of all preselected service terminals. A list of preselected service terminal device information is maintained for subsequent operations based on the list of preselected service terminal device information.
When the user obtains the information list of the preselected service terminal equipment through the operation and maintenance terminal, the preselected service terminals which need to monitor the CPU state in batches can be arbitrarily selected from all the preselected service terminals according to the actual operation and maintenance requirements, and a target service terminal selection instruction is input to the operation and maintenance terminal.
The operation and maintenance terminal selects at least one preselected service terminal from all preselected service terminals as a target service terminal according to a target service terminal selection instruction input by a user based on a preselected service terminal equipment information list, and for each target service terminal, the operation and maintenance terminal communicates with the target service terminal according to equipment information of the target service terminal, such as an IP address, an SN bar code and the like, and triggers the target service terminal to monitor the CPU state according to the configuration file of the operation and maintenance terminal to obtain a CPU state monitoring result of the target service terminal, thereby obtaining the CPU state monitoring result of all the target service terminals.
According to the embodiment of the invention, the configuration files are stored in each pre-selected service terminal in advance, the operation and maintenance terminal selects at least one target service terminal from all pre-selected service terminals, and each target service terminal is triggered to monitor the CPU state according to the configuration files, so that the CPU state monitoring result of all target service terminals is obtained, the CPU states of a large number of service terminals can be automatically monitored, and the operation and maintenance efficiency of the service terminals is improved.
In an alternative embodiment, the selecting all the pre-selected service terminals from all the service terminals, and generating a pre-selected service terminal device information list according to the device information of each pre-selected service terminal specifically includes: sending a wakeup request message to all service terminals, so that all preselected service terminals analyze the wakeup request message and return a wakeup response message; correspondingly extracting the equipment information of each preselected service terminal from the wake-up response message returned by each preselected service terminal; and generating a pre-selected service terminal equipment information list according to the equipment information of all pre-selected service terminals.
As an example, the operation and maintenance terminal broadcasts and transmits a wake-up request message to all service terminals.
For example, the wake-up request message may be a two-layer interaction message. The two-layer interactive message consists of an Ethernet header and an upper layer encapsulation message, and the structural schematic diagram of the two-layer interactive message is shown in fig. 2.
In the wake-up request message shown in fig. 2, the ethernet header is the destination MAC address, i.e. the broadcast address, and the ethernet type is 0x0811; the pre-selected service terminal is distinguished from the common message according to the custom Ethernet type, and the custom protocol message comprises: message type: the message type of the self-defined message is shown, the length is 1byte, and the meaning of the message type is shown in table 1; message length: recording the total length of the message of the whole 'custom protocol' part, excluding the Ethernet header, and the length is 2byte; and (3) checksum: the checksum after CRC operation is carried out on the message of the whole custom protocol part is recorded; IP address: the length is 4 bytes, which is used for recording the IP address of the sender, and the operation and maintenance terminal and the preselected service terminal are used for exchanging addresses; SN barcode: the length is 20 bytes, and only 0x02 type wake-up response message needs to fill in SN bar code of BMC, and wake-up request message is directly filled with 0x 00.
TABLE 1
When a service terminal provided with a BMC supporting the monitoring CPU state receives the wake-up request message, the wake-up request message can be analyzed, the IP address of the operation and maintenance terminal is obtained from the wake-up request message and stored, the self equipment information such as the IP address, the SN bar code and the like is filled into the wake-up response message, and the wake-up response message is returned to the operation and maintenance terminal after a random time such as t (t is less than or equal to 10) seconds is delayed.
When the service terminal which is not provided with the BMC supporting the monitoring CPU state receives the wake-up request message, the wake-up request message cannot be analyzed, only the wake-up request message can be discarded, and the wake-up response message cannot be returned to the operation and maintenance terminal.
When the operation and maintenance terminal receives the awakening response message returned by any service terminal, the service terminal is considered to be internally provided with a BMC supporting the monitoring of the CPU state, the service terminal is used as a preselection service terminal, the equipment information of the preselection service terminal, such as an IP address, an SN bar code and the like, is extracted from the awakening response message returned by the preselection service terminal, and when the equipment information of all preselection service terminals is obtained, a preselection service terminal equipment information list is generated according to the equipment information of all preselection service terminals.
For example, assuming that all service terminals in the field are the service terminal 1, the service terminal 2 and the service terminal 3 respectively, the service terminal 1 and the service terminal 2 are both internally provided with BMCs supporting the monitoring CPU state, and the service terminal 3 is not internally provided with a BMC supporting the monitoring CPU state, the operation and maintenance terminal generates a data flow diagram of the pre-selected service terminal device information list as shown in fig. 3.
According to the embodiment of the invention, the operation and maintenance terminal sends the awakening request message which can be analyzed by the preselection service terminal only to all the service terminals, and the equipment information of each preselection service terminal is correspondingly extracted from the awakening response message returned by each preselection service terminal to generate the preselection service terminal equipment information list, so that all the preselection service terminals can be rapidly and accurately screened and the equipment information of each preselection service terminal can be acquired, and the operation and maintenance efficiency of the service terminal can be further improved.
In an optional embodiment, before the selecting at least one pre-selected service terminal as the target service terminal based on the pre-selected service terminal device information list and triggering each target service terminal to monitor the CPU state according to its configuration file, the method further includes: responding to a configuration file customization request initiated by a user, and modifying an initial configuration file based on a predefined configuration file writing rule to obtain an updated configuration file; based on the pre-selected service terminal equipment information list, at least one pre-selected service terminal is selected as a customized service terminal, and each customized service terminal is triggered to download an update configuration file as a configuration file of the customized service terminal.
As an example, when the user obtains the information list of the preselected service terminal device through the operation and maintenance terminal, the user can customize and write the configuration file according to the actual operation and maintenance requirement, and initiate a configuration file customization request to the operation and maintenance terminal.
When receiving a configuration file customizing request initiated by a user, the operation and maintenance terminal responds to the configuration file customizing request initiated by the user to acquire an initial configuration file, modifies the initial configuration file based on a predefined configuration file writing rule to acquire an updated configuration file, and stores the updated configuration file.
Assuming that the initial configuration file includes two configuration files, one configuration file is used for defining attribute information of each register of the preselected service terminal CPU, including ID, type, monitoring task enabling state, monitoring task execution period, etc. of each register, and the other configuration file is used for defining monitoring operation information of each register of the preselected service terminal CPU, including initialization operation enabling state, register information reading instruction, and register monitoring rule of each register, the predefined configuration file writing rule may include: the names and the numbers of the register objects in the two configuration files are consistent and must be in one-to-one correspondence; the integrity of the commands in the two configuration files meets the requirements of the I2C commands, etc. When the updated configuration files are obtained, whether two configuration files in the updated configuration files accord with configuration file writing rules is checked, and if not, a user can be prompted to fail to modify so as to ensure that the updated configuration files are available effectively.
When the user obtains the information list of the preselected service terminal equipment through the operation and maintenance terminal, the user can also randomly select a batch of preselected service terminals needing to update the configuration file from all the preselected service terminals according to actual operation and maintenance requirements, and input a customized service terminal selection instruction to the operation and maintenance terminal.
The operation and maintenance terminal selects at least one pre-selected service terminal from all pre-selected service terminals as a customized service terminal according to a customized service terminal selection instruction input by a user based on the pre-selected service terminal equipment information list, and for each customized service terminal, the customized service terminal is triggered to download an updated configuration file from the operation and maintenance terminal, and the configuration file of the operation and maintenance terminal is updated into an updated configuration file.
According to the embodiment of the invention, the operation and maintenance terminal responds to the configuration file customization request initiated by the user to compile the update configuration file, and the selected customization service terminals are triggered to download the update configuration file as the configuration file of the operation and maintenance terminal, so that the operation and maintenance requirements of the user can be better met, and the operation and maintenance efficiency of the service terminal is further improved.
In an alternative embodiment, the triggering each customized service terminal to download the update configuration file as the configuration file thereof specifically includes: and for each customized service terminal, sending a configuration file update trigger message to the customized service terminal, so that the customized service terminal downloads an update configuration file and reports an update configuration file downloading result based on a TFTP (transport stream transmission protocol) after a preset interval time.
As an example, the operation and maintenance terminal transmits a profile update trigger message to the customized service terminal for each customized service terminal.
For example, the profile update trigger message may be a three-layer interactive message. The three-layer interactive message is composed of UDP (User Datagram Protocol ) and an upper layer encapsulation message, and the structural schematic diagram of the three-layer interactive message is shown in figure 4. The three-layer interactive message is used for adopting IP address communication after the operation and maintenance terminal interacts IP addresses with each preselected service terminal to generate a preselected service terminal equipment information list.
The special protocol message is to encapsulate the data information content on the basis of the UDP data message, and its format is shown in fig. 4. The operation and maintenance terminal is different from other IP data messages through UDP port number 16555, and monitors with UDP port. The custom protocol message includes: message type: the message type of the self-defined message is shown, the length is 1byte, and the specific meaning is shown in table 2; message length: recording the total length of the message of the whole 'custom protocol' part, excluding the Ethernet header, and the length is 2byte; and (3) checksum: the checksum after CRC operation is carried out on the message of the whole custom protocol part is recorded; load: in general, no load is applied to the message, the message is filled to the minimum message length by 0x00, and only 0x06 error feedback message is provided with the ID (one byte corresponds to one ID) of the error register in the load.
TABLE 2
And when receiving the configuration file update trigger message, the customized service terminal downloads an update configuration file from the operation and maintenance terminal based on the TFTP protocol after a preset interval time, and updates the configuration file into an update configuration file. After the TFTP transmission flow is completed, the customized service terminal synchronously sends a transmission completion message to the operation and maintenance terminal.
TFTP (Trivial File Transfer Protocol, simple file transfer protocol) is a protocol in the TCP/IP family of protocols for simple file transfer between a client and a server, providing a less complex, less expensive file transfer service.
For example, assuming that all the customized service terminals are the service terminal 1 and the service terminal 2, respectively, the operation and maintenance terminal triggers the respective customized service terminals to download the data flow diagram of the update configuration file as shown in fig. 5.
According to the embodiment of the invention, after the operation and maintenance terminal triggers the customized service terminal to download the updated configuration file based on the TFTP protocol and report the download result of the updated configuration file after the preset interval time, each customized service terminal can be ensured to effectively update the configuration file of the customized service terminal, and the operation and maintenance efficiency of the service terminal is further improved.
In an alternative embodiment, the triggering each target service terminal to monitor the CPU state according to its own configuration file to obtain the CPU state monitoring results of all the target service terminals specifically includes: for each target service terminal, sending a CPU state monitoring trigger message to the target service terminal, and triggering the target service terminal to execute: according to the first configuration file, taking all registers enabling monitoring tasks in a target service terminal CPU as target registers; according to the second configuration file, for each target register, when the target register starts the initializing operation, initializing the target register; executing a register information reading instruction of the target register according to the monitoring task execution period of the target register to obtain information of the target register; judging whether the target register operates abnormally or not according to the information of the target register based on the register monitoring rule of the target register, and obtaining a monitoring result of the target register; combining the monitoring results of all the target registers to obtain a CPU state monitoring result of the target service terminal; the configuration files of the target service terminal comprise a first configuration file and a second configuration file, wherein the first configuration file comprises IDs (identity) of all registers in a CPU (central processing unit) of the target service terminal, a monitoring task starting state and a monitoring task executing period, and the second configuration file comprises an initialization operation starting state of all registers in the CPU of the target service terminal, a register information reading instruction and a register monitoring rule.
As an example, for each target service terminal, the configuration file of the target service terminal itself includes a first configuration file for defining attribute information of each register of the target service terminal CPU, including an ID, a type, a monitoring task enabling state, a monitoring task execution period, and the like of each register, and a second configuration file for defining monitoring operation information of each register of the target service terminal CPU, including an initialization operation enabling state, a register information reading instruction, a register monitoring rule of each register.
For example, assume that the first configuration file and the second configuration file are both JSON format files, the first configuration file is cpuconfigure.json, and the second configuration file is register monitor task.json.
In the first configuration file cpuconfigure.json, the array JSON key pair of "CPU register" contains a plurality of different register objects named ClockSynchronization, PCIEBifModeReg, PCIELaneReg, each register object representing a register, the array key pair name is not modifiable, and the register object name can be customized with reference to the CPU register manual. The "ID" field in each register object indicates the ID of the register, the "register type" field indicates the type of the register, the support is set to a status register (value is status machine), a Counter register (value is Counter), the initialization operation enabling state and the register monitoring rule of the status register and the Counter register are different, the "register status" field indicates the monitoring task enabling state of the register, the support is set to an enabling state (value is enable) and the disabling state (value is disable). The key value of "IntervalTime" indicates the execution period of the monitoring task for each register of the target service terminal CPU, for example, 15 minutes.
In the second configuration file register task, JSON, the array type JSON key value pair of the register monitor task contains a plurality of different register objects named ClockSynchronization, PCIEBifModeReg, PCIELaneReg, the names and the numbers of the register objects in the second configuration file register monitor task are consistent with those of the register objects in the first configuration file CPU configuration, an Initialization field in each register object represents an Initialization operation enabling state of a register, a register information reading instruction of the register is supported to be set to an enabling state (the value is an Initialization operation instruction) and a disabling state (the value is null), the register information reading instruction of the register is a periodically executed monitoring instruction, the register monitoring rule of the register is recorded, and the register monitoring rule records judgment logic for a return value of the monitoring instruction and is an abstract summary rule of the register judgment logic provided for the CPU register. For example, the register monitoring rule may be to directly determine whether the monitoring command return value is consistent with the expected value, calculate an increment (for the count register) of the current monitoring command return value compared with the last monitoring command return value, or perform offset calculation on the monitoring command return value, and determine whether the data value on a bit is consistent with the expected value.
And the operation and maintenance terminal sends a CPU state monitoring trigger message to the target service terminal. For example, the CPU state monitoring trigger message may be a three-layer interactive message.
When receiving a CPU state monitoring trigger message, the target service terminal starts to monitor the CPU state according to the configuration file of the target service terminal, and specifically executes the following steps: according to the first configuration file, determining which registers in the target service terminal CPU are in an enabling state in the monitoring task enabling state, and taking the registers as target registers; according to the second configuration file, judging whether the initialization operation starting state of each target register is a starting state or not, and when the initialization operation is started by the target registers, initializing the target registers according to the initialization operation instruction; acquiring a monitoring task execution period of a target register according to a first configuration file, acquiring a register information reading instruction of the target register according to a second configuration file, and executing the register information reading instruction of the target register according to the monitoring task execution period of the target register to obtain information of the target register; according to the second configuration file, acquiring a register monitoring rule of the target register, judging whether the target register operates abnormally or not according to information of the target register based on the register monitoring rule of the target register, obtaining a monitoring result of the target register, and returning the monitoring result of the target register to the operation and maintenance terminal.
And the operation and maintenance terminal combines the monitoring results of all the target registers to obtain the CPU state monitoring result of the target service terminal.
According to the embodiment of the invention, the operation and maintenance terminals trigger each target service terminal to monitor the CPU state according to the configuration file thereof, so that each target service terminal can rapidly and accurately monitor the information of the designated register in the configuration file to assist the operation and maintenance terminals to monitor the CPU state, the CPU states of a large number of service terminals can be automatically monitored, and the operation and maintenance efficiency of the service terminals is improved.
In an optional embodiment, the triggering each target service terminal to monitor the CPU state according to its own configuration file to obtain the CPU state monitoring results of all the target service terminals, and specifically further includes: when the monitoring result of the target register is that the target register operates abnormally, receiving target register abnormal prompt information sent by a target service terminal; the target register abnormality prompt information comprises an ID of a target register; and sending a register information collection trigger message to the target service terminal, and triggering the target service terminal to send the information of each register of the CPU.
As an example, when determining that the target register runs abnormally, the target service terminal obtains the ID of the target register according to the first configuration file, generates target register abnormality prompting information according to the ID of the target register, and sends the target register abnormality prompting information to the operation and maintenance terminal.
When the user obtains the target register abnormality prompt information through the operation and maintenance terminal, the user can select whether to acquire the information of each register of the target service terminal CPU according to the actual operation and maintenance requirement for abnormality analysis, and when the user selects to acquire the information of each register of the target service terminal CPU, the user inputs a register information collection instruction to the operation and maintenance terminal.
And when receiving a register information collection instruction input by a user, the operation and maintenance terminal sends a register information collection trigger message to the target service terminal. For example, the register information collection trigger message may be a three-layer interaction message.
When receiving a register information collection trigger message, the target service terminal collects information of each register of the target service terminal CPU, stores the information of each register of the target service terminal CPU in a register information file, such as a register information file resultfeed back. After the TFTP transmission flow is completed, the target service terminal may send a transmission completion message to the operation and maintenance terminal synchronously.
For example, assuming that the target service terminal is the service terminal 1, the service terminal 1CPU has a register abnormality, and the operation and maintenance terminal obtains the data flow chart of the information of each register of the target service terminal CPU as shown in fig. 6.
According to the embodiment of the invention, when the operation and maintenance terminal receives the target register abnormality prompt information sent by the target service terminal, the register information collection trigger message is sent to the target service terminal, the target service terminal is triggered to send the information of each register of the CPU, the information of each register of the CPU of the target service terminal can be timely obtained, and a user can conveniently conduct abnormality analysis according to the information of each register of the CPU of the target service terminal.
Referring to fig. 7, fig. 7 is a schematic structural diagram of a CPU status monitor according to a second embodiment of the present invention. A second embodiment of the present invention provides a CPU state monitoring device, adapted to an operation and maintenance terminal, the device including: the service terminal screening module 201 is configured to screen all pre-selected service terminals from all service terminals, and generate a pre-selected service terminal device information list according to device information of each pre-selected service terminal; the preselection service terminal is provided with a BMC supporting the monitoring of the CPU state; the CPU state monitoring module 202 is configured to select at least one preselected service terminal as a target service terminal based on the list of preselected service terminal device information, and trigger each target service terminal to monitor the CPU state according to its own configuration file, so as to obtain the CPU state monitoring results of all the target service terminals.
In an alternative embodiment, the selecting all the pre-selected service terminals from all the service terminals, and generating a pre-selected service terminal device information list according to the device information of each pre-selected service terminal specifically includes: sending a wakeup request message to all service terminals, so that all preselected service terminals analyze the wakeup request message and return a wakeup response message; correspondingly extracting the equipment information of each preselected service terminal from the wake-up response message returned by each preselected service terminal; and generating a pre-selected service terminal equipment information list according to the equipment information of all pre-selected service terminals.
In an alternative embodiment, the CPU state monitoring module 202 is further configured to, before the selecting at least one preselected service terminal as a target service terminal based on the list of preselected service terminal device information, trigger each target service terminal to monitor the CPU state according to its own configuration file, obtain the CPU state monitoring result of all the target service terminals, respond to a configuration file customization request initiated by a user, and modify an initial configuration file based on a predefined configuration file writing rule, so as to obtain an updated configuration file; based on the pre-selected service terminal equipment information list, at least one pre-selected service terminal is selected as a customized service terminal, and each customized service terminal is triggered to download an update configuration file as a configuration file of the customized service terminal.
In an alternative embodiment, the triggering each customized service terminal to download the update configuration file as the configuration file thereof specifically includes: and for each customized service terminal, sending a configuration file update trigger message to the customized service terminal, so that the customized service terminal downloads an update configuration file and reports an update configuration file downloading result based on a TFTP (transport stream transmission protocol) after a preset interval time.
In an alternative embodiment, the triggering each target service terminal to monitor the CPU state according to its own configuration file to obtain the CPU state monitoring results of all the target service terminals specifically includes: for each target service terminal, sending a CPU state monitoring trigger message to the target service terminal, and triggering the target service terminal to execute: according to the first configuration file, taking all registers enabling monitoring tasks in a target service terminal CPU as target registers; according to the second configuration file, for each target register, when the target register starts the initializing operation, initializing the target register; executing a register information reading instruction of the target register according to the monitoring task execution period of the target register to obtain information of the target register; judging whether the target register operates abnormally or not according to the information of the target register based on the register monitoring rule of the target register, and obtaining a monitoring result of the target register; combining the monitoring results of all the target registers to obtain a CPU state monitoring result of the target service terminal; the configuration files of the target service terminal comprise a first configuration file and a second configuration file, wherein the first configuration file comprises IDs (identity) of all registers in a CPU (central processing unit) of the target service terminal, a monitoring task starting state and a monitoring task executing period, and the second configuration file comprises an initialization operation starting state of all registers in the CPU of the target service terminal, a register information reading instruction and a register monitoring rule.
In an optional embodiment, the triggering each target service terminal to monitor the CPU state according to its own configuration file to obtain the CPU state monitoring results of all the target service terminals, and specifically further includes: when the monitoring result of the target register is that the target register operates abnormally, receiving target register abnormal prompt information sent by a target service terminal; the target register abnormality prompt information comprises an ID of a target register; and sending a register information collection trigger message to the target service terminal, and triggering the target service terminal to send the information of each register of the CPU.
The implementation process of the functions and roles of each module in the above device is specifically shown in the implementation process of the corresponding steps in the above method, and will not be described herein again.
Referring to fig. 8, fig. 8 is a schematic structural diagram of a CPU state monitoring system according to a third embodiment of the present invention. The third embodiment of the present invention provides a CPU state monitoring system, including an operation and maintenance terminal 301 and at least one service terminal 302; each service terminal 302 is communicatively connected to the operation and maintenance terminal 301; an operation and maintenance terminal 301 for: screening all preselected service terminals from all service terminals 302, and generating a preselected service terminal equipment information list according to equipment information of each preselected service terminal; the pre-selected service terminal is a service terminal 302 provided with a BMC supporting the monitoring of the CPU state; based on the information list of the preselected service terminal equipment, at least one preselected service terminal is selected as a target service terminal, each target service terminal is triggered to monitor the CPU state according to the configuration file of the target service terminal, and the CPU state monitoring result of all the target service terminals is obtained.
The implementation process of the functions and roles of the operation and maintenance terminal 301 in the above system is specifically described in the implementation process of the corresponding steps in the above method, and will not be repeated here.
Referring to fig. 9, fig. 9 is a schematic structural diagram of an electronic device according to a fourth embodiment of the present invention. A fourth embodiment of the invention provides an electronic device 40 comprising a processor 401, a memory 402 and a computer program stored in the memory 402 and configured to be executed by the processor 401; the memory 402 is coupled to the processor 401, and the processor 401 implements the method for monitoring the state of a CPU according to the first embodiment of the present invention when executing a computer program, and can achieve the same advantages as the method.
Wherein the processor 401 may implement the method of any embodiment comprised by the method for monitoring the status of a CPU according to the first embodiment of the present invention when reading a computer program from the memory 402 via the bus 403 and executing said computer program.
The processor 401 may process the digital signals and may include various computing structures. Such as a complex instruction set computer architecture, a reduced instruction set computer architecture, or an architecture that implements a combination of instruction sets. In some examples, the processor 401 may be a microprocessor.
Memory 402 can be used for storing instructions to be executed by processor 401 or data related to execution of the instructions. Such instructions and/or data may include code to implement some or all of the functions of one or more of the modules described in embodiments of the present invention. The processor 401 of the presently disclosed embodiment may be used to execute instructions in the memory 402 to implement the CPU state monitoring method according to the first embodiment of the present invention. Memory 402 includes dynamic random access memory, static random access memory, flash memory, optical memory, or other memory known to those skilled in the art.
A fifth embodiment of the present invention provides a computer-readable storage medium including a stored computer program; the device where the computer readable storage medium is controlled to execute the method for monitoring the state of the CPU according to the first embodiment of the present invention when the computer program runs can achieve the same advantages as the method.
In summary, an embodiment of the present invention provides a method for monitoring a CPU state, which is applicable to an operation and maintenance terminal, and the method includes: screening all preselected service terminals from all service terminals, and generating a preselected service terminal equipment information list according to equipment information of each preselected service terminal; the preselection service terminal is provided with a BMC supporting the monitoring of the CPU state; based on the information list of the preselected service terminal equipment, at least one preselected service terminal is selected as a target service terminal, each target service terminal is triggered to monitor the CPU state according to the configuration file of the target service terminal, and the CPU state monitoring result of all the target service terminals is obtained. According to the embodiment of the invention, the configuration files are stored in each pre-selected service terminal in advance, the operation and maintenance terminal selects at least one target service terminal from all pre-selected service terminals, and each target service terminal is triggered to monitor the CPU state according to the configuration files, so that the CPU state monitoring result of all target service terminals is obtained, the CPU states of a large number of service terminals can be automatically monitored, and the operation and maintenance efficiency of the service terminals is improved.
In the several embodiments provided in the present invention, it should be understood that the disclosed apparatus and method may be implemented in other manners. The apparatus embodiments described above are merely illustrative, for example, flow diagrams and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of apparatus, methods and computer program products according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
In addition, functional modules in the embodiments of the present invention may be integrated together to form a single part, or each module may exist alone, or two or more modules may be integrated to form a single part.
The functions, if implemented in the form of software functional modules and sold or used as a stand-alone product, may be stored in a computer-readable storage medium. Based on this understanding, the technical solution of the present invention may be embodied essentially or in a part contributing to the prior art or in a part of the technical solution, in the form of a software product stored in a storage medium, comprising several instructions for causing a computer device (which may be a personal computer, a server, a network device, etc.) to perform all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a random access Memory (RAM, random Access Memory), a magnetic disk, or an optical disk, or other various media capable of storing program codes.
The foregoing is merely illustrative of the present invention, and the present invention is not limited thereto, and any person skilled in the art will readily appreciate variations or alternatives within the scope of the present invention. Therefore, the protection scope of the invention is subject to the protection scope of the claims.

Claims (8)

1. A method for monitoring a CPU state, the method being adapted for an operation and maintenance terminal, the method comprising:
screening all preselected service terminals from all service terminals, and generating a preselected service terminal equipment information list according to equipment information of each preselected service terminal; the pre-selected service terminal is provided with a BMC supporting the monitoring of the CPU state;
based on the pre-selected service terminal equipment information list, selecting at least one pre-selected service terminal as a target service terminal, and triggering each target service terminal to monitor the CPU state according to the configuration file of the target service terminal to obtain the CPU state monitoring results of all the target service terminals;
the triggering of each target service terminal monitors the CPU state according to the configuration file of the target service terminal to obtain the CPU state monitoring results of all the target service terminals, and specifically comprises the following steps:
for each target service terminal, sending a CPU state monitoring trigger message to the target service terminal, and triggering the target service terminal to execute:
according to a first configuration file, taking all registers enabling monitoring tasks in the target service terminal CPU as target registers;
According to a second configuration file, for each target register, when the target register starts an initialization operation, initializing the target register;
executing a register information reading instruction of the target register according to the monitoring task execution period of the target register to obtain information of the target register;
judging whether the target register runs abnormally or not according to the information of the target register based on a register monitoring rule of the target register, and obtaining a monitoring result of the target register;
combining the monitoring results of all the target registers to obtain a CPU state monitoring result of the target service terminal;
the configuration file of the target service terminal comprises a first configuration file and a second configuration file, wherein the first configuration file comprises IDs (identity) of all registers in a CPU (central processing unit) of the target service terminal, a monitoring task starting state and a monitoring task executing period, and the second configuration file comprises an initialization operation starting state of all registers in the CPU of the target service terminal, a register information reading instruction and a register monitoring rule;
the triggering of each target service terminal monitors the CPU state according to the configuration file of the target service terminal to obtain the CPU state monitoring results of all the target service terminals, and the method specifically comprises the following steps:
When the monitoring result of the target register is that the target register operates abnormally, receiving target register abnormal prompt information sent by the target service terminal; the target register exception prompt information comprises an ID of the target register;
and sending a register information collection trigger message to the target service terminal, and triggering the target service terminal to send information of each register of the CPU.
2. The CPU state monitoring method of claim 1, wherein the step of screening all the preselected service terminals from all the service terminals and generating a preselected service terminal device information list based on device information of each of the preselected service terminals specifically includes:
sending a wakeup request message to all the service terminals, so that all the preselected service terminals analyze the wakeup request message and return a wakeup response message;
correspondingly extracting the equipment information of each pre-selected service terminal from the wake-up response message returned by each pre-selected service terminal;
and generating the pre-selected service terminal equipment information list according to the equipment information of all the pre-selected service terminals.
3. The method for monitoring the CPU state according to claim 1, wherein before said selecting at least one of said preselected service terminals as a target service terminal based on said list of preselected service terminal device information, triggering each of said target service terminals to monitor the CPU state according to its own configuration file, obtaining the CPU state monitoring results of all of said target service terminals, further comprising:
Responding to a configuration file customization request initiated by a user, and modifying an initial configuration file based on a predefined configuration file writing rule to obtain an updated configuration file;
and selecting at least one pre-selected service terminal as a customized service terminal based on the pre-selected service terminal equipment information list, and triggering each customized service terminal to download the updated configuration file as a configuration file of the customized service terminal.
4. The method for monitoring the CPU state according to claim 3, wherein the triggering each customized service terminal to download the updated configuration file as the configuration file thereof specifically includes:
and for each customized service terminal, sending a configuration file update trigger message to the customized service terminal, so that the customized service terminal downloads the update configuration file and reports the update configuration file downloading result based on a TFTP (transport stream transmission protocol) after a preset interval time.
5. A CPU state monitoring device, adapted for use with an operation and maintenance terminal, said device comprising:
the service terminal screening module is used for screening all preselected service terminals from all service terminals and generating a preselected service terminal equipment information list according to the equipment information of each preselected service terminal; the pre-selected service terminal is provided with a BMC supporting the monitoring of the CPU state;
The CPU state monitoring module is used for selecting at least one preselected service terminal as a target service terminal based on the preselected service terminal equipment information list, triggering each target service terminal to monitor the CPU state according to the configuration file of the target service terminal, and obtaining the CPU state monitoring results of all the target service terminals;
the triggering of each target service terminal monitors the CPU state according to the configuration file of the target service terminal to obtain the CPU state monitoring results of all the target service terminals, and specifically comprises the following steps:
for each target service terminal, sending a CPU state monitoring trigger message to the target service terminal, and triggering the target service terminal to execute:
according to a first configuration file, taking all registers enabling monitoring tasks in the target service terminal CPU as target registers;
according to a second configuration file, for each target register, when the target register starts an initialization operation, initializing the target register;
executing a register information reading instruction of the target register according to the monitoring task execution period of the target register to obtain information of the target register;
Judging whether the target register runs abnormally or not according to the information of the target register based on a register monitoring rule of the target register, and obtaining a monitoring result of the target register;
combining the monitoring results of all the target registers to obtain a CPU state monitoring result of the target service terminal;
the configuration file of the target service terminal comprises a first configuration file and a second configuration file, wherein the first configuration file comprises IDs (identity) of all registers in a CPU (central processing unit) of the target service terminal, a monitoring task starting state and a monitoring task executing period, and the second configuration file comprises an initialization operation starting state of all registers in the CPU of the target service terminal, a register information reading instruction and a register monitoring rule;
the triggering of each target service terminal monitors the CPU state according to the configuration file of the target service terminal to obtain the CPU state monitoring results of all the target service terminals, and the method specifically comprises the following steps:
when the monitoring result of the target register is that the target register operates abnormally, receiving target register abnormal prompt information sent by the target service terminal; the target register exception prompt information comprises an ID of the target register;
And sending a register information collection trigger message to the target service terminal, and triggering the target service terminal to send information of each register of the CPU.
6. The CPU state monitoring system is characterized by comprising an operation and maintenance terminal and at least one service terminal; each service terminal is in communication connection with the operation and maintenance terminal;
the operation and maintenance terminal is used for:
screening all preselected service terminals from all service terminals, and generating a preselected service terminal equipment information list according to equipment information of each preselected service terminal; the pre-selected service terminal is provided with a BMC supporting the monitoring of the CPU state;
based on the pre-selected service terminal equipment information list, selecting at least one pre-selected service terminal as a target service terminal, and triggering each target service terminal to monitor the CPU state according to the configuration file of the target service terminal to obtain the CPU state monitoring results of all the target service terminals;
the triggering of each target service terminal monitors the CPU state according to the configuration file of the target service terminal to obtain the CPU state monitoring results of all the target service terminals, and specifically comprises the following steps:
for each target service terminal, sending a CPU state monitoring trigger message to the target service terminal, and triggering the target service terminal to execute:
According to a first configuration file, taking all registers enabling monitoring tasks in the target service terminal CPU as target registers;
according to a second configuration file, for each target register, when the target register starts an initialization operation, initializing the target register;
executing a register information reading instruction of the target register according to the monitoring task execution period of the target register to obtain information of the target register;
judging whether the target register runs abnormally or not according to the information of the target register based on a register monitoring rule of the target register, and obtaining a monitoring result of the target register;
combining the monitoring results of all the target registers to obtain a CPU state monitoring result of the target service terminal;
the configuration file of the target service terminal comprises a first configuration file and a second configuration file, wherein the first configuration file comprises IDs (identity) of all registers in a CPU (central processing unit) of the target service terminal, a monitoring task starting state and a monitoring task executing period, and the second configuration file comprises an initialization operation starting state of all registers in the CPU of the target service terminal, a register information reading instruction and a register monitoring rule;
The triggering of each target service terminal monitors the CPU state according to the configuration file of the target service terminal to obtain the CPU state monitoring results of all the target service terminals, and the method specifically comprises the following steps:
when the monitoring result of the target register is that the target register operates abnormally, receiving target register abnormal prompt information sent by the target service terminal; the target register exception prompt information comprises an ID of the target register;
and sending a register information collection trigger message to the target service terminal, and triggering the target service terminal to send information of each register of the CPU.
7. An electronic device comprising a processor, a memory, and a computer program stored in the memory and configured to be executed by the processor; the memory is coupled to the processor and the processor, when executing the computer program, implements the CPU state monitoring method according to any one of claims 1 to 4.
8. A computer readable storage medium, wherein the computer readable storage medium comprises a stored computer program; wherein the computer-readable storage medium is controlled to execute the CPU state monitoring method according to any one of claims 1 to 4 when the computer program is run.
CN202311219947.7A 2023-09-21 2023-09-21 CPU state monitoring method, device, system, equipment and storage medium Active CN116955093B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311219947.7A CN116955093B (en) 2023-09-21 2023-09-21 CPU state monitoring method, device, system, equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311219947.7A CN116955093B (en) 2023-09-21 2023-09-21 CPU state monitoring method, device, system, equipment and storage medium

Publications (2)

Publication Number Publication Date
CN116955093A CN116955093A (en) 2023-10-27
CN116955093B true CN116955093B (en) 2024-01-30

Family

ID=88455107

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311219947.7A Active CN116955093B (en) 2023-09-21 2023-09-21 CPU state monitoring method, device, system, equipment and storage medium

Country Status (1)

Country Link
CN (1) CN116955093B (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101271415A (en) * 2008-05-07 2008-09-24 深圳国人通信有限公司 Monitoring watchdog implementing method of built-in equipment
CN108459944A (en) * 2018-03-29 2018-08-28 中科创能实业有限公司 System operation monitoring method, device and server
CN114416508A (en) * 2022-01-11 2022-04-29 青岛海尔科技有限公司 Application performance monitoring method and device, storage medium and electronic equipment
CN115913872A (en) * 2022-11-30 2023-04-04 浪潮(山东)计算机科技有限公司 Server monitoring operation and maintenance method, device, equipment and medium
CN116028094A (en) * 2023-02-10 2023-04-28 苏州浪潮智能科技有限公司 BMC upgrading method and device

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112422750A (en) * 2019-08-22 2021-02-26 中兴通讯股份有限公司 Method, terminal, system and computer readable storage medium for acquiring terminal state

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101271415A (en) * 2008-05-07 2008-09-24 深圳国人通信有限公司 Monitoring watchdog implementing method of built-in equipment
CN108459944A (en) * 2018-03-29 2018-08-28 中科创能实业有限公司 System operation monitoring method, device and server
CN114416508A (en) * 2022-01-11 2022-04-29 青岛海尔科技有限公司 Application performance monitoring method and device, storage medium and electronic equipment
CN115913872A (en) * 2022-11-30 2023-04-04 浪潮(山东)计算机科技有限公司 Server monitoring operation and maintenance method, device, equipment and medium
CN116028094A (en) * 2023-02-10 2023-04-28 苏州浪潮智能科技有限公司 BMC upgrading method and device

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
网络服务运行状态监控技术研究;刘桂峰;陶漪;;计算机与现代化(02);全文 *

Also Published As

Publication number Publication date
CN116955093A (en) 2023-10-27

Similar Documents

Publication Publication Date Title
US7788520B2 (en) Administering a system dump on a redundant node controller in a computer system
CN106980565B (en) Upgrading process monitoring method and device
US10924350B1 (en) Software sensor for reporting controller metrics
CN108920103B (en) Server management method and device, computer equipment and storage medium
CN111104139A (en) Firmware upgrading method, device, equipment and storage medium
WO2016197737A1 (en) Self-check processing method, apparatus and system
US20110029650A1 (en) Method and system for host independent platform diagnostics
US10754722B1 (en) Method for remotely clearing abnormal status of racks applied in data center
TWI507889B (en) Management node , method , machine 一accessible medium and system for reservation and execution image writing of native computing devices
CN116955093B (en) CPU state monitoring method, device, system, equipment and storage medium
CN115599617B (en) Bus detection method and device, server and electronic equipment
CN116886490A (en) Server inspection method and device and computer readable storage medium
CN111611124A (en) Monitoring equipment analysis method and device, computer device and storage medium
US20210334153A1 (en) Remote error detection method adapted for a remote computer device to detect errors that occur in a service computer device
CN115168146A (en) Anomaly detection method and device
CN113656378A (en) Server management method, device and medium
JP2007068090A (en) Network device monitor system and monitoring method, and program therefor
JP4208494B2 (en) Load test system for network server
US20230176986A1 (en) USB Terminal Server
CN105446833B (en) Control method of server and control device of server
CN112241283B (en) Software upgrading method, device, computer equipment and storage medium
TWI709037B (en) Bios setting method
CN111061597B (en) Method for testing KCS communication stability
TWI715005B (en) Monitor method for demand of a bmc
CN116820516A (en) Firmware upgrading method and device and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant