WO2020107205A1 - 运算设备维护方法及装置、存储介质和程序产品 - Google Patents

运算设备维护方法及装置、存储介质和程序产品 Download PDF

Info

Publication number
WO2020107205A1
WO2020107205A1 PCT/CN2018/117651 CN2018117651W WO2020107205A1 WO 2020107205 A1 WO2020107205 A1 WO 2020107205A1 CN 2018117651 W CN2018117651 W CN 2018117651W WO 2020107205 A1 WO2020107205 A1 WO 2020107205A1
Authority
WO
WIPO (PCT)
Prior art keywords
computing device
computing
data processing
power
abnormal
Prior art date
Application number
PCT/CN2018/117651
Other languages
English (en)
French (fr)
Inventor
刘馥祎
彭逸豪
郭立春
贺文
Original Assignee
刘馥祎
比特大陆科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 刘馥祎, 比特大陆科技有限公司 filed Critical 刘馥祎
Priority to PCT/CN2018/117651 priority Critical patent/WO2020107205A1/zh
Priority to CN201880100621.3A priority patent/CN113396561A/zh
Publication of WO2020107205A1 publication Critical patent/WO2020107205A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1415Saving, restoring, recovering or retrying at system level
    • G06F11/1441Resetting or repowering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0751Error or fault detection not based on redundancy
    • G06F11/0754Error or fault detection not based on redundancy by exceeding limits
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3055Monitoring arrangements for monitoring the status of the computing system or of the computing system component, e.g. monitoring if the computing system is on, off, available, not available
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3409Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2201/00Indexing scheme relating to error detection, to error correction, and to monitoring
    • G06F2201/81Threshold

Definitions

  • This application relates to the field of data processing technology, for example, to a method and device for maintaining computing equipment, storage media, and program products.
  • Embodiments of the present disclosure provide a method and device for maintaining computing equipment, storage media, and program products, with a view to reducing human resource costs and improving the operating stability and safety of computing equipment.
  • an embodiment of the present disclosure also provides a method for maintaining computing equipment, including:
  • the computing device determine whether the computing device operates abnormally
  • the determining whether the computing device operates abnormally according to the status report information includes:
  • the operation abnormality identifier is carried in the status report information, it is determined that the computing device operates abnormally, wherein the operation abnormality indicator is used to indicate that the computing device operates abnormally.
  • the status report message includes the operating status information of the computing device
  • the operating state information includes: the current data processing computing power of the computing device.
  • the determining whether the computing device operates abnormally according to the status report information includes:
  • the current data processing computing power is less than the abnormal computing power threshold, it is determined that the computing device is operating abnormally; or,
  • the duration of when the current data processing computing power is less than the abnormal computing power threshold reaches the duration threshold, it is determined that the computing device is operating abnormally.
  • the method further includes:
  • the obtaining the abnormal computing power threshold includes:
  • the standard data processing computing power is: factory data processing computing power or initial data processing computing power.
  • the acquiring the standard data processing power of the computing device includes:
  • the factory data processing computing power recorded by the data processing chip of the computing device is acquired as the standard data processing computing power.
  • controlling the computing device to restart includes:
  • the sending a restart instruction to the computing device includes:
  • the identification information includes: an internet protocol IP address.
  • controlling the computing device to restart includes:
  • an embodiment of the present disclosure also provides a computing device maintenance device, including:
  • the receiving module is configured to receive the status report information sent by the computing device
  • the determining module is configured to determine whether the computing device operates abnormally according to the status reporting information
  • the control module is configured to control the computing device to restart if the computing device operates abnormally.
  • the determination module is configured as:
  • the status reporting information carries an abnormal operation identifier, it is determined that the computing device operates abnormally, wherein the abnormal operation indicator is configured to indicate that the computing device operates abnormally.
  • the status report message includes the operating status information of the computing device
  • the operating state information includes: the current data processing computing power of the computing device.
  • the determination module is specifically configured as:
  • the current data processing computing power is less than the abnormal computing power threshold, it is determined that the computing device is operating abnormally; or,
  • the duration of when the current data processing computing power is less than the abnormal computing power threshold reaches the duration threshold, it is determined that the computing device is operating abnormally.
  • the determination module is further configured to:
  • the determination module is specifically configured as:
  • the standard data processing computing power is: factory data processing computing power or initial data processing computing power.
  • the determination module is specifically configured as:
  • the factory data processing computing power recorded by the data processing chip of the computing device is acquired as the standard data processing computing power.
  • control module is specifically configured as:
  • control module is specifically configured as:
  • the identification information includes: an internet protocol IP address.
  • control module is specifically configured as:
  • an embodiment of the present disclosure also provides an electronic device, including: a memory, a transceiver, and a processor, where the memory, the transceiver, and the processor are connected by a bus;
  • the memory is configured to store a computer program
  • the transceiver is configured to communicate with other devices
  • the processor is configured to execute the computer program to implement the method according to any one of the first aspects.
  • the transceiver is configured to receive status report information sent by the computing device;
  • the processor is configured to determine whether the computing device operates abnormally according to the status report information; and, configured to control the computing device to restart if the computing device operates abnormally.
  • the processor is specifically configured as:
  • the status reporting information carries an abnormal operation identifier, it is determined that the computing device operates abnormally, wherein the abnormal operation indicator is configured to indicate that the computing device operates abnormally.
  • the status report message includes the operating status information of the computing device
  • the operating state information includes: the current data processing computing power of the computing device.
  • the processor is specifically configured as:
  • the current data processing computing power is less than the abnormal computing power threshold, it is determined that the computing device is operating abnormally; or,
  • the duration of when the current data processing computing power is less than the abnormal computing power threshold reaches the duration threshold, it is determined that the computing device is operating abnormally.
  • the processor is further configured as:
  • the processor is further specifically configured as:
  • the standard data processing computing power is: factory data processing computing power or initial data processing computing power.
  • the processor is further specifically configured as:
  • the factory data processing computing power recorded by the data processing chip of the computing device is acquired as the standard data processing computing power.
  • the transceiver is specifically configured as:
  • the processor is further specifically configured to establish a remote access connection with each computing device based on the identification information of each computing device, wherein the identification information uniquely identifies one computing device ;
  • the transceiver is further specifically configured to send the restart instruction to the computing device through the remote access connection.
  • the identification information includes: an internet protocol IP address.
  • the processor is specifically configured as:
  • an embodiment of the present disclosure also provides a computer-readable storage medium that stores computer-executable instructions that are configured to perform the above-described computing device maintenance method.
  • an embodiment of the present disclosure also provides a computer program product.
  • the computer program product includes a computer program stored on a computer-readable storage medium.
  • the computer program includes program instructions. During execution, the computer is caused to perform the above-mentioned computing device maintenance method.
  • the computing device when the status report information sent by the computing device is received, whether the computing device operates abnormally is detected, and the computing device is directly controlled to restart when it operates abnormally. Observe or monitor the operation of computing equipment, without the need for users to manually perform restart processing, saving human resources costs, and the automatic monitoring and maintenance method can find anomalies and solve abnormal problems in a more timely manner, which improves the operation to a certain extent The operation stability and safety of the equipment.
  • FIG. 1 is a schematic flowchart of a method for maintaining a computing device according to an embodiment of the present disclosure
  • FIG. 2 is a schematic flowchart of another method for maintaining a computing device according to an embodiment of the present disclosure
  • FIG. 3 is a schematic flowchart of another method for maintaining a computing device according to an embodiment of the present disclosure
  • FIG. 4 is a schematic flowchart of another method for maintaining a computing device according to an embodiment of the present disclosure
  • FIG. 5 is a schematic flowchart of another method for maintaining a computing device according to an embodiment of the present disclosure
  • FIG. 6 is a schematic flowchart of another method for maintaining a computing device according to an embodiment of the present disclosure
  • FIG. 7 is a schematic structural diagram of an electronic device according to an embodiment of the present disclosure.
  • FIG. 8 is a schematic structural diagram of a computing device maintenance device provided by implementation of the present disclosure.
  • the computing device maintenance method, the electronic device, the computer-readable storage medium, and the computer program product provided by the embodiments of the present disclosure are aimed at solving the waste of human resources and the low stability and safety of the operation of the existing computing device maintenance method Problem, and put forward the following solution: According to the information reported by the received computing device, determine whether the computing device is operating abnormally, so as to restart it in time when it is abnormal.
  • the embodiments of the present disclosure provide a method for maintaining computing equipment.
  • FIG. 1 shows a schematic flowchart of a method for maintaining a computing device according to an embodiment of the present disclosure. The method includes the following steps:
  • S102 Receive status report information sent by the computing device.
  • S104 Determine whether the computing device operates abnormally according to the status report information.
  • the computing device involved in the embodiments of the present disclosure is a device with computing processing capability.
  • the computing device involved in the embodiments of the present disclosure may be specifically applied to an application scenario where the computing device is used for data processing to obtain digital credentials.
  • the execution device of the computing device maintenance method provided by the embodiments of the present disclosure may be a maintenance device corresponding to the computing device, or may be a processor or a processing module in the maintenance device.
  • One maintenance device may correspond to one or more computing devices. Therefore, the number of computing devices is not particularly limited in the embodiments of the present disclosure, and this solution can be executed for the status report information sent by any computing device.
  • the implementation manner of determining whether the computing device operates abnormally described in S104 may include, but is not limited to, the following ways:
  • S104 may include the following steps:
  • S1042A judging whether the status reporting information carries an abnormal operation identifier; if it is, it is determined that the computing device is abnormally operating, execute S106; if not, execute S1044A.
  • the abnormal operation flag is used to indicate abnormal operation of the computing device.
  • S1044A according to the operating status information carried in the status reporting information, determine whether the computing device is operating abnormally.
  • the computing device may report the operating state information for ensuring its own state to the executing device, then, if the status reporting information received in S102 does not carry the operating abnormality identifier, the operating device carries the operating state information to determine the computing device Whether the operation is abnormal.
  • the running state information involved in the embodiments of the present disclosure may include, but is not limited to: the current data processing computing power of the computing device.
  • FIG. 3 shows a method for judging whether a computing device operates abnormally. The method specifically includes the following steps:
  • S1042 Obtain the current data processing computing power carried in the status report information.
  • S1044 Determine whether the current data processing computing power is less than the abnormal computing power threshold.
  • FIG. 4 shows another way to determine whether the computing device is operating abnormally.
  • the method specifically includes the following steps:
  • S1042 Obtain the current data processing computing power carried in the status report information.
  • S1044 Determine whether the current data processing power is less than the abnormal computing power threshold; if yes, execute S1046; if not, execute S1042.
  • the starting time of the timing may be the time when it is first determined that the data processing computing power is less than the abnormal computing power threshold.
  • steps S1042 and S1044 are continuously executed, so that, after the timing starts, if the current data processing power is greater than or equal to the abnormal computing power threshold, the current data processing power is greater than or equal to abnormal
  • the time of the computing power threshold is used as the end of the timing to obtain the duration; on the contrary, if the current data processing power obtained in the previous step continues to be less than the abnormal computing power threshold after the timing starts, the current time is used as the end of the timing to obtain the duration duration.
  • the aforementioned duration threshold may be preset as needed, for example, it may be preset to 15 minutes.
  • the occasional abnormality of the monitoring result caused by the abnormal processing power of the data processing can be avoided to a certain extent, and the monitoring accuracy can be improved to a certain extent.
  • the method may include the following steps:
  • S1041 and S1042 are not particularly limited, and the manner shown in FIG. 3 or FIG. 4 is only a feasible implementation manner, and is not intended to limit the embodiment of the present disclosure.
  • S1041 and S1042 may be executed at the same time, or S1042 may be executed before S1041.
  • the implementation of this step is related to the setting method of the abnormal computing power threshold.
  • the setting method of the abnormal computing power threshold may include, but is not limited to: a preset value, or obtained through a preset algorithm.
  • the abnormal computing power threshold when performing this step, the abnormal computing power threshold can be obtained by directly reading the preset data.
  • the abnormal computing power threshold when the abnormal computing power threshold is preset, it may be preset to a fixed number.
  • the preset algorithm may be directly run.
  • an embodiment of the present disclosure provides a preferred implementation manner of S1041, please refer to FIG. 5, and this step specifically includes:
  • S10412 Acquire standard data processing computing power of the computing device.
  • S10414 Obtain the product of standard data processing computing power and a preset abnormal ratio to obtain an abnormal computing power threshold.
  • the standard data processing computing power may include: factory data processing computing power or initial data processing computing power.
  • the factory data processing power can be the standard computing power value when the computing device is shipped.
  • the factory data processing computing power recorded by the data processing chip of the computing device can be obtained as standard data Handle computing power.
  • the factory data processing computing power is the value written into the firmware immediately after shipment, there may be a certain error from the actual standard computing power of the data processing chip of the computing device. Therefore, when performing the aforementioned step S10412, the chip status information recorded by the data processing chip of the computing device (which may include but is not limited to the chip operating frequency, etc.) can be further obtained, and then, the chip status information can be used to calculate a reference data processing power, so that if the aforementioned factory data processing The difference between the computing power and the reference data processing computing power is less than 5% (may be 5% of the factory data processing computing power or the reference data processing computing power, which is not particularly limited), then obtain the factory data processing The computing power can be used as its standard data processing computing power.
  • the initial data processing computing power is the data processing computing power when the computing device initially starts to operate.
  • the initial data processing power can be recorded on the background of the computing device or on the data processing chip of the computing device.
  • the communication method may include but is not limited to: at least the wired communication and the wireless communication. One kind.
  • the aforementioned abnormal ratio can be set as needed, and the specific value of the abnormal ratio is not limited in the embodiments of the present disclosure. For example, it can be set to 80%. That is, when the current data processing computing power of the computing device is less than 80% of the standard data processing computing power, or the duration of which is less than 80% of the standard data processing computing power reaches the duration threshold, it can be determined that the computing device operates abnormally.
  • the abnormal computing device can be determined based on the monitored data.
  • step S106 when step S106 is executed, the restart process can be executed only for the abnormal computing device, which can avoid the impact of the restart of the abnormal device on the normal operating computing device.
  • the restart process can be executed only for the abnormal computing device, which can avoid the impact of the restart of the abnormal device on the normal operating computing device.
  • the computing device generally has its own restart function or authority. Therefore, it can be implemented by sending an instruction to cause the abnormal computing device to restart itself.
  • step S106 may specifically be: sending a restart instruction to the abnormal computing device, so that the abnormal computing device restarts according to the restart command.
  • S106 may be implemented with reference to the manner shown in FIG. 6:
  • S1062 Establish a remote access connection with the computing device based on the identification information of the computing device.
  • an embodiment of the present disclosure provides a A solution: remotely access the computing device, and obtain the operating status information of the computing device through remote access.
  • remote access to the computing device that is, establishing a remote access connection with the computing device can be achieved by the identification information of the computing device.
  • the identification information of the computing device involved in the embodiments of the present disclosure may include but is not limited to: Internet Protocol (IP) address.
  • IP Internet Protocol
  • the embodiment of the present disclosure does not particularly limit the manner of controlling the restart of the computing device, and can be implemented by a cold restart method, that is, by cutting off the power supply and restarting; Under the condition of continuous power, control the computing device to perform a restart operation in the background.
  • a prompt signal may be output so that the user can learn the restart processing progress of the abnormal computing device according to the prompt signal.
  • the prompt signals involved in the embodiments of the present disclosure may include, but are not limited to, at least one of sound signals, vibration signals, flashing signals, and text prompt information.
  • the execution device is integrated into the maintenance device, it can also output at least any one of the aforementioned prompt signals.
  • it involves monitoring of the multi-ether computing device it can further output relevant information that can identify the identity of the computing device, for example , The number of the computing device, IP address, etc.
  • an embodiment of the present disclosure further provides an electronic device.
  • the electronic device 700 includes: a memory 710, a transceiver 720, and a processor 730, and the memory 710, the transceiver 720, and the processor 730 are connected by a bus;
  • the memory 710 is configured to store a computer program
  • Transceiver 720 configured to communicate with other devices
  • the processor 730 is configured to execute the computer program to implement the computing device maintenance method in any of the foregoing implementation manners.
  • the transceiver 720 is configured to receive status report information sent by the computing device;
  • the processor 730 is configured to determine whether the computing device operates abnormally according to the status report information; and, is configured to control the computing device to restart if the computing device operates abnormally.
  • the processor 730 is specifically configured as:
  • the status reporting information carries an operation abnormality identifier, it is determined that the computing device operates abnormally, wherein the operation abnormality indicator is configured to indicate that the computing device operates abnormally.
  • the status report message includes the operating status information of the computing device
  • the running status information includes: the current data processing power of the computing device.
  • the processor 730 is specifically configured as:
  • the current data processing power is less than the abnormal computing power threshold, it is determined that the computing device is operating abnormally; or,
  • the duration threshold it is determined that the computing device is operating abnormally.
  • the processor 730 is also configured as:
  • the processor 730 is further specifically configured as:
  • the standard data processing computing power is: factory data processing computing power or initial data processing computing power.
  • the processor 730 is also specifically configured as:
  • the factory data processing computing power recorded by the data processing chip of the computing device is acquired as the standard data processing computing power.
  • the transceiver 720 is specifically configured as:
  • the processor 730 is further specifically configured to establish a remote access connection with each computing device based on the identification information of each computing device, where the identification information uniquely identifies one computing device;
  • the transceiver 720 is specifically configured to send a restart instruction to the computing device through a remote access connection.
  • the identification information includes: an internet protocol IP address.
  • the processor 730 is specifically configured as:
  • the above logic instructions in the memory 710 may be implemented in the form of software functional units and sold or used as independent products, and may be stored in a computer-readable storage medium.
  • the memory 710 as a computer-readable storage medium may be configured to store software programs and computer-executable programs, such as program instructions/modules corresponding to the methods in the embodiments of the present disclosure.
  • the processor 730 executes functional applications and data processing by running software programs, instructions, and modules stored in the memory 710, that is, implementing the computing device maintenance method in the foregoing method embodiments.
  • the memory 710 may include a storage program area and a storage data area, where the storage program area may store an operating system and application programs required for at least one function; the storage data area may store data created according to the use of a terminal device and the like.
  • the memory 710 may include a high-speed random access memory, and may also include a non-volatile memory.
  • the number of the processor 730 may be one or more, and the processor 730 may also be called a processing unit, which may implement a certain control function.
  • the processor 730 may be a general-purpose processor or a dedicated processor or the like.
  • the processor 730 may also store instructions, and the instructions may be executed by the processor, so that the electronic device 700 executes the computing device maintenance method described in the foregoing method embodiments.
  • the electronic device 700 may include a circuit that can implement the function of sending or receiving or communicating in the foregoing method embodiments.
  • the number of transceivers 720 may be one or more.
  • the transceiver 720 may be called a transceiver unit, a transceiver, a transceiver circuit, or a transceiver, etc., and is configured to implement the transceiver function of the electronic device 700.
  • the processor 730 and the transceiver 720 described in the embodiments of the present disclosure may be implemented in an integrated circuit (IC), analog IC, radio frequency integrated circuit RFIC, mixed signal IC, application-specific integrated circuit (ASIC), Printed circuit (PCB), electronic equipment, etc.
  • the processor and transceiver can also be manufactured using various 1C process technologies, such as complementary metal oxide semiconductor (CMOS), N-type metal oxide semiconductor (nMetal-oxide-semiconductor, NMOS), P-type Metal oxide semiconductor (positive channel metal oxide semiconductor (PMOS), bipolar junction transistor (Bipolar Junction Transistor, BJT), bipolar CMOS (BiCMOS), silicon germanium (SiGe), gallium arsenide (GaAs), etc.
  • CMOS complementary metal oxide semiconductor
  • N-type metal oxide semiconductor nMetal-oxide-semiconductor
  • PMOS positive channel metal oxide semiconductor
  • BJT bipolar junction transistor
  • BiCMOS bipolar CMOS
  • the electronic device 700 may be an independent device or may be part of a larger device.
  • the electronic device 700 may be integrated into one computing device.
  • the embodiments of the present disclosure provide a computing device maintenance device.
  • the computing device maintenance device 800 includes:
  • the receiving module 810 is configured to receive status report information sent by the computing device
  • the determining module 820 is configured to determine whether the computing device operates abnormally according to the status report information
  • the control module 830 is configured to control the computing device to restart if the computing device operates abnormally.
  • the determination module 820 is configured as:
  • the status reporting information carries an operation abnormality identifier, it is determined that the computing device operates abnormally, wherein the operation abnormality indicator is configured to indicate that the computing device operates abnormally.
  • the status report message includes the operating status information of the computing device
  • the running status information includes: the current data processing power of the computing device.
  • the determination module 820 is specifically configured as:
  • the current data processing power is less than the abnormal computing power threshold, it is determined that the computing device is operating abnormally; or,
  • the duration threshold it is determined that the computing device is operating abnormally.
  • the determination module 820 is also configured as:
  • the determination module 820 is specifically configured as:
  • the standard data processing computing power is: factory data processing computing power or initial data processing computing power.
  • the determination module 820 is specifically configured as:
  • control module 830 is specifically configured as:
  • control module 830 is specifically configured as:
  • each computing device establish a remote access connection with each computing device, where the identification information uniquely identifies a computing device;
  • the identification information includes: an internet protocol IP address.
  • control module 830 is specifically configured as:
  • the data processing described in the embodiment of the present disclosure may include setting, calculating, and judging based on or on the data. At least one of transmission, storage, management, etc.
  • the data processing may be data processing related to digital vouchers performed by a data processing device
  • the digital vouchers may be obtained through the data processing
  • the data processing device may be a digital voucher processing device.
  • the digital certificate processing device may be a digital currency data processor, and the digital currency may be an encrypted currency such as bitcoin.
  • an embodiment of the present disclosure also provides a computer-readable storage medium that stores computer-executable instructions, the computer-executable instructions being configured to perform the above-described computing device maintenance method.
  • An embodiment of the present disclosure also provides a computer program product.
  • the computer program product includes a computer program stored on a computer-readable storage medium.
  • the computer program includes program instructions. When the program instructions are executed by a computer, the The computer executes the above-mentioned computing device maintenance method.
  • the aforementioned computer-readable storage medium may be a transient computer-readable storage medium or a non-transitory computer-readable storage medium.
  • the technical solutions of the embodiments of the present disclosure may be embodied in the form of software products, which are stored in a storage medium and include one or more instructions to make a computer device (which may be a personal computer, server, or network) Equipment, etc.) to perform all or part of the steps of the method described in the embodiments of the present disclosure.
  • the aforementioned storage medium may be a non-transitory storage medium, including: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disk or optical disk, etc.
  • a medium that can store program codes may also be a transient storage medium.
  • first, second, etc. may be used in the embodiments of the present disclosure to describe various elements, these elements should not be limited by these terms. These terms are only used to distinguish one element from another.
  • the first element can be called the second element, and likewise, the second element can be called the first element, as long as all occurrences of the "first element” are consistently renamed and all occurrences of The “second component” can be renamed consistently.
  • the first element and the second element are both elements, but they may not be the same element.
  • the term “comprise” and its variations “comprises” and/or includes refers to the stated features, wholes, steps, operations, elements, and The presence of/or components, but does not exclude the presence or addition of one or more other features, wholes, steps, operations, elements, components, and/or groups of these.
  • the various aspects, implementations, implementations or features in the described embodiments can be used alone or in any combination.
  • Various aspects in the described embodiments may be implemented by software, hardware, or a combination of software and hardware.
  • the described embodiments may also be embodied by a computer-readable medium that stores computer-readable code including instructions executable by at least one computing device.
  • the computer-readable medium can be associated with any data storage device capable of storing data, which can be read by a computer system.
  • Computer-readable media used for examples may include read-only memory, random access memory, CD-ROM, HDD, DVD, magnetic tape, optical data storage devices, and the like.
  • the computer-readable medium may also be distributed in computer systems connected through a network, so that computer-readable codes can be stored and executed in a distributed manner.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Retry When Errors Occur (AREA)
  • Power Sources (AREA)

Abstract

一种运算设备维护方法及装置、存储介质和程序产品,具体包括:通过接收运算设备发送的状态上报信息,然后,根据所述状态上报信息,确定所述运算设备是否运行异常,从而,若所述运算设备运行异常,控制所述运算设备重启,上述方法能够降低人力资源成本,并提高了运算设备的运行稳定性及安全性。

Description

运算设备维护方法及装置、存储介质和程序产品 技术领域
本申请涉及数据处理技术领域,例如涉及一种运算设备维护方法及装置、存储介质和程序产品。
背景技术
随着数据技术的发展以及社会的进步,通过运算设备进行数据处理以获取数字凭证得到更多人的关注。由于运算设备高速运转,如何提升运算设备的安全性成为本领域的重要研究课题。
目前,针对运算设备的异常维护一般是由用户来手动处理的。这需要用户主动关注或发现运算设备的运行状态,并在其发现运算设备运行异常时,对运算设备执行手动重启的处理,以实现运算设备维护。
现有的运算设备维护方法依赖于用户手动操作实现,浪费人力资源成本,且存在运算设备异常发现不及时的问题,导致运算设备运行的稳定性及安全性较低。
发明内容
本公开实施例提供了一种运算设备维护方法及装置、存储介质和程序产品,以期降低人力资源成本并提高运算设备的运行稳定性及安全性。
第一方面,本公开实施例还提供了一种运算设备维护方法,包括:
接收运算设备发送的状态上报信息;
根据所述状态上报信息,确定所述运算设备是否运行异常;
若所述运算设备运行异常,控制所述运算设备重启。
在一种可能的设计中,所述根据所述状态上报信息,确定所述运算设备是否运行异常,包括:
若所述状态上报信息中携带运行异常标识,确定所述运算设备运行异常,其中,所述运行异常标识用于指示所述运算设备运行异常。
在另一种可能的设计中,所述状态上报消息中包括所述运算设备的运行 状态信息;
所述运行状态信息包括:所述运算设备的当前数据处理算力。
在另一种可能的设计中,所述根据所述状态上报信息,确定所述运算设备是否运行异常,包括:
若所述当前数据处理算力小于异常算力阈值,确定所述运算设备运行异常;或者,
若所述当前数据处理算力小于所述异常算力阈值的持续时长达到时长阈值,确定所述运算设备运行异常。
在另一种可能的设计中,所述方法还包括:
获取所述异常算力阈值。
在另一种可能的设计中,所述获取所述异常算力阈值,包括:
获取所述运算设备的标准数据处理算力;
获取所述标准数据处理算力与预设异常比例之积,得到所述异常算力阈值。
在另一种可能的设计中,所述标准数据处理算力为:出厂数据处理算力或者初始数据处理算力。
在另一种可能的设计中,所述获取所述运算设备的标准数据处理算力,包括:
获取所述运算设备的数据处理芯片记录的出厂数据处理算力,以作为所述标准数据处理算力。
在另一种可能的设计中,所述控制所述运算设备重启,包括:
发送重启指令至所述运算设备,以使所述运算设备根据所述重启指令进行重启。
在另一种可能的设计中,所述发送重启指令至所述运算设备,包括:
根据所述各运算设备的识别信息,与所述各运算设备建立远程访问连接,其中,所述识别信息唯一标识一个运算设备;
通过所述远程访问连接,将所述重启指令发送给所述运算设备。
在另一种可能的设计中,所述识别信息包括:互联网协议IP地址。
在另一种可能的设计中,所述控制所述运算设备重启,包括:
控制所述运算设备断电并重新启动;或者,
控制所述运算设备在不断电情况下重新启动。
第二方面,本公开实施例还提供了一种运算设备维护装置,包括:
接收模块,配置为接收运算设备发送的状态上报信息;
确定模块,配置为根据所述状态上报信息,确定所述运算设备是否运行异常;
控制模块,配置为若所述运算设备运行异常,控制所述运算设备重启。
在一种可能的设计中,所述确定模块,配置为:
若所述状态上报信息中携带运行异常标识,确定所述运算设备运行异常,其中,所述运行异常标识配置为指示所述运算设备运行异常。
在另一种可能的设计中,所述状态上报消息中包括所述运算设备的运行状态信息;
所述运行状态信息包括:所述运算设备的当前数据处理算力。
在另一种可能的设计中,所述确定模块,具体配置为:
若所述当前数据处理算力小于异常算力阈值,确定所述运算设备运行异常;或者,
若所述当前数据处理算力小于所述异常算力阈值的持续时长达到时长阈值,确定所述运算设备运行异常。
在另一种可能的设计中,所述确定模块,还配置为:
获取所述异常算力阈值。
在另一种可能的设计中,所述确定模块,具体配置为:
获取所述运算设备的标准数据处理算力;
获取所述标准数据处理算力与预设异常比例之积,得到所述异常算力阈值。
在另一种可能的设计中,所述标准数据处理算力为:出厂数据处理算力或者初始数据处理算力。
在另一种可能的设计中,所述确定模块,具体配置为:
获取所述运算设备的数据处理芯片记录的出厂数据处理算力,以作为所述标准数据处理算力。
在另一种可能的设计中,所述控制模块,具体配置为:
发送重启指令至所述运算设备,以使所述运算设备根据所述重启指令进 行重启。
在另一种可能的设计中,所述控制模块,具体配置为:
根据所述各运算设备的识别信息,与所述各运算设备建立远程访问连接,其中,所述识别信息唯一标识一个运算设备;
通过所述远程访问连接,将所述重启指令发送给所述运算设备。
在另一种可能的设计中,所述识别信息包括:互联网协议IP地址。
在另一种可能的设计中,所述控制模块,具体配置为:
控制所述运算设备断电并重新启动;或者,
控制所述运算设备在不断电情况下重新启动。
第三方面,本公开实施例还提供了一种电子设备,包括:存储器、收发器和处理器,所述存储器、所述收发器与所述处理器通过总线连接;
所述存储器,配置为存储计算机程序;
所述收发器,配置为与其他设备进行通信;
所述处理器,配置为执行所述计算机程序以实现如第一方面任一项所述的方法。
具体而言,在一种可能的设计中,所述收发器,配置为接收运算设备发送的状态上报信息;
所述处理器,配置为根据所述状态上报信息,确定所述运算设备是否运行异常;以及,配置为若所述运算设备运行异常,控制所述运算设备重启。
在一种可能的设计中,所述处理器,具体配置为:
若所述状态上报信息中携带运行异常标识,确定所述运算设备运行异常,其中,所述运行异常标识配置为指示所述运算设备运行异常。
在另一种可能的设计中,所述状态上报消息中包括所述运算设备的运行状态信息;
所述运行状态信息包括:所述运算设备的当前数据处理算力。
在另一种可能的设计中,所述处理器,具体配置为:
若所述当前数据处理算力小于异常算力阈值,确定所述运算设备运行异常;或者,
若所述当前数据处理算力小于所述异常算力阈值的持续时长达到时长阈值,确定所述运算设备运行异常。
在另一种可能的设计中,所述处理器,还配置为:
获取所述异常算力阈值。
在另一种可能的设计中,所述处理器,还具体配置为:
获取所述运算设备的标准数据处理算力;
获取所述标准数据处理算力与预设异常比例之积,得到所述异常算力阈值。
在另一种可能的设计中,所述标准数据处理算力为:出厂数据处理算力或者初始数据处理算力。
在另一种可能的设计中,所述处理器,还具体配置为:
获取所述运算设备的数据处理芯片记录的出厂数据处理算力,以作为所述标准数据处理算力。
在另一种可能的设计中,所述收发器,具体配置为:
发送重启指令至所述运算设备,以使所述运算设备根据所述重启指令进行重启。
在另一种可能的设计中,所述处理器,还具体配置为根据所述各运算设备的识别信息,与所述各运算设备建立远程访问连接,其中,所述识别信息唯一标识一个运算设备;
所述收发器,还具体配置为通过所述远程访问连接,将所述重启指令发送给所述运算设备。
在另一种可能的设计中,所述识别信息包括:互联网协议IP地址。
在另一种可能的设计中,所述处理器,具体配置为:
控制所述异常运算设备断电并重新启动;或者,
控制所述异常运算设备在不断电情况下重新启动。
第四方面,本公开实施例还提供了一种计算机可读存储介质,存储有计算机可执行指令,所述计算机可执行指令设置为执行上述的运算设备维护方法。
第五方面,本公开实施例还提供了一种计算机程序产品,所述计算机程序产品包括存储在计算机可读存储介质上的计算机程序,所述计算机程序包括程序指令,当所述程序指令被计算机执行时,使所述计算机执行上述的运算设备维护方法。
本公开实施例提供的上述技术方案,当接收到运算设备发送的状态上报信息,则检测运算设备是否运行异常,并在其运行异常时直接控制该运算设备重启,在该过程中,无需用户主观观察或监测运算设备的运行情况,也无需用户手动执行重启处理,节省了人力资源成本,并且,自动监测并维护的方式能够更及时的发现异常并解决异常问题,这在一定程度上提高了运算设备的运行稳定性与安全性。
附图说明
一个或多个实施例通过与之对应的附图进行示例性说明,这些示例性说明和附图并不构成对实施例的限定,附图中具有相同参考数字标号的元件示为类似的元件,附图不构成比例限制,并且其中:
图1为本公开实施例提供的一种运算设备维护方法的流程示意图;
图2为本公开实施例提供的另一种运算设备维护方法的流程示意图;
图3为本公开实施例提供的另一种运算设备维护方法的流程示意图;
图4为本公开实施例提供的另一种运算设备维护方法的流程示意图;
图5为本公开实施例提供的另一种运算设备维护方法的流程示意图;
图6为本公开实施例提供的另一种运算设备维护方法的流程示意图;
图7为本公开实施例提供的一种电子设备的结构示意图;
图8为本公开实施提供的一种运算设备维护装置的结构示意图。
具体实施方式
为了能够更加详尽地了解本公开实施例的特点与技术内容,下面结合附图对本公开实施例的实现进行详细阐述,所附附图仅供参考说明之用,并非用来限定本公开实施例。在以下的技术描述中,为方便解释起见,通过多个细节以提供对所披露实施例的充分理解。然而,在没有这些细节的情况下,一个或多个实施例仍然可以实施。在其它情况下,为简化附图,熟知的结构和装置可以简化展示。
本公开实施例所提供的运算设备维护方法、电子设备、计算机可读存储介质及计算机程序产品,旨在解决现有的运算设备维护方法存在的人力资源浪费及设备运行稳定性及安全性较低的问题,并提出如下解决思路:根据接 收到的运算设备上报的信息,确定运算设备是否运行异常,以便于在其运行异常时,及时重启。
本公开实施例提供了一种运算设备维护方法。
图1示出了本公开实施例所提供的一种运算设备维护方法的流程示意图,该方法包括如下步骤:
S102,接收运算设备发送的状态上报信息。
S104,根据状态上报信息,确定运算设备是否运行异常。
S106,若运算设备运行异常,控制运算设备重启。
本公开实施例中所涉及的运算设备为具备运算处理能力的设备。在一个具体的实现场景中,本公开实施例所涉及的运算设备可以为运算设备具体应用于数据处理以获取数字凭证这一应用场景中。
而本公开实施例所提供的运算设备维护方法的执行设备可以为与运算设备相对应的维护设备,或者,为维护设备中的一个处理器或处理模块。其中,一个维护设备可以对应于一个或多个运算设备,因此,本公开实施例对运算设备的数目无特别限定,针对任一运算设备发送的状态上报信息,均可执行本方案。
以下,对本公开实施例前述各步骤的具体实现方式进行具体说明。
本公开实施例中,基于运算设备发送的状态上报信息不同个,S104所述的确定运算设备是否运行异常的实现方式,可以包括但不限于以下方式:
第一种,若运算设备向该执行设备上报的信息中携带用以表征该运算设备运行异常的标识或信息,由于运算设备自身已经完成了自身是否运行异常的判断,无需再重复执行该步骤。
此时,仅需要将运行状态信息中携带有运行异常标识的运算设备,确定为所述异常运算设备即可。此时,请参考图2,S104可以包括如下步骤:
S1042A,判断状态上报信息中是否携带运行异常标识;若是,确定运算设备运行异常,执行S106;若否,执行S1044A。
其中,运行异常标识用于指示运算设备运行异常。
S1044A,根据状态上报信息中携带的运行状态信息,确定运算设备是否运行异常。
运算设备可以将用于保证自身状态的运行状态信息上报给该执行设备, 那么,若其S102中接收到的状态上报信息中未携带运行异常标识,则通过其中携带的运行状态信息来确定运算设备是否运行异常。
本公开实施例所涉及到的运行状态信息可以包括但不限于:运算设备的当前数据处理算力。
由此,根据状态上报信息中携带的当前数据处理算力,在具体确定运算设备是否运行异常时,可以参考如图3与图4所示的方式。
图3示出了一种运算设备是否运行异常的判断方式,该方式具体包括如下步骤:
S1042,获取状态上报信息中携带的当前数据处理算力。
S1044,判断当前数据处理算力是否小于异常算力阈值。
若是,确定运算设备运行异常,此时,执行S106。
若否,确定运算设备运行正常,此时,执行S1042。
通过前述方法,可以针对当前获取到的运算设备的当前数据处理算力进行实时判断,检测的及时性较高。
图4示出了另一种运算设备是否运行异常的判断方式,该方式具体包括如下步骤:
S1042,获取状态上报信息中携带的当前数据处理算力。
S1044,判断当前数据处理算力是否小于异常算力阈值;若是,执行S1046;若否,执行S1042。
S1046,获取当前数据处理算力小于异常算力阈值的持续时长。
需要说明的是,该步骤中,计时的起始时刻可以为首次判断出数据处理算力小于异常算力阈值的时刻。而该步骤执行时,持续执行S1042与S1044步骤,以便于,在计时开始后,若出现当前数据处理算力大于或者等于异常算力阈值的情况时,将出现当前数据处理算力大于或者等于异常算力阈值的时刻作为计时终点,以得到持续时长;反之,若在计时开始后,前述步骤获取到的当前数据处理算力持续小于异常算力阈值,则将当前时刻作为计时终点,以得到持续时长。
S1048,判断持续时长是否达到时长阈值;若是,执行S106;若否,执行S1042。
其中,前述时长阈值可以根据需要预设,例如,可以预设为15分钟。
如图4所示的实现方式中,可以在一定程度上避免偶尔出现的数据处理算力异常导致的监测结果异常的情况,能够在一定程度上提高监测准确率。
此外,如图3与图4所示,在执行S1044步骤之前,该方法可以包括如下步骤:
S1041,获取异常算力阈值。
需要说明的是,本公开实施例对于S1041与S1042的执行次序无特别限定,如图3或图4所示方式仅为一种可行的实现方式,并不用以限制本公开实施例。例如,S1041与S1042可同时执行,或者,还可以先执行S1042再执行S1041。
而该步骤的实现,与异常算力阈值的设置方式相关。异常算力阈值的设置方式可以包括但不限于:预设数值,或者,通过预设算法获取。
其中,若异常算力阈值通过预设的方式设置,那么,在执行该步骤时,直接读取预设好的数据即可得到异常算力阈值。具体的,异常算力阈值在具体进行预设时,可以预设为一个固定数值的数。
若异常算力阈值是通过算法方式预设,则执行该步骤时,直接运行预设算法即可。
为了便于理解,本公开实施例给出S1041的一种优选实现方式,请参考图5,该步骤具体包括:
S10412,获取运算设备的标准数据处理算力。
S10414,获取标准数据处理算力与预设异常比例之积,得到异常算力阈值。
其中,标准数据处理算力可以包括:出厂数据处理算力或者初始数据处理算力。
其中,出厂数据处理算力可以在运算设备出厂时即标准的算力数值,此时,S10412在具体实现时,可以通过获取运算设备的数据处理芯片记录的出厂数据处理算力,以作为标准数据处理算力。
此外,考虑到出厂数据处理算力是在出厂后即写入固件的数值,因此,可能与运算设备的数据处理芯片的实际标准算力存在一定的误差,因此,在执行前述S10412步骤时,还可以进一步获取运算设备的数据处理芯片记录的芯片状态信息(其中,可以包括但不限于芯片工作频率等),然后,利用芯片 状态信息计算得到一个参考数据处理算力,从而,若前述出厂数据处理算力与该参考数据处理算力之间的差值小于5%(可以为出厂数据处理算力的5%或者参考数据处理算力的5%,对此无特别限定),则获取出厂数据处理算力,以作为其标准数据处理算力即可。
初始数据处理算力是运算设备初始开始运行时的数据处理算力。初始数据处理算力可以被记录在运算设备后台或者运算设备的数据处理芯片上。
由于前述数据均被记录在运算设备后台或者运算设备的数据处理芯片上,因此,可通过建立通信的方式获取到这些信息,其中,通信方式可以包括但不限于:有线通信与无线通信中的至少一种。
而前述异常比例则可以根据需要设定,本公开实施例对异常比例的具体数值无限定。例如,可以将其设定为80%。也就是,当运算设备的当前数据处理算力小于标准数据处理算力的80%,或者,其小于标准数据处理算力的80%的持续时长达到时长阈值时,即可确定运算设备运行异常。
通过前述流程,可以基于监测到的数据来确定异常运算设备。
从而,在确定了异常运算设备的基础上,执行S106步骤时,可以仅针对异常运算设备执行重启处理,这能够避免异常设备重启对运行正常的运算设备的影响。或者,还可以控制所监测的全部运算设备执行重启,采取统一操作,降低处理复杂度。
以下,仅针对异常运算设备的重启控制进行说明。
运算设备一般具备自身重启功能或权限,因此,可以通过发送指令以使异常运算设备自身执行重启的方式实现。此时,S106步骤可以具体为:发送重启指令至所述异常运算设备,以使所述异常运算设备根据所述重启指令进行重启。
在一种可能的设计中,可以参考图6所示方式实现S106:
S1062,根据运算设备的识别信息,与运算设备建立远程访问连接。
S1064,根据远程访问连接,发送重启指令至该运算设备。
这是考虑到该执行设备可能需要监测多个运算设备,其与运算设备之间无法通过接口交互数据(若满足接口条件,亦可通过该方法实现S106),因此,本公开实施例给出一种解决方式:远程访问运算设备,并通过远程访问,获取运算设备的运行状态信息。其中,远程访问运算设备,也就是,与运算 设备建立远程访问连接,这可以通过运算设备的识别信息实现。
其中,本公开实施例所涉及到的运算设备的识别信息可以包括但不限于:互联网协议(Internet Protocol,IP)地址。
这种方式尽可能的降低了地域对监测运算设备这一过程的影响,有较高的灵活性与可扩展空间。
具体的,本公开实施例对于控制运算设备重启的方式无特别限定,可以通过冷重启方式实现,也就是,切断电源并重启的方式实现;或者,可以通过该热重启方式实现,也就是,在不断电的条件下控制运算设备后台执行重启操作。
除此之外,本公开实施例中,还可以在执行S106之后,输出提示信号,以便于用户可以根据该提示信号获知异常运算设备的重启处理进度。其中,本公开实施例所涉及到的提示信号可以包括但不限于:声音信号、震动信号、闪烁信号与文字提示信息中的至少一种。此外,若该执行设备集成于维护设备中,亦可输出前述任意的至少一种提示信号,此外,若涉及监测到多太运算设备,则还可以进一步输出可以标识运算设备身份的相关信息,例如,运算设备的编号,IP地址等。
基于前述运算设备维护方法,本公开实施例还进一步给出一种电子设备。
请参考图7,该电子设备700包括:存储器710、收发器720和处理器730,存储器710、收发器720和处理器730通过总线连接;
存储器710,配置为存储计算机程序;
收发器720,配置为与其他设备进行通信;
处理器730,配置为执行所述计算机程序以实现如前所述任一实现方式的运算设备维护方法。
一种可能的设计中,收发器720,配置为接收运算设备发送的状态上报信息;
处理器730,配置为根据状态上报信息,确定运算设备是否运行异常;以及,配置为若运算设备运行异常,控制运算设备重启。
一种可能的设计中,处理器730,具体配置为:
若状态上报信息中携带运行异常标识,确定运算设备运行异常,其中,运行异常标识配置为指示运算设备运行异常。
另一种可能的设计中,状态上报消息中包括运算设备的运行状态信息;
运行状态信息包括:运算设备的当前数据处理算力。
此时,处理器730,具体配置为:
若当前数据处理算力小于异常算力阈值,确定运算设备运行异常;或者,
若当前数据处理算力小于异常算力阈值的持续时长达到时长阈值,确定运算设备运行异常。
一种可能的设计中,处理器730,还配置为:
获取异常算力阈值。
此时,一种可能的实现方式中,处理器730,还具体配置为:
获取运算设备的标准数据处理算力;
获取标准数据处理算力与预设异常比例之积,得到异常算力阈值。
其中,标准数据处理算力为:出厂数据处理算力或者初始数据处理算力。
一种可能的设计中,处理器730,还具体配置为:
获取所述运算设备的数据处理芯片记录的出厂数据处理算力,以作为所述标准数据处理算力。
另一种实现场景中,收发器720,具体配置为:
发送重启指令至运算设备,以使运算设备根据重启指令进行重启。
具体的,处理器730,还具体配置为:根据各运算设备的识别信息,与各运算设备建立远程访问连接,其中,识别信息唯一标识一个运算设备;
收发器720,具体配置为通过远程访问连接,将重启指令发送给运算设备。
本公开实施例中,识别信息包括:互联网协议IP地址。
本公开实施例中,处理器730,具体配置为:
控制异常运算设备断电并重新启动;或者,
控制异常运算设备在不断电情况下重新启动。
本公开实施例中,上述的存储器710中的逻辑指令可以通过软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。
存储器710作为一种计算机可读存储介质,可配置为存储软件程序、计算机可执行程序,如本公开实施例中的方法对应的程序指令/模块。处理器730 通过运行存储在存储器710中的软件程序、指令以及模块,从而执行功能应用以及数据处理,即实现上述方法实施例中的运算设备维护方法。
存储器710可包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需的应用程序;存储数据区可存储根据终端设备的使用所创建的数据等。此外,存储器710可以包括高速随机存取存储器,还可以包括非易失性存储器。
本公开实施例中,处理器730的数目可以为一个或者多个,处理器730也可以称为处理单元,可以实现一定的控制功能。处理器730可以是通用处理器或者专用处理器等。
在一种可选地设计中,处理器730也可以存有指令,所述指令可以被所述处理器运行,使得所述电子设备700执行上述方法实施例中描述的运算设备维护方法。
在又一种可能的设计中,电子设备700可以包括电路,所述电路可以实现前述方法实施例中发送或接收或者通信的功能。
本公开实施例中,收发器720的数目可以为一个或者多个,收发器720可以称为收发单元、收发机、收发电路、或者收发器等,配置为实现电子设备700的收发功能。
各个部件的具体的处理方式可以参考前述实施例的相关描述。
本公开实施例中描述的处理器730和收发器720可实现在集成电路(integrated circuit,IC)、模拟IC、射频集成电路RFIC、混合信号IC、专用集成电路(application specific integrated circuit,ASIC)、印刷电路板(printed circuit board,PCB)、电子设备等上。该处理器和收发器也可以用各种1C工艺技术来制造,例如互补金属氧化物半导体(complementary metal oxide semiconductor,CMOS)、N型金属氧化物半导体(nMetal-oxide-semiconductor,NMOS)、P型金属氧化物半导体(positive channel metal oxide semiconductor,PMOS)、双极结型晶体管(Bipolar Junction Transistor,BJT)、双极CMOS(BiCMOS)、硅锗(SiGe)、砷化镓(GaAs)等。
可选的,电子设备700可以是独立的设备或者可以是较大设备的一部分。例如,电子设备700可以集成在一台运算设备中。
此外,本公开实施例提供了一种运算设备维护装置。请参考图8,该运 算设备维护装置800包括:
接收模块810,配置为接收运算设备发送的状态上报信息;
确定模块820,配置为根据状态上报信息,确定运算设备是否运行异常;
控制模块830,配置为若运算设备运行异常,控制运算设备重启。
在一种可能的设计中,确定模块820,配置为:
若状态上报信息中携带运行异常标识,确定运算设备运行异常,其中,运行异常标识配置为指示运算设备运行异常。
在另一种可能的设计中,状态上报消息中包括运算设备的运行状态信息;
运行状态信息包括:运算设备的当前数据处理算力。
在另一种可能的设计中,确定模块820,具体配置为:
若当前数据处理算力小于异常算力阈值,确定运算设备运行异常;或者,
若当前数据处理算力小于异常算力阈值的持续时长达到时长阈值,确定运算设备运行异常。
在另一种可能的设计中,确定模块820,还配置为:
获取异常算力阈值。
在另一种可能的设计中,确定模块820,具体配置为:
获取运算设备的标准数据处理算力;
获取标准数据处理算力与预设异常比例之积,得到异常算力阈值。
在另一种可能的设计中,标准数据处理算力为:出厂数据处理算力或者初始数据处理算力。
在另一种可能的设计中,确定模块820,具体配置为:
获取运算设备的数据处理芯片记录的出厂数据处理算力,以作为标准数据处理算力。
在另一种可能的设计中,控制模块830,具体配置为:
发送重启指令至运算设备,以使运算设备根据重启指令进行重启。
在另一种可能的设计中,控制模块830,具体配置为:
根据各运算设备的识别信息,与各运算设备建立远程访问连接,其中,识别信息唯一标识一个运算设备;
通过远程访问连接,将重启指令发送给运算设备。
在另一种可能的设计中,识别信息包括:互联网协议IP地址。
在另一种可能的设计中,控制模块830,具体配置为:
控制运算设备断电并重新启动;或者,
控制运算设备在不断电情况下重新启动。
需要说明的是,本公开实施例中,无论是哪一方面,例如方法方面、装置方面等,本公开实施例中所述的数据处理可以包括基于数据或对数据进行的设置、计算、判断、传输、存储、管理等至少之一。
作为一个实施例,所述数据处理可以是由数据处理装置进行的与数字凭证相关的数据处理,所述数字凭证可以通过所述数据处理得到,所述数据处理装置可以是数字凭证处理装置。
当所述数字凭证与数字货币相关或体现为数字货币时,所述数字凭证处理装置可以是数字货币数据处理机,所述数字货币可以是比特币等加密货币。
此外,本公开实施例还提供了一种计算机可读存储介质,存储有计算机可执行指令,所述计算机可执行指令设置为执行上述运算设备维护方法。
本公开实施例还提供了一种计算机程序产品,所述计算机程序产品包括存储在计算机可读存储介质上的计算机程序,所述计算机程序包括程序指令,当所述程序指令被计算机执行时,使所述计算机执行上述运算设备维护方法。
上述的计算机可读存储介质可以是暂态计算机可读存储介质,也可以是非暂态计算机可读存储介质。
本公开实施例的技术方案可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括一个或多个指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本公开实施例所述方法的全部或部分步骤。而前述的存储介质可以是非暂态存储介质,包括:U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、磁碟或者光盘等多种可以存储程序代码的介质,也可以是暂态存储介质。
当用于本公开实施例中时,虽然术语“第一”、“第二”等可能会在本公开实施例中使用以描述各元件,但这些元件不应受到这些术语的限制。这些术语仅用于将一个元件与另一个元件区别开。比如,在不改变描述的含义的情况下,第一元件可以叫做第二元件,并且同样第,第二元件可以叫做第一元件,只要所有出现的“第一元件”一致重命名并且所有出现的“第二元件” 一致重命名即可。第一元件和第二元件都是元件,但可以不是相同的元件。
本公开实施例中使用的用词仅用于描述实施例并且不用于限制权利要求。如在实施例以及权利要求的描述中使用的,除非上下文清楚地表明,否则单数形式的“一个”(a)、“一个”(an)和“所述”(the)旨在同样包括复数形式。类似地,如在本公开实施例中所使用的术语“和/或”是指包含一个或一个以上相关联的列出的任何以及所有可能的组合。另外,当用于本公开实施例中时,术语“包括”(comprise)及其变型“包括”(comprises)和/或包括(comprising)等指陈述的特征、整体、步骤、操作、元素,和/或组件的存在,但不排除一个或一个以上其它特征、整体、步骤、操作、元素、组件和/或这些的分组的存在或添加。
所描述的实施例中的各方面、实施方式、实现或特征能够单独使用或以任意组合的方式使用。所描述的实施例中的各方面可由软件、硬件或软硬件的结合实现。所描述的实施例也可以由存储有计算机可读代码的计算机可读介质体现,该计算机可读代码包括可由至少一个计算装置执行的指令。所述计算机可读介质可与任何能够存储数据的数据存储装置相关联,该数据可由计算机系统读取。用于举例的计算机可读介质可以包括只读存储器、随机存取存储器、CD-ROM、HDD、DVD、磁带以及光数据存储装置等。所述计算机可读介质还可以分布于通过网络联接的计算机系统中,这样计算机可读代码就可以分布式存储并执行。
上述技术描述可参照附图,这些附图形成了本公开实施例的一部分,并且通过描述在附图中示出了依照所描述的实施例的实施方式。虽然这些实施例描述的足够详细以使本领域技术人员能够实现这些实施例,但这些实施例是非限制性的;这样就可以使用其它的实施例,并且在不脱离所描述的实施例的范围的情况下还可以做出变化。比如,流程图中所描述的操作顺序是非限制性的,因此在流程图中阐释并且根据流程图描述的两个或两个以上操作的顺序可以根据若干实施例进行改变。作为另一个例子,在若干实施例中,在流程图中阐释并且根据流程图描述的一个或一个以上操作是可选的,或是可删除的。另外,某些步骤或功能可以添加到所公开的实施例中,或两个以上的步骤顺序被置换。所有这些变化被认为包含在所公开的实施例以及权利要求中。
另外,上述技术描述中使用术语以提供所描述的实施例的透彻理解。然而,并不需要过于详细的细节以实现所描述的实施例。因此,实施例的上述描述是为了阐释和描述而呈现的。上述描述中所呈现的实施例以及根据这些实施例所公开的例子是单独提供的,以添加上下文并有助于理解所描述的实施例。上述说明书不用于做到无遗漏或将所描述的实施例限制到本公开的精确形式。根据上述教导,若干修改、选择适用以及变化是可行的。在某些情况下,没有详细描述为人所熟知的处理步骤以避免不必要地影响所描述的实施例。

Claims (27)

  1. 一种运算设备维护方法,其特征在于,包括:
    接收运算设备发送的状态上报信息;
    根据所述状态上报信息,确定所述运算设备是否运行异常;
    若所述运算设备运行异常,控制所述运算设备重启。
  2. 根据权利要求1所述的方法,其特征在于,所述根据所述状态上报信息,确定所述运算设备是否运行异常,包括:
    若所述状态上报信息中携带运行异常标识,确定所述运算设备运行异常,其中,所述运行异常标识用于指示所述运算设备运行异常。
  3. 根据权利要求1所述的方法,其特征在于,所述状态上报消息中包括所述运算设备的运行状态信息;
    所述运行状态信息包括:所述运算设备的当前数据处理算力。
  4. 根据权利要求3所述的方法,其特征在于,所述根据所述状态上报信息,确定所述运算设备是否运行异常,包括:
    若所述当前数据处理算力小于异常算力阈值,确定所述运算设备运行异常;或者,
    若所述当前数据处理算力小于所述异常算力阈值的持续时长达到时长阈值,确定所述运算设备运行异常。
  5. 根据权利要求4所述的方法,其特征在于,所述方法还包括:
    获取所述异常算力阈值。
  6. 根据权利要求5所述的方法,其特征在于,所述获取所述异常算力阈值,包括:
    获取所述运算设备的标准数据处理算力;
    获取所述标准数据处理算力与预设异常比例之积,得到所述异常算力阈值。
  7. 根据权利要求6所述的方法,其特征在于,所述标准数据处理算力为:出厂数据处理算力或者初始数据处理算力。
  8. 根据权利要求6或7所述的方法,其特征在于,所述获取所述运算设备的标准数据处理算力,包括:
    获取所述运算设备的数据处理芯片记录的出厂数据处理算力,以作为所 述标准数据处理算力。
  9. 根据权利要求1所述的方法,其特征在于,所述控制所述运算设备重启,包括:
    发送重启指令至所述运算设备,以使所述运算设备根据所述重启指令进行重启。
  10. 根据权利要求9所述的方法,其特征在于,所述发送重启指令至所述运算设备,包括:
    根据所述各运算设备的识别信息,与所述各运算设备建立远程访问连接,其中,所述识别信息唯一标识一个运算设备;
    通过所述远程访问连接,将所述重启指令发送给所述运算设备。
  11. 根据权利要求10所述的方法,其特征在于,所述识别信息包括:互联网协议IP地址。
  12. 根据权利要求1或9所述的方法,其特征在于,所述控制所述运算设备重启,包括:
    控制所述运算设备断电并重新启动;或者,
    控制所述运算设备在不断电情况下重新启动。
  13. 一种运算设备维护装置,其特征在于,包括:
    接收模块,配置为接收运算设备发送的状态上报信息;
    确定模块,配置为根据所述状态上报信息,确定所述运算设备是否运行异常;
    控制模块,配置为若所述运算设备运行异常,控制所述运算设备重启。
  14. 根据权利要求13所述的装置,其特征在于,所述确定模块,配置为:
    若所述状态上报信息中携带运行异常标识,确定所述运算设备运行异常,其中,所述运行异常标识配置为指示所述运算设备运行异常。
  15. 根据权利要求13所述的装置,其特征在于,所述状态上报消息中包括所述运算设备的运行状态信息;
    所述运行状态信息包括:所述运算设备的当前数据处理算力。
  16. 根据权利要求15所述的装置,其特征在于,所述确定模块,具体配置为:
    若所述当前数据处理算力小于异常算力阈值,确定所述运算设备运行异 常;或者,
    若所述当前数据处理算力小于所述异常算力阈值的持续时长达到时长阈值,确定所述运算设备运行异常。
  17. 根据权利要求16所述的装置,其特征在于,所述确定模块,还配置为:
    获取所述异常算力阈值。
  18. 根据权利要求17所述的装置,其特征在于,所述确定模块,具体配置为:
    获取所述运算设备的标准数据处理算力;
    获取所述标准数据处理算力与预设异常比例之积,得到所述异常算力阈值。
  19. 根据权利要求18所述的装置,其特征在于,所述标准数据处理算力为:出厂数据处理算力或者初始数据处理算力。
  20. 根据权利要求18或19所述的装置,其特征在于,所述确定模块,具体配置为:
    获取所述运算设备的数据处理芯片记录的出厂数据处理算力,以作为所述标准数据处理算力。
  21. 根据权利要求13所述的装置,其特征在于,所述控制模块,具体配置为:
    发送重启指令至所述运算设备,以使所述运算设备根据所述重启指令进行重启。
  22. 根据权利要求21所述的装置,其特征在于,所述控制模块,具体配置为:
    根据所述各运算设备的识别信息,与所述各运算设备建立远程访问连接,其中,所述识别信息唯一标识一个运算设备;
    通过所述远程访问连接,将所述重启指令发送给所述运算设备。
  23. 根据权利要求22所述的装置,其特征在于,所述识别信息包括:互联网协议IP地址。
  24. 根据权利要求13或22所述的装置,其特征在于,所述控制模块,具体配置为:
    控制所述运算设备断电并重新启动;或者,
    控制所述运算设备在不断电情况下重新启动。
  25. 一种电子设备,其特征在于,包括:存储器、收发器和处理器,所述存储器、所述收发器与所述处理器通过总线连接;
    所述存储器,配置为存储计算机程序;
    所述收发器,配置为与其他设备进行通信;
    所述处理器,配置为执行所述计算机程序以实现如权利要求1-12任一项所述的方法。
  26. 一种计算机可读存储介质,其特征在于,存储有计算机可执行指令,所述计算机可执行指令设置为执行权利要求1-12任一项所述的方法。
  27. 一种计算机程序产品,其特征在于,所述计算机程序产品包括存储在计算机可读存储介质上的计算机程序,所述计算机程序包括程序指令,当所述程序指令被计算机执行时,使所述计算机执行权利要求1-12任一项所述的方法。
PCT/CN2018/117651 2018-11-27 2018-11-27 运算设备维护方法及装置、存储介质和程序产品 WO2020107205A1 (zh)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PCT/CN2018/117651 WO2020107205A1 (zh) 2018-11-27 2018-11-27 运算设备维护方法及装置、存储介质和程序产品
CN201880100621.3A CN113396561A (zh) 2018-11-27 2018-11-27 运算设备维护方法及装置、存储介质和程序产品

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2018/117651 WO2020107205A1 (zh) 2018-11-27 2018-11-27 运算设备维护方法及装置、存储介质和程序产品

Publications (1)

Publication Number Publication Date
WO2020107205A1 true WO2020107205A1 (zh) 2020-06-04

Family

ID=70852208

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/117651 WO2020107205A1 (zh) 2018-11-27 2018-11-27 运算设备维护方法及装置、存储介质和程序产品

Country Status (2)

Country Link
CN (1) CN113396561A (zh)
WO (1) WO2020107205A1 (zh)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112261141A (zh) * 2020-10-23 2021-01-22 移康智能科技(上海)股份有限公司 一种基于区块链技术的本地物联网算力检测系统
CN113824787A (zh) * 2021-09-22 2021-12-21 深圳维盟网络技术有限公司 一种控制终端重启的方法
CN115168427A (zh) * 2022-09-08 2022-10-11 中科声龙科技发展(北京)有限公司 设备查找方法、装置、设备及存储介质

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102394791A (zh) * 2011-10-26 2012-03-28 浪潮(北京)电子信息产业有限公司 宕机恢复方法和系统
CN102412998A (zh) * 2011-12-21 2012-04-11 上海会畅通讯科技发展有限公司 运营服务系统及其维护方法和装置
CN104243216A (zh) * 2014-09-28 2014-12-24 北京国双科技有限公司 集群服务器的维护方法及装置
US20150381460A1 (en) * 2014-06-26 2015-12-31 Fujitsu Limited Network monitoring system and method
CN108647130A (zh) * 2018-05-28 2018-10-12 比特大陆科技有限公司 一种故障矿机的定位方法、报警方法以及相关设备和系统

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108881413A (zh) * 2018-05-31 2018-11-23 北京金风科创风电设备有限公司 风力发电机组的通讯控制方法、装置、设备及介质

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102394791A (zh) * 2011-10-26 2012-03-28 浪潮(北京)电子信息产业有限公司 宕机恢复方法和系统
CN102412998A (zh) * 2011-12-21 2012-04-11 上海会畅通讯科技发展有限公司 运营服务系统及其维护方法和装置
US20150381460A1 (en) * 2014-06-26 2015-12-31 Fujitsu Limited Network monitoring system and method
CN104243216A (zh) * 2014-09-28 2014-12-24 北京国双科技有限公司 集群服务器的维护方法及装置
CN108647130A (zh) * 2018-05-28 2018-10-12 比特大陆科技有限公司 一种故障矿机的定位方法、报警方法以及相关设备和系统

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112261141A (zh) * 2020-10-23 2021-01-22 移康智能科技(上海)股份有限公司 一种基于区块链技术的本地物联网算力检测系统
CN113824787A (zh) * 2021-09-22 2021-12-21 深圳维盟网络技术有限公司 一种控制终端重启的方法
CN113824787B (zh) * 2021-09-22 2024-03-29 深圳维盟网络技术有限公司 一种控制终端重启的方法
CN115168427A (zh) * 2022-09-08 2022-10-11 中科声龙科技发展(北京)有限公司 设备查找方法、装置、设备及存储介质

Also Published As

Publication number Publication date
CN113396561A (zh) 2021-09-14

Similar Documents

Publication Publication Date Title
WO2020107205A1 (zh) 运算设备维护方法及装置、存储介质和程序产品
EP3013086A1 (en) Method, apparatus and electronic device for connection management
WO2020107198A1 (zh) 运算设备维护方法及装置、存储介质和程序产品
JP2010528375A (ja) モバイルオペレーティング環境のための、イベント制御された連続的なロギングを提供すること
CN104869609A (zh) 信息提供方法和装置
CN110012527B (zh) 唤醒方法及电子设备
EP3220586A1 (en) Authority management method and device for a router, and a router
US9762477B2 (en) Network apparatus with loop detection and port shutdown capabilities
EP2811690B1 (en) Method and apparatus for managing wireless docking network
US11983539B2 (en) Method for computing device maintenance, apparatus, storage medium and program product
CN105516937B (zh) 一种基于Android手机系统的手机丢失后的远程控制方法
US20160020974A1 (en) Network device, communication method, program, and recording medium
CN110908881B (zh) 埋点数据的发送方法、装置、电子设备及计算机可读存储介质
US11533228B2 (en) Method for information configuration, apparatus, electronic device, storage medium and program product
US11314670B2 (en) Method, apparatus, and device for transmitting file based on BMC, and medium
WO2020107211A1 (zh) 运算设备维护方法及装置、存储介质和程序产品
WO2020107203A1 (zh) 运算设备维护方法及装置、存储介质和程序产品
CN108965382B (zh) 一种基于bmc的文件传输方法、装置、设备及介质
CN108605054A (zh) 实现增值服务的方法、装置与云服务器
JPWO2006090647A1 (ja) 処理装置
WO2020037607A1 (zh) 一种传输数据的方法和装置
WO2020147415A1 (zh) 抓拍服务进程管理方法、装置、电子设备及可读存储介质
US10063386B2 (en) Control method, controller, and recording medium
US10419363B2 (en) Network device, communication method, and recording medium
TWI791316B (zh) 實現程式間通訊的方法及系統

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18941476

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18941476

Country of ref document: EP

Kind code of ref document: A1

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 16/02/2022)

122 Ep: pct application non-entry in european phase

Ref document number: 18941476

Country of ref document: EP

Kind code of ref document: A1