CN116701055A - Fault isolation method, device, equipment and medium of server - Google Patents

Fault isolation method, device, equipment and medium of server Download PDF

Info

Publication number
CN116701055A
CN116701055A CN202310639717.XA CN202310639717A CN116701055A CN 116701055 A CN116701055 A CN 116701055A CN 202310639717 A CN202310639717 A CN 202310639717A CN 116701055 A CN116701055 A CN 116701055A
Authority
CN
China
Prior art keywords
isolation
server
transmission interface
hardware transmission
interface
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310639717.XA
Other languages
Chinese (zh)
Inventor
唐华宇
付迪
唐文涛
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Great Wall Chaoyun Beijing Technology Co ltd
Original Assignee
Great Wall Chaoyun Beijing Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Great Wall Chaoyun Beijing Technology Co ltd filed Critical Great Wall Chaoyun Beijing Technology Co ltd
Priority to CN202310639717.XA priority Critical patent/CN116701055A/en
Publication of CN116701055A publication Critical patent/CN116701055A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1415Saving, restoring, recovering or retrying at system level
    • G06F11/1441Resetting or repowering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/22Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
    • G06F11/2284Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing by power-on test, e.g. power-on self test [POST]
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Hardware Design (AREA)
  • Retry When Errors Occur (AREA)

Abstract

The application provides a fault isolation method, device, equipment and medium of a server, wherein the server comprises a plurality of hardware transmission interfaces, and the method comprises the following steps: when the server is powered on or restarted, the basic input/output system acquires fault information of each hardware transmission interface where the server fails; the basic input/output system analyzes the fault information of each hardware transmission interface according to a fault isolation mechanism preset in the baseboard management controller and judges whether each hardware transmission interface with faults is allowed to execute isolation processing or not; and if judging that each hardware transmission interface with faults is not allowed to execute isolation processing, allowing the hardware transmission interface to perform data transmission so as to enable the server to be normally powered on or restarted. The effect of normal startup and use can be achieved under the condition that the hardware part of the server accords with the preset condition.

Description

Fault isolation method, device, equipment and medium of server
Technical Field
The present application relates to the field of server technologies, and in particular, to a method, an apparatus, a device, and a medium for fault isolation of a server.
Background
A basic input output system (BIOS, basic Input Output System) is a system that mainly manages basic input and output of a server, and can detect a hardware portion of the server, boot an operating system, and the like.
After detecting the hardware part and recognizing the hardware risk, the basic input/output system automatically executes a shutdown program, so that the server cannot be started normally, a worker cannot modify the detection process, and the detection condition cannot be adjusted, and the startup and the use of the server are affected when necessary.
Disclosure of Invention
Therefore, the present application aims to provide a fault isolation method, apparatus and medium for a server, which can perform isolation processing on a hardware transmission interface by a basic input/output system according to a fault isolation mechanism set by a baseboard control manager when the server is powered on and restarted, so that the server is powered on or restarted normally, and the problems that in the prior art, a worker cannot modify a detection process, cannot adjust a detection condition, and affects the startup and use of the server when necessary are solved, and the effect of normal startup and use when a hardware part of the server meets preset conditions is achieved.
In a first aspect, an embodiment of the present application provides a fault isolation method for a server, where the server includes a plurality of hardware transmission interfaces, and the method includes:
when the server is powered on or restarted, the basic input/output system acquires fault information of each hardware transmission interface of the server, wherein the fault information is generated by the server; the basic input/output system analyzes the fault information of each hardware transmission interface according to a fault isolation mechanism preset in the baseboard management controller and judges whether each hardware transmission interface with faults is allowed to execute isolation processing or not; and if judging that each hardware transmission interface with faults is not allowed to execute isolation processing, allowing the hardware transmission interface to perform data transmission so as to enable the server to be normally powered on or restarted.
Optionally, it is determined whether each hardware transmission interface that has failed is allowed to perform the isolation processing by: the basic input/output system determines a fault isolation mechanism corresponding to the interface identifier of each hardware transmission interface according to the interface identifier of each hardware transmission interface with faults; judging whether the power-on or restarting of the server is affected in each hardware transmission interface according to a fault isolation mechanism corresponding to the interface identification of each hardware transmission interface; if not, allowing the hardware transmission interface to be subjected to isolation processing; if yes, the hardware transmission interface is not allowed to be subjected to isolation processing.
Optionally, the method further comprises: judging whether the server is powered on or restarted; if the server is powered on or restarted, the basic input/output system determines the isolation level of each hardware transmission interface with faults; and according to the isolation level of each hardware transmission interface, carrying out isolation processing on each hardware transmission interface with faults.
Optionally, the isolation level includes a boot isolation level, an operation isolation level, and an isolation level to be corrected, where the boot isolation level is used to indicate that, after the server is booted, data transmission isolation is performed on the hardware transmission interface at the boot isolation level, the operation isolation level is used to indicate that, when the server is running, the hardware transmission interface at the operation isolation level is allowed to transmit data of the first target address or the first target program, and the isolation level to be corrected is used to indicate that, when the server is running, the hardware transmission interface at the isolation level to be corrected is allowed to transmit data of the second target address or the second target program.
Optionally, the step of isolating each hardware transmission interface that fails according to the isolation level of each hardware transmission interface includes: if the isolation level of the hardware transmission interface is a starting isolation level, after the server is started or restarted, the hardware transmission interface is limited to execute data transmission isolation;
if the isolation level of the hardware transmission interface is an operation isolation level, only allowing the hardware transmission interface to transmit the data of the first target address or the first target program when the hardware transmission interface performs data transmission; and if the isolation level of the transmission interface is to be corrected, allowing the hardware transmission interface to transmit the data of the second target address or the second target program when the hardware transmission interface transmits the data.
Optionally, the fault isolation mechanism is set by: displaying a plurality of hardware transmission interfaces in the server in a fault isolation setting interface; in response to selection of each hardware transmission interface, displaying a data type which can be transmitted by the interface and a connectable address of the interface in a first area of a display interface; in response to a selection of a type of data that the interface can transfer and a selection of a connectable address of the interface, a fault isolation mechanism of the interface is formed.
In a second aspect, an embodiment of the present application further provides a fault isolation apparatus of a server, where the apparatus includes:
the system comprises a fault information acquisition module, a control module and a control module, wherein the fault information acquisition module is used for acquiring fault information of each hardware transmission interface of a server, which is faulty, when the server is powered on or restarted;
the isolation processing judging module is used for analyzing the fault information of each hardware transmission interface according to a fault isolation mechanism preset in the baseboard management controller by the basic input/output system and judging whether each hardware transmission interface with faults is allowed to execute isolation processing or not;
and the isolation processing execution module is used for allowing the hardware transmission interfaces to perform data transmission if judging that each hardware transmission interface with faults is not allowed to perform isolation processing, so that the server is powered on or restarted normally.
Optionally, the apparatus further comprises: the power-on or restarting completion judging module is used for judging whether the server is powered on or restarted;
the isolation level determining module is used for determining the isolation level of each hardware transmission interface with faults if the server is powered on or restarted;
and the interface isolation module is used for carrying out isolation processing on each hardware transmission interface with faults according to the isolation level of each hardware transmission interface.
In a third aspect, an embodiment of the present application further provides an electronic device, including: a processor, a memory and a bus, the memory storing machine-readable instructions executable by the processor, the processor and the memory in communication via the bus when the electronic device is running, the machine-readable instructions when executed by the processor performing the steps of the fault isolation method of a server as described above.
In a fourth aspect, embodiments of the present application also provide a computer readable storage medium having stored thereon a computer program which, when executed by a processor, performs the steps of the fault isolation method of a server as described above.
According to the fault isolation method, the fault isolation device, the fault isolation equipment and the fault isolation medium for the server, provided by the embodiment of the application, the basic input/output system can execute isolation processing on the hardware transmission interface according to the fault isolation mechanism set by the substrate control manager when the server is powered on and restarted, so that the server is powered on or restarted normally, the problems that a worker cannot modify a detection process and cannot adjust detection conditions in the prior art, and the startup and use of the server when necessary are affected are solved, and the effect that the server can be started up normally under the condition that the hardware part of the server meets preset conditions is achieved.
In order to make the above objects, features and advantages of the present application more comprehensible, preferred embodiments accompanied with figures are described in detail below.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present application, the drawings that are needed in the embodiments will be briefly described below, it being understood that the following drawings only illustrate some embodiments of the present application and therefore should not be considered as limiting the scope, and other related drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a flowchart of a method for isolating a server from a failure according to an embodiment of the present application;
FIG. 2 is a flowchart of another method for isolating a failure of a server according to an embodiment of the present application;
fig. 3 is a schematic structural diagram of a fault isolation apparatus for a server according to an embodiment of the present application;
fig. 4 is a schematic structural diagram of an electronic device according to an embodiment of the present application.
Detailed Description
For the purpose of making the objects, technical solutions and advantages of the embodiments of the present application more apparent, the technical solutions of the embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present application, and it is apparent that the described embodiments are only some embodiments of the present application, not all embodiments. The components of the embodiments of the present application generally described and illustrated in the figures herein may be arranged and designed in a wide variety of different configurations. Thus, the following detailed description of the embodiments of the application, as presented in the figures, is not intended to limit the scope of the application, as claimed, but is merely representative of selected embodiments of the application. Based on the embodiments of the present application, every other embodiment obtained by a person skilled in the art without making any inventive effort falls within the scope of protection of the present application.
First, an application scenario to which the present application is applicable will be described. The application can be applied to the field of servers.
It has been found that the basic input/output system (BIOS, basic Input Output System) as a system for mainly managing basic input/output of a server can detect a hardware portion of the server, boot an operating system, and the like.
After detecting the hardware part and recognizing the hardware risk, the basic input/output system automatically executes a shutdown program, so that the server cannot be started normally, a worker cannot modify the detection process, and the detection condition cannot be adjusted, and the startup and the use of the server are affected when necessary.
Based on this, the embodiment of the application provides a fault isolation method for a server, which can execute isolation processing on a hardware transmission interface by a basic input/output system according to a fault isolation mechanism set by a substrate control manager when the server is powered on and restarted, so that the server is powered on or restarted normally, the problems that a detection process cannot be modified by a worker in the prior art, detection conditions cannot be regulated, and the use of the server when necessary is affected are solved, and the effect of normal startup and use when a hardware part of the server meets preset conditions is achieved.
Referring to fig. 1, fig. 1 is a flowchart of a fault isolation method of a server according to an embodiment of the present application. As shown in fig. 1, the fault isolation method for a server provided by the embodiment of the present application includes:
it should be noted that the server includes a plurality of hardware transmission interfaces.
Here, the hardware transmission interface may be a hard disk interface (SATA interface), an SAS interface, an NVMe interface, a network interface, a PCIE interface, a VGA interface, an IPMI management interface, or the like.
And S101, when the server is powered on or restarted, the basic input/output system acquires fault information of each hardware transmission interface where the server fails.
For example, the hardware transmission interface is a SATA interface, and when data transmission of the hardware transmission interface fails, for example, the SATA interface cannot transmit data for storage or cannot read stored data, the bios detects that the SATA interface cannot transmit data for storage or cannot read stored data.
The hardware transmission interface is a network interface, and the network interface has a virus risk, so that the basic input/output system can detect that the network interface has a virus input risk.
S102, the basic input/output system analyzes the fault information of each hardware transmission interface according to a fault isolation mechanism preset in the baseboard management controller, and judges whether each hardware transmission interface with faults is allowed to execute isolation processing or not.
Specifically, referring to fig. 2, fig. 2 is a flowchart of another fault isolation method of a server according to an embodiment of the present application. As shown in fig. 2, another fault isolation method for a server according to an embodiment of the present application includes:
s201, the basic input/output system determines a fault isolation mechanism corresponding to the interface identifier of each hardware transmission interface according to the interface identifier of each hardware transmission interface with faults;
the interface identifier is used for indicating the position and function of each hardware transmission interface, for example, the SATA interface is used for data transmission stored in a hard disk and is arranged at a first position of the SATA interface group.
Specifically, the fault isolation mechanism may be set by: displaying a plurality of hardware transmission interfaces in the server in a fault isolation setting interface; in response to selection of each hardware transmission interface, displaying a data type which can be transmitted by the interface and a connectable address of the interface in a first area of a display interface; in response to a selection of a type of data that the interface can transfer and a selection of a connectable address of the interface, a fault isolation mechanism of the interface is formed.
Illustratively, all hardware transport interfaces are displayed in the failure isolation setup interface, e.g., for each SATA interface, in response to selection of one SATA interface, the types of data that can be transported by that SATA interface, the storage address of the storage hard disk to which that interface is connected, etc., are displayed in the first area of the display interface. In response to selection of the data type and the storage address of the interface, the interface will only transmit the selected data type and only transmit data between the server and the selected address.
S202, judging whether the power-on or restarting of the server is affected in each hardware transmission interface according to a fault isolation mechanism corresponding to the interface identification of each hardware transmission interface.
For example, the server includes an IPMI interface, and if the IPMI interface is isolated, the server cannot be restarted or powered on, so the failure isolation mechanism for the IPMI interface is to allow all types of data transmission between the IPMI interface and the server during the restart or power on process. If the basic input/output system judges that the IPMI interface allows all types of data transmission in the restarting or powering-on process, the hardware transmission interface is confirmed to influence the powering-on restarting of the server.
If not, step S203 is executed to allow the hardware transmission interface to be subjected to the isolation processing.
If yes, step S204 is executed, and the hardware transmission interface is not allowed to be executed with the isolation processing.
For example, even if the IPMI interface is confirmed to have a virus transmission risk, the bios will not perform isolation processing on the IPMI interface, so as to avoid that the server cannot be started normally when the server needs to be started.
And S103, if each hardware transmission interface with faults is judged not to be allowed to execute isolation processing, allowing the hardware transmission interface to conduct data transmission so as to enable the server to be normally powered on or restarted.
According to the fault isolation method for the server, provided by the embodiment of the application, when the server is powered on and restarted, the basic input/output system can execute isolation processing on the hardware transmission interface according to the fault isolation mechanism set by the substrate control manager, so that the server is powered on or restarted normally, the problems that in the prior art, a worker cannot modify a detection process, cannot adjust detection conditions and influence the startup and use of the server when necessary are solved, and the effect of normal startup and use under the condition that the hardware part of the server meets preset conditions is achieved.
Optionally, the method further comprises: judging whether the server is powered on or restarted; if the server is powered on or restarted, the basic input/output system determines the isolation level of each hardware transmission interface with faults; and according to the isolation level of each hardware transmission interface, carrying out isolation processing on each hardware transmission interface with faults.
Specifically, the isolation level includes a boot isolation level, an operation isolation level, and an isolation level to be corrected.
The method comprises the steps of starting up a server, executing data transmission isolation on a hardware transmission interface at a starting-up isolation level after the server is started up, enabling the hardware transmission interface at the starting-up isolation level to transmit data of a first target address or a first target program when the server is operated, and enabling the hardware transmission interface at the starting-up isolation level to transmit data of a second target address or a second target program when the server is operated.
The step of performing isolation processing on each hardware transmission interface with faults according to the isolation level of each hardware transmission interface comprises the following steps: if the isolation level of the hardware transmission interface is a starting isolation level, after the server is started or restarted, the hardware transmission interface is limited to execute data transmission isolation; if the isolation level of the hardware transmission interface is an operation isolation level, only allowing the hardware transmission interface to transmit the data of the first target address or the first target program when the hardware transmission interface performs data transmission; and if the isolation level of the transmission interface is to be corrected, allowing the hardware transmission interface to transmit the data of the second target address or the second target program when the hardware transmission interface transmits the data.
According to the fault isolation method for the server, provided by the embodiment of the application, when the server is powered on and restarted, the basic input/output system can execute isolation processing on the hardware transmission interface according to the fault isolation mechanism set by the substrate control manager, so that the server is powered on or restarted normally, the problems that in the prior art, a worker cannot modify a detection process, cannot adjust detection conditions and influence the startup and use of the server when necessary are solved, and the effect of normal startup and use under the condition that the hardware part of the server meets preset conditions is achieved.
Based on the same inventive concept, the embodiment of the application also provides a fault isolation device of a server corresponding to the fault isolation method of the server, and since the principle of solving the problem by the device in the embodiment of the application is similar to that of the fault isolation method of the server in the embodiment of the application, the implementation of the device can refer to the implementation of the method, and the repetition is omitted.
Referring to fig. 3, fig. 3 is a schematic structural diagram of a fault isolation apparatus for a server according to an embodiment of the application. As shown in fig. 3, the fault isolation apparatus 300 of the server includes:
the failure information obtaining module 301 is configured to obtain, by using the bios, failure information of each hardware transmission interface where the server fails when the server is powered on or restarted;
the isolation processing judging module 302 is configured to analyze fault information of each hardware transmission interface according to a fault isolation mechanism preset in the baseboard management controller by using the bios to judge whether each hardware transmission interface that fails is allowed to perform isolation processing;
and the isolation processing executing module 303 is configured to allow the hardware transmission interfaces to perform data transmission if it is determined that each hardware transmission interface that fails is not allowed to perform isolation processing, so that the server is powered on or restarted normally.
Optionally, the apparatus further comprises: the power-on or restarting completion judging module is used for judging whether the server is powered on or restarted; the isolation level determining module is used for determining the isolation level of each hardware transmission interface with faults if the server is powered on or restarted; and the interface isolation module is used for carrying out isolation processing on each hardware transmission interface with faults according to the isolation level of each hardware transmission interface.
According to the fault isolation device for the server, provided by the embodiment of the application, the basic input/output system can execute isolation processing on the hardware transmission interface according to the fault isolation mechanism set by the substrate control manager when the server is powered on and restarted, so that the server is powered on or restarted normally, the problems that in the prior art, a worker cannot modify a detection process or adjust detection conditions, and the startup and use of the server are affected when necessary are solved, and the effect of normal startup and use can be achieved under the condition that the hardware part of the server meets preset conditions.
Referring to fig. 4, fig. 4 is a schematic structural diagram of an electronic device according to an embodiment of the application. As shown in fig. 4, the electronic device 400 includes a processor 410, a memory 420, and a bus 430.
The memory 420 stores machine-readable instructions executable by the processor 410, when the electronic device 400 is running, the processor 410 communicates with the memory 420 through the bus 430, and when the machine-readable instructions are executed by the processor 410, the steps of the fault isolation method of the server in the method embodiments shown in fig. 1 and fig. 2 can be executed, and detailed implementation can refer to method embodiments and will not be repeated herein.
The embodiment of the present application further provides a computer readable storage medium, where a computer program is stored, where the computer program when executed by a processor may perform the steps of the fault isolation method of the server in the method embodiments shown in fig. 1 and fig. 2, and the specific implementation manner may refer to the method embodiment and will not be described herein.
It will be clear to those skilled in the art that, for convenience and brevity of description, specific working procedures of the above-described systems, apparatuses and units may refer to corresponding procedures in the foregoing method embodiments, and are not repeated herein.
In the several embodiments provided by the present application, it should be understood that the disclosed systems, devices, and methods may be implemented in other manners. The above-described apparatus embodiments are merely illustrative, for example, the division of the units is merely a logical function division, and there may be other manners of division in actual implementation, and for example, multiple units or components may be combined or integrated into another system, or some features may be omitted, or not performed. Alternatively, the coupling or direct coupling or communication connection shown or discussed with each other may be through some communication interface, device or unit indirect coupling or communication connection, which may be in electrical, mechanical or other form.
The units described as separate units may or may not be physically separate, and units shown as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.
In addition, each functional unit in the embodiments of the present application may be integrated in one processing unit, or each unit may exist alone physically, or two or more units may be integrated in one unit.
The functions, if implemented in the form of software functional units and sold or used as a stand-alone product, may be stored in a non-volatile computer readable storage medium executable by a processor. Based on this understanding, the technical solution of the present application may be embodied essentially or in a part contributing to the prior art or in a part of the technical solution, in the form of a software product stored in a storage medium, comprising several instructions for causing a computer device (which may be a personal computer, a server, a network device, etc.) to perform all or part of the steps of the method according to the embodiments of the present application. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a random access Memory (Random Access Memory, RAM), a magnetic disk, or an optical disk, or other various media capable of storing program codes.
Finally, it should be noted that: the above examples are only specific embodiments of the present application, and are not intended to limit the scope of the present application, but it should be understood by those skilled in the art that the present application is not limited thereto, and that the present application is described in detail with reference to the foregoing examples: any person skilled in the art may modify or easily conceive of the technical solution described in the foregoing embodiments, or perform equivalent substitution of some of the technical features, while remaining within the technical scope of the present disclosure; such modifications, changes or substitutions do not depart from the spirit and scope of the technical solutions of the embodiments of the present application, and are intended to be included in the scope of the present application. Therefore, the protection scope of the application is subject to the protection scope of the claims.

Claims (10)

1. A method of fault isolation for a server, the server comprising a plurality of hardware transmission interfaces, the method comprising:
when the server is powered on or restarted, the basic input/output system acquires fault information of each hardware transmission interface where the server fails;
the basic input/output system analyzes the fault information of each hardware transmission interface according to a fault isolation mechanism preset in the baseboard management controller and judges whether each hardware transmission interface with faults is allowed to execute isolation processing or not;
and if judging that each hardware transmission interface with faults is not allowed to execute isolation processing, allowing the hardware transmission interface to perform data transmission so as to enable the server to be normally powered on or restarted.
2. The method according to claim 1, wherein it is determined whether each hardware transmission interface that has failed is allowed to perform the quarantine processing by:
the basic input/output system determines a fault isolation mechanism corresponding to the interface identifier of each hardware transmission interface according to the interface identifier of each hardware transmission interface with faults;
judging whether the power-on or restarting of the server is affected in each hardware transmission interface according to a fault isolation mechanism corresponding to the interface identification of each hardware transmission interface;
if not, allowing the hardware transmission interface to be subjected to isolation processing;
if yes, the hardware transmission interface is not allowed to be subjected to isolation processing.
3. The method according to claim 1, wherein the method further comprises:
judging whether the server is powered on or restarted;
if the server is powered on or restarted, the basic input/output system determines the isolation level of each hardware transmission interface with faults;
and according to the isolation level of each hardware transmission interface, carrying out isolation processing on each hardware transmission interface with faults.
4. The method of claim 3, wherein the isolation levels include a boot isolation level, an operational isolation level, and a to-be-modified isolation level,
wherein the boot isolation level is used for indicating that after the server is booted, data transmission isolation is performed on the hardware transmission interface at the boot isolation level,
the operation isolation level is used for indicating that the hardware transmission interface at the operation isolation level is allowed to transmit the first target address or the data of the first target program when the server operates,
the isolation level to be corrected is used for indicating that the hardware transmission interface at the isolation level to be corrected is allowed to transmit the data of the second target address or the second target program when the server runs.
5. The method of claim 4, wherein the step of isolating each hardware transport interface that fails according to the isolation level of each hardware transport interface comprises:
if the isolation level of the hardware transmission interface is a starting isolation level, after the server is started or restarted, the hardware transmission interface is limited to execute data transmission isolation;
if the isolation level of the hardware transmission interface is an operation isolation level, only allowing the hardware transmission interface to transmit the data of the first target address or the first target program when the hardware transmission interface performs data transmission;
and if the isolation level of the hardware transmission interface is the isolation level to be corrected, allowing the hardware transmission interface to transmit the data of the second target address or the second target program when the hardware transmission interface transmits the data.
6. The method of claim 1, wherein the fault isolation mechanism is set by:
displaying a plurality of hardware transmission interfaces in the server in a fault isolation setting interface;
in response to selection of each hardware transmission interface, displaying a data type which can be transmitted by the interface and a connectable address of the interface in a first area of a display interface;
in response to a selection of a type of data that the interface can transfer and a selection of a connectable address of the interface, a fault isolation mechanism of the interface is formed.
7. A fault isolation apparatus for a server, the server comprising a plurality of hardware transmission interfaces, the apparatus comprising:
the system comprises a fault information acquisition module, a control module and a control module, wherein the fault information acquisition module is used for acquiring fault information of each hardware transmission interface of a server, which is faulty, when the server is powered on or restarted;
the isolation processing judging module is used for analyzing the fault information of each hardware transmission interface according to a fault isolation mechanism preset in the baseboard management controller by the basic input/output system and judging whether each hardware transmission interface with faults is allowed to execute isolation processing or not;
and the isolation processing execution module is used for allowing the hardware transmission interfaces to perform data transmission if judging that each hardware transmission interface with faults is not allowed to perform isolation processing, so that the server is powered on or restarted normally.
8. The apparatus of claim 7, wherein the apparatus further comprises:
the power-on or restarting completion judging module is used for judging whether the server is powered on or restarted;
the isolation level determining module is used for determining the isolation level of each hardware transmission interface with faults if the server is powered on or restarted;
and the interface isolation module is used for carrying out isolation processing on each hardware transmission interface with faults according to the isolation level of each hardware transmission interface.
9. An electronic device, comprising: a processor, a memory and a bus, the memory storing machine-readable instructions executable by the processor, the processor and the memory in communication over the bus when the electronic device is running, the processor executing the machine-readable instructions to perform the steps of the method of any one of claims 1 to 6.
10. A computer-readable storage medium, characterized in that it has stored thereon a computer program which, when executed by a processor, performs the steps of the method according to any of claims 1 to 6.
CN202310639717.XA 2023-06-01 2023-06-01 Fault isolation method, device, equipment and medium of server Pending CN116701055A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310639717.XA CN116701055A (en) 2023-06-01 2023-06-01 Fault isolation method, device, equipment and medium of server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310639717.XA CN116701055A (en) 2023-06-01 2023-06-01 Fault isolation method, device, equipment and medium of server

Publications (1)

Publication Number Publication Date
CN116701055A true CN116701055A (en) 2023-09-05

Family

ID=87830526

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310639717.XA Pending CN116701055A (en) 2023-06-01 2023-06-01 Fault isolation method, device, equipment and medium of server

Country Status (1)

Country Link
CN (1) CN116701055A (en)

Similar Documents

Publication Publication Date Title
US10365961B2 (en) Information handling system pre-boot fault management
US10068015B2 (en) Monitoring a monitoring-target process
WO2018095107A1 (en) Bios program abnormal processing method and apparatus
US10275330B2 (en) Computer readable non-transitory recording medium storing pseudo failure generation program, generation method, and generation apparatus
WO2016206514A1 (en) Startup processing method and device
US11907073B2 (en) Method and apparatus for data interaction between BIOS and BMC of server, and device
US20220214945A1 (en) System Booting Method and Apparatus, Node Device, and Computer-Readable Storage Medium
CN108292342B (en) Notification of intrusions into firmware
KR20040047209A (en) Method for automatically recovering computer system in network and recovering system for realizing the same
US9448888B2 (en) Preventing a rollback attack in a computing system that includes a primary memory bank and a backup memory bank
CN112099825A (en) Method, device and equipment for upgrading component and storage medium
US20050033952A1 (en) Dynamic scheduling of diagnostic tests to be performed during a system boot process
US9411666B2 (en) Anticipatory protection of critical jobs in a computing system
US20200342109A1 (en) Baseboard management controller to convey data
CN116701055A (en) Fault isolation method, device, equipment and medium of server
JP2013109722A (en) Computer, computer system and failure information management method
CN107168819B (en) Method and device for restarting operating system
CN110781517B (en) Method for realizing data interaction by BIOS and BMC communication
WO2017072904A1 (en) Computer system and failure detection method
TWI554876B (en) Method for processing node replacement and server system using the same
JP5683354B2 (en) Monitoring device and monitoring method
CN114139168B (en) TPCM measuring method, device and medium
TWI715005B (en) Monitor method for demand of a bmc
TWI840907B (en) Computer system and method for detecting deviations, and non-transitory computer readable medium
JP7389877B2 (en) Network optimal boot path method and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination