CN113448785A - Method, device and equipment for processing bandwidth state exception and readable medium - Google Patents

Method, device and equipment for processing bandwidth state exception and readable medium Download PDF

Info

Publication number
CN113448785A
CN113448785A CN202110594327.6A CN202110594327A CN113448785A CN 113448785 A CN113448785 A CN 113448785A CN 202110594327 A CN202110594327 A CN 202110594327A CN 113448785 A CN113448785 A CN 113448785A
Authority
CN
China
Prior art keywords
bmc
pcie
command
state information
bandwidth
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110594327.6A
Other languages
Chinese (zh)
Other versions
CN113448785B (en
Inventor
刘涛
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shandong Yingxin Computer Technology Co Ltd
Original Assignee
Shandong Yingxin Computer Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shandong Yingxin Computer Technology Co Ltd filed Critical Shandong Yingxin Computer Technology Co Ltd
Priority to CN202110594327.6A priority Critical patent/CN113448785B/en
Publication of CN113448785A publication Critical patent/CN113448785A/en
Application granted granted Critical
Publication of CN113448785B publication Critical patent/CN113448785B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/22Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
    • G06F11/2247Verification or detection of system hardware configuration
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Hardware Design (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The invention discloses a processing method of bandwidth state exception, which comprises the following steps: reading the PCIE card state of the slot position through the CPLD, reading the bandwidth state information through the PCIE Switch, and judging whether the bandwidth state information is consistent with the PCIE card state; if not, unbundling the slot position and reconfiguring bandwidth state information based on the PCIE card state; sending command request content to the PCIE Switch, reading return data obtained by the PCIE Switch based on the command request content and executing the step circularly based on the return data; responding to the condition that the number of times of executing the last step in a circulating mode does not exceed the preset number of times of circulating, and continuously reading the returned data for three times to be consistent, considering that the returned data is successful, and further judging whether the current bandwidth state information is consistent with the state of the PCIE card or not based on the returned data; and if the current bandwidth state information is inconsistent with the PCIE card state, unbinding and reconfiguring the current bandwidth state information based on the PCIE card state. The invention also discloses a corresponding processing device, computer equipment and a readable storage medium.

Description

Method, device and equipment for processing bandwidth state exception and readable medium
Technical Field
The present invention relates to the field of server technologies, and in particular, to a method, an apparatus, a device, and a readable medium for handling a bandwidth status exception.
Background
In the server product, in order to improve the competitiveness of the product in the market and increase the product advantages, the high-performance server needs to support a plurality of x16 external PCIE cards to improve the flexible configuration of the product.
When the server is in use, once resource reading failure occurs, the server is unacceptable for the product of the server. Because in the server industry, the server product has the requirement of not being down and not losing data, which also means that the server product has extremely high requirement on stability.
At present, the BMC reads the external card status provided by the CPLD register, and if the bandwidth configuration information of the Switch read by the BMC is inconsistent with the external card information, the BMC performs unbundling on the current slot, and reconfigures the Switch and powers on the Switch again. The current slot unbinding and binding of the card only occur after the equipment is powered on for the first time and before the BMC starts and finishes scanning the PCIE resources by the BIOS, the whole equipment is not powered down, and the operation of the process cannot be triggered. However, during the restart of the operating system, the configured Switch may be abnormal, and the card is operated according to the PCIE resource allocated by the BIOS, and at this time, the system may not recognize the card because the configuration abnormal resource is unavailable. Therefore, the abnormal bandwidth function of the PCIE chip may affect the application manufacturer, which is not favorable for the enterprise to improve the competitiveness and reduce the detection efficiency and the yield of the product.
Disclosure of Invention
In view of this, an object of the embodiments of the present invention is to provide a method, an apparatus, a device, and a readable medium for processing a bandwidth status exception, which optimize a processing flow and reliability of read data, solve a problem of reducing stability of a server product due to instability of a third-party firmware chip, improve stability of the server product, avoid a serious card loss problem or other problems, even downtime, caused by data reception and data processing, and reduce labor cost and material cost of the product due to such problems.
Based on the above purpose, an aspect of the embodiments of the present invention provides a method for handling a bandwidth status exception, including the following steps: reading the PCIE card state of the slot position through the CPLD by the BMC, reading bandwidth state information through the PCIE Switch, and judging whether the bandwidth state information is consistent with the PCIE card state; if the bandwidth state information is inconsistent with the PCIE card state, the BMC unbinds the slot position and reconfigures the bandwidth state information based on the PCIE card state; the BMC sends command request content to the PCIE Switch, reads return data obtained by the PCIE Switch based on the command request content and circularly executes the step based on the return data; responding to the condition that the number of times of executing the last step in a circulating mode does not exceed the preset number of times of circulating, and continuously reading the returned data for three times to be consistent, considering that the returned data is successful, and further judging whether the current bandwidth state information is consistent with the PCIE card state or not based on the returned data; and if the current bandwidth state information is inconsistent with the PCIE card state, the BMC unbundles and reconfigures the current bandwidth state information based on the PCIE card state.
In some embodiments, sending, by the BMC, a command request content to the PCIE Switch, reading return data obtained by the PCIE Switch executing based on the command request content, and executing this step in a loop based on the return data includes: sending a command number to the PCIE Switch by the BMC, and judging whether the command number is sent successfully or not; if the command number is successfully sent, the BMC sends command request content corresponding to the command number to the PCIE Switch, and judges whether the command request content is successfully sent; if the command request content is successfully sent, the BMC queries whether the thread is successfully received from the PCIE Switch, and if the thread is successfully received, the BMC queries whether the command is successfully executed from the PCIE Switch; and if the command is successfully executed, reading data from the PCIE Switch by the BMC.
In some embodiments, further comprising: if the command number is not successfully sent, returning to the step of sending the command number to the PCIE Switch by the BMC; and if the command request content corresponding to the command number is not successfully sent, returning to the step of sending the command number to the PCIE Switch by the BMC.
In some embodiments, further comprising: if the thread receiving fails, the BMC queries whether the thread receiving is successful from the PCIE Switch again; and returning to the step of sending a command number to the PCIE Switch by the BMC in response to the number of times of executing the previous step exceeding three times.
In some embodiments, further comprising: if the command is being executed, the BMC queries whether the command is successfully executed from the PCIE Switch again; if the command execution fails, returning to the step of sending a command number to the PCIE Switch by the BMC.
In some embodiments, further comprising: and if the current bandwidth state information is consistent with the PCIE card state, skipping the steps of unbinding and binding.
In some embodiments, reading, by the BMC through the CPLD, the PCIE card status of the slot includes: and the BMC receives the slot position signal through the CPLD and reads the PCIE card state of the slot position based on the slot position signal, wherein the PCIE card state comprises an off-position state, an x8 card and an x16 card.
In another aspect of the embodiments of the present invention, a device for processing a bandwidth status exception is further provided, including: the first module is configured to read the PCIE card state of the slot position through the CPLD by the BMC, read bandwidth state information through the PCIE Switch, and judge whether the bandwidth state information is consistent with the PCIE card state; a second module configured to unbind the slot by the BMC and reconfigure the bandwidth state information based on the PCIE card state if the bandwidth state information is inconsistent with the PCIE card state; a third module, configured to send a command request content to the PCIE Switch by the BMC, and read return data obtained by the PCIE Switch executing based on the command request content; a fourth module, configured to respond that the number of times of executing the previous step in a loop does not exceed a preset number of loop times, and the returned data read for three consecutive times are consistent, consider that the returned data is successful, and further determine whether the current bandwidth state information is consistent with the PCIE card state based on the returned data; and a fifth module configured to, if the current bandwidth state information is inconsistent with the PCIE card state, unbundle by the BMC and reconfigure the current bandwidth state information based on the PCIE card state.
In another aspect of the embodiments of the present invention, there is also provided a computer device, including: at least one processor; and a memory storing computer instructions executable on the processor, the instructions when executed by the processor implementing method steps comprising: reading the PCIE card state of the slot position through the CPLD by the BMC, reading bandwidth state information through the PCIE Switch, and judging whether the bandwidth state information is consistent with the PCIE card state; if the bandwidth state information is inconsistent with the PCIE card state, the BMC unbinds the slot position and reconfigures the bandwidth state information based on the PCIE card state; the BMC sends command request content to the PCIE Switch, reads return data obtained by the PCIE Switch based on the command request content and circularly executes the module based on the return data; responding to the condition that the number of times of circularly executing the last module does not exceed the preset number of times of circulation and the returned data read for three continuous times are consistent, considering that the returned data are successful, and further judging whether the current bandwidth state information is consistent with the PCIE card state based on the returned data; and if the current bandwidth state information is inconsistent with the PCIE card state, the BMC unbundles and reconfigures the current bandwidth state information based on the PCIE card state.
In some embodiments, sending, by the BMC, a command request content to the PCIE Switch, reading return data obtained by the PCIE Switch executing based on the command request content, and executing this step in a loop based on the return data includes: sending a command number to the PCIE Switch by the BMC, and judging whether the command number is sent successfully or not; if the command number is successfully sent, the BMC sends command request content corresponding to the command number to the PCIE Switch, and judges whether the command request content is successfully sent; if the command request content is successfully sent, the BMC queries whether the thread is successfully received from the PCIE Switch, and if the thread is successfully received, the BMC queries whether the command is successfully executed from the PCIE Switch; and if the command is successfully executed, reading data from the PCIE Switch by the BMC.
In some embodiments, further comprising: if the command number is not successfully sent, returning to the step of sending the command number to the PCIE Switch by the BMC; and if the command request content corresponding to the command number is not successfully sent, returning to the step of sending the command number to the PCIE Switch by the BMC.
In some embodiments, further comprising: if the thread receiving fails, the BMC queries whether the thread receiving is successful from the PCIE Switch again; and returning to the step of sending a command number to the PCIE Switch by the BMC in response to the number of times of executing the previous step exceeding three times.
In some embodiments, further comprising: if the command is being executed, the BMC queries whether the command is successfully executed from the PCIE Switch again; if the command execution fails, returning to the step of sending a command number to the PCIE Switch by the BMC.
In some embodiments, further comprising: and if the current bandwidth state information is consistent with the PCIE card state, skipping the steps of unbinding and binding.
In some embodiments, reading, by the BMC through the CPLD, the PCIE card status of the slot includes: and the BMC receives the slot position signal through the CPLD and reads the PCIE card state of the slot position based on the slot position signal, wherein the PCIE card state comprises an off-position state, an x8 card and an x16 card.
In a further aspect of the embodiments of the present invention, a computer-readable storage medium is also provided, in which a computer program for implementing the above method steps is stored when the computer program is executed by a processor.
The invention has the following beneficial technical effects: the reliability of the processing flow and the read data and the abnormal processing scene are optimized, the problem that the stability of a server product is reduced due to the instability of a third-party firmware chip is solved, the stability of the server product is improved, the problem that a system is seriously lost due to data receiving and data processing or other problems or even downtime is avoided, and the labor cost and the material cost of the product due to the fact that the problem is solved.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art that other embodiments can be obtained by using the drawings without creative efforts.
FIG. 1 is a diagram illustrating a method for handling a bandwidth status exception according to an embodiment of the present invention;
FIG. 2 is a diagram of an embodiment of a device for handling a bandwidth status exception according to the present invention;
FIG. 3 is a schematic diagram of an embodiment of a computer device provided by the present invention;
FIG. 4 is a schematic diagram of an embodiment of a computer-readable storage medium provided by the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the following embodiments of the present invention are described in further detail with reference to the accompanying drawings.
It should be noted that all expressions using "first" and "second" in the embodiments of the present invention are used for distinguishing two entities with the same name but different names or different parameters, and it should be noted that "first" and "second" are merely for convenience of description and should not be construed as limitations of the embodiments of the present invention, and they are not described in any more detail in the following embodiments.
In view of the foregoing, a first aspect of the embodiments of the present invention provides an embodiment of a method for handling a bandwidth status exception. Fig. 1 is a schematic diagram illustrating an embodiment of a method for handling a bandwidth status exception according to the present invention. As shown in fig. 1, the embodiment of the present invention includes the following steps:
s01, the BMC reads the PCIE card state of the slot position through the CPLD, reads the bandwidth state information through the PCIE Switch, and judges whether the bandwidth state information is consistent with the PCIE card state;
s02, if the bandwidth state information is inconsistent with the PCIE card state, unbundling the slot positions by the BMC and reconfiguring the bandwidth state information based on the PCIE card state;
s03, the BMC sends command request content to the PCIE Switch, reads return data obtained by the PCIE Switch based on the command request content and circularly executes the step based on the return data;
s04, responding to the fact that the number of times of executing the previous step in a circulating mode does not exceed the preset number of circulating times, and continuously reading the returned data for three times to be consistent, considering that the returned data are successful, and further judging whether the current bandwidth state information is consistent with the PCIE card state or not based on the returned data; and
and S05, if the current bandwidth state information is inconsistent with the PCIE card state, the BMC unbundles and reconfigures the current bandwidth state information based on the PCIE card state.
In this embodiment, the BMC reads the external plug-in card status provided by the CPLD register, which includes three statuses, i.e., no-bit status, x8 PCIE card and x16 PCIE card. After the BMC reads the PCIE card whose current slot is X16, the BMC reads the bandwidth configuration information of the Switch, the Switch returns that the PCIE card of the current slot is not X16, and the BMC performs unbinding of the PCIE card of the current slot, reconfigures the PCIE card to be a card of X16, and then powers on the card. The unbinding and binding of the current slot to the card only occur after the equipment is powered on for the first time, and the start of the BMC is completed before the BIOS scans the PCIE resources. The whole device is not powered down, and the process operation is not triggered. On the basis, exception flow processing for reading the x16 state of Switch by the BMC is added; increasing protocol format judgment of Switch return data; increasing the effectiveness judgment of the returned data by judgment; and (4) judging an abnormal scene after adding the returned data.
In this embodiment, the BMC first measures the state of the PCIE card in the current slot of the CPLD; the CPLD informs the BMC that the current slot position is X16; BMC instantiates Switch x16 status; the BMC reads exception handling of x16 state flow of Switch; and after the data is successfully returned, locally storing the data, continuously reading the data twice, if the data read three times is consistent, determining that the read data is correct, otherwise, circularly reading the data to ensure that the data read three times continuously are consistent, determining that the data read three times continuously are correct, and determining that the data read three times continuously are not consistent after the number of cycles reaches the maximum number, and determining that the data read is wrong.
In this embodiment, the returned data is successful and the data is in the x16 state, and this scene data indicates that the BMC has been correctly bound, and then the unbinding and binding are skipped; returning that the data is successful and the data is in an x8 state, wherein the scene data indicates that the BMC is not bound, and the BMC is unbound and bound; returning that the data was successful and the data was not in x16 or x8 states, this scenario indicates that Switch returned erroneous data, for which the BMC considered an exception scenario. BMC does not process; and when the data return fails, the scene indicates that the Switch returns error data, the BMC considers the scene as an abnormal scene, and the BMC does not process the abnormal scene.
In the embodiment, the data reliability is realized through a certain algorithm, the difficulty of the realization method is small, the problem of card loss caused by card insertion outside the server is reduced, and the stability is high, so that the stability of the service product is improved, and the labor cost and the material cost of the product caused by the occurrence of the problem are reduced. The problem of server product stability reduction caused by third party firmware chip instability is solved, the product stability of the server is improved, and the problem of serious card loss or other problems or even downtime caused by data receiving and data processing is avoided. The labor cost and the material cost of the product caused by the occurrence of the phenomenon are reduced.
In some embodiments of the present invention, sending, by the BMC, a command request content to the PCIE Switch, reading return data obtained by the PCIE Switch executing based on the command request content, and executing the step cyclically based on the return data includes: the BMC sends a command number to the PCIE Switch and judges whether the command number is sent successfully or not; if the command number is successfully sent, the BMC sends command request content corresponding to the command number to the PCIE Switch, and judges whether the command request content is successfully sent; if the command request content is successfully sent, the BMC queries whether the thread is successfully received from the PCIE Switch, and if the thread is successfully received, the BMC queries whether the command is successfully executed from the PCIE Switch; if the command is successfully executed, the BMC reads data from the PCIE Switch.
In this embodiment, the BMC sends a cmd command number to the Switch, and if the command is successfully sent, the next step is performed, otherwise, the BMC exits and initiates the flow again; the BMC sends command request content corresponding to the command number to the Switch, if the command request content is successfully sent, the next step is carried out, otherwise, the BMC exits, and the process is restarted; the BMC queries whether the thread of the Switch is successfully received or not from the Switch, the thread fails to receive, retries are carried out, the BMC exits when the third retries fail, and re-initiates the sending of the command to the Switch; the BMC inquires whether the command is successfully executed or not from the Switch, and the Switch returns that the command is failed to be executed, and then quits and restarts the initiating process; the Switch return command is being executed, the BMC waits for the execution, the query is carried out again, the execution of the Switch return command is successfully queried within the specified time, the next operation is carried out, otherwise, the BMC returns that the command is overtime, and the starting process is restarted; the BMC reads data from the Switch, and the return data includes the bandwidth state (X8/X16) of the PCIE card, the issued CMD command number, and the issued command request data.
In this embodiment, the BMC determines whether the returned command number is consistent with the issued command, and determines whether the returned request data is consistent with the issued request data. If the data are consistent, the returned bandwidth state data of the PCIE card is considered to be correct, otherwise, the data are judged to be failed, and the process is restarted.
In some embodiments of the invention, further comprising: if the command number is not successfully sent, returning to the step of sending the command number to the PCIE Switch by the BMC; and if the command request content corresponding to the command number is not successfully sent, returning to the step of sending the command number to the PCIE Switch by the BMC.
In some embodiments of the invention, further comprising: if the thread receiving fails, the BMC queries whether the thread receiving is successful from the PCIE Switch again; and in response to the number of times of executing the previous step exceeding three times, returning to the step of sending the command number to the PCIE Switch by the BMC.
In some embodiments of the invention, further comprising: if the command is being executed, the BMC queries whether the command is successfully executed from the PCIE Switch again; if the command fails to be executed, returning to the step of sending the command number to the PCIE Switch by the BMC.
In some embodiments of the invention, further comprising: and if the current bandwidth state information is consistent with the PCIE card state, skipping the steps of unbinding and binding.
In some embodiments of the present invention, the reading, by the BMC, the PCIE card state of the slot through the CPLD includes: and the BMC receives the slot position signal through the CPLD and reads the PCIE card state of the slot position based on the slot position signal, wherein the PCIE card state comprises an off-position state, an x8 card and an x16 card.
It should be particularly noted that, the steps in the embodiments of the processing method for a bandwidth status exception described above may be mutually intersected, replaced, added, or deleted, and therefore, the processing method for a bandwidth status exception based on these reasonable permutation and combination transformations shall also belong to the scope of the present invention, and shall not limit the scope of the present invention to the embodiments.
In view of the above object, according to a second aspect of the embodiments of the present invention, an apparatus for handling a bandwidth status exception is provided. Fig. 2 is a schematic diagram illustrating an embodiment of a device for handling a bandwidth status exception according to the present invention. As shown in fig. 2, the embodiment of the present invention includes the following modules: a first module S11, configured to read, by the BMC, the PCIE card state of the slot through the CPLD, and read bandwidth state information through the PCIE Switch, and determine whether the bandwidth state information is consistent with the PCIE card state; a second module S12 configured to, if the bandwidth state information is inconsistent with the PCIE card state, unbundle the slot by the BMC and reconfigure the bandwidth state information based on the PCIE card state; a third module S13, configured to send a command request content to the PCIE Switch by the BMC, read return data obtained by the PCIE Switch executing based on the command request content, and execute this module cyclically based on the return data; a fourth module S14, configured to respond that the number of times of executing the previous module in a loop does not exceed a preset number of cycles, and if the returned data read for three consecutive times are consistent, consider that the returned data is successful, and further determine whether the current bandwidth state information is consistent with the PCIE card state based on the returned data; and a fifth module S15 configured to, if the current bandwidth state information is inconsistent with the PCIE card state, unbundle by the BMC and reconfigure the current bandwidth state information based on the PCIE card state.
In view of the above object, a third aspect of the embodiments of the present invention provides a computer device. Fig. 3 is a schematic diagram of an embodiment of a computer device provided by the present invention. As shown in fig. 3, an embodiment of the present invention includes: at least one processor S21; and a memory S22, the memory S22 storing computer instructions S23 executable on the processor, the instructions when executed by the processor implementing method steps comprising: the BMC reads the PCIE card state of the slot position through the CPLD, reads the bandwidth state information through the PCIE Switch, and judges whether the bandwidth state information is consistent with the PCIE card state; if the bandwidth state information is inconsistent with the PCIE card state, the BMC unbundles the slot position and reconfigures the bandwidth state information based on the PCIE card state; the BMC sends command request content to the PCIE Switch, reads return data obtained by the PCIE Switch based on the command request content and circularly executes the step based on the return data; responding to the condition that the number of times of executing the last step in a circulating mode does not exceed the preset number of times of circulating, and continuously reading the returned data for three times to be consistent, considering that the returned data is successful, and further judging whether the current bandwidth state information is consistent with the state of the PCIE card or not based on the returned data; and if the current bandwidth state information is inconsistent with the PCIE card state, the BMC unbundles and reconfigures the current bandwidth state information based on the PCIE card state.
In some embodiments of the present invention, sending, by the BMC, a command request content to the PCIE Switch, reading return data obtained by the PCIE Switch executing based on the command request content, and executing the step cyclically based on the return data includes: the BMC sends a command number to the PCIE Switch and judges whether the command number is sent successfully or not; if the command number is successfully sent, the BMC sends command request content corresponding to the command number to the PCIE Switch, and judges whether the command request content is successfully sent; if the command request content is successfully sent, the BMC queries whether the thread is successfully received from the PCIE Switch, and if the thread is successfully received, the BMC queries whether the command is successfully executed from the PCIE Switch; if the command is successfully executed, the BMC reads data from the PCIE Switch.
In some embodiments of the invention, further comprising: if the command number is not successfully sent, returning to the step of sending the command number to the PCIE Switch by the BMC; and if the command request content corresponding to the command number is not successfully sent, returning to the step of sending the command number to the PCIE Switch by the BMC.
In some embodiments of the invention, further comprising: if the thread receiving fails, the BMC queries whether the thread receiving is successful from the PCIE Switch again; and in response to the number of times of executing the previous step exceeding three times, returning to the step of sending the command number to the PCIE Switch by the BMC.
In some embodiments of the invention, further comprising: if the command is being executed, the BMC queries whether the command is successfully executed from the PCIE Switch again; if the command fails to be executed, returning to the step of sending the command number to the PCIE Switch by the BMC.
In some embodiments of the invention, further comprising: and if the current bandwidth state information is consistent with the PCIE card state, skipping the steps of unbinding and binding.
In some embodiments of the present invention, the reading, by the BMC, the PCIE card state of the slot through the CPLD includes: and the BMC receives the slot position signal through the CPLD and reads the PCIE card state of the slot position based on the slot position signal, wherein the PCIE card state comprises an off-position state, an x8 card and an x16 card.
The invention also provides a computer readable storage medium. FIG. 4 is a schematic diagram illustrating an embodiment of a computer-readable storage medium provided by the present invention. As shown in fig. 4, the computer readable storage medium stores S31 a computer program that, when executed by a processor, performs the method as described above S32.
Finally, it should be noted that, as one of ordinary skill in the art can appreciate that all or part of the processes of the methods of the above embodiments can be implemented by a computer program to instruct related hardware, and the program of the method for handling a bandwidth status exception can be stored in a computer-readable storage medium, and when executed, the program can include the processes of the embodiments of the methods described above. The storage medium of the program may be a magnetic disk, an optical disk, a Read Only Memory (ROM), a Random Access Memory (RAM), or the like. The embodiments of the computer program may achieve the same or similar effects as any of the above-described method embodiments.
Furthermore, the methods disclosed according to embodiments of the present invention may also be implemented as a computer program executed by a processor, which may be stored in a computer-readable storage medium. Which when executed by a processor performs the above-described functions defined in the methods disclosed in embodiments of the invention.
Further, the above method steps and system elements may also be implemented using a controller and a computer readable storage medium for storing a computer program for causing the controller to implement the functions of the above steps or elements.
Those of skill would further appreciate that the various illustrative logical blocks, modules, circuits, and algorithm steps described in connection with the disclosure herein may be implemented as electronic hardware, computer software, or combinations of both. To clearly illustrate this interchangeability of hardware and software, various illustrative components, blocks, modules, circuits, and steps have been described above generally in terms of their functionality. Whether such functionality is implemented as software or hardware depends upon the particular application and design constraints imposed on the overall system. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the disclosed embodiments of the present invention.
In one or more exemplary designs, the functions may be implemented in hardware, software, firmware, or any combination thereof. If implemented in software, the functions may be stored on or transmitted over as one or more instructions or code on a computer-readable medium. Computer-readable media includes both computer storage media and communication media including any medium that facilitates transfer of a computer program from one place to another. A storage media may be any available media that can be accessed by a general purpose or special purpose computer. By way of example, and not limitation, such computer-readable media can comprise RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium that can be used to carry or store desired program code in the form of instructions or data structures and that can be accessed by a general-purpose or special-purpose computer, or a general-purpose or special-purpose processor. Also, any connection is properly termed a computer-readable medium. For example, if the software is transmitted from a website, server, or other remote source using a coaxial cable, fiber optic cable, twisted pair, Digital Subscriber Line (DSL), or wireless technologies such as infrared, radio, and microwave, then the coaxial cable, fiber optic cable, twisted pair, DSL, or wireless technologies such as infrared, radio, and microwave are included in the definition of medium. Disk and disc, as used herein, includes Compact Disc (CD), laser disc, optical disc, Digital Versatile Disc (DVD), floppy disk, blu-ray disc where disks usually reproduce data magnetically, while discs reproduce data optically with lasers. Combinations of the above should also be included within the scope of computer-readable media.
The foregoing is an exemplary embodiment of the present disclosure, but it should be noted that various changes and modifications could be made herein without departing from the scope of the present disclosure as defined by the appended claims. The functions, steps and/or actions of the method claims in accordance with the disclosed embodiments described herein need not be performed in any particular order. Furthermore, although elements of the disclosed embodiments of the invention may be described or claimed in the singular, the plural is contemplated unless limitation to the singular is explicitly stated.
It should be understood that, as used herein, the singular forms "a", "an" and "the" are intended to include the plural forms as well, unless the context clearly supports the exception. It should also be understood that "and/or" as used herein is meant to include any and all possible combinations of one or more of the associated listed items.
The numbers of the embodiments disclosed in the embodiments of the present invention are merely for description, and do not represent the merits of the embodiments.
It will be understood by those skilled in the art that all or part of the steps for implementing the above embodiments may be implemented by hardware, or may be implemented by a program instructing relevant hardware, and the program may be stored in a computer-readable storage medium, and the above-mentioned storage medium may be a read-only memory, a magnetic disk or an optical disk, etc.
Those of ordinary skill in the art will understand that: the discussion of any embodiment above is meant to be exemplary only, and is not intended to intimate that the scope of the disclosure, including the claims, of embodiments of the invention is limited to these examples; within the idea of an embodiment of the invention, also technical features in the above embodiment or in different embodiments may be combined and there are many other variations of the different aspects of the embodiments of the invention as described above, which are not provided in detail for the sake of brevity. Therefore, any omissions, modifications, substitutions, improvements, and the like that may be made without departing from the spirit and principles of the embodiments of the present invention are intended to be included within the scope of the embodiments of the present invention.

Claims (10)

1. A processing method for bandwidth state exception is characterized by comprising the following steps:
reading the PCIE card state of the slot position through the CPLD by the BMC, reading bandwidth state information through the PCIE Switch, and judging whether the bandwidth state information is consistent with the PCIE card state;
if the bandwidth state information is inconsistent with the PCIE card state, the BMC unbinds the slot position and reconfigures the bandwidth state information based on the PCIE card state;
sending a command request content to the PCIE Switch by the BMC, reading return data obtained by the PCIE Switch based on the command request content and circularly executing the step based on the return data;
responding to the condition that the number of times of executing the last step in a circulating mode does not exceed the preset number of times of circulating, and continuously reading the returned data for three times to be consistent, considering that the returned data is successful, and further judging whether the current bandwidth state information is consistent with the PCIE card state or not based on the returned data; and
and if the current bandwidth state information is inconsistent with the PCIE card state, the BMC unbundles and reconfigures the current bandwidth state information based on the PCIE card state.
2. The method according to claim 1, wherein the sending, by the BMC, a command request content to the PCIE Switch, reading return data obtained by the PCIE Switch executing based on the command request content, and executing the step cyclically based on the return data includes:
sending a command number to the PCIE Switch by the BMC, and judging whether the command number is sent successfully or not;
if the command number is successfully sent, the BMC sends command request content corresponding to the command number to the PCIE Switch, and judges whether the command request content is successfully sent;
if the command request content is successfully sent, the BMC queries whether the thread is successfully received from the PCIE Switch, and if the thread is successfully received, the BMC queries whether the command is successfully executed from the PCIE Switch;
and if the command is successfully executed, reading data from the PCIE Switch by the BMC.
3. The method for handling a bandwidth status exception according to claim 2, further comprising:
if the command number is not successfully sent, returning to the step of sending the command number to the PCIE Switch by the BMC;
and if the command request content corresponding to the command number is not successfully sent, returning to the step of sending the command number to the PCIE Switch by the BMC.
4. The method for handling a bandwidth status exception according to claim 2, further comprising:
if the thread receiving fails, the BMC queries whether the thread receiving is successful from the PCIE Switch again;
and returning to the step of sending a command number to the PCIE Switch by the BMC in response to the number of times of executing the previous step exceeding three times.
5. The method for handling a bandwidth status exception according to claim 2, further comprising:
if the command is being executed, the BMC queries whether the command is successfully executed from the PCIE Switch again;
if the command execution fails, returning to the step of sending a command number to the PCIE Switch by the BMC.
6. The method for handling a bandwidth status exception according to claim 1, further comprising:
and if the current bandwidth state information is consistent with the PCIE card state, skipping the steps of unbinding and binding.
7. The method for processing the bandwidth status exception according to claim 1, wherein the reading, by the BMC through the CPLD, the PCIE card status of the slot includes:
and the BMC receives the slot position signal through the CPLD and reads the PCIE card state of the slot position based on the slot position signal, wherein the PCIE card state comprises an off-position state, an x8 card and an x16 card.
8. A device for handling a bandwidth status exception, comprising:
the first module is configured to read the PCIE card state of the slot position through the CPLD by the BMC, read bandwidth state information through the PCIE Switch, and judge whether the bandwidth state information is consistent with the PCIE card state;
a second module configured to unbind the slot by the BMC and reconfigure the bandwidth state information based on the PCIE card state if the bandwidth state information is inconsistent with the PCIE card state;
a third module, configured to send a command request content to the PCIE Switch by the BMC, read return data obtained by the PCIE Switch executing based on the command request content, and execute the present module cyclically based on the return data;
the fourth module is configured to respond that the number of times of circularly executing the previous module does not exceed a preset number of times of circulation, and the returned data read for three continuous times are consistent, the returned data is considered to be successful, and whether the current bandwidth state information is consistent with the state of the PCIE card or not is further judged based on the returned data; and
and the fifth module is configured to, if the current bandwidth state information is inconsistent with the PCIE card state, unbundle by the BMC and reconfigure the current bandwidth state information based on the PCIE card state.
9. A computer device, comprising:
at least one processor; and
a memory storing computer instructions executable on the processor, the instructions when executed by the processor implementing the steps of the method of any one of claims 1 to 7.
10. A computer-readable storage medium, in which a computer program is stored which, when being executed by a processor, carries out the steps of the method according to any one of claims 1 to 7.
CN202110594327.6A 2021-05-28 2021-05-28 Method, device and equipment for processing bandwidth state exception and readable medium Active CN113448785B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110594327.6A CN113448785B (en) 2021-05-28 2021-05-28 Method, device and equipment for processing bandwidth state exception and readable medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110594327.6A CN113448785B (en) 2021-05-28 2021-05-28 Method, device and equipment for processing bandwidth state exception and readable medium

Publications (2)

Publication Number Publication Date
CN113448785A true CN113448785A (en) 2021-09-28
CN113448785B CN113448785B (en) 2023-03-28

Family

ID=77810376

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110594327.6A Active CN113448785B (en) 2021-05-28 2021-05-28 Method, device and equipment for processing bandwidth state exception and readable medium

Country Status (1)

Country Link
CN (1) CN113448785B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113688087A (en) * 2021-10-25 2021-11-23 苏州浪潮智能科技有限公司 PCIE (peripheral component interface express) device enumeration method, system, storage medium and device
CN114003535A (en) * 2021-10-14 2022-02-01 苏州浪潮智能科技有限公司 Equipment bandwidth configuration method and system, electronic equipment and storage medium

Citations (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101702141A (en) * 2009-11-18 2010-05-05 成都市华为赛门铁克科技有限公司 Method and device supporting dynamic working mode of high-density card
US20120151112A1 (en) * 2010-12-09 2012-06-14 Dell Products, Lp System and Method for Mapping a Logical Drive Status to a Physical Drive Status for Multiple Storage Drives Having Different Storage Technologies within a Server
CN103763129A (en) * 2013-12-31 2014-04-30 华为技术有限公司 PCIe equipment management method and device
CN104202194A (en) * 2014-09-10 2014-12-10 华为技术有限公司 Configuration method and device of PCIe (peripheral component interface express) topology
CN104503933A (en) * 2014-12-03 2015-04-08 浪潮集团有限公司 PCIE automatic switching design based on multi-path server system
CN106557407A (en) * 2016-11-14 2017-04-05 腾讯科技(深圳)有限公司 A kind of monitoring method and device of apparatus of load
CN206147604U (en) * 2016-07-26 2017-05-03 浪潮电子信息产业股份有限公司 PCIE switches module based on automatic GPU that matches of openpower platform
CN206312134U (en) * 2016-12-29 2017-07-07 郑州云海信息技术有限公司 A kind of switching device suitable for multipath server
US20170359275A1 (en) * 2016-06-13 2017-12-14 Dell Products, Lp Virtual PCIe Based Ethernet Controller for In-band Communication Between Server and BMC
CN107590050A (en) * 2017-09-04 2018-01-16 杭州迪普科技股份有限公司 A kind of method and device for checking external plug-in unit exception
CN107608926A (en) * 2017-08-18 2018-01-19 郑州云海信息技术有限公司 One kind supports PCIE bandwidth automatic switching control equipment and method based on server
CN109144860A (en) * 2018-08-08 2019-01-04 广州云测信息技术有限公司 The operating method and terminal device of a kind of pair of control object
CN109271177A (en) * 2018-08-21 2019-01-25 郑州云海信息技术有限公司 A kind of method and system of automatic load PCIE Switch products configuration parameter
CN109445997A (en) * 2018-06-29 2019-03-08 郑州云海信息技术有限公司 A kind of production line test method, system, medium and the equipment of PCIE Switch product
CN110399267A (en) * 2019-07-24 2019-11-01 苏州浪潮智能科技有限公司 A kind of server PCIE device monitoring method, system, equipment and readable storage medium storing program for executing
CN111176414A (en) * 2019-12-13 2020-05-19 苏州浪潮智能科技有限公司 Power supply method and device for server network card and readable medium
CN111338907A (en) * 2020-03-09 2020-06-26 山东超越数控电子股份有限公司 Remote state monitoring system and method of PCIE (peripheral component interface express) equipment
CN111488233A (en) * 2020-04-02 2020-08-04 苏州浪潮智能科技有限公司 Method and system for processing bandwidth loss problem of PCIe device
CN112256539A (en) * 2020-09-18 2021-01-22 苏州浪潮智能科技有限公司 PCIE link error statistical method, device, terminal and storage medium

Patent Citations (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101702141A (en) * 2009-11-18 2010-05-05 成都市华为赛门铁克科技有限公司 Method and device supporting dynamic working mode of high-density card
US20120151112A1 (en) * 2010-12-09 2012-06-14 Dell Products, Lp System and Method for Mapping a Logical Drive Status to a Physical Drive Status for Multiple Storage Drives Having Different Storage Technologies within a Server
CN103763129A (en) * 2013-12-31 2014-04-30 华为技术有限公司 PCIe equipment management method and device
CN104202194A (en) * 2014-09-10 2014-12-10 华为技术有限公司 Configuration method and device of PCIe (peripheral component interface express) topology
CN104503933A (en) * 2014-12-03 2015-04-08 浪潮集团有限公司 PCIE automatic switching design based on multi-path server system
US20170359275A1 (en) * 2016-06-13 2017-12-14 Dell Products, Lp Virtual PCIe Based Ethernet Controller for In-band Communication Between Server and BMC
CN206147604U (en) * 2016-07-26 2017-05-03 浪潮电子信息产业股份有限公司 PCIE switches module based on automatic GPU that matches of openpower platform
CN106557407A (en) * 2016-11-14 2017-04-05 腾讯科技(深圳)有限公司 A kind of monitoring method and device of apparatus of load
CN206312134U (en) * 2016-12-29 2017-07-07 郑州云海信息技术有限公司 A kind of switching device suitable for multipath server
CN107608926A (en) * 2017-08-18 2018-01-19 郑州云海信息技术有限公司 One kind supports PCIE bandwidth automatic switching control equipment and method based on server
CN107590050A (en) * 2017-09-04 2018-01-16 杭州迪普科技股份有限公司 A kind of method and device for checking external plug-in unit exception
CN109445997A (en) * 2018-06-29 2019-03-08 郑州云海信息技术有限公司 A kind of production line test method, system, medium and the equipment of PCIE Switch product
CN109144860A (en) * 2018-08-08 2019-01-04 广州云测信息技术有限公司 The operating method and terminal device of a kind of pair of control object
CN109271177A (en) * 2018-08-21 2019-01-25 郑州云海信息技术有限公司 A kind of method and system of automatic load PCIE Switch products configuration parameter
CN110399267A (en) * 2019-07-24 2019-11-01 苏州浪潮智能科技有限公司 A kind of server PCIE device monitoring method, system, equipment and readable storage medium storing program for executing
CN111176414A (en) * 2019-12-13 2020-05-19 苏州浪潮智能科技有限公司 Power supply method and device for server network card and readable medium
CN111338907A (en) * 2020-03-09 2020-06-26 山东超越数控电子股份有限公司 Remote state monitoring system and method of PCIE (peripheral component interface express) equipment
CN111488233A (en) * 2020-04-02 2020-08-04 苏州浪潮智能科技有限公司 Method and system for processing bandwidth loss problem of PCIe device
CN112256539A (en) * 2020-09-18 2021-01-22 苏州浪潮智能科技有限公司 PCIE link error statistical method, device, terminal and storage medium

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114003535A (en) * 2021-10-14 2022-02-01 苏州浪潮智能科技有限公司 Equipment bandwidth configuration method and system, electronic equipment and storage medium
CN114003535B (en) * 2021-10-14 2023-07-14 苏州浪潮智能科技有限公司 Device bandwidth configuration method and system, electronic device and storage medium
CN113688087A (en) * 2021-10-25 2021-11-23 苏州浪潮智能科技有限公司 PCIE (peripheral component interface express) device enumeration method, system, storage medium and device

Also Published As

Publication number Publication date
CN113448785B (en) 2023-03-28

Similar Documents

Publication Publication Date Title
CN113448785B (en) Method, device and equipment for processing bandwidth state exception and readable medium
JP5559435B2 (en) Method, apparatus and system for validating data
CN111143106B (en) Repair method, device, equipment and medium for dual-port SSD
CN103514173A (en) Data processing method and node equipment
CN105468302A (en) Data processing method, apparatus and system
EP3170322A1 (en) Electronic device, controller and control method for nfc
US20230393950A1 (en) Method and apparatus for expanding cloud volume, and device and readable medium
CN103324554A (en) Standby system device, a control method, and a program thereof
CN112860327A (en) CPU port switching method, device, equipment and readable medium
CN113722364A (en) Redis use control method and device
CN114586014A (en) Selecting error correction code type for memory device
CN110661599B (en) HA implementation method, device and storage medium between main node and standby node
JP5211826B2 (en) Information processing apparatus, information processing system, information processing method, and computer program
CN116113011A (en) Network connection method and device and electronic equipment
CN114489464A (en) Data writing method and device and electronic equipment
CN113741990B (en) Method, device, equipment and readable medium for synchronizing main and standby BIOS data of switch
CN112328527B (en) Method, system, equipment and medium for BMC remote access to I2C device
CN111224799B (en) Single board master/slave control method, device, equipment and readable storage medium
CN115499489B (en) Method, device, equipment and readable medium for managing sub-network in link
CN111698538B (en) System and method for realizing unified network management by fusing terminal discrete functional units
US20230309005A1 (en) Service request handling
CN114979061B (en) Method, device, equipment and medium for responding ARP (address resolution protocol) of intelligent network card
CN109992212B (en) Data writing method and data reading method
CN112748882B (en) Communication method, device, equipment and readable medium of host and storage equipment
CN115225559A (en) Method, device and equipment for detecting multipath driving link and readable medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant