CN111949466A - System and method capable of quickly reading equipment information - Google Patents

System and method capable of quickly reading equipment information Download PDF

Info

Publication number
CN111949466A
CN111949466A CN202010951703.8A CN202010951703A CN111949466A CN 111949466 A CN111949466 A CN 111949466A CN 202010951703 A CN202010951703 A CN 202010951703A CN 111949466 A CN111949466 A CN 111949466A
Authority
CN
China
Prior art keywords
control unit
equipment
interrupt
devices
interrupt signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010951703.8A
Other languages
Chinese (zh)
Other versions
CN111949466B (en
Inventor
唐传贞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Inspur Intelligent Technology Co Ltd
Original Assignee
Suzhou Inspur Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Inspur Intelligent Technology Co Ltd filed Critical Suzhou Inspur Intelligent Technology Co Ltd
Priority to CN202010951703.8A priority Critical patent/CN111949466B/en
Publication of CN111949466A publication Critical patent/CN111949466A/en
Application granted granted Critical
Publication of CN111949466B publication Critical patent/CN111949466B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/22Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
    • G06F11/2205Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing using arrangements specific to the hardware being tested
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/22Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
    • G06F11/2268Logging of test results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/22Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
    • G06F11/2273Test methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • G06F11/3006Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system is distributed, e.g. networked systems, clusters, multiprocessor systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5061Partitioning or combining of resources
    • G06F9/5072Grid computing

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Quality & Reliability (AREA)
  • Computer Hardware Design (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Computing Systems (AREA)
  • Bus Control (AREA)

Abstract

The invention discloses a system and a method capable of rapidly reading equipment information, wherein the system comprises a plurality of pieces of equipment, all the pieces of equipment are divided into a plurality of groups, each group is correspondingly connected to a merging unit, and the merging unit is connected to an interrupt pin of a control unit through an interrupt signal line; meanwhile, the devices within each subgroup are connected to the I2C of the control unit via the same I2C bus. The invention combines the interrupts of a plurality of devices into one interrupt, continues to use the interrupt mechanism, and the control unit can quickly identify the device area with the fault, ensure the quick response and ensure the timeliness of the problem response. According to the scheme, the abnormal interrupts of the plurality of devices are combined into one interrupt signal, so that the use of interrupt resources of the control unit can be reduced, materials with few resources can be selected when the control unit is used, and the material cost is reduced.

Description

System and method capable of quickly reading equipment information
Technical Field
The invention relates to the technical field of control system information query, in particular to a system and a method capable of quickly reading equipment information.
Background
With the continuous development of cloud computing, each large internet operator has more and more requirements on the product configuration and density of a server, and when the requirements on the product configuration and density are too high, a plurality of devices, such as up to four or more power supplies or devices such as hard disks, must exist in one server. Meanwhile, the performance and access timeliness requirements of each large internet operator on server products are higher and higher, for example, for the timeliness problem of all devices in a server accessed by a main control board with a control unit, the function requires that the control unit accesses all device information of the server through an I2C or other buses, including fault information, the conventional design mode is to access all devices through an I2C polling mode, so-called polling is to arrange all devices into a queue, and the control unit accesses the server sequentially from front to back according to the sequence of the queue. When a device behind the queue fails in the polling process, the device also needs to wait for an access command of the control unit to arrive and can send information to the control unit, and this way often results in failure information being not obtained in the first time, and even more, important information is missed. Therefore, for this situation, an interrupt mechanism is generally introduced, each device is configured with an interrupt pin, the interrupt signals of the devices are independent from each other, and the interrupt pins of the control unit are also independent. The control unit connects all devices together via an I2C bus, while keeping the I2C address of each device different. When any one device fails, the device sends a device abnormal interrupt signal to the control unit, when the control unit receives interrupt information corresponding to the interrupt pin, the control unit regards that the device fails, and the control unit accesses the device through the I2C bus to acquire specific fault information. The scheme can quickly acquire specific fault equipment, but the scheme needs to occupy enough interrupt resources, and when the interrupt resources of the control unit are not enough or the enough interrupt resources are not reserved in the reprinting project, the scheme has the defects that the control unit with higher cost is replaced or the project development cycle is prolonged.
In reality, a situation is encountered, and the time urgency requirement of the product development is high for the client. In this case, the development period needs to be shortened by the developers, and at this time, the best way is to reuse the existing product, or to make a partial change development, and make a slight change while ensuring the minimum development workload, which may result in the existing multiplexing part, for example, the main control board with the control unit, not being designed to reserve enough resources to accommodate the additional configuration device in the new requirement. Such as a multiplexed master control board not reserving enough interrupt response pins to handle the newly added device requirements.
Disclosure of Invention
The present invention is directed to solve the above-mentioned problems, and an object of the present invention is to provide a system and a method capable of quickly reading device information, where interrupts of multiple devices are combined into one interrupt, and an interrupt mechanism is continuously used, so that a control unit can quickly identify a device region where a failure occurs, and can ensure quick response, and ensure timeliness of problem response.
In order to achieve the purpose, the invention adopts the following technical scheme:
a system capable of rapidly reading equipment information comprises a plurality of pieces of equipment, wherein all the pieces of equipment are divided into a plurality of groups, each group is correspondingly connected to a merging unit, and the merging unit is connected to an interrupt pin of a control unit through an interrupt signal line; meanwhile, the devices within each subgroup are connected to the I2C of the control unit via the same I2C bus.
Each device within the group is connected to the merging unit by a respective device abort signal line.
There is at least one device in a subgroup.
When all the devices are grouped, grouping the devices according to the set requirement; the grouping can not be averaged, the grouping is averaged after redundant grouping is removed, and redundant grouping is independently performed.
A reading method using the system capable of quickly reading device information, comprising:
the merging unit merges equipment abnormal interrupt signals of one or more equipment into an interrupt signal which is sent to an interrupt pin of the control unit;
the devices in a subgroup are all connected to the I2C of the control unit through the same I2C bus;
when any equipment has a fault, the fault equipment sends an equipment abnormal interrupt signal to the corresponding merging unit, the merging unit outputs the interrupt signal to the control unit, after the control unit receives the interrupt signal information, a polling mechanism of an I2C bus is started, equipment in a group where the fault equipment is located is polled through the corresponding I2C bus, specific fault equipment is found out, and corresponding equipment information is read.
And a corresponding table is arranged in the control unit, and the table contains the corresponding relation between each interrupt signal and a specific equipment number.
The invention has the beneficial effects that:
1. the invention combines the interrupts of a plurality of devices into one interrupt, continues to use the interrupt mechanism, and the control unit can quickly identify the device area with the fault, ensure the quick response and ensure the timeliness of the problem response.
2. According to the scheme, the abnormal interrupts of the plurality of devices are combined into one interrupt signal, so that the use of interrupt resources of the control unit can be reduced, materials with few resources can be selected when the control unit is used, and the material cost is reduced.
3. Aiming at the version changing project, even if a newly added device is provided, the newly added device is connected to the merging unit to share the interrupt signal without increasing interrupt resources by a new board opening card, so that the project development period is shortened.
Drawings
FIG. 1 is a system diagram of the present invention.
Detailed Description
The invention is further described with reference to the following figures and examples.
When the number of newly added devices exceeds the reserved interrupt response pins in the main control board or the control unit in the main control board does not have enough interrupt response pins, the devices are merged and interrupted, so that a plurality of devices share one interrupt, and when the control unit receives the interrupt information, the control unit can poll the device list corresponding to the interrupt preferentially, thereby achieving the purposes of quickly accessing the devices and saving the interrupt resources of the control unit.
A system capable of quickly reading device information is shown in a system diagram as fig. 1, and the scheme combines polling and interrupt response mechanisms, specifically explaining by taking four devices as an example, device abnormal interrupt signals of a device 1 and a device 2 are respectively connected to a combining unit, the combining unit combines the device abnormal interrupt signals of the two devices into one interrupt signal 1, and device abnormal interrupt signals of a device 3 and a device 4 in the same way are combined into an interrupt signal 2 through the combining unit.
Device 1 and device 2 are connected to I2C of the control unit via the same I2C bus 1, and device 3 and device 4 are connected to I2C of the control unit via the same I2C bus 2. A corresponding table is designed in the control unit, and the table contains the corresponding relation between each interrupt signal and a specific equipment number.
A method for rapidly reading device information adopts a mechanism of first interrupt and then polling, when any device has a fault, for example, when the device 1 has a fault, the device 1 sends a device abnormal interrupt signal to a merging unit, the merging unit outputs the interrupt signal 1 to a control unit, when the control unit receives the interrupt information sent by the interrupt signal 1, a polling mechanism of I2C is started, and the device 1 and the device 2 are polled through an I2C bus 1 to find out a specific fault device.
The invention uses a merging unit to merge the interrupts of a plurality of devices into one interrupt signal, and sends the merged interrupt signal to a control unit. And after the control unit receives the interrupt signal, the control unit quickly responds and starts to poll and search specific equipment in the abnormal area. The control unit is provided with corresponding relation between devices and interrupt signals, and each interrupt signal corresponds to more than one device. The correspondence can assist the control unit to quickly find the abnormal device. The problem of slow access timeliness caused by polling can be solved, and the problems of resource waste and long development period caused by independent interruption of use of all equipment can also be solved.
Although the embodiments of the present invention have been described with reference to the accompanying drawings, it is not intended to limit the scope of the present invention, and it should be understood by those skilled in the art that various modifications and variations can be made without inventive efforts by those skilled in the art based on the technical solution of the present invention.

Claims (6)

1. A system capable of quickly reading equipment information is characterized by comprising a plurality of pieces of equipment, wherein all the pieces of equipment are divided into a plurality of groups, each group is correspondingly connected to a merging unit, and the merging unit is connected to an interrupt pin of a control unit through an interrupt signal line; meanwhile, the devices within each subgroup are connected to the I2C of the control unit via the same I2C bus.
2. A system for enabling rapid reading of device information as claimed in claim 1, wherein each device within the group is connected to the merging unit via a respective device abort signal line.
3. A system for enabling rapid reading of device information as claimed in claim 1, characterized in that there is at least one device in a subgroup.
4. The system for fast reading device information according to claim 1, wherein all devices are grouped and are grouped according to a set requirement; the grouping can not be averaged, the grouping is averaged after redundant grouping is removed, and redundant grouping is independently performed.
5. A reading method using the system for quickly reading device information according to claim 1, comprising:
the merging unit merges equipment abnormal interrupt signals of one or more equipment into an interrupt signal which is sent to an interrupt pin of the control unit;
the devices in a subgroup are all connected to the I2C of the control unit through the same I2C bus;
when any equipment has a fault, the fault equipment sends an equipment abnormal interrupt signal to the corresponding merging unit, the merging unit outputs the interrupt signal to the control unit, after the control unit receives the interrupt signal information, a polling mechanism of an I2C bus is started, equipment in a group where the fault equipment is located is polled through the corresponding I2C bus, specific fault equipment is found out, and corresponding equipment information is read.
6. The method according to claim 5, wherein a table is provided in the control unit, the table containing a correspondence between each interrupt signal and a specific device number.
CN202010951703.8A 2020-09-11 2020-09-11 System and method capable of quickly reading equipment information Active CN111949466B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010951703.8A CN111949466B (en) 2020-09-11 2020-09-11 System and method capable of quickly reading equipment information

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010951703.8A CN111949466B (en) 2020-09-11 2020-09-11 System and method capable of quickly reading equipment information

Publications (2)

Publication Number Publication Date
CN111949466A true CN111949466A (en) 2020-11-17
CN111949466B CN111949466B (en) 2023-01-06

Family

ID=73356315

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010951703.8A Active CN111949466B (en) 2020-09-11 2020-09-11 System and method capable of quickly reading equipment information

Country Status (1)

Country Link
CN (1) CN111949466B (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1811480A (en) * 2005-01-26 2006-08-02 华为技术有限公司 Method and apparatus for real-time monitoring level signal
CN111258405A (en) * 2020-01-18 2020-06-09 苏州浪潮智能科技有限公司 Server mainboard burning prevention system and method

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1811480A (en) * 2005-01-26 2006-08-02 华为技术有限公司 Method and apparatus for real-time monitoring level signal
CN111258405A (en) * 2020-01-18 2020-06-09 苏州浪潮智能科技有限公司 Server mainboard burning prevention system and method

Also Published As

Publication number Publication date
CN111949466B (en) 2023-01-06

Similar Documents

Publication Publication Date Title
US20160246866A1 (en) Distributed persistent memory using asynchronous streaming of log records
US11372688B2 (en) Resource scheduling method, scheduling server, cloud computing system, and storage medium
US8099522B2 (en) Arrangements for I/O control in a virtualized system
US5987538A (en) Apparatus for initiating generation of an inter-processor interrupt by a peripheral device not directly connected to any of the multi-processor local interrupt controllers
CN101120317A (en) Method, apparatus and system for dynamically reassigning memory from one virtual machine to another
US8041877B2 (en) Distributed computing utilizing virtual memory having a shared paging space
US8626978B2 (en) Computer apparatus, computer system and adapter carry-over method
JPH0661073B2 (en) How the multi-processor system works
JP5373893B2 (en) Configuration for storing and retrieving blocks of data having different sizes
US10445295B1 (en) Task-based framework for synchronization of event handling between nodes in an active/active data storage system
CN110213309B (en) Binding relationship management method, device and storage medium
JPH11161625A (en) Computer system
US20090307460A1 (en) Data Sharing Utilizing Virtual Memory
US20100100776A1 (en) Information processing apparatus, failure processing method, and recording medium in which failure processing program is recorded
CN114265753A (en) Management method and management system of message queue and electronic equipment
CN107145304B (en) Server, storage system and related method
CN111949466B (en) System and method capable of quickly reading equipment information
CN112543922A (en) Method and related device for improving reliability of storage system
CN109697179B (en) Hardware resource expansion system and hot plug management device
US20230137609A1 (en) Data synchronization method and apparatus
CN112612741B (en) Multi-path server
US20110153901A1 (en) Virtual usb key for blade server
US11093301B2 (en) Input output adapter error recovery concurrent diagnostics
US10726046B2 (en) Object storage system with ordered parallel replication
US9652429B2 (en) Horizontally expandable computing system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant