CN114697187B

CN114697187B - Master selection method

Info

Publication number: CN114697187B
Application number: CN202210438232.XA
Authority: CN
Inventors: 不公告发明人
Original assignee: Muxi Technology Beijing Co ltd
Current assignee: Muxi Technology Beijing Co ltd
Priority date: 2022-04-25
Filing date: 2022-04-25
Publication date: 2022-12-02
Anticipated expiration: 2042-04-25
Also published as: CN114697187A

Abstract

The invention relates to a master selection method, which comprises the step S1 of acquiring topological structure information { G ] of a computing device when a server is powered on ₁ ，G ₂ ，…G _N If G ₁ ，G ₂ ，…G _N And connecting in sequence to form a linear topological structure, and executing the step S2: each G _i Generating a request packet R _i ，R _i Comprising G _i Id and G of _i Target parameter of G _i R is to be _i From G in a first direction of a linear topology _i‑1 In turn to G ₁ ，G _i R is to be _i From G in a second direction of the linear topology _i+1 In turn to G _N (ii) a S3, obtaining each G _j Received R emitted in a first direction along a linear topology _i Number M of _i1 And R emitted in a second direction of the linear topology _i Number M of _i2 If M is present _i1 =M _i2 Or M is _i1 +1=M _i2 Then G will be _i Determined as master. The invention improves the information interaction efficiency among a plurality of computing devices.

Description

Master selection method

Technical Field

The invention relates to the technical field of computers, in particular to a master selection method.

Background

In the normal operation process of AI operation devices such as a GPU, an FPGA, a DSP, and an AI accelerator in a server, power consumption needs to be maintained near Thermal Design Power (TDP for short), an existing server or a server cluster usually includes a plurality of operation devices, and how to reasonably distribute Power consumption for the plurality of operation devices is crucial in the operation process of the plurality of operation devices. In the related art, one master (master operation device) is generally selected from a plurality of operation devices, and power consumption is allocated to the plurality of operation devices by the master. However, the prior art has at least the following disadvantages: (1) In the prior art, a master is generally selected randomly, which may cause that the distance between part of computing equipment and the master is long, the delay is large, and the efficiency of power consumption distribution is affected. (2) In the prior art, in the process of allocating power consumption to a plurality of computing devices, static allocation is usually performed only based on the TDP values of the computing devices, and the real-time working state of each computing device is not fully considered, so that power consumption waste is caused, and the power consumption utilization rate is low. Therefore, how to determine a reasonable master in a plurality of computing devices and how to reasonably distribute power consumption among the plurality of computing devices become an urgent technical problem to be solved.

Disclosure of Invention

The invention aims to provide a master selection method, which is used for quickly selecting a master for a plurality of operation devices in a server when the server is powered on every time, and selecting the master with a central position by the plurality of operation devices, so that the delay of information interaction among the operation devices is minimum, and the information interaction efficiency among the operation devices is improved.

The invention provides a master selection method, which is applied to a scene of interaction of a plurality of computing devices and comprises the following steps:

step S1, when the server is powered on, obtaining topological structure information { G) of the computing equipment ₁ ，G ₂ ，…G _N If G ₁ ，G ₂ ，…G _N Sequentially connecting to form a linear topological structure, and executing the step S2;

step S2, each G _i Generating a request packet R _i ，R _i Comprising G _i Id and G of _i Target parameter of G _i R is to be _i From G in a first direction of a linear topology _i-1 In turn is transmitted to G ₁ ，G _i R is to be _i From G in a second direction of the linear topology _i+1 In turn is transmitted to G _N ；；

Step S3, obtaining each G _j Received R emitted in a first direction along a linear topology _i Number M of _i1 And R emitted in a second direction of the linear topology _i Number M of _i2 If M is present _i1 = M _i2 Or M is _i1 +1= M _i2 Then G will be _i And determining that the value ranges of i and j are 1 to N when the value range is the master, wherein N is the total number of the computing equipment in the server, and i is not equal to j.

Compared with the prior art, the invention has obvious advantages and beneficial effects. By means of the technical scheme, the master selection method provided by the invention can achieve considerable technical progress and practicability, has wide industrial utilization value and at least has the following advantages:

the invention can select the central master by the plurality of arithmetic devices when the server is powered on each time, so that the delay of information interaction among the arithmetic devices is minimum, the information interaction efficiency among the plurality of arithmetic devices is improved, and the master can be determined in a self-adaptive manner based on the number of the current arithmetic devices and the connection structure when the server is powered on each time because the number of the arithmetic devices is increased or reduced and then the arithmetic devices are powered on again.

The foregoing description is only an overview of the technical solutions of the present invention, and in order to make the technical means of the present invention more clearly understood, the present invention may be implemented in accordance with the content of the description, and in order to make the above and other objects, features, and advantages of the present invention more clearly understood, the following preferred embodiments are described in detail with reference to the accompanying drawings.

Drawings

FIG. 1 is a flowchart of a master selection method for a multi-computing device according to an embodiment of the present invention;

fig. 2 is a flowchart of a power consumption allocation method based on multiple computing devices according to an embodiment of the present invention.

Detailed Description

To further illustrate the technical means and effects of the present invention adopted to achieve the predetermined objects, the following detailed description will be given to the specific implementation and effects of the master selection method according to the present invention with reference to the accompanying drawings and preferred embodiments.

The first embodiment,

An embodiment provides a master selection method, which is applied to a scene where multiple computing devices interact with each other, as shown in fig. 1, and includes:

step S1, when the server is powered on, acquiring topological structure information { G of the computing equipment ₁ ，G ₂ ，…G _N If G ₁ ，G ₂ ，…G _N Sequentially connecting to form a linear topological structure, and executing the step S2;

when the server is powered on, that is, when the server is started, when the computing devices in the server need to increase, decrease or adjust the topology structure, the server needs to be shut down, and the power is re-powered on after the number of the computing devices is adjusted, and after the power is powered on, the method of the embodiment can also adaptively determine the optimal master based on the current latest number of the computing devices and the topology structure of the computing devices. The server is specifically a single server or a cluster formed by a plurality of servers, a CPU in the server may be connected to one or a plurality of computing devices, all the computing devices corresponding to the server are connected according to a preset topology structure, and the topology structure may be a linear topology structure, a full-interconnection topology structure, or a ring topology structure.

Step S2, each G _i Generating a request packet R _i ，R _i Comprising G _i Id and G of _i Target parameter of (1), G _i R is to be _i From G in a first direction of a linear topology _i-1 In turn to G ₁ ，G _i R is to be _i From G in a second direction of the linear topology _i+1 In turn to G _N ；

It is understood that the first direction and the second direction are opposite directions, and taking the id of the computing device as the first direction from large to small and as the second direction from small to large as an example, when i =1, G _i Capable of transferring R only in the second direction _i When i = n, G _i Capable of transferring R only in a first direction _i . The target parameters are determined according to a specific application scenario, for example, a master is selected for Power distribution among a plurality of computing devices, and then the corresponding target parameters are static Power consumption parameters, the static Power consumption parameters may specifically include peak Power (peak Power) and Thermal Design Power (short for) values of the computing devices, the peak Power of the computing devices is the maximum Power of the computing devices, the TDP value is the Power of the computing devices under normal operation, and after the operating devices in the same row normally operate, the Power may be maintained near the TDP, may reach the peak Power occasionally, but may also fall back to the TDP quickly. However, due to the influence of the load operation of the computing device and other factors, the power required by the computing device during the actual operation process is possibleAnd the power distribution is smaller than the TDP and can also be between the TDP and the peak power, so that the power distribution of the computing equipment needs to be adjusted in time based on the working state of the computing equipment, and the power waste is avoided.

Step S3, obtaining each G _j Received R emitted in a first direction of a linear topology _i Number M of _i1 And R emitted in a second direction of the linear topology _i Number M of _i2 If M is present _i1 = M _i2 Or M is _i1 +1= M _i2 Then G will be _i And determining that the value ranges of i and j are 1 to N, wherein N is the total number of the computing equipment in the server, and i is not equal to j.

As will be appreciated by those skilled in the art, steps S1-S3 are performed quickly after the server is powered on, allowing the master to be quickly determined. In addition, it should be noted that, in the first embodiment, determining the master is not limited to a scenario in which power consumption of multiple computing devices is allocated, and other scenarios in which the master needs to be selected are also within the protection scope of the present application, and the computing devices may specifically include an AI computing device such as a GPU, an FPGA, a DSP, and an AI accelerator.

The method of the embodiment one can quickly select the centrally-located master for the plurality of computing devices in the server when the server is powered on each time, so that the delay of information interaction among the computing devices is minimum, the information interaction efficiency among the plurality of computing devices is improved, and the master can be determined in a self-adaptive manner based on the number of the current computing devices and the connection structure each time the computing devices are powered on because the number of the computing devices is increased or reduced and then the computing devices are powered on again.

As an example, in the step S1, if G ₁ ，G ₂ ，…G _N Forming a full-interconnection topological structure, and directly adding a preset G _x Determined as master, x is a preset value, and x =1, 2, \8230, or N. It should be noted that, because any two computing devices in the full interconnect structure are connected, the delay effects of any one computing device are consistent, and as a preferred example, the value of x may be directly set to 1.

As an example, in step S1,if G is ₁ ，G ₂ ，…G _N Forming a ring topology, then set G _y And G _y+1 Does not directly transmit request data packet between them, and G ₁ ，G ₂ ，…G _N Converted into a linear topology, and then step S2 is performed, where y ranges from 1 to N, and when y = N, y +1 is set to 1. By converting the ring structure into a linear topology, it is sufficient to subsequently determine the master directly based on step S2 and step S3. It should be noted that converting the ring topology to the linear topology does not directly convert G to the linear topology _y And G _y+1 The physical connection between the two is disconnected, and G is set only in the process of determining the master _y And G _y+1 Does not directly transmit request data packet therebetween, and has the effect of temporary disconnection, G _y And G _y+1 Other data packets are not affected.

As an example, in the step S2, each G _i Generating a request packet R _i The method comprises the following steps:

step S21, each G _i Generating a request packet R based on a first packet format _i The first data packet format comprises a first packet header section and a first data section, the packet header section comprises a preset request identifier, and the first data section comprises G _i Id and G of _i The target parameter of (1).

It should be noted that the id of the computing device may not only include identification information of the computing device, but also include, for example, receiving address information of the computing device, and the like, where the receiving address information may be a target address for receiving a master parameter and may be directly used by a subsequent second data packet and/or a third data packet, but it is understood that the second data packet and/or the third data packet may also generate an id of a corresponding computing device according to a specific requirement. In addition, when each G _i Receive a terminal-connected G _i The transmitted request data packet is automatically transmitted to the other end one by one after the preset request identification is identified.

When the number of the computing equipment is too large, a master is directly selected, and the computing equipment far away from the master still has larger delay, so that the implementation is realizedFor example, in the step S1, if N is not less than N ₁ ，N ₁ Represents a first preset threshold, which is determined according to the specific application requirements and in combination with specific parameters of the computing device, and may be set to 16, for example. The method further comprises the following steps:

step S10, adding G ₁ ，G ₂ ，…G _N Divided into S groups L ₁ ，L ₂ ，…L _S },L _s Represents the S group, S has a value ranging from 1 to S, U _s Represents L _s Number of arithmetic devices of medium arithmetic device, max (U) _s )≤N ₂ ，N ₂ Representing a second predetermined threshold value, N ₂ <N ₁ ；

In the preferred embodiment, G is set as much as possible ₁ ，G ₂ ，…G _N Divided into S packets equally, e.g. N =32,n ₁ =16，N ₂ =8, then G ₁ ，G ₂ ，…G _N The division is into four groups of 8 arithmetic devices.

Step S20, obtaining each G _i Corresponding M _i1 And M _i2 And G _i The value range of the belonged grouping number h, h is 1 to S based on M _i1 、M _i2 、h、U _s Obtaining G _i Corresponding first intermediate parameter MA _i1 And a second intermediate parameter MA _i2 ：

;

Wherein M is obtained based on steps S2 and S3 _i1 And M _i2 And will not be described in detail herein.

Step S30, grouping each group L _s In, satisfy MA _i1 = MA _i2 Or MA _i1 +1= MA _i2 G of (A) _i Is determined as L _s Corresponding second master, determining the master obtained in step S3 as the first masterA master, the first master being a master of the second master.

As an embodiment, it is specifically possible to determine the first master through step S3, and then go through the first master to each G _i Sending S and U _s , G _i According to S and U _s Can determine G _i The grouping number h to which h belongs can be determined, for example, in such a way that when h satisfies M _i1 > sum(U _s ) Wherein s has a value range of [1, h-1 ]](ii) a And M _i2 >= sum(U _s ) Wherein the value range of s is [ h +1,S']When h is G _i The number of the belonging packet, wherein, sum (U) _s ) Represents U _s Corresponding to the sum of the values. It should be noted that the above is only one algorithm for determining h, and is not limited thereto. In addition, it is also possible to directly uniquely G at the time of grouping _i And setting a corresponding group number identifier, and directly determining a corresponding h value according to the group number identifier.

It should be noted that, still taking power allocation as an example, the second master first performs power allocation on all the first masters, and each first master performs power allocation on a plurality of computing devices in a group based on the allocated power consumption, thereby improving power allocation efficiency and reducing information transmission delay.

As an embodiment, the method further comprises:

s4, the master generates a master notification data packet according to a preset second data packet format, the second data packet format comprises a second header segment and a second data segment, the second header segment comprises a preset master notification identifier, and the second data segment comprises a G corresponding to the master _i Id of (2).

It is understood that after each computing device receives the master notification packet, the computing devices will be transferred one by one toward the computing device in the direction away from the master, and the details are not repeated herein.

And S4, each computing device can acquire the computing device id of the master so as to report corresponding information to the master subsequently, and each request data packet comprises the target parameters of the corresponding computing device, so that the master directly follows each R _i Obtained to G _i The target parameter of (1).

It can be understood that if the method further passes through the operations of grouping and grading determination of the masters in steps S10 to S30, the method in step S4 sends the computing device id of the first master to each second master, and each second master sends the corresponding computing device id to the computing devices in the group, so that the implementation details are consistent with step S4, and are not described herein again.

Besides the target parameter transmission among a plurality of computing devices, other interactive information can be transmitted among the connected computing devices, and as an embodiment, the method further includes:

step S100, G _i Generating an information interaction data packet according to a preset third data packet format, wherein the third data packet format comprises a third packet header section and a third data section, the third packet header section comprises an information interaction identifier, the third data section comprises an initiating terminal id, a receiving terminal id and target interaction information, and the initiating terminal id is used for storing G for generating the information interaction data packet _i The receiving end id is used for storing G of the information interaction data packet _i If the preamble master informs the G in the packet, it should be noted that _i The id of (2) already contains address information used by the third data packet, so that the corresponding receiving end id is not added in the third data packet, and the third data packet is directly sent based on the corresponding receiving end address information;

step S200, G _i And sending the information interaction data packet to a receiving end operation device.

The master according to the first embodiment may be directly applied to the power consumption allocation method based on the multiple computing devices according to the second embodiment, and may also be applied to other application scenarios requiring selection of the master.

Example II,

An embodiment two provides a power consumption allocation method based on multiple computing devices, as shown in fig. 2, including:

step C1, when the server is electrified, all G _i Sending static power consumption parameters to a master, the master being for each G _i Setting initial distributionPower consumption, the master being one of the plurality of computing devices corresponding to the server, G _i The value range of i is 1 to N, and N is the total number of the operation equipment in the server;

wherein the initial allocation of power may be based on G _i The static power consumption parameters and the total power consumption of the server are directly distributed, or default values can be directly set, and as the master can be quickly determined after power-on and the process of dynamically distributing power consumption is quickly entered, reasonable initial distribution power consumption can be set.

Note that the master satisfies all the operations G _i The sum of the time delays for transmitting the information to the master is the minimum value. The master can be directly set, or the master can be adaptively selected according to the method described in the first embodiment, which is not described herein again.

Step C2, G _i G is obtained at intervals of a preset first time interval _i Current { U } _i ,I _i ,T _i ,F _i }，G _i Based on the current { U _i ,I _i ,T _i ,F _i } and the current allocation power consumption PA _i Adjusting the internal power consumption of the computing device, wherein U _i Represents G _i Current value of voltage, I _i Represents G _i Current value of current, T _i Represents G _i Current temperature value, F _i Represents G _i A current frequency value;

it will be appreciated that through U _i ,I _i The instantaneous power, T, can be determined _i ,F _i Are all reacted with G _i Is proportional to the power consumption.

Step C3, G _i Every preset second time interval, based on the { U ] in the historical time window before the current time _i ,I _i ,T _i ,F _i { UV } Generation _i ,IV _i ,TV _i ,FV _i And sending to the master, UV _i Represents G _i Current period voltage value of IV _i Represents G _i Periodic current value of, TV _i Represents G _i Periodic temperature value of (FV) _i Represents G _i The second time interval is greater than the first time interval;

and preferably, the historical time window is equal to the second time interval.

Step C4, presetting total power consumption of the master based on the server and each G _i { UV of (1) } _i ,IV _i ,TV _i ,FV _i Each G _i Generating each G of the static power consumption parameters _i Current allocated power consumption PA _i ^’ Update PA _i = PA _i ^’ 。

In the second embodiment, the internal power consumption of the operation device is adjusted by the fine granularity in the operation device, the dynamic power consumption distribution among the operation devices is adjusted by combining the master coarse granularity, and the power consumption distribution counteracts the internal power consumption adjustment of each G, so that the power consumption utilization rate of the operation devices is improved, the power consumption waste is avoided, and the real-time reasonable distribution of the power consumption among the operation devices is realized.

As an example, in step C2, G _i Based on the current { U _i ,I _i ,T _i ,F _i And current allocation power consumption PA _i Adjusting the internal power consumption of the computing device, including:

step C21, if U _i *I _i -PA _i >PX _i ，PX _i Is G _i Corresponding to the floating threshold of power consumption, the control reduces G _i T of _i And/or F _i If PA _i -U _i *I _i >PX _i Then control to raise G _i T of _i And/or F _i . The temperature adjustment can be realized by adjusting a fan or a cooling device.

It will be understood that, under normal conditions, G _i The power consumption PA should be allocated at the present _i But because of each G _i Real-time changes in workload, etc., will cause G _i Current and PA of _i The difference is large, so that G can be controlled by adjusting the temperature and/or frequency in step C21 _i Power consumption, but it is understood that there is a process for power consumption adjustment, and each G _i The workload and other conditions can change in real time, so that dynamic allocation is needed to be performed in real time through the master based on the power consumption states of all the computing devices, the power consumption utilization rate is improved, and power consumption waste is avoided.

As an embodiment, in the step C3, the current time is based on { U ] in the historical time window before the current time _i ,I _i ,T _i ,F _i { UV } Generation _i ,IV _i ,TV _i ,FV _i And (4) the method comprises the following steps:

all the U in the historical time window before the current time _i Is determined as UV _i All of I _i Is determined as IV _i All of T _i Is determined as TV _i All of F _i Is determined as FV _i 。

As another example, in the step C3, the current time is based on { U ] in the historical time window before the current time _i ,I _i ,T _i ,F _i { UV } Generation _i ,IV _i ,TV _i ,FV _i And (4) the method comprises the following steps:

all U in the historical time window before the current time _i The maximum value in (A) is determined as UV _i All of I _i Is determined as IV _i All of T _i Is determined as TV _i All of F _i Is determined as FV _i 。

It is understood that the above-mentioned means of taking the average or maximum value are only two examples, and can also be based on { U }according to the application requirement _i ,I _i ,T _i ,F _i Take other reasonable values to generate { UV } _i ,IV _i ,TV _i ,FV _i There is no longer a single row.

As an example, the step C4 includes:

step C41, if UV _i *IV _i -PA _i ≥PB _i ，PB _i Assigning a floating threshold for power consumption, then G is set _i Dynamic adjustment coefficient k of _i =(UV _i *IV _i -PA _i )/ PA _i If PA _i -UV _i *IV _i ≥PB _i Then set k _i =-(PA _i -UV _i *IV _i )/ UV _i *IV _i If UV _i *IV _i -PA _i │<PB, then set k _i =0;

Step C42, adjusting the step length lambda and G according to the current dynamic state _i Obtaining the dynamic adjustment coefficient of G _i Current allocated power consumption PA _i ^’ =PA _i +k _i * λ, λ satisfies

Less than or equal to 0, updating PA _i = PA _i ^’ 。

G with margin for the current distribution power consumption can be ensured not to exceed the total power consumption through the steps C41 to C42 _i The current distribution power consumption is reduced, and G can be higher _i The current distribution power consumption is increased, the utilization rate of the power consumption is improved, and in addition, the equipment can run to reach the highest frequency within the allowed range of the power consumption, so that the task on the equipment can be completed more quickly, and the performance of the equipment is improved.

As an example, the first time interval is set to be in the order of milliseconds, and G is set to be within the first time interval _i Is smaller than a preset temperature change threshold, e.g. the first time interval takes 1ms. The second time interval is set to the order of seconds, for example the second time interval is set to 1s.

It should be noted that the same technical details that have already been described in the first embodiment are not described again in the second embodiment, and the technical details related to the first embodiment and the second embodiment may be used in combination.

It should be noted that some of the exemplary embodiments in the first and second embodiments are described as processes or methods depicted as flowcharts. Although a flowchart may describe the steps as a sequential process, many of the steps can be performed in parallel, concurrently or simultaneously. Additionally, the order of many of the steps may be rearranged. A process may be terminated when its operations are completed, but may have additional steps not included in the figure. A process may correspond to a method, a function, a procedure, a subroutine, a subprogram, etc.

An embodiment of the present invention further provides an electronic device, including: at least one processor; and a memory communicatively coupled to the at least one processor; wherein the memory stores instructions executable by the at least one processor and configured to perform a method according to an embodiment of the invention.

The embodiment of the invention also provides a computer-readable storage medium, and the computer instructions are used for executing the method of the embodiment of the invention.

Although the present invention has been described with reference to the preferred embodiments, it will be understood by those skilled in the art that various changes may be made and equivalents may be substituted for elements thereof without departing from the scope of the present invention.

Claims

1. A master selection method is applied to a scene of interaction of a plurality of arithmetic devices, and comprises the following steps:

step S2, each G _i Generating a request packet R _i ，R _i Comprising G _i Id and G of _i Target parameter of (1), G _i R is to be _i From G in a first direction of a linear topology _i-1 In turn is transmitted to G ₁ ，G _i R is to be _i From G in a second direction of the linear topology _i+1 In turn is transmitted to G _N ；

Step S3, obtaining each G _j Received R emitted in a first direction of a linear topology _i Number M of _i1 And R emitted in a second direction of the linear topology _i Number M of _i2 If M is present _i1 ＝M _i2 Or M is _i1 +1＝M _i2 Then G will be _i And determining that the value ranges of i and j are 1 to N, wherein N is the total number of the computing equipment in the server, and i is not equal to j.

2. The method of claim 1,

in the step S1, if G ₁ ，G ₂ ，…G _N Forming a full-interconnection topological structure, and directly connecting the preset G _x Determined as master, x is a preset value, and x =1, 2, \8230, or N.

3. The method of claim 2,

the value of x is 1.

4. The method of claim 1,

in the step S1, if G ₁ ，G ₂ ，…G _N Forming a ring topology, then set G _y And G _y+1 Does not directly transmit request data packet between them, and G ₁ ，G ₂ ，…G _N Converted into a linear topology, and then step S2 is performed, where y ranges from 1 to N, and when y = N, y +1 is set to 1.

5. The method of claim 1,

in the step S1, if N is more than or equal to N ₁ ，N ₁ Which is indicative of a first pre-set threshold value,the method further comprises the following steps:

step S10, adding G ₁ ，G ₂ ，…G _N Divided into S packets { L } ₁ ，L ₂ ，…L _S },L _s Represents the S group, S has a value ranging from 1 to S, U _s Represents L _s Number of arithmetic devices, max (U) _s )≤N ₂ ，N ₂ Representing a second predetermined threshold value, N ₂ <N ₁ ；

Step S30, grouping each group L _s In, satisfy MA _i1 ＝MA _i2 Or MA _i1 +1＝MA _i2 G of (A) _i Is determined as L _s And the corresponding second master determines the master obtained in the step S3 as the first master, wherein the first master is the main master of the second master.

6. The method of claim 5,

the first master first allocates power consumption to all the second masters, and each second master allocates power consumption to a plurality of arithmetic devices in a group based on the allocated power consumption.

7. The method of claim 5,

in the step S10, G is added ₁ ，G ₂ ，…G _N Are equally divided into S packets.

8. The method of claim 1,

the operation equipment is a GPU, an FPGA, a DSP or an AI accelerator.