RU2556458C2

RU2556458C2 - Network having extended generalised hypercube topology

Info

Publication number: RU2556458C2
Application number: RU2013112594/08A
Authority: RU
Inventors: Михаил Федорович Каравай; Виктор Сергеевич Подлазов; Владимир Владимирович Соколов
Priority date: 2013-03-21
Filing date: 2013-03-21
Publication date: 2015-07-10
Also published as: RU2013112594A

Abstract

FIELD: physics, computer engineering.

SUBSTANCE: invention relates to high-performance multiprocessor computer systems. A system network with a topology of an extended n-dimensional R-ary generalised hypercube n≥2, R≥3, is characterised by that the network has the appearance of an n-dimensional R-ary extended generalised hypercube of Rⁿ nodes, wherein each node contains a subscriber and an (n+1)×(n+1) full switch, and to link the nodes of each of the n×R^n-1 edges of the generalised hypercube, there is an extended self-routing non-blocking R×R switch consisting of m×m switches, m≥2, each of the R ports of which is connected by one channel with the (n+1)×(n+1) switch of each of the R nodes of the edge, wherein the remaining port of each of the switches is connected to the subscriber of its node.

EFFECT: providing reliable high-efficiency networks with a large number of processor nodes.

5 dwg, 2 tbl

Description

Заявленное изобретение относится к области высокопроизводительных многопроцессорных вычислительных систем. Изобретение может быть применено для организации системных сетей многопроцессорных систем с большим числом процессорных узлов (абонентов), в которых требуется быстрая параллельная передача информации между абонентами с минимальным числом промежуточных скачков (запоминаний в буферах) (Arimili B., Arimili R., Chung V., et al. The PERCS High-Performance Interconnect // 18th IEEE Symposium on High Performance Interconnects. 2009. P. 75 - 82. Alverson R., Roweth D. and Kaplan L. The Gemini System Interconnect // 18th IEEE Symposium on High Performance Interconnects. 2009. P. 83 - 87).The claimed invention relates to the field of high-performance multiprocessor computing systems. The invention can be used to organize system networks of multiprocessor systems with a large number of processor nodes (subscribers), which require fast parallel transmission of information between subscribers with a minimum number of intermediate jumps (memorization in buffers) ( Arimili B., Arimili R., Chung V. .., et al The PERCS High -Performance Interconnect // 18th IEEE Symposium on High Performance Interconnects 2009. P. 75 - 82. Alverson R., Roweth D. and Kaplan L. The Gemini System Interconnect // 18th IEEE Symposium on High Performance Interconnects. 2009. P. 83 - 87).

В таких сетях процессорные узлы (связка многоядерных процессоров) используются в тесной связке с многопортовыми связными узлами. Множество портов в них требуется для возможно большего распараллеливания системной сети, объединяющей связные узлы. Топология связей узлов сети, соответствующая полному графу, обеспечивает неблокируемость и самомаршрутизируемость при прокладывании маршрутов. Это свойство позволяет иметь бесконфликтное расписание на произвольной перестановке пакетов между вершинами графа, которое составляется в каждой вершине независимо от других вершин.In such networks, processor nodes (a bunch of multicore processors) are used in close conjunction with multi-port connected nodes. Many ports in them are required to parallelize the system network connecting the connected nodes as much as possible. The topology of node network connections corresponding to the full graph ensures non-blocking and self-routing when laying routes. This property allows you to have a conflict-free schedule on an arbitrary permutation of packets between the vertices of the graph, which is compiled at each vertex independently of other vertices.

Известна системная сеть суперкомпьютера, который разрабатывался IBM для проекта Blue Waters (Arimili B., Arimili R., Chung V., et al. The PERCS High-Performance Interconnect // 18th IEEE Symposium on High Performance Interconnects. 2009. P. 75 - 82. Hruska J. After Years of Work IBM, NCSA Cancel «BlueWaters» Supercomputer // Aug. 2011. URL: http://hothardware.com/News/After-Years-of-Work-IBM-NCSA-Cancel-Blue-Waters-Supercomputer/). Системная сеть имеет топологию двухуровневого полного графа (фиг. 1). Каждый узел связи (node) имеет 47 межузловых каналов трех видов с разной пропускной способностью. Суперузел (supernode) образуется из 32 узлов связи, которые связаны в нем по схеме полного графа. Суперузлы связаны также по схеме полного графа. Каждый суперузел имеет 512 каналов. В максимальной конфигурации суперкомпьютера каждый такой канал используется для связи с другим суперузлом по схеме полного графа. В этом случае он содержит 513 суперузлов. Передача пакета между любыми двумя узлами занимает не более трёх смен каналов с промежуточной буферизацией пакетов (скачков). The system network of a supercomputer, which was developedIbmfor the projectBlue waters(Arimili B., Arimili R., Chung V., et al. The PERCS High-Performance Interconnect // 18th IEEE Symposium on High Performance Interconnects. 2009.P. 75 - 82.Hruska J. After Years of Work IBM, NCSA Cancel “BlueWaters” Supercomputer // Aug. 2011. URL: http://hothardware.com/News/After-Years-of-Work-IBM-NCSA-Cancel-Blue-Waters-Supercomputer/). The system network has the topology of a two-level complete graph (Fig. 1). Each communication node (node) has 47 inter-nodal channels of three types with different throughputs. Supergirl (supernode) is formed from 32 communication nodes, which are connected in it according to the scheme of a complete graph. Super nodes are also connected according to the full graph scheme. Each supernode has 512 channels. In the maximum configuration of a supercomputer each such channel is used for communication with another supernode according to the full graph scheme. In this case, he contains 513 super nodes. A packet transfer between any two nodes takes no more than three channel changes with intermediate packet buffering (hopping).

Недостатками данной системной сети являются неоптимальное использование портов узла связи для создания связей внутри суперузлов и между суперузлами, а также отсутствие резервных трехскачковых путей, что приводит к ухудшению отказоустойчивости системной сети. The disadvantages of this system network are the non-optimal use of the ports of the communication node to create connections within the super nodes and between the super nodes, as well as the lack of backup three-hop paths, which leads to a deterioration in the fault tolerance of the system network.

Другим примером иерархической сети такого класса является Иерархический толстый гиперкуб, базирующийся на подсетях с топологией гиперкуба (US 5669008 A, 16.09.1997, Hierarchical fat hypercube architecture for parallel processing systems. United States Patent Patent Number 5,669,008. Issued September 16, 1997). Эта сеть выбрана в качестве прототипа. Она строилась для удовлетворения требованиям масштабируемости, высокой пропускной способности, малого диаметра и высокой бисекционной пропускной способности, отражающей параллельность сети. Величина бисекции B определяется как минимальное число каналов «точка-точка» между множествами, разделенных пополам, абонентов. Another example of a hierarchical network of this class is the Hierarchical fat hypercube based on subnets with the hypercube topology (US 5669008 A, September 16, 1997, Hierarchical fat hypercube architecture for parallel processing systems. United States Patent Patent Number 5,669,008. Issued September 16, 1997). This network is selected as a prototype. It was built to meet the requirements of scalability, high bandwidth, small diameter and high bisection bandwidth, reflecting the parallel network. Bisection size B is determined as the minimum number of channels "point-to-point" between the sets divided in half subscribers.

В иерархическом толстом гиперкубе (ИТГ) абоненты подсоединяются к коммутаторам, расположенным в узлах подсетей первого уровня, а те соединяются через подсети второго уровня. Подсети первого уровня, так же как и второго, имеют топологию гиперкуба (группа гиперкубов второго уровня называется авторами метакубом). Пример толстого иерархического гиперкуба с базовыми гиперкубами размерности 2 на 16 абонентов представлен на фиг. 2. На Фиг. 2 гиперкубы первого уровня называются ГК1-1, ГК1-2, ГК1-3 и ГК1-4, а метакуб состоит из гиперкубов ГК2-1, ГК2-2, ГК2-3 и ГК2-4. При этом ГК2-1 связывает первые узлы гиперкубов первого уровня, ГК2-2 - вторые узлы гиперкубов первого уровня и т.д. На Фиг. 2 для наглядности показаны связи только для ГК2-1 и ГК2-4. Соединения ГК2-2, ГК2-3 с гиперкубами нижнего уровня аналогичны. In a hierarchical thick hypercube (ITG), subscribers are connected to switches located in the nodes of the first-level subnets, and those are connected via the second-level subnets. The subnets of the first level, as well as the second, have the topology of the hypercube (the group of hypercubes of the second level is called the authors of the metacub). An example of a thick hierarchical hypercube with basic hypercubes of dimension 2 for 16 subscribers is shown in FIG. 2. In FIG. The two first level hypercubes are called GK1-1, GK1-2, GK1-3, and GK1-4, and the metacub consists of the hypercubes GK2-1, GK2-2, GK2-3, and GK2-4. In this case, GK2-1 connects the first nodes of the first level hypercubes, GK2-2 - the second nodes of the first level hypercubes, etc. In FIG. 2, for clarity, only connections for GK2-1 and GK2-4 are shown. Compounds GK2-2, GK2-3 with lower level hypercubes are similar.

Недостатками ИТГ являются высокая схемная и проводная сложность из-за наличия большого числа коммутаторов в метакубе, сравнительно большой диаметр из-за использования обычных гиперкубов и наличие конфликтов при параллельной передаче в гиперкубах.The disadvantages of ITG are the high circuit and wire complexity due to the large number of switches in the metacube, the relatively large diameter due to the use of conventional hypercubes, and the presence of conflicts during parallel transmission in hypercubes.

Задачей изобретения является упрощение сетей с большим числом процессорных узлов (абонентов), в которых требуется быстрая параллельная передача информации между абонентами, уменьшение числа скачков в них и уменьшение числа конфликтов при параллельной передаче.The objective of the invention is to simplify networks with a large number of processor nodes (subscribers), which require fast parallel transmission of information between subscribers, reducing the number of jumps in them and reducing the number of conflicts in parallel transmission.

При осуществлении изобретения может быть получен следующий технический результат:When carrying out the invention, the following technical result can be obtained:

1. По сравнению с иерархически толстым коммутатором - это, во-первых, уменьшение сложности и, во-вторых, уменьшение диаметра, уменьшение числа конфликтов, увеличение бисекционной пропускной способности и, как следствие, уменьшение задержeк передачи данных по сети.1. Compared with a hierarchically thick switch, this is, firstly, a decrease in complexity and, secondly, a decrease in diameter, a decrease in the number of conflicts, an increase in bisection bandwidth and, as a result, a decrease in the delay in data transmission over the network.

2. По сравнению с сетями, использующими полнографовые связи - это увеличение числа узлов (примерно в квадрат раз), а также повышение отказоустойчивости при сохранении числа проводов и диаметра; или уменьшение числа проводов (примерно в корень квадратный раз) при сохранении числа узлов и диаметра, или многократное увеличение пропускной способности и отказоустойчивости системной сети при сохранении числа узлов.2. Compared with networks using full-graph communications, this means an increase in the number of nodes (approximately a square time), as well as an increase in fault tolerance while maintaining the number of wires and diameter; or a decrease in the number of wires (approximately a square root) while maintaining the number of nodes and diameter, or a multiple increase in the throughput and fault tolerance of the system network while maintaining the number of nodes.

Технический результат достигается тем, что системная сеть с топологией расширенного n-мерного R-ичного обобщенного гиперкуба, n ≥ 2, R ≥ 3, имеет вид n-мерного R-ичного расширенного обобщенного гиперкуба из R ⁿ узлов, при этом в каждом узле расположен абонент и полный коммутатор (n+1)×(n+1), а для связи узлов каждого из n×R ^n-1 ребер обобщённого гиперкуба имеется расширенный самомаршрутизируемый неблокируемый коммутатор R×R, состоящий из коммутаторов m×m, m ≥ 2, каждый из R портов которого соединен одним каналом с коммутатором (n+1)×(n+1) каждого из R узлов ребра, причём оставшийся порт каждого из коммутаторов подсоединен к абоненту своего узла.The technical result is achieved by the fact that a system network with a topology extendedn-dimensionalR- a generalized hypercube,n ≥2R ≥ 3, has the formn-dimensionalR-ary extended generalized hypercube fromR ⁿ nodes, while in each node there is a subscriber and a full switch (n +1) × (n +1), and for the connection of the nodes of each ofn×R ^n-1 edges of the generalized hypercube there is an extended self-routing non-blocking switchR×Rconsisting of switchesm×m,m ≥2, each ofRwhich ports are connected by one channel to the switch (n +1) × (n +1) each ofR nodes of the rib, and the remaining port of each of the switches is connected to the subscriber of its node.

На фиг.1 представлена Структура системной сети Blue Waters. Figure 1 presents the structure of the system network Blue Waters.

На фиг.2 представлена Структура системной сети Иерархического толстого гиперкуба, ИТГ, размерности 2 на 16 абонентов.Figure 2 presents the structure of the system network of the Hierarchical thick hypercube, ITG, dimension 2 for 16 subscribers.

На фиг.3 изображён граф обобщённого гиперкуба, ОГК, с числом вершин 16, размерности 2, арности 4.Figure 3 shows a graph of a generalized hypercube, OGK, with the number of vertices 16, dimension 2, arity 4.

На фиг.4 представлен Расширенный коммутатор РК(7,4,2) из коммутаторов 4×4 и разветвителей/объединителей 1×4.Figure 4 presents the Extended switchboard PK (7,4,2) of the switches 4 × 4 and splitters / combiners 1 × 4.

На фиг. 5 представлена Сеть с топологией обобщенного расширенного гиперкуба РОГК(49, 2, 7, 4, 2).In FIG. Figure 5 shows the Network with the topology of the generalized extended hypercube ROGK (49, 2, 7, 4, 2).

Предлагаемая системная сеть имеет топологию, основанную на расширении топологии обобщённого гиперкуба (ОГК), названной расширенным обобщённым гиперкубом (РОГК). Для аппаратной реализации связей между абонентами используются неблокируемые самомаршрутизируемые расширенные коммутаторы (РК), полученные согласно «Способу построения неблокируемого самомаршрутизируемого расширенного коммутатора (патент ⁽¹⁹⁾RU⁽¹¹⁾ 2 435 295⁽¹³⁾ C2)», предложенному авторами ранее.The proposed system network has a topology based on the extension of the generalized hypercube (OGK) topology, called the extended generalized hypercube (ROGK). For the hardware implementation of communication between subscribers, non-blocking self-routing extended switches (PK) are used, obtained according to the “Method for constructing a non-blocking self-routing extended switch (patent ⁽¹⁹⁾ RU ⁽¹¹⁾ 2,435,295 ⁽¹³⁾ C2)” proposed by the authors earlier.

Обобщенный гиперкуб, ОГК(V, d, s), обычно задается как d-мерный s-ичный гиперкуб, который имеет V=s ^d узлов степени d(s-1) каждый, размещенных в строках по s узлов, задающих «ребра» d-мерного куба. «Ребра» здесь понимаются не в графовом, а в геометрическом смысле. Наоборот, в графовом смысле «ребро» есть полный граф, т.е. все s узлов одного геометрического «ребра» связаны s(s-1) графовыми ребрами. Пример такого обобщённого гиперкуба ОГК(16, 2, 4) представлен на фиг.3. Содержательно ОГК можно представить как результат тиражирования двоичного гиперкуба во всех измерениях в s-1 экземплярах со «склейкой» смежных рёбер и с сохранением полноты связей в каждом геометрическом ребре.Generalized hypercube, OGK (V, d, s), usually given asd-dimensionals-personal hypercube that hasV = s ^d degree nodesd(s-1) each,placed in rows bysnodes defining "edges"d-dimensional cube. Ribs here they are understood not in the graph, but in the geometric sense. On the contrary, in the graphical sense, an “edge” is a complete graph, i.e. alls nodes of one geometric "rib" are connecteds(s-1) graph edges. An example of such a generalized hypercube of OGKs (16, 2, 4) is presented in Fig. 3. OGKs can be represented meaningfully as the result of replication of the binary hypercube in all dimensions ins-1 copies with “gluing” of adjacent edges and preserving the completeness of bonds in each geometric edge.

Рассмотрим сеть с топологией РОГК, использующую РК, полученный выше названным Способом (патент ⁽¹⁹⁾RU⁽¹¹⁾ 2 435 295⁽¹³⁾ C2), использующим базовую таблицу соединений для использования дуплексных каналов (с топологией неполной уравновешенной блок-схемы). По этой таблице может быть построен РК(N, m, σ), являющийся «идеальной» сетью, которая имеет возможность использования прямых каналов (без промежуточной буферизации пакетов) для бесконфликтной реализации произвольной перестановки пакетов данных между узлами. Пример расширенного коммутатора РК(7, 4, 2) из коммутаторов 4×4 и разветвителей/объединителей 1×4 приведён на фиг.4. По построению распределенный полный коммутатор РК(N, m, σ) является неблокируемым самомаршрутизируемым коммутатором N×N, как и исходный коммутатор m×m. Это означает, что произвольная перестановка пакетов данных между абонентами может осуществляться в нем бесконфликтно по прямым (без промежуточной буферизации пакетов) каналам.Consider a network with the ROGK topology using the RK obtained by the aforementioned Method (patent ⁽¹⁹⁾ RU ⁽¹¹⁾ 2,435,295 ⁽¹³⁾ C2) using the base connection table for using duplex channels (with the topology of an incomplete balanced block diagram). Based on this table, a RC ( N , m , σ ) can be built, which is an “ideal” network that has the ability to use direct channels (without intermediate packet buffering) for conflict-free implementation of arbitrary permutation of data packets between nodes. An example of an extended PK switch (7, 4, 2) from 4 × 4 switches and 1 × 4 splitters / combiners is shown in Fig. 4. By construction, the distributed full PK switch ( N , m , σ ) is a non-blocking self-routing N × N switch , as is the original m × m switch. This means that arbitrary rearrangement of data packets between subscribers can be carried out in it without conflict through direct (without intermediate packet buffering) channels.

С формальной точки зрения диаметр РК(N, m, σ) равен 2 (D=2). Однако диаметр можно выражать и в числе скачков (передач по прямым каналам без промежуточной буферизации пакетов). Такой диаметр равен 1 (D=1).From a formal point of view, the diameter of the RC ( N , m , σ ) is 2 ( D = 2). However, the diameter can also be expressed in the number of jumps (transmissions on direct channels without intermediate packet buffering). This diameter is 1 ( D = 1).

Фактически РК(N, m, σ) реализует минимальные квазиполные графы связей между входными портами N, осуществляемые через коммутаторы m×m, с числом независимых связей, равным σ.In fact, the RK ( N , m , σ ) implements minimal quasi-complete graphs of connections between input ports N implemented through m × m switches , with the number of independent connections equal to σ .

Расширение обобщенного гиперкуба можно осуществить за счет замены топологии связей в каждом геометрическом ребре с топологии полного графа на топологию квазиполного графа с помощью РК.An extension of a generalized hypercube can be realized by replacing the topology of connections in each geometric edge from the topology of a complete graph to the topology of a quasi-complete graph using RC.

Рассмотрим устройство системной сети с топологией расширенного n-мерного R -ичного обобщенного гиперкуба. В нём Q=R ⁿ абонентов расположены в узлах ОГК(Q, n, R). На фиг. 5 показан пример сети, имеющей в основе граф ОГК с параметрами Q=49, n=2, R=7. В каждом узле также находится полный коммутатор (n+1)×(n+1), одним портом связанный с абонентом, а оставшиеся порты (число их равно размерности гиперкуба) используются для организации связей следующим образом. Для объединения узлов в каждом геометрическом ребре ОГК используем РК(R, m, σ) (в примере R=7, m=4, σ=2), где R=m(m-1)/σ+1. Всего таких рёбер n×R, по R рёбер в каждой размерности гиперкуба. В результате получим сеть с Q=R ⁿ узлами, в котором узлы каждого геометрического ребра связаны прямыми каналами через свой коммутатор R×R, как показано на фиг.5. Абоненты узлов разных рядов связываются через полные коммутаторы (n+1)×(n+1), расположенные в каждом узле гиперкуба. Топологию такой сети будем называть расширенным обобщенным n-мерным R-ичным гиперкубом и обозначать её РОГК(Q, n, R, m, σ). На фиг.5 для примера показана сеть с топологией РОГК(49, 2, 7, 4, 2).Consider a system network device with a topology extendedn-dimensionalR - a generalized hypercube. In himQ = R ⁿ subscribers are located in the OGK nodes (Q, n, R) In FIG. 5 shows an example of a network based on an OGK graph with parametersQ =49,n =2, R =7. Each node also has a full switch (n +1) × (n +1), one port associated with the subscriber, and the remaining ports (their number is equal to the dimension of the hypercube) are used to organize communications as follows. To combine the nodes in each geometric edge of the OGK, we use the RK (R,m, σ) (in the exampleR =7,m =4, σ = 2), whereR = m(m-one)/σ +one.In total such edgesn×RbyRedges in each dimension of a hypercube. As a result, we get a networkwith Q = R ⁿ nodes, in which the nodes of each geometric edge are connected by direct channels through its switchR×Ras shown in FIG. Subscribers of nodes of different rows communicate through full switches (n +1) × (n +1) located in each node of the hypercube. The topology of such a network will be called extended generalizedn-measuredR-with the other hypercube and denote it by the ROGK (Q, n, R, m,σ). Figure 5 shows, for example, a network with the topology of the ROGK (49, 2, 7, 4, 2).

Подчеркнём, что предложенная сеть может иметь в рёбрах σ>1 параллельных каналов, что позволяет повышать пропускную способность, либо отказоустойчивость сети, а произвольная перестановка пакетов данных между абонентами может осуществляться в ребре бесконфликтно по прямым (без промежуточной буферизации пакетов) каналам.We emphasize that the proposed network can have parallel channels in the edges σ> 1, which allows increasing the throughput or fault tolerance of the network, and arbitrary rearrangement of data packets between subscribers can be performed in the edge without any conflicts along the direct (without intermediate packet buffering) channels.

Оценим преимущества предложенной сети перед прототипом. Рассчитаем параметры 2-уровнего толстого гиперкуба, построенного посредством коммутаторов m×m. Учитывая один процессор и одну связь с метакубом, из них можно построить гиперкуб с числом узлов (абонентов) K=2^m-2. Каждый узел метакуба содержит K коммутаторов, из которых можно построить метакуб с M=2^m-1=2K узлами. В результате толстый гиперкуб содержит Q=KM=2K ² абонентов, имеет схемную сложность S=Km ² MKm ²=QKm ⁴=4K ³ m ⁴≈11,2Q ^3/2 m ⁴ и проводную сложность W=KmWKm/4=K ³ m ²≈2,8 Q ^3/2 m ² . We will evaluate the advantages of the proposed network over the prototype. We calculate the parameters of the 2-level thick hypercube constructed using m × m commutators. Considering one processor and one connection with a metacube, one can construct a hypercube from them with the number of nodes (subscribers) K = 2 ^m-2 . Each node contains metakuba K switches, of which one can construct metakub with M = 2 ^m-1 = 2 K nodes. As a result, the thick hypercube contains Q = KM = 2 K ² subscribers, has the circuit complexity S = Km ² MKm ² = QKm ⁴ = 4 K ³ m ⁴ ≈ 11.2 Q ^3/2 m ^4, and the wire complexity W = KmWKm / 4 = K ³ m ² ≈ 2.8 Q ^3/2 m ² .

Диаметр толстого гиперкуба D=m-2+m-1+m-2=3m-5, бисекционная пропускная способность B=G/2.The diameter of the thick hypercube is D = m -2+ m -1+ m -2 = 3 m -5, the bisectional throughput is B = G / 2.

Пусть m=8, тогда K=64, M=128 и Q=8·1024, S=4·64³·64²=4·64⁵=4·8·512·8·512·64=Let m = 8, then K = 64, M = 128, and Q = 8 · 1024, S = 4 · 64 ³ · 64 ² = 4 · 64 ⁵ = 4 · 8 · 512 · 8 · 512 · 64 =

=64·1024²·64=4·1024³, W=128·1024² , B=4·1024, D=19.= 64 · 1024 ² · 64 = 4 · 1024 ³ , W = 128 · 1024 ² , B = 4 · 1024, D = 19.

Для сравнения рассчитаем параметры нашего 2-мерного расширенного обобщенного гиперкуба, построенного из тех же коммутаторов m×m. Будем использовать схему расширенного обобщённого гиперкуба РОГК(Q, 2, R, m, 1), использующего расширенные коммутаторы РК(R, m, 1) с таблицами построения, рассчитанными на использование симплексных каналов. Тогда в каждом измерении содержится R=m ² узлов, общее число узлов и абонентов - Q=R ²=m ⁴, схемная сложность S=Q(2m ²+2m)=2(m ⁶+m ⁵)=2(Q ^3/2+Q ^5/4) и проводная сложность W=Q2m=2Q ^5/4. Диаметр D=4 или D=2, если считать расширенный коммутатор РК(R, m, 1) 1-скачковым. Бисекционная пропускная способность B=m ² m ³/4=Q ^5/4/4.For comparison, we calculate the parameters of our 2-dimensional extended generalized hypercube constructed from the same m × m commutators . We will use the scheme of the extended generalized hypercube ROGK ( Q, 2, R, m, 1), using the extended commutators of the RK ( R, m, 1) with construction tables designed for the use of simplex channels. Then each dimension contains R = m ² nodes, the total number of nodes and subscribers is Q = R ² = m ⁴ , the circuit complexity S = Q (2 m ² +2 m ) = 2 ( m ⁶ + m ⁵ ) = 2 ( Q ^3/2 + Q ^5/4 ) and the wire complexity W = Q 2 m = 2 Q ^5/4 . The diameter D = 4 or D = 2, if we consider the extended commutator PK ( R, m, 1) 1-hop. Bisectional bandwidth B = m ² m ^3/4 = Q ^5/4 / 4.

Пусть m=8, тогда K=64 и Q=64² =4·1024, S=2·(64·64·64+8·64·64)=2·72·64·64=142·8·512= =564·1024, W=8·1024·8=64·1024, B=8⁵/4=2·64² =8·1024 и D=4.Let m = 8, then K = 64 and Q = 64 ² = 4 · 1024, S = 2 · (64 · 64 · 64 + 8 · 64 · 64) = 2 · 72 · 64 · 64 = 142 · 8 · 512 = = 564 · 1024, W = 8 · 1024 · 8 = 64 · 1024, B = 8 ^5/4 = 2 · 64 ² = 8 · 1024 and D = 4.

При вдвое меньшем числе абонентов предлагаемая сеть с топологией РОГК имеет вдвое большую бисекционную пропускную способность, во много раз меньший диаметр и на много порядков меньшую сложность. Для того чтобы уравнять число узлов достаточно взять в РОГК m=10 (на четверть больше). При этом все остальные сравнения останутся справедливыми.With half the number of subscribers, the proposed network with the ROGK topology has twice as large bisection bandwidth, many times smaller diameter and many orders of magnitude less complexity. In order to equalize the number of nodes, it is enough to take m = 10 in the ROGK (a quarter more). Moreover, all other comparisons will remain valid.

Покажем, что предложенное решение лучше по сравнению с вариантом сети, когда связи в рёбрах обобщённого гиперкуба реализуют полный граф. We show that the proposed solution is better in comparison with the variant of the network when the connections in the edges of a generalized hypercube realize a complete graph.

Табл. 1 показывает, что сеть РОГК(Q, 3, R, m, σ) по сравнению с ОГК(V, 3, s) может иметь во много раз большее число узлов и/или в σ раз большую пропускную способность каждого геометрического ребраTab. 1 shows that the ROGC network ( Q, 3 , R, m, σ), as compared to the WGC ( V, 3 , s ), can have many times more nodes and / or σ times the throughput of each geometric edge

Таблица 1. Фактор Q/V при m=s-1. Table 1. Q / V factor for m = s- 1 .

σ \ m σ \ m 4four 66 88 1010 1212 1one 17,617.6 9696 254254 566566 10711071 22 2,72.7 9,89.8 27,027.0 5656 114114 33 1,01,0 3,93.9 9,49,4 18,318.3 36,236,2 4four -- 1,51,5 4,64.6 8,08.0 16,416,4

В табл. 2 показывается степень узла s (равная числу дуг узла) по каждому измерению в ОГК(V, d, s), имеющего такое же число узлов, как и РОГК(Q, n, R, m, σ), V=Q. Табл. 2 показывает, что РОГК(Q, n, R, m, σ) по сравнению с ОГК(V, d, s) может иметь во много раз меньшее число портов в каждом узле и/или в несколько раз большую пропускную способность каждого геометрического ребра. In the table. Figure 2 shows the degree of the node s (equal to the number of node arcs) for each dimension in the OGK ( V, d, s ), which has the same number of nodes as the ROGK ( Q, n, R, m, σ), V = Q. Tab. 2 shows that ROGK ( Q, n, R, m, σ), compared to OGK ( V, d, s ), can have many times fewer ports at each node and / or several times higher throughput of each geometric edge .

Таблица 2. Значение s при V=Q (s=N) . Table 2. The value of s at V = Q (s = N).

σ \ m σ \ m 4four 66 88 1010 1212 1one 1313 3232 5757 9191 133133 22 77 15fifteen 2727 4242 6363 33 55 11eleven 1919 2929th 4343 4four -- 88 15fifteen 2222 3333

Рассмотрим работу сети с топологией РОГК на примере алгоритма зеркального отображения матрицы в сети РОГК(49, 2, 7, 4, 2), представленной на фиг. 5. В результате выполнения этого алгоритма 1-й элемент матрицы меняется местами с 49-м, 2-й элемент - с 48-м, 3-й элемент - с 47-м и т.д. Перед выполнением алгоритма элементы строк исходной матрицы размерности 7×7 находятся в абонентах соответствующих горизонтальных рёбер РОГК. На первом шаге алгоритма производится зеркальное отображение элементов матрицы во всех абонентах вертикальных рядов одновременно, на втором шаге производится зеркальное отображение элементов матрицы во всех абонентах горизонтальных рядов одновременно. В этом алгоритме на каждом шаге в расширенных коммутаторах РК(7, 4, 2) осуществляется перестановка 7 элементов за один такт параллельно и бесконфликтно. В случае выполнения более сложных операций в сети с топологией РОГК (в частности, с промежуточной буферизацией данных) произвольной размерности и арности (РОГК(Q, n, R, m, σ)) могут быть использованы известные алгоритмы параллельных вычислений и маршрутизации (Гергель В.П., Стронгин Р.Г. Основы параллельных вычислений для многопроцессорных вычислительных систем. Н.Новгород, ННГУ. 2003).Let us consider the operation of the network with the ROGK topology using the example of the matrix mirroring algorithm in the ROGK network (49, 2, 7, 4, 2) shown in FIG. 5. As a result of this algorithm, the 1st element of the matrix is interchanged from the 49th, the 2nd element from the 48th, the 3rd element from the 47th, etc. Before the algorithm is executed, the row elements of the initial 7 × 7 matrix are located in the subscribers of the corresponding horizontal edges of the ROGK. At the first step of the algorithm, the matrix elements are mirrored in all subscribers of the vertical rows at the same time; at the second step, the matrix elements are mirrored in all the subscribers of the horizontal rows at the same time. In this algorithm, at every step in the extended switches of the Republic of Kazakhstan (7, 4, 2), 7 elements are rearranged per cycle in parallel and without conflict. In the case of more complex operations in the network with the topology of the ROGK (in particular, with intermediate data buffering) of arbitrary dimension and arity (the ROGK ( Q, n, R, m, σ)), well-known parallel computing and routing algorithms (Gergel V .P., Strongin RG Fundamentals of parallel computing for multiprocessor computing systems. N. Novgorod, NNGU. 2003).

Предлагаемая организация сети обладает следующими преимуществами:The proposed network organization has the following advantages:

- обеспечивает уменьшение сложности, задержек передачи данных по сети, а также увеличение пропускной способности, - provides a decrease in complexity, delays in data transmission over the network, as well as an increase in throughput,

- позволяет в зависимости от области применения увеличивать либо количество узлов, либо вводить избыточные связи при экономном расходовании аппаратуры и проводов для связей,- allows, depending on the application, to increase either the number of nodes, or to introduce redundant connections with the economical use of equipment and wires for communications,

- приводит к уменьшению числа конфликтов при параллельной передаче данных в сети за счёт применения неблокируемых самомаршрутизируемых расширенных коммутаторов.- leads to a decrease in the number of conflicts during parallel data transmission in the network due to the use of non-blocking self-routing advanced switches.

Claims

System topology network with extended n-dimensional R-ary generalized hypercube, n≥2, R≥3, characterized in that the network is of the form n-dimensional R-ary extended generalized hypercube R ⁿ of nodes, wherein each node is a subscriber and a full switch (n + 1) × (n + 1), and for connecting nodes of each of the n × R ^n-1 edges of a generalized hypercube, there is an extended self-routing non-blocking switch R × R consisting of m × m, m≥2, each of the R ports of which is connected by one channel to the switch (n + 1) × (n + 1) of each of the R nodes of the edge, the remaining I port of each of the switches is connected to the subscriber of my node.