RU2797164C1

RU2797164C1 - Pipeline module multiplier

Info

Publication number: RU2797164C1
Application number: RU2023106826A
Authority: RU
Inventors: Вячеслав Иванович Петренко
Filing date: 2023-03-22
Publication date: 2023-05-31

Abstract

FIELD: computing technology.

SUBSTANCE: conveyor multiplier contains n parallel registers, n keys, (n−1) module multipliers by two, (n−1) module adders, where n is the capacity of the numbers being processed, with the corresponding links.

EFFECT: increasing speed of operations of multiplying numbers by an arbitrary module during conveyor processing of information.

1 cl, 2 dwg

Description

Область техники, к которой относится изобретениеThe field of technology to which the invention belongs

Изобретение относится к вычислительной технике и может быть использовано в цифровых вычислительных устройствах конвейерного типа, а также в устройствах цифровой обработки сигналов и в криптографических приложениях.The invention relates to computer technology and can be used in conveyor-type digital computing devices, as well as in digital signal processing devices and in cryptographic applications.

Уровень техникиState of the art

Из существующего уровня техники известен умножитель по модулю два, содержащий сумматор и мультиплексор, позволяющий выполнять умножение чисел на два по произвольным модулям [1]. Технической проблемой, которая не может быть решена при использовании данного технического решения, является невозможность умножения чисел по произвольным модулям на другие числа, не равные двум, так как для выполнения такой операции потребуются дополнительные технические средства. From the existing prior art known multiplier modulo two, containing an adder and a multiplexer that allows you to multiply numbers by two by arbitrary modules [1]. A technical problem that cannot be solved using this technical solution is the impossibility of multiplying numbers in arbitrary modules by other numbers that are not equal to two, since additional technical means will be required to perform such an operation.

Из существующего уровня техники известно устройство для умножения чисел по произвольному модулю, содержащее (m-1) сумматоров, (m-1) мультиплексоров, m ключей, (m-2) блоков сдвига, инвертор и сумматор по модулю, позволяющее выполнять умножение двух чисел по произвольному модулю [2]. Технической проблемой, которая не может быть решена при использовании данного технического решения, является низкое быстродействие при умножении чисел по произвольному модулю при конвейерной обработке информации, так как умножение очередных чисел по модулю в потоке не может быть начато до тех пор, пока не завершено умножение предыдущих чисел по модулю.From the existing prior art, a device is known for multiplying numbers modulo an arbitrary, containing ( m -1) adders, ( m -1) multiplexers, m keys, ( m -2) shift blocks, an inverter and a modulo adder, which allows multiplication of two numbers modulo [2]. A technical problem that cannot be solved using this technical solution is the low speed when multiplying numbers modulo an arbitrary number during pipeline processing of information, since the multiplication of successive numbers modulo in a stream cannot be started until the multiplication of the previous ones is completed. modulo numbers.

Наиболее близким к заявленному техническому решению по технической сущности и достигаемому техническому результату, выбранному в качестве прототипа, является умножитель по модулю, содержащий ключи, сумматоры, мультиплексоры, позволяющий выполнять приведение чисел по произвольным модулям [3]. Технической проблемой, которая не может быть решена при использовании данного технического решения при конвейерной обработке информации, является низкое быстродействие при умножении чисел по произвольному модулю, так как умножение очередных чисел по модулю в потоке не может быть начато до тех пор, пока не завершено умножение по модулю предыдущих чисел.The closest to the claimed technical solution in terms of technical essence and the achieved technical result, selected as a prototype, is a modulo multiplier containing keys, adders, multiplexers, which allows to perform reduction of numbers by arbitrary modules [3]. A technical problem that cannot be solved when using this technical solution for pipelined processing of information is the low speed when multiplying numbers modulo an arbitrary, since the multiplication of successive numbers modulo in the stream cannot be started until the multiplication modulo is completed. modulus of previous numbers.

Техническим результатом, обеспечиваемым приведенной совокупностью признаков, является повышение быстродействия операций умножения чисел по произвольному модулю при конвейерной обработке информации.The technical result provided by the above set of features is to increase the speed of operations of multiplying numbers by an arbitrary modulus during pipeline processing of information.

Раскрытие сущности изобретенияDisclosure of the essence of the invention

Указанный технический результат при осуществлении изобретения достигается тем, что в конвейерный вычислитель содержащий n ключей, где n – разрядность обрабатываемых чисел, первые информационные входы устройства, вторые информационные входы устройства, третьи информационные входы устройства, информационные выходы устройства, на первые информационные входы устройства подаётся код множимого, на вторые информационные входы устройства подаётся код множителя, на третьи информационные входы устройства подаётся инверсный код модуля, с информационных выходов устройства снимается код произведения чисел по модулю, введены n параллельных регистров, (n−1) умножителей на два по модулю, (n−1) сумматоров по модулю, тактовый вход устройства, который соединен со входами подачи тактовых импульсов n параллельных регистров, первые информационные входы устройства соединены с (1, …, n)-ми разрядами информационных входов первого параллельного регистра, вторые информационные входы устройства соединены с (n+1, …, 2n)-ми разрядами информационных входов первого параллельного регистра, третьи информационные входы устройства соединены со вторыми входами умножителей на два по модулю и с третьими входами сумматоров по модулю, выход (n−1)-го сумматора по модулю соединён с информационными выходами устройства, первый разряд информационных выходов (1, …, n)-го параллельных регистров соединен соответственно с управляющим входом (1, …, n)-го ключей, (1, …, n−i+1)-е разряды информационных входов i-х параллельных регистров, i=(2, …, n), соединены со (2, …, n−i+2)-ми разрядами информационных выходов (i−1)-х параллельных регистров, (n−i+2, …, 2n−i+1)-е разряды соединены с информационными выходами (i-1)-х умножителей на два по модулю, (2n, …, 3n−1)-е разряды информационных входов второго параллельного регистра соединены с выходами первого ключа, а (2n−i+2, …, 3n−i+1)-е разряды информационных входов (3, …, n)-го параллельных регистров с выходами (1, …, n−2)-го сумматоров по модулю, (2, …, n+1)-й разряд информационных выходов n-го параллельного регистра соединен с информационными входами n-го ключа, (2n−i+2, …, 3n-i+1)-е разряды информационных выходов i-х параллельных регистров соединены с первыми информационными входами (i−1)-х сумматоров по модулю, вторые информационные входы которых соединены с информационными выходами i-х ключей, (n−i+2, …, 2n−i+1)-е разряды, i=(1, …, n), информационных выходов (1, …, n−1)-го параллельных регистров соединены соответственно с информационными входами (1, …, n−1)-го ключей и с первыми входами (1, …, n−1)-го умножителей на два по модулю.The specified technical result in the implementation of the invention is achieved by the fact that the pipeline calculator containing n keys, where n is the capacity of the processed numbers, the first information inputs of the device, the second information inputs of the device, the third information inputs of the device, the information outputs of the device, the code is fed to the first information inputs of the device multiplier, the second information inputs of the device are supplied with the multiplier code, the third information inputs of the device are supplied with the inverse code of the module, the code for the product of numbers modulo is removed from the information outputs of the device, n parallel registers are introduced, ( n − 1) multipliers by two modulo, ( n −1) modulo adders, the clock input of the device, which is connected to the clock inputs of n parallel registers, the first information inputs of the device are connected to (1, ..., n )-bits of the information inputs of the first parallel register, the second information inputs of the device are connected to ( n +1, …, 2 n )-bits of the information inputs of the first parallel register, the third information inputs of the device are connected to the second inputs of multipliers by two modulo and to the third inputs of modulo adders, the output of the ( n −1)-th adder modulo the module is connected to the information outputs of the device, the first bit of the information outputs of the (1, ..., n )-th parallel registers is connected respectively to the control input of the (1, ..., n )-th keys, (1, ..., n − i +1) - e bits of information inputs of i -th parallel registers, i =(2, …, n ), are connected to (2, …, n − i +2)-th bits of information outputs of ( i −1) -th parallel registers, ( n − i +2, …, 2 n − i +1)-th bits are connected to information outputs of ( i -1)-th multipliers by two modulo, (2 n , …, 3 n −1)-th bits of information inputs of the second parallel register are connected to the outputs of the first key, and (2 n - i +2, ..., 3 n - i + 1)-th bits of information inputs of the (3, ..., n )-th parallel registers with outputs (1, ..., n − 2)-th modulo adders, (2, …, n+ 1)-th bit of information outputs of the n -th parallel register is connected to information inputs of the n -th key, (2 n − i +2, …, 3 n-i + 1)th bits of the information outputs of the i -th parallel registers are connected to the first information inputs of the ( i −1)th modulo adders, the second information inputs of which are connected to the information outputs of the i -th keys, ( n−i +2, ..., 2 n−i +1)-th bits, i =(1, …, n ), information outputs of the (1, …, n − 1)th parallel registers are connected respectively to information inputs (1, …, n − 1) -th keys and with the first inputs of (1, …, n − 1)-th multipliers by two modulo.

Сущность изобретения заключается в реализации следующего способа умножения двух чисел по модулю. ПустьThe essence of the invention lies in the implementation of the following method of multiplying two numbers modulo. Let

где A и B – целые положительные числа, 0≤A, B<P, называемые соответственно множимым и множителем; where A and B are positive integers, 0≤ A , B < P , called the multiplier and multiplier respectively;

P – целое положительное число, называемое модулем; P is a positive integer called the modulus;

C – целое положительное число, являющееся произведением чисел A и B, приведенным по модулю P. C is a positive integer that is the product of A and B modulo P .

Причем And

где a _i,

- коэффициенты, принимающие значение 0 или 1 в зависимости от значения числа A; where a _i ,

- coefficients that take the value 0 or 1 depending on the value of the number A ;

b _i,

- коэффициенты, принимающие значение 0 или 1 в зависимости от значения числа B; b _i ,

- coefficients that take the value 0 or 1 depending on the value of the number B ;

p _i,

коэффициенты, принимающие значение 0 или 1 в зависимости от значения модуля P; p _i ,

coefficients that take the value 0 or 1 depending on the value of the module P ;

c _i,

- коэффициенты, принимающие значение 0 или 1 в зависимости от значения произведения C; c _i ,

- coefficients that take the value 0 or 1 depending on the value of the product C ;

n – количество разрядов в представлении чисел. n is the number of digits in the representation of numbers.

Задача состоит в том, чтобы по известным A и B отыскать произведение C по модулю P. Произведение чисел A и B по модулю P может быть представлено в следующем виде:The problem is to find the product of C modulo P given the known A and B. The product of numbers A and B modulo P can be represented as follows:

Учитывая, что операция приведения по модулю инвариантна к сложению и умножению, выражение (6) можно записать в виде:Given that the modulo reduction operation is invariant to addition and multiplication, expression (6) can be written as:

Выражение (7) может быть представлено как:Expression (7) can be represented as:

Введем следующее обозначение:Let us introduce the following notation:

Очевидно, чтоIt's obvious that

Тогда выражение (8) может быть записано в следующем виде:Then expression (8) can be written in the following form:

В результате вычисление выражения (11) в конвейерном режиме может быть сведено к следующим вычислительным действиям. As a result, the calculation of expression (11) in the pipeline mode can be reduced to the following computational steps.

На 1-й ступени конвейера вычисляют

и t ₀:At the 1st stage of the pipeline, calculate

_and t0 :

На 2-й ступени конвейера вычисляют

a ₁

и t ₁:At the 2nd stage of the pipeline, they calculate

a ₁

_and t1 :

На n-й ступени конвейера вычисляют значение

а также t _n ₋₁:At the nth stage of the pipeline, the value is calculated

and also t _n ₋₁ :

Значение t _n _-1 и будет являться искомым произведением (6).The value of t _n _-1 will be the desired product (6).

Вычисление

в соответствии с (9) осуществляется последовательным удвоением числа B с одновременным приведением результата по модулю P.calculation

in accordance with (9) is carried out by successive doubling of the number B with simultaneous reduction of the result modulo P .

Описанная выше процедура определяет вычисление произведения от одной пары чисел A и B.The procedure described above defines the calculation of the product of one pair of numbers A and B .

Рассмотрим реализацию вычисления произведений для потока чисел A _j и B _j, поступающего потактово на вход конвейерного умножителя, j=1, 2, 3, …, .Consider the implementation of the calculation of products for the stream of numbers A _j and B _j , coming cycle by cycle to the input of the conveyor multiplier, j =1, 2, 3, …, .

Выражение (1) при этом примет следующий видExpression (1) will then take the following form

Обозначим через t _i _, _j значение i-го частичного произведения j-й пары чисел A _j и B _j по модулю P на i-й ступени конвейера, где i =1, 2, … , n – номер ступени конвейера, j=1, 2, 3, …, – номер такта работы устройства. ТогдаDenote by t _i _, _j the value of the i -th partial product of the j -th pair of numbers A _j and B _j modulo P on the i -th stage of the pipeline, where i =1, 2, … , n is the number of the pipeline stage, j =1 , 2, 3, …, is the number of the device operation cycle. Then

где a _i _-1, _j – коэффициенты в выражении (2) для A _j числа,where a _i _-1, _j are coefficients in expression (2) for A _j numbers,

Очевидно, чтоIt's obvious that

В результате вычисление выражения (11) в конвейерном режиме для пар чисел A _j и B _j по модулю P может быть сведено к следующим вычислительным действиям. As a result, the calculation of expression (11) in the pipeline mode for pairs of numbers A _j and B _j modulo P can be reduced to the following computational steps.

На первом такте работы устройства на 1-й ступени конвейера в соответствии с (19), (20) и (21) для пар чисел A ₁ и B ₁ вычисляют

и t _0,1:On the first stroke of the device at the 1st stage of the conveyor in accordance with (19), (20) and (21) for pairs of numbers A ₁ and B ₁ calculate

_and t0.1 :

На втором такте работы устройства на 1-й ступени конвейера в соответствии с (19), (20) и (21) для пар чисел A ₂ и B ₂ вычисляют

и t _0,2:On the second cycle of operation of the device at the 1st stage of the conveyor in accordance with (19), (20) and (21) for pairs of numbers A ₂ and B ₂ calculate

_and t0.2 :

На j-м такте работы устройства на 1-й ступени конвейера в соответствии с (19), (20) и (21) для пар чисел A _j и B _j вычисляют

и t _0, _j:On the j -th cycle of the device at the 1st stage of the conveyor in accordance with (19), (20) and (21) for pairs of numbers A _j and B _j calculate

and t _0, _j :

На втором такте работы устройства на 2-й ступени конвейера в соответствии с (18) и (19) для пар чисел A ₁ и B ₁ вычисляют

и t _1,1:On the second stroke of the device at the 2nd stage of the conveyor in accordance with (18) and (19) for pairs of numbers A ₁ and B ₁ calculate

and t _1,1 :

На третьем такте работы устройства на 2-й ступени конвейера в соответствии с (18) и (19) для пар чисел A ₂ и B ₂ вычисляют

и t _1,2:On the third stroke of the device at the 2nd stage of the conveyor in accordance with (18) and (19) for pairs of numbers A ₂ and B ₂ calculate

and t _1,2 :

На j-м такте работы устройства на 2-й ступени конвейера в соответствии с (18) и (19) для пар чисел A _j и B _j вычисляют

и t _1, _j:On the j -th cycle of the device at the 2nd stage of the conveyor in accordance with (18) and (19) for pairs of numbers A _j and B _j calculate

and t _1, _j :

На n-ом такте работы устройства на n-й ступени конвейера в соответствии с (18) для пар чисел A ₁ и B ₁ вычисляют

и t _1,1:Onnth cycle of the device onn-th stage of the pipeline in accordance with (18) for pairs of numbersA ₁ AndB ₁ calculate

Andt _1.1:

Значение t _n ₋ _1,1 и будет являться искомым произведением C ₁≡(A ₁·B ₁) mod P.The value t _n ₋ _1,1 will be the desired product C ₁ ≡( A ₁ · B ₁ ) mod P .

На (n+1)-ом такте работы устройства на n-й ступени конвейера в соответствии с (18) для пар чисел A ₂ и B ₂ вычисляют

и t _1,2:On (n+1)-th cycle of the device onn-th stage of the pipeline in accordance with (18) for pairs of numbersA ₂ AndB ₂ calculate

Andt _1.2:

Значение t _n ₋ _1,2 и будет являться искомым произведением C ₂≡(A ₂·B ₂) mod P. t value _n ₋ _1,2 and will be the desired product C ₂ ≡( A ₂ · B ₂ ) mod P .

На (n+j)-ом такте работы устройства на n-й ступени конвейера в соответствии с (18) для пар чисел A _j и B _j вычисляют

и t _1, _j:On the ( n + j )-th cycle of the device at the n -th stage of the conveyor, in accordance with (18) for pairs of numbers A _j and B _j calculate

and t _1, _j :

Значение t _n ₋ _1, _j и будет являться искомым произведением C _j≡(A _j ·B _j) mod P.The value t _n ₋ _1, _j will be the desired product C _j ≡( A _j B _j ) mod P .

Краткое описание чертежейBrief description of the drawings

Сущность изобретения поясняется чертежами.The essence of the invention is illustrated by drawings.

На фиг. 1 представлена схема конвейерного умножителя по модулю. Конвейерный умножитель по модулю содержит n параллельных регистров 1.1 ÷ 1.n, n ключей 2.1 ÷ 2.n, (n−1) умножителей на два по модулю 3.1 ÷ 3.n–1, (n−1) сумматоров по модулю 4.1 ÷ 4.n–1, где n – разрядность обрабатываемых чисел, первые информационные входы устройства 5, вторые информационные входы устройства 6, третьи информационные входы устройства 7, информационные выходы устройства 8, тактовый вход устройства 9. На первые информационные входы устройства 5 подаётся код множимого, на вторые информационные входы устройства 6 подаётся код множителя, на третьи информационные входы устройства 7 подаётся инверсный код модуля, с информационных выходов устройства 8 снимается код произведения чисел по модулю. Тактовый вход устройства 9 соединен со входами подачи тактовых импульсов n параллельных регистров 1.1 ÷ 1.n. Первые информационные входы устройства 5 соединены с (1, …, n)-ми разрядами информационных входов первого параллельного регистра 1.1, вторые информационные входы устройства 6 соединены с (n+1, …, 2n)-ми разрядами информационных входов первого параллельного регистра 1.1, третьи информационные входы устройства 7 соединены со вторыми входами умножителей на два по модулю и с третьими входами сумматоров по модулю 4.1 ÷ 4.n–1. Выход (n−1)-го сумматора по модулю 4.n–1 соединён с информационными выходами устройства 8. Первый разряд информационных выходов (1, …, n)-го параллельных регистров 1.1 ÷ 1.n соединен соответственно с управляющим входом (1, …, n)-го ключей 2.1 ÷ 2.n, (1, …, n−i+1)-е разряды информационных входов i-го параллельного регистра 1.i, i=(2, …, n), соединены со (2, …, n−i+2)-ми разрядами информационных выходов (i−1)-х параллельных регистров 1.i−1, (n−i+2, …, 2n−i+1)-е разряды соединены с информационными выходами (i−1)-х умножителей на два по модулю 3.i−1, (2n, …, 3n−1)-е разряды информационных входов второго параллельного регистра 1.2 соединены с выходами первого ключа 2.1, а (2n−i+2, …, 3n−i+1)-е разряды (3, …, n)-го параллельных регистров 1.3 ÷ 1.n с выходами (1, …, n−2)-го сумматоров по модулю 4.1 ÷ 4.n–2. (2, …, n+1)-й разряд информационных выходов n-го параллельного регистра 1.n соединен с информационными входами 2.n-го ключа, (2n−i+2, …, 3n-i+1)-е разряды информационных выходов i-х параллельных регистров 1.i соединены с первыми информационными входами (i−1)-х сумматоров по модулю 4.i−1, вторые информационные входы которых соединены с информационными выходами i-х ключей 2.i, (n−i+2, …, 2n−i+1)-е разряды, i=(1, …, n), информационных выходов (1.1, …, 1.n−1)-х параллельных регистров соединены соответственно с информационными входами (2.1, …, 2.n−1)-х ключей и с первыми входами (3.1, …, 3.n−1)-х умножителей на два по модулю.In FIG. 1 shows the scheme of the conveyor multiplier modulo. The pipeline multiplier modulo containsn parallel registers 1.1 ÷ 1.n,n keys 2.1 ÷ 2.n, (n−1) multipliers by two modulo 3.1 ÷ 3.n-1, (n−1) adders modulo 4.1 ÷ 4.n–1, wheren - word length of the processed numbers, the first information inputs of the device 5, the second information inputs of the device 6, the third information inputs of the device 7, the information outputs of the device 8, the clock input of the device 9. The multiplier code is fed to the first information inputs of the device 5, the multiplier code is fed to the second information inputs of the device 6, the inverse code of the module is fed to the third information inputs of the device 7, the modulo product code is removed from the information outputs of the device 8. The clock input of the device 9 is connected to the clock inputsn parallel registers 1.1 ÷ 1.n. The first information inputs of device 5 are connected to (1, ...,n)-bits of the information inputs of the first parallel register 1.1, the second information inputs of the device 6 are connected to (n+1, ..., 2n)-bits of the information inputs of the first parallel register 1.1, the third information inputs of the device 7 are connected to the second inputs of multipliers by two modulo and to the third inputs of adders modulo 4.1 ÷ 4.n-1. Exit (n−1)th adder modulo 4.n-1 is connected to the information outputs of the device 8. The first bit of the information outputs (1, ...,n)th parallel registers 1.1 ÷ 1.n connected respectively to the control input (1, ...,n)th keys 2.1 ÷ 2.n, (1, …,n−i+1)th digits of information inputsith parallel register 1.i,i=(2, …,n), connected to (2, ...,n−i+2) digits of information outputs (i−1)th parallel registers 1.i−1, (n−i+2, ..., 2n−i+1)th digits are connected to information outputs (i−1)-x multipliers by two modulo 3.i−1, (2n, …, 3n−1)-th digits of the information inputs of the second parallel register 1.2 are connected to the outputs of the first key 2.1, and (2n−i+2, ..., 3n−i+1)-th digits (3, ...,n)th parallel registers 1.3 ÷ 1.n with outputs (1, ...,n−2)th adders modulo 4.1 ÷ 4.n-2. (2, …,n+1)-th category of information outputsnth parallel register 1.n connected to information inputs 2.n-th key, (2n−i+2, ..., 3ni+1)th digits of information outputsi-x parallel registers 1.i connected to the first information inputs (i−1)-th adders modulo 4.i−1, the second information inputs of which are connected to the information outputsi-x keys 2.i, (n−i+2, ..., 2n−i+1)-th digits,i=(1, …,n), information outputs (1.1, ..., 1.n−1)th parallel registers are connected respectively to the information inputs (2.1, ..., 2.n−1)-th keys and with the first inputs (3.1, ..., 3.n−1)-th multipliers by two modulo.

На фиг. 2 представлено распределение разрядности параллельных регистров 1.1 ÷ 1.n устройства. Первый параллельный регистр 1.1 имеет разрядность 2n, равную сумме разрядности входных чисел A и B. В разряды с 1-го по n-й первого параллельного регистра 1.1 записываются числа A _j, а в разряды с (n+1)-го по 2n-й записываются числа B _j. Параллельные регистры 1.i, где i = (2, …, n) – номер ступени конвейера, имеют разрядность равную (3n−i+1). В разряды с 1-го по (n−i+1)-й i-го параллельного регистра 1.i записываются значения разрядов числа A, причем количество этих разрядов уменьшается на каждой последующей ступени конвейера на один со стороны младших, в разряды с (n−i+2)-го по (2n−i+1)-й записываются соответствующие значения разрядов чисел

а в разряды с (2n−i+2)-го по (3n−i+1)-й записываются значения разрядов чисел t _i _-2. In FIG. 2 shows the distribution of the capacity of parallel registers 1.1 ÷ 1. n devices. The first parallel register 1.1 has a capacity 2 n equal to the sum of the capacity of the input numbers A and B . Numbers A _j are written into bits _from 1st to nth of the first parallel register 1.1, and numbers Bj are written into bits from ( n +1)th to 2nth . Parallel registers 1. i , where i = (2, …, n ) is the number of the pipeline stage, have a capacity equal to (3 n − i +1). The digits from the 1st to ( n − i +1)-th i -th parallel register 1. i are filled with the values of the digits of the number A , and the number of these digits decreases at each subsequent stage of the pipeline by one from the lower ones, into digits with ( n − i +2)-th to (2 n − i +1)-th the corresponding values of the digits of numbers are written

and in the digits from (2 n − i +2)-th to (3 n − i +1)-th the values of the digits of numbers t _i _-2 are written.

Осуществление изобретенияImplementation of the invention

Конвейерный вычислитель работает следующим образом (см. Фиг. 1).Pipeline calculator works as follows (see Fig. 1).

В исходном состоянии параллельные регистры 1.1 ÷ 1.n−1 обнулены. На тактовый вход 9 устройства поступают тактовые импульсы. На первые информационные входы 5 устройства и вторые информационные входы 6 устройства с каждым тактовым импульсом подаются числа A _j и B _j, для которых необходимо вычислить произведение C _j по модулю P. На третьи информационные входы 7 устройства, в течение всего цикла формирования произведений, подаётся инверсный код модуля P. Произведение C _j по модулю P чисел A _j и B _j снимается с информационных выходов 8 устройства.In the initial state, the parallel registers 1.1 ÷ 1. n −1 are set to zero. The clock input 9 of the device receives clock pulses. The first information inputs 5 of the device and the second information inputs 6 of the device with each clock pulse are supplied with the numbers A _j and B _j for which it is necessary to calculate the product C _j modulo P . The inverse code of the module P is fed to the third information inputs 7 of the device during the entire cycle of creating works. The product C _j modulo P numbers A _j and B _j is removed from the information outputs 8 of the device.

На первом такте работы устройства первые два числа A ₁ и B ₁ записываются в параллельный регистр 1.1. При этом значение младшего разряда a _0,1 первого числа A ₁ с выходов параллельного регистра 1.1 поступает на управляющий вход первого ключа 2.1, а оставшиеся значения (a _1,1, …, a _n _−1,1) разрядов первого числа A ₁ поступают на (1, …, n−1)-е разряды информационных входов второго параллельного регистра 1.2. На информационные входы первого ключа 2.1 и на первые информационные входы первого умножителя на два по модулю 3.1 с (n+1, …, 2n)-го разрядов информационных выходов первого параллельного регистра 1.1 поступает код первого числа B ₁. В соответствии с (23) на выходе первого ключа 2.1 образуется произведение t _0,1 = a _0,1·B ₁, а в соответствии с (22) на выходе первого умножителя на два по модулю 3.1 образуется число

mod P. Умножители на два по модулю 3.1 ÷ 3.n–1 могут быть реализованы в соответствии с [1].On the first cycle of the device, the first two numbers A ₁ and B ₁ are written to the parallel register 1.1. In this case, the value of the least significant digit a _0.1 of the first number A ₁ from the outputs of the parallel register 1.1 is fed to the control input of the first key 2.1, and the remaining values ( a _1,1 , …, a _n _−1,1 ) of the digits of the first number A ₁ are received to (1, …, n − 1)-th bits of the information inputs of the second parallel register 1.2. The information inputs of the first key 2.1 and the first information inputs of the first multiplier by two modulo 3.1 with ( n +1, ..., 2 n )-bits of the information outputs of the first parallel register 1.1 receives the code of the first number B ₁ . In accordance with (23), at the output of the first key 2.1, the product t _0.1 = a _0.1 B ₁ is formed, and in accordance with (22), at the output of the first multiplier by two modulo 3.1, the number

mod P . Multipliers by two modulo 3.1 ÷ 3. n –1 can be implemented in accordance with [1].

На втором такте работы устройства в (1, …, n−1)-е разряды информационных входов второго параллельного регистра 1.2 записываются значения (a _1,1, …, a _n _−1,1) разрядов первого числа A ₁, в (n, …, 2n−1)-е разряды записывается значение числа

а в (2n, …, 3n−1)-е разряды записывается значение числа t _0,1. С выхода первого разряда второго параллельного регистра 1.2 значение (a _1,1)-го разряда первого числа A ₁, поступает на управляющий вход второго ключа 2.2. Значения (a _2,1, …, a _n _−1,1) разрядов первого числа A ₁ с (2, …, n−1)-х разрядов информационных выходов второго параллельного регистра 1.2 поступают на (1, …, n−2)-е разряды информационных входов третьего параллельного регистра 1.3. Значение числа

с (n, …, 2n−1)-х разрядов информационных выходов второго параллельного регистра 1.2 поступает на информационные входы второго ключа 2.2 и на первые информационные входы второго умножителя на два по модулю 3.2. Значение числа t _0,1 с (2n, …, 3n−1)-х разрядов информационных выходов второго параллельного регистра 1.2 поступает на первые информационные входы первого сумматора по модулю 4.1, на вторые информационные входы которого поступает значение

а на информационных выходах образуется значение числа t _1,1 в соответствии с (29), которое поступает на (2n−1, …, 3n)-е разряды информационных входов третьего параллельного регистра 1.3, на (n−1, …, 2n−2)-е разряды которых поступает значение числа

в соответствии с (28). Сумматоры по модулю 4.1 ÷ 4.n–1 могут быть реализованы согласно [4].On the second cycle of device operation, the (1, …, n −1)-th bits of the information inputs of the second parallel register 1.2 are filled with the values ( a _1,1 , …, a _n _−1,1 ) of the bits of the first number A ₁ , in ( n , …, 2 n −1)-th digits the value of the number is written

and in (2 n , …, 3 n −1)-th digits the value of the number t _0.1 is written. From the output of the first digit of the second parallel register 1.2 value ( a _1,1 )-th digit of the first number A ₁ is supplied to the control input of the second key 2.2. Values ( a _2,1 , …, a _n _−1,1 ) bits of the first number A ₁ with (2, …, n −1)-th bits of the information outputs of the second parallel register 1.2 are sent to (1, …, n −2 )-th bits of information inputs of the third parallel register 1.3. Number value

with ( n , ..., 2 n −1)-th bits of the information outputs of the second parallel register 1.2 is supplied to the information inputs of the second key 2.2 and to the first information inputs of the second multiplier by two modulo 3.2. The value of the number t _0.1 s (2 n , ..., 3 n −1)-th bits of the information outputs of the second parallel register 1.2 is supplied to the first information inputs of the first adder modulo 4.1, the second information inputs of which receive the value

and the value of the number t _1.1 is formed at the information outputs in accordance with (29), which goes to (2 n −1, …, 3 n )-th bits of the information inputs of the third parallel register 1.3, to ( n −1, …, 2 n − 2)-th bits of which the value of the number comes

according to (28). Adders modulo 4.1 ÷ 4. n –1 can be implemented according to [4].

Также на втором такте устройства вторые два числа A ₂ и B ₂ записываются в параллельный регистр 1.1. При этом значение младшего разряда a _0,2 второго числа A ₂ с выходов параллельного регистра 1.1 поступает на управляющий вход первого ключа 2.1, а оставшиеся значения (a _1,2, …, a _n _−1,2) разрядов второго числа A ₂ поступают на (1, …, n−1)-е разряды информационных входов второго параллельного регистра 1.2. На информационные входы первого ключа 2.1 и на первые информационные входы первого умножителя на два по модулю 3.1 с (n+1, …, 2n)-го разрядов информационных выходов первого параллельного регистра 1.1 поступает код второго числа B ₂. В соответствии с (25) на выходе первого ключа 2.1 образуется произведение t _0,2 = a _0,2·B ₂, а в соответствии с (24) на выходе первого умножителя на два по модулю 3.1 образуется число

Also on the second cycle of the device, the second two numbers A ₂ and B ₂ are written to the parallel register 1.1. In this case, the value of the least significant bit a _0.2 of the second number A ₂ from the outputs of the parallel register 1.1 is fed to the control input of the first key 2.1, and the remaining values ( a _1.2 , ..., a _n _−1.2 ) of the digits of the second number A ₂ are received to (1, …, n − 1)-th bits of the information inputs of the second parallel register 1.2. The information inputs of the first key 2.1 and the first information inputs of the first multiplier by two modulo 3.1 with ( n +1, ..., 2 n )-bits of the information outputs of the first parallel register 1.1 receives the code of the second number B ₂ . In accordance with (25), at the output of the first key 2.1, the product t _0.2 = a _0.2 B ₂ is formed, and in accordance with (24), at the output of the first multiplier by two modulo 3.1, the number

На следующих тактах работа устройства осуществляется аналогичным образом.On the following cycles, the operation of the device is carried out in a similar way.

На n-ом такте работы устройства на информационных выходах 8 устройства в соответствии с (34) для пар чисел A ₁ и B ₁ будет получено значение t _n ₋ _1,1, которое и будет являться искомым произведением

On the nth cycle of the device operation, at the information outputs 8 of the device, in accordance with (34), for pairs of numbers A ₁ and B ₁ , the value t _n ₋ _1.1 will be obtained, which will be the desired product

На (n+1)-ом такте работы устройства на информационных выходах 8 устройства для пар чисел A ₂ и B ₂ в соответствии с (36) будет получено значение t _n ₋ _1,2, которое будет являться искомым произведением

On the ( n +1)-th cycle of the device operation, at the information outputs 8 of the device for pairs of numbers A ₂ and B ₂ , in accordance with (36), the value t _n ₋ _1.2 will be obtained, which will be the desired product

На (n+j)-м такте работы устройства на информационных выходах 8 устройства для пар чисел A _j и B _j в соответствии с (36) будет получено значение t _n ₋ _1, _j которое будет являться искомым произведением

On the ( n + j )th cycle of the device operation, at the information outputs 8 of the device for pairs of numbers A _j and B _j in accordance with (36) the value t _n ₋ _{1 will be obtained,} _j which will be the desired product

Изобретение позволяет в конвейерном режим осуществлять умножение по модулю чисел, поступающих на его вход.The invention allows in the pipeline mode to carry out modulo multiplication of the numbers coming to its input.

Сравним быстродействие вычисления произведений чисел устройства прототипа [3] и предлагаемого устройства при реализации операции вычисления произведений для 100 пар 16-разрядных чисел. Устройство прототип при его реализации для работы с 16-разрядными числами будет содержать 8 последовательных ступеней преобразования. Обозначим через T_пр – время вычисления на одной ступени преобразования. При условии, что вычисление следующего произведения чисел не может быть начато до тех пор, пока не будет выполнено текущее вычисление, время, затрачиваемое устройством прототипом на вычисления произведения 100 пар 16-разрядных чисел, составит 800 T_пр. Предлагаемое устройство для выполнения аналогичной задачи будет содержать 16 последовательных ступеней конвейера. Обозначим через T_из – время обработки информации одной ступенью конвейера. Тогда общее время вычисления произведения 100 пар 16-разрядных чисел составит 132 T_из. Полагая

(хотя фактически T_пр>T_из), получим выигрыш B в быстродействии для указанных условий вычислений Let's compare the speed of calculating the products of numbers of the prototype device [3] and the proposed device when implementing the operation of calculating products for 100 pairs of 16-bit numbers. The prototype device, when implemented to work with 16-bit numbers, will contain 8 consecutive conversion steps. Denote by T _pr - the time of calculation at one stage of the transformation. Provided that the calculation of the next product of numbers cannot be started until the current calculation is completed, the time taken by the prototype device to calculate the product of 100 pairs of 16-bit numbers will be 800 T _pr . The proposed device for performing a similar task will contain 16 successive stages of the conveyor. Let us denote by T _out the time of information processing by one stage of the conveyor. Then the total time for calculating the product of 100 pairs of 16-bit numbers will be 132 T _of . Assuming

(although in fact T _pr >T _from ), we get a gain B in speed for the specified calculation conditions

Таким образом, при организации последовательных вычислений произведений 100 пар 16-разрядных чисел в конвейерном режиме выигрыш в быстродействии заявленного устройства по сравнению с устройством прототипом составит 6 раз. Очевидно, что при увеличении разрядности и увеличении количества чисел, выигрыш будет только возрастать.Thus, when organizing sequential calculations of the products of 100 pairs of 16-bit numbers in the pipeline mode, the speed gain of the claimed device compared to the prototype device will be 6 times. Obviously, with an increase in the bit depth and an increase in the number of numbers, the gain will only increase.

Источники информацииInformation sources

1. Патент на изобретение RU 2015537 C1. МПК G06F 7/49 (1990.01). Умножитель на два по модулю. Опубликован 30.06.1994. 1. Patent for invention RU 2015537 C1. IPC G06F 7/49 (1990.01). Multiplier by two modulo. Published 06/30/1994.

2. Патент на изобретение RU 2316042 C1. МПК G06F 7/523 (2006.01), G06F 7/72 (2006.01). Устройство для умножения чисел по произвольному модулю. Опубликован 27.01.2008. Бюл. № 3.2. Patent for invention RU 2316042 C1. IPC G06F 7/523 (2006.01), G06F 7/72 (2006.01). A device for multiplying numbers modulo arbitrary. Published on 01/27/2008. Bull. No. 3.

3. Патент на изобретение RU 2751802 C1. МПК G06F 7/72 (2006.01), G06F 7/523 (2006.01). Умножитель по модулю. Опубликован 19.07.2021 Бюл. № 20.3. Patent for invention RU 2751802 C1. IPC G06F 7/72 (2006.01), G06F 7/523 (2006.01). Modulo multiplier. Published on 19.07.2021 Bull. No. 20.

4. Патент на изобретение RU 2032934 C1. МПК G06F 7/49 (1995.01). Сумматор по модулю. Опубликован 10.04.1995. 4. Patent for invention RU 2032934 C1. IPC G06F 7/49 (1995.01). Modulo adder. Published 04/10/1995.

Claims

Modulo pipeline multiplier containing n keys, where n is the number of bits being processed, the first information inputs of the device, the second information inputs of the device, the third information inputs of the device, the information outputs of the device, the code of the multiplier is fed to the first information inputs of the device, the code of the multiplier is fed to the second information inputs of the device multiplier code, the inverse code of the module is fed to the third information inputs of the device, the code of the product of numbers modulo is removed from the information outputs of the device, which differs in that it contains n parallel registers, ( n − 1) multipliers by two modulo, ( n − 1 ) modulo adders, the clock input of the device, which is connected to the clock inputs of n parallel registers, the first information inputs of the device are connected to (1, ..., n)-bits of the information inputs of the first parallel register, the second information inputs of the device are connected to ( n +1, ..., 2 n )-bits of the information inputs of the first parallel register, the third information inputs of the device are connected to the second inputs of multipliers by two modulo and to the third inputs of modulo adders, the output of the ( n − 1)-th modulo adder is connected with the information outputs of the device, the first bit of the information outputs of the (1, ..., n )-th parallel registers is connected respectively to the control input of the (1, ..., n )-th keys, (1, ..., n − i +1)-th bits information inputs of i -th parallel registers, i =(2, …, n ), are connected to (2, …, n − i +2)-th bits of information outputs of ( i −1)-th parallel registers, ( n − i +2, …, 2 n − i +1)-th bits are connected to information outputs of ( i -1)-th multipliers by two modulo, (2 n , …, 3 n −1)-th bits of information inputs of the second parallel register are connected to the outputs of the first key, and (2 n - i +2, ..., 3 n - i + 1)-th bits of information inputs of the (3, ..., n )-th parallel registers with outputs (1, ..., n - 2)-th adder modulo, (2, ..., n + 1)-th bit of the information outputs of the n -th parallel register is connected to the information inputs of the n -th key, (2 n−i +2, ..., 3 n-i +1) -th bits of the information outputs of the i -th parallel registers are connected to the first information inputs of ( i −1)-th adders modulo, the second information inputs of which are connected to the information outputs of the i -th keys, ( n−i +2, ..., 2 n −i +1)-th bits, i =(1, …, n ), information outputs of the (1, …, n −1)-th parallel registers are connected respectively with information inputs of the (1, …, n −1)-th keys and with the first inputs of the (1, …, n − 1)th multipliers by two modulo.