JP2002521720A

JP2002521720A - Circuits and methods for modulo multiplication

Info

Publication number: JP2002521720A
Application number: JP2000561553A
Authority: JP
Inventors: ロバート・アイ・フォスター; ジョン・マイケル・バス; ロドニー・シー・テッシュ; ジェイムス・ダグラス・デュウォーキン; マイケル・ジェイ・トーラ
Original assignee: Motorola Inc
Current assignee: Motorola Solutions Inc
Priority date: 1998-07-22
Filing date: 1999-01-13
Publication date: 2002-07-16
Also published as: WO2000005645A1; AU2228199A; CN1248102C; US6182104B1; CN1310816A

Abstract

(57)【要約】コプロセッサ４４は、データを暗号化し、暗号を解読するためにモジュラ指数等式を計算する数学的アルゴリズムを実行する。パイプライン化乗算器５６はA/B RAM７２に格納される１６ビット・データ値を受信し、部分積を生成する。生成された部分積は加算器５８内で、積RAM６４内に格納される前回の部分積と合算される。モジューロ換算器６０は、合計値の特定のデータ・ビット位置が論理１値を有するとき、バイナリ・データ値Ｎを合計値に整合し加算する。Ｎ RAM７０は、モジューロ換算器６０内で合計値に加算されたデータ値Ｎを格納する。コプロセッサ４４は、フォスタ−モンゴメリ換算アルゴリズムを計算し、モンゴメリ換算アルゴリズムで必要とされたように↓の値を最初に計算する必要なしに、（A*Ｂ mod N）の値を換算する。 (57) Summary The coprocessor 44 executes a mathematical algorithm that encrypts data and calculates a modular exponential equation to decrypt the data. Pipelined multiplier 56 receives the 16-bit data value stored in A / B RAM 72 and generates a partial product. The generated partial product is added in the adder 58 with the previous partial product stored in the product RAM 64. Modulo converter 60 aligns and adds binary data value N to the sum when a particular data bit position of the sum has a logical one value. N RAM 70 stores data value N added to the total value in modulo converter 60. The coprocessor 44 calculates the Foster-Montgomery conversion algorithm and converts the value of (A * B mod N) without having to first calculate the value of ↓ as required by the Montgomery conversion algorithm.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

[Industrial applications]

本発明は、一般に乗算器に関し、さらに詳しくは、暗号化乗算器に関する。 The present invention relates generally to multipliers, and more particularly, to cryptographic multipliers.

【０００２】[0002]

[Prior art]

リベスト−シャミル−エイドルマン（RSA：Rivest-Shamir-Adleman）は、電子
装置間のデジタル・データ転送に関して高度な機密性を提供する、広く用いられ
ている暗号化アルゴリズムである。RSAアルゴリズムのモジュラ累乗法数学は、
ハードウェア乗算器に基づくモジュラ換算のためのモンゴメリ法を利用して効率
的に計算することができる。大きな整数のモジュラ累乗法は、モジュラ乗算を繰
り返すことで効率的に計算することができ、RSA演算全体の効率性は乗算器の速
度に直接的に関わる。ハードウェア乗算器アーキテクチャは、モンゴメリ・アル
ゴリズムの膨大な平衡演算に関するパイプライン化法を利用する。モンゴメリ・
アルゴリズムを計算するパイプライン化ハードウェア乗算器は、速度とシリコン
面積のバランスをとることができ、それにより高性能で費用効果の良い解決策が
得られる。また、パイプライン化整数モジュラ乗算器は、多くの用途で必要とさ
れる電力の軽減を可能にする。Rivest-Shamir-Adleman (RSA) is a widely used encryption algorithm that provides a high degree of confidentiality for digital data transfer between electronic devices. The modular exponentiation math of the RSA algorithm is
The calculation can be efficiently performed using the Montgomery method for modular conversion based on a hardware multiplier. Large integer modular exponentiation can be computed efficiently by repeating modular multiplication, and the overall efficiency of the RSA operation is directly related to the speed of the multiplier. The hardware multiplier architecture utilizes the Montgomery algorithm's massively balanced pipelined approach. Montgomery
A pipelined hardware multiplier that computes the algorithm can balance speed and silicon area, resulting in a high-performance, cost-effective solution. Also, pipelined integer modular multipliers allow for the reduction of power required in many applications.

【０００３】 RSAアルゴリズムにより容易になる暗号システムは、高レベルの機密性を提供
するが、実現には費用がかかる。モジュラ累乗を伴うRSAアルゴリズムの数学は
簡単明瞭であるが効率的ハードウェアの実現は簡単ではない。より高速の暗号化
動作とより高い性能に対する需要が高まるにつれて、ハードウェア・モジュラ乗
算器アーキテクチャの改良が高度な機密性を確保するために必要とされる。[0003] Cryptographic systems facilitated by the RSA algorithm provide a high level of confidentiality, but are expensive to implement. The math of the RSA algorithm with modular exponentiation is simple and straightforward, but efficient hardware implementation is not. As the demand for faster cryptographic operations and higher performance increases, improvements in hardware modular multiplier architectures are needed to ensure a high degree of confidentiality.

【０００４】従って、集積回路内に内蔵するための高性能，低コストで電力が低いモジュラ
累乗および乗算システムを有すると有利である。従来技術によるシステムよりも
少ないクロック・サイクルでモンゴメリ・アルゴリズムを演算することにより高
性能を達成する乗算システムが必要である。さらに、ビット数の増大したオペラ
ンドに適応することのできる乗算システムが必要である。Accordingly, it would be advantageous to have a high performance, low cost, low power modular power and multiply system for incorporation within an integrated circuit. There is a need for a multiplication system that achieves high performance by operating the Montgomery algorithm in fewer clock cycles than in prior art systems. Further, there is a need for a multiplication system that can accommodate operands with an increased number of bits.

【０００５】[0005]

DESCRIPTION OF THE PREFERRED EMBODIMENTS

図１は、データ通信網内で動作するように構築されるスマートカード１０のブ
ロック図である。「コンタクト型」のスマートカード構造においては、スマート
カード１０は、いくつかのコンタクト点１３に接続されるインタフェース（I/F
）ブロック１２を備える。コンタクト点１３は、端末装置（図示せず）とスマー
トカード１０との間の電気信号の転送を可能にする。スマートカード１０は、ス
マートカード１０内の機能ブロックにエネルギを供給するコンタクト点１３のう
ちの１つを通じて、端末装置から動作電位を受信する。コンタクト点１３をさら
に利用して、スマートカード１０と端末装置との間の入力／出力（I/O）信号の
転送を行う。FIG. 1 is a block diagram of a smart card 10 constructed to operate in a data communication network. In the “contact type” smart card structure, the smart card 10 has an interface (I / F) connected to several contact points 13.
) Block 12. The contact points 13 enable the transfer of electrical signals between a terminal (not shown) and the smart card 10. The smart card 10 receives an operating potential from a terminal device through one of the contact points 13 that supplies energy to functional blocks in the smart card 10. The input / output (I / O) signal is transferred between the smart card 10 and the terminal device by further utilizing the contact point 13.

【０００６】あるいは、スマートカード１０は、端末装置との物理的接触を行わずに、動作
する「コンタクトレス」スマートカードとすることもできる。この場合、スマー
トカード１０は、搬送周波数上での入力信号の受信と、被変調出力信号の送信の
両方を行う。たとえば、無線周波数（RF）エネルギがスマートカード１０内のコ
イル（図示せず）に放射され、スマートカード１０内の機能ブロックの動作を可
能にする動作電位をコイルが供給する。Alternatively, the smart card 10 can be a “contactless” smart card that operates without making physical contact with the terminal device. In this case, the smart card 10 both receives the input signal on the carrier frequency and transmits the modulated output signal. For example, radio frequency (RF) energy is radiated to a coil (not shown) within the smart card 10 and the coil provides an operating potential that enables operation of functional blocks within the smart card 10.

【０００７】外部端末装置とデータを送受信するI/Fブロック１２に加えて、スマートカー
ド１０は、汎用非同期式受信機−送信機装置（UART: Universal Asynchronous R
eceiver-Transmitter device）１４を備える。UART１４は、マイクロプロセッサ
１８と端末装置との間のインタフェースとなる。インタフェース・ブロック、す
なわちUART１４は、UART１４を通じてデータを動的に移動させるボーレート生成
器１６から、可調整クロック信号を受信する。システム・バス１５は、一般にマ
イクロプロセッサ１８を、UART１４，ランダム・アクセス・メモリ（RAM）２０
，読取専用メモリ（ROM）２２、メモリ・アクセス・コントローラ（MAC：Memory
Access controller）２４および機密メモリ管理ユニット（SMMU: Secure Memor
y Management Unit）２８などの他の機能ブロックに接続する。UART１４から受
信されるデータはRAM２０に格納され、RAM２０の一部は不揮発性であってスマー
トカード１０が動作電位を受信していないときに情報を保持する。不揮発性メモ
リの例としては、特に電気的消去可能（E²：Electrically Erasable）メモリま
たは強電性メモリなどがある。ROM２２はスマートカード１０のオペレーティン
グ・システムのためのデータと命令とを、マイクロプロセッサ１８のプログラム
制御のためのシステム・バスを介して提供する。RAM２０からのデータはMAC２４
を通じてフォスタ−モンゴメリ・ハードウェア・アクセラレータ（FMHA）２６に
転送され、そこで数学的演算が行われてデータを暗号化する。FMHA２６はモジュ
ラ演算ユニット（MAU: Modular Arithmetic Unit）または暗号化アクセラレータ
・ブロックとも呼ばれる。暗号化されたデータはFMHA２６からシステム・バス１
５を介してUART１４と端末装置とに転送される。[0007] In addition to an I / F block 12 for transmitting and receiving data to and from an external terminal device, the smart card 10 includes a universal asynchronous receiver-transmitter device (UART: Universal Asynchronous R).
eceiver-Transmitter device) 14. The UART 14 serves as an interface between the microprocessor 18 and the terminal device. The interface block, UART, receives an adjustable clock signal from a baud rate generator 16 that moves data dynamically through UART. The system bus 15 generally includes a microprocessor 18, a UART 14, a random access memory (RAM) 20.
, Read-only memory (ROM) 22, memory access controller (MAC: Memory)
Access controller (24) and Secure Memory Management Unit (SMMU)
y Management Unit) 28 and other functional blocks. Data received from the UART 14 is stored in a RAM 20, and a portion of the RAM 20 is non-volatile and retains information when the smart card 10 is not receiving an operating potential. Examples of non-volatile memories include, in particular, electrically erasable (E ² ) memories or high-power memories. ROM 22 provides data and instructions for the operating system of smart card 10 via a system bus for program control of microprocessor 18. Data from RAM 20 is MAC 24
To the Foster-Montgomery Hardware Accelerator (FMHA) 26 where mathematical operations are performed to encrypt the data. The FMHA 26 is also called a modular arithmetic unit (MAU) or a cryptographic accelerator block. The encrypted data is sent from the FMHA 26 to the system bus 1
5 to the UART 14 and the terminal device.

【０００８】図１に示されるスマートカード１０は、簡略化された形式にあることに注目さ
れたい。さらに、スマートカード１０は「コンタクト」モードにおいても「コン
タクトレス」モードにおいても動作するプラスチック・クレジット・カード内部
に埋め込まれるコンピュータ・チップであることに注目されたい。とりわけシリ
アル通信インタフェース・ブロック，ウォッチドッグ・タイマ，インターバル・
タイマ，割込コントローラなどの別のブロックを機能ブロックとしてスマートカ
ード１０に追加することもできる。Note that the smart card 10 shown in FIG. 1 is in a simplified form. Further, note that smart card 10 is a computer chip embedded inside a plastic credit card that operates in both a "contact" mode and a "contactless" mode. In particular, serial communication interface block, watchdog timer, interval
Other blocks, such as a timer and an interrupt controller, can be added to the smart card 10 as functional blocks.

【０００９】動作中にスマートカード１０は、スマートカード１０と端末装置との間に送信
されるデータのための安全な通信リンクを開設する。マイクロプロセッサ１８の
制御下で、SMMU２８，MAC２４およびFMHA２６が協働し、暗号化キーおよび他の
情報を用いてRAM２０に格納されるデータの一部を暗号化するためのモジュラ累
乗等式を計算する数学的アルゴリズムを実行する。例として、RAM２０は個人の
健康記録，経済的記録および個人の認証識別子すなわち指紋や網膜アイプリント
などのデータを格納する。個人データはRAM２０からシステム・バス１５を介し
てMAC２４に転送され、MAC２４からデータ・ホスト・バス２５を介してFMHA２６
に転送される。FMHA２６は、モジュラ乗算，加算，減算および累乗法を含む関数
を用いてデータ・ホスト・バス２５上に受信されるデータを暗号化する。データ
暗号化の後で、暗号化された個人データはFMHA２６からUART１４およびI/Fブロ
ック１２に転送される。暗号化された個人データはコンタクトレス・スマートカ
ードにおいてはRF信号を通じて、またコンタクト型スマートカードにおいては１
組のI/Oピンを通じて端末装置に放出される。In operation, the smart card 10 opens a secure communication link for data transmitted between the smart card 10 and the terminal. Under the control of the microprocessor 18, the SMMU 28, MAC 24 and FMHA 26 work together to calculate a modular exponentiation equation for encrypting a portion of the data stored in the RAM 20 using the encryption key and other information. Execute a mathematical algorithm. By way of example, RAM 20 stores personal health records, financial records, and personal identification identifiers, such as fingerprints and retinal eyeprints. The personal data is transferred from the RAM 20 to the MAC 24 via the system bus 15 and from the MAC 24 to the FMHA 26 via the data host bus 25.
Is forwarded to FMHA 26 encrypts data received on data host bus 25 using functions including modular multiplication, addition, subtraction and exponentiation. After data encryption, the encrypted personal data is transferred from FMHA 26 to UART 14 and I / F block 12. Encrypted personal data is transmitted via RF signals for contactless smart cards and 1 for contact smart cards.
Released to the terminal through a set of I / O pins.

【００１０】図２は、インターネットを介してFMHAブロックを含む集積回路とやりとりされ
ているデータを示す図である。キーボード３０が、中央処理装置（CPU）３４へ
のデータ入力を行うためのインタフェースをユーザに提供する。モニタ３２は、
ユーザがCPU３４に格納されるデータを視覚的に表示することを可能にする。集
積回路３６は、フォスタ−モンゴメリ・アルゴリズムを実行する暗号化回路構成
を備える。CPU３４に格納されるデータは、データ・バスを介して集積回路３６
に転送されて暗号化され、暗号データがインターネット３８に転送される。また
、インターネット３８を介して受信されるデータは集積回路３６に転送され暗号
解読することができる。このように、図２は、インターネットなどの通信網にイ
ンタフェースする暗号化システムを示す。FIG. 2 is a diagram showing data exchanged with an integrated circuit including an FMHA block via the Internet. A keyboard 30 provides a user with an interface for inputting data to a central processing unit (CPU). The monitor 32 is
It allows a user to visually display data stored in CPU. The integrated circuit 36 has an encryption circuit configuration for executing the Foster-Montgomery algorithm. Data stored in the CPU 34 is transmitted to the integrated circuit 36 via a data bus.
The encrypted data is transferred to the Internet 38. Data received via the Internet 38 can be transferred to the integrated circuit 36 and decrypted. Thus, FIG. 2 illustrates an encryption system that interfaces to a communication network such as the Internet.

【００１１】図３は、図１のFMHA２６に含まれる機能ブロックを示すブロック図である。同
じ要素を示すために図面内では同じ参照番号が用いられることに留意されたい。
さらに、フォスタ−モンゴメリ・アルゴリズムはオペランドＡ，Ｂの積を形成す
ることに留意されたい。ただし、オペランドＡ，Ｂはいずれも１０２４ビット数
などの大きな整数である。FMHA２６が用いるパイプライン化法により、オペラン
ドＡ，Ｂをディジットと呼ばれる複数の規則正しい１６ビット数に分割すること
ができる。ディジットには１６ビットのデータが含まれるが、これは本発明を制
約するものではない。さらに、オペランドＡの数の集合内の各々の分割された数
を値Ａと呼ぶ。同様に、オペランドＢの数の集合内の各々の分割された数を値Ｂ
と呼ぶ。値Ａの例は、A₀，A₁，...A₆₃であり、値Ｂの例はB₀，B₁，...B₆₃である
。ホスト・インタフェース（I/F）ブロック４０が値Ａと値ＢとをRAM２０からデ
ータ・ホスト・バス２５（図１）を介して受信する。値Ａ，ＢはA/Bランダム・
アクセス・メモリ（RAM）７２に格納される。また、I/Fブロック４０は、ホスト
・プロセッサすなわちマイクロプロセッサ１８（図１）から制御信号を受信し、
これらの信号はFMHA２６内のデータ転送を制御する制御回路７４によりホスト制
御信号に変換される。FIG. 3 is a block diagram showing functional blocks included in the FMHA 26 of FIG. Note that the same reference numbers are used in the drawings to indicate the same elements.
Note further that the Foster-Montgomery algorithm forms the product of operands A and B. However, each of the operands A and B is a large integer such as a 1024-bit number. By the pipeline method used by the FMHA 26, the operands A and B can be divided into a plurality of regular 16-bit numbers called digits. The digits include 16 bits of data, but this is not a limitation of the present invention. Further, each divided number in the set of operand A numbers is referred to as a value A. Similarly, each divided number in the set of operand B numbers is represented by the value B
Call. Examples of the value A are A ₀ , A ₁ ,... A ₆₃ , and examples of the value B are B ₀ , B ₁ _,. The host interface (I / F) block 40 receives the value A and the value B from the RAM 20 via the data host bus 25 (FIG. 1). Values A and B are A / B random
It is stored in an access memory (RAM) 72. The I / F block 40 also receives control signals from the host processor or microprocessor 18 (FIG. 1),
These signals are converted to host control signals by a control circuit 74 for controlling data transfer in the FMHA 26.

【００１２】制御回路７４は、データ・バス４１と呼ばれるバスを介して、ホストI/Fブロ
ック４０の出力に接続される端末を有する。制御回路７４は、ホスト・プロセッ
サから制御信号を受信し、ホストI/Fブロック４０とFMHA２６内の他のブロック
との間の対話を制御する信号を生成する。The control circuit 74 has a terminal connected to the output of the host I / F block 40 via a bus called the data bus 41. The control circuit 74 receives control signals from the host processor and generates signals that control interaction between the host I / F block 40 and other blocks in the FMHA 26.

【００１３】ディジット・ネゲーション・ユニット（DNU: Digit Negation Unit）４２は、
データ・バス４１を介してホストI/Fブロック４０の出力に接続される入力を有
する。値ＢがDNU４２の入力においてデータ・バス４１上でA/B RAM７２から受信
され、コプロセッサ４４の端末４６に転送されるか、あるいはDNU４２により１
の補数の負数に変換され端末４６に転送される。また、コプロセッサ４４は、デ
ータ・バス４１に接続されA/B RAM７２から値Ａを受信する端末４８を有する。
コプロセッサ４４の端末５０，５２は、それぞれ部分積値と値Ｎとを受信するた
めに結合される。オペランドＮは、すべての演算の係数であり、数学的計算が有
効な有限界を定義する。そのため可能な数の範囲は、係数により制限される。The digit negation unit (DNU: Digit Negation Unit) 42
It has an input connected to the output of the host I / F block 40 via the data bus 41. The value B is received from the A / B RAM 72 on the data bus 41 at the input of the DNU 42 and transferred to the terminal 46 of the coprocessor 44 or
And is transferred to the terminal 46. The coprocessor 44 also has a terminal 48 connected to the data bus 41 for receiving the value A from the A / B RAM 72.
The terminals 50, 52 of the coprocessor 44 are coupled to receive the partial product value and the value N, respectively. Operand N is the coefficient of all operations and defines the bounds for which mathematical calculations are valid. The range of possible numbers is therefore limited by the coefficients.

【００１４】コプロセッサ４４は、フォスタ−モンゴメリ・モジュラ換算アルゴリズムを計
算する。コプロセッサ４４は、端末４６に接続される第１入力と端末４８に接続
される第２入力とを有する乗算器５６を備える。加算器回路または加算器５８は
、乗算器５６の出力に接続される第１入力とコプロセッサ４４の端末５０に接続
される第２入力とを有する。モジューロ換算器６０は、加算器５８の出力に接続
される第１入力と、コプロセッサ４４の端末５２に接続される第２入力とを有す
る。ラッチ６２は、モジューロ換算器６０の出力に接続される入力とコプロセッ
サ４４の端末５４に接続される出力とを有する。ラッチ６２は、コプロセッサ４
４のある実施例に関しては必要とされない場合もあり、端末４６，４８，５０，
５２などの入力に含まれる場合も含まれない場合もある。The coprocessor 44 calculates a Foster-Montgomery modular conversion algorithm. Coprocessor 44 includes a multiplier 56 having a first input connected to terminal 46 and a second input connected to terminal 48. Adder circuit or adder 58 has a first input connected to the output of multiplier 56 and a second input connected to terminal 50 of coprocessor 44. Modulo converter 60 has a first input connected to the output of adder 58, and a second input connected to terminal 52 of coprocessor 44. Latch 62 has an input connected to the output of modulo converter 60, and an output connected to terminal 54 of coprocessor 44. The latch 62 is connected to the coprocessor 4
4 may not be required for certain embodiments, and terminals 46, 48, 50,
52, etc., may or may not be included.

【００１５】コプロセッサ４４の出力端末は、積RAM６４の入力に接続される。積RAM６４は
、コプロセッサ４４が生成する中間データ値の一時的な格納場所を提供する。例
として、積RAM６４は、２つの別々のRAMすなわち偶数メモリと奇数メモリを有し
、これらにより１回のサイクル内で二重のアクセスを行うことができる。たとえ
ば、１サイクルの間に、偶数メモリは、コプロセッサ４４が関与する次の計算中
に必要とされるデータを供給し、奇数メモリは、前回の計算でコプロセッサ４４
が生成したデータを格納する。次のサイクルで、奇数メモリは、コプロセッサ４
４が関与する次の計算中に必要なデータを供給し、偶数メモリが前回の計算でコ
プロセッサ４４が生成したデータを格納する。このようにして、偶数メモリと奇
数メモリが、各サイクル毎に交代して読取モードと書込モードになり、メモリが
同一サイクルの間に両方とも読取モードまたは書込モードになることはない。積
RAM６４の偶数メモリと奇数メモリは両方とも、各々が１６ビットのデータ（デ
ィジット）を格納する３２の行に編成される。An output terminal of coprocessor 44 is connected to an input of product RAM 64. Product RAM 64 provides a temporary storage location for intermediate data values generated by coprocessor 44. By way of example, product RAM 64 has two separate RAMs, an even memory and an odd memory, which allow for double access within a single cycle. For example, during one cycle, the even memory supplies the data needed during the next calculation involving the coprocessor 44 and the odd memory stores the coprocessor 44 in the previous calculation.
Stores the data generated by. In the next cycle, the odd memory is
4 provides the necessary data during the next calculation involving, and the even memory stores the data generated by coprocessor 44 in the previous calculation. In this way, the even and odd memories alternate between read and write modes each cycle, and neither memory is in read or write mode during the same cycle. product
Both the even and odd memories of RAM 64 are organized into 32 rows, each storing 16 bits of data (digits).

【００１６】積RAM６４の出力は、データ・スイッチ・ユニット（DSU：Data Switch Unit）
６８の第１入力に接続される。DSU６８の第２入力は、データ・バス４１に接続
される。DSU６８の出力は、コプロセッサ４４の端末５０に接続される。かくし
て、データ・バス４１からのデータまたは積RAM６４からのデータのいずれか一
方が部分積値としてDSU６８内で選択され、コプロセッサ４４の端末５０に転送
される。また、積RAM６４からのデータは、データ・バス４１にも転送すること
ができる。The output of the product RAM 64 is a data switch unit (DSU).
68 is connected to a first input. A second input of DSU 68 is connected to data bus 41. The output of DSU 68 is connected to terminal 50 of coprocessor 44. Thus, either the data from data bus 41 or the data from product RAM 64 is selected in DSU 68 as a partial product value and transferred to terminal 50 of coprocessor 44. The data from the product RAM 64 can also be transferred to the data bus 41.

【００１７】 N RAM７０は、データ・バス４１に接続され、コプロセッサ４４が用いる数体
系のための係数値を受信する入力を有する。N RAM７０は、たとえば、各々が１
６ビットのデータを格納する６４の行に編成される。N RAM７０の出力は、ディ
ジット比較ユニット（DCU：Digit Compare Unit）６６の第１入力に接続される
。DCU６６の第２入力は、データ・バス４１に接続される。DCU６６の出力は、コ
プロセッサ４４の端末５２に接続される。かくして、データ・バス４１からのデ
ータまたはN RAM７０からのデータのいずれか一方が、DCU６６内で値Ｎとして選
択され、コプロセッサ４４の端末５２に転送される。また、データは、N RAM７
０からデータ・バス４１を介してDCU６６にも転送することができる。N RAM 70 is connected to data bus 41 and has an input for receiving coefficient values for the number system used by coprocessor 44. N RAM 70 has, for example, 1
It is organized into 64 rows that store 6 bits of data. An output of the N RAM 70 is connected to a first input of a digit compare unit (DCU) 66. A second input of DCU 66 is connected to data bus 41. The output of DCU 66 is connected to terminal 52 of coprocessor 44. Thus, either data from the data bus 41 or data from the N RAM 70 is selected as the value N in the DCU 66 and transferred to the terminal 52 of the coprocessor 44. Data is stored in N RAM7
0 can be transferred to the DCU 66 via the data bus 41.

【００１８】ＡセクションとＢセクションとを有するA/B RAM７２は、データ・バス４１に
接続され、数学的演算のためのソース・オペランドを受信する。例として、A/B
RAM７２は、Ａセクション内に１０２４ビットを有する第１オペランドのディジ
ットのすべて、すなわち被分割オペランドＡの値Ａの６４のディジットを格納す
る。同様に、A/B RAM７２は、Ｂセクション内に１０２４ビットを有する第２オ
ペランドのディジットのすべて、すなわち被分割オペランドＢの値Ｂの６４のデ
ィジットを格納する。このようにして、A/B RAM７２は、コプロセッサ４４の端
末４８に転送される値Ａの６４ディジットと、DNU４２の入力に転送される値Ｂ
の６４ディジットとを格納する。あるいは、A/B RAM７２は、オペランドＡを格
納するものとオペランドＢを格納するものとの２つの別々のメモリとしてもよい
。さらに、本発明においては、A/B RAM７２のＢセクションは、暗号化動作が終
了した後のオペランドＡ，Ｂの乗算の最終的な積を格納する。積RAM６４の出力
は、最終積が計算されると、DSU６８内でデータ・バス４１に転送される。ホス
トI/Fブロック４０は、A/B RAM７２のＢセクションに格納される最終積すなわち
被暗号化データをデータ・ホスト・バス２５に転送することができる。An A / B RAM 72 having an A section and a B section is connected to the data bus 41 and receives source operands for mathematical operations. For example, A / B
The RAM 72 stores all digits of the first operand having 1024 bits in the A section, that is, 64 digits of the value A of the operand A to be split. Similarly, the A / B RAM 72 stores all of the digits of the second operand having 1024 bits in the B section, ie, 64 digits of the value B of the operand B to be split. In this way, the A / B RAM 72 stores the 64 digits of the value A transferred to the terminal 48 of the coprocessor 44 and the value B transferred to the input of the DNU 42.
And 64 digits are stored. Alternatively, the A / B RAM 72 may be two separate memories, one for storing operand A and one for storing operand B. Further, in the present invention, the B section of the A / B RAM 72 stores the final product of the multiplication of the operands A and B after the end of the encryption operation. The output of product RAM 64 is transferred to data bus 41 in DSU 68 once the final product has been calculated. The host I / F block 40 can transfer the final product, that is, the encrypted data stored in the B section of the A / B RAM 72 to the data host bus 25.

【００１９】 FMHA２６は、暗号化および暗号解読のためのオペランドＡ，Ｂの乗算を実行す
る。オペランドＡ，Ｂは、数値データまたはアメリカ規格協会情報交換標準コー
ド（アスキー符号（ASCII：American Standard Code for Information Intercha
nge））または他の変換済みキャラクタ集合を利用する序数詞に変換される通常
のテキスト・ストリングとすることができる。FMHA２６は、このデータをバイナ
リ整数全数として扱う。モジュラ乗算のためのモンゴメリ換算アルゴリズムは、
次の形式をとる： (A*R mod N)(B*R mod N)+↓＊Ｎただし：Ａは第１オペランドで整数；Ｂは第２オペランドで整数；Ｎは奇数値を有する整数； mod Nは、有限界の要素数を定義する（A*B*R）の剰余数；ＲはＮの値より大きな値を有する２の整数べき数；および ↓は、(A*R mod N)(B*R mod N)+↓＊Ｎが上位ビットを失わずにＲで除算する
ことができる整数になるように計算される換算値である。The FMHA 26 performs multiplication of operands A and B for encryption and decryption. Operands A and B are numeric data or an American Standards Code Information Exchange Standard Code (ASCII).
nge)) or any other text string that is converted to an ordinal utilizing another converted character set. The FMHA 26 treats this data as a binary integer whole number. The Montgomery conversion algorithm for modular multiplication is
Takes the form: (A * R mod N) (B * R mod N) + ↓ * N where: A is an integer in the first operand; B is an integer in the second operand; N is an integer having an odd value; mod N is the remainder of (A * B * R) that defines the bounded number of elements; R is an integer power of 2 having a value greater than the value of N; and ↓ is (A * R mod N) (B * R mod N) + ↓ * is a conversion value calculated so that N is an integer that can be divided by R without losing the upper bits.

【００２０】 FMHA２６の概念を利用するある例においては、２つの１０２４ビット・オペラ
ンドをパイプライン化法およびコプロセッサ４４内の複数のパスまたは回転を用
いて乗算する。このとき、２つの１６ビット・バイナリ数が乗算器５６により乗
算される。しかし、本発明は１０２４ビットのオペランドにも、２つの１６ビッ
ト・バイナリ数を乗算するハードウェア乗算器にも制約されない。簡単にして説
明を行うために、フォスタ−モンゴメリ・モジュラ換算アルゴリズムを、２つの
小さな数を乗算する以下の例を用いて説明する。モンゴメリ法は、オペランドＡ
，ＢをＲで予め乗算することによりモンゴメリ形式に変換して、ハードウェア。
モジュラ換算の問題を簡単にする。In one example utilizing the FMHA 26 concept, two 1024-bit operands are multiplied using a pipelined approach and multiple passes or rotations within coprocessor 44. At this time, the two 56-bit binary numbers are multiplied by the multiplier 56. However, the invention is not limited to 1024-bit operands or hardware multipliers that multiply two 16-bit binary numbers. For simplicity, the Foster-Montgomery modular conversion algorithm will be described using the following example of multiplying two small numbers. The Montgomery method uses operand A
, B are pre-multiplied by R to convert to Montgomery format and hardware.
Simplify modular conversion problems.

【００２１】底２の数を用いて、項（A*R mod N）は、A₁₀＝９，R₁₀＝１６およびN₁₀＝１３
のとき０００１の値を有する。さらに、項（B*R mod N）は、B₁₀＝１１，R₁₀＝
１６およびN₁₀＝１３のとき０１１１の値を有する。以下の例では、フォスタ−
モンゴメリ換算アルゴリズムを（A*R mod N）の乗算、すなわち（０００１）と
（B*R mod N）すなわち（０１１１）に用いる。Using the base 2 number, the terms (A * R mod N) are: A ₁₀ = 9, R ₁₀ = 16 and N ₁₀ = 13
Has a value of 0001. Further, the term (B * R mod N) is B ₁₀ = 11, R ₁₀ =
It has a value of 0111 when 16 and N ₁₀ = 13. In the following example, the foster
The Montgomery conversion algorithm is used for multiplication of (A * R mod N), that is, (0001) and (B * R mod N), that is, (0111).

【００２２】乗算器５６は、２つのデータ値を乗算し、これらのデータ値の積が加算器５８
に転送される。加算器５８は、前回の部分積と乗算器５６から生成される積との
合計値を生成する。フォスタ−モンゴメリ換算アルゴリズムにおいては、合計値
の特定のビット位置の論理値が、合計値が換算されるべきか否かを決定する。最
初は、特定のビット位置が最も右側のビット位置、すなわち第１合計値の最下位
データ・ビットにある。第２データ値のビット位置の値を第１データ値に乗算す
ると、すなわちビット掛け算の後で、特定のビット位置が左側に１ビット位置だ
け移動する。このため、ビット掛け算を生成するたびに、合計値内の特定のビッ
ト位置は左側に１ビット位置だけ移動される。すなわち最下位ビット位置から最
上位ビット位置に向かって移動する。The multiplier 56 multiplies two data values, and the product of these data values is added to an adder 58
Is forwarded to The adder 58 generates a total value of the previous partial product and the product generated from the multiplier 56. In the Foster-Montgomery conversion algorithm, the logical value of a particular bit position in the sum determines whether the sum should be converted. Initially, a particular bit position is in the rightmost bit position, ie, the least significant data bit of the first sum. Multiplying the value of the bit position of the second data value by the first data value, ie, after bit multiplication, moves the particular bit position one bit position to the left. Thus, each time a bit multiplication is generated, a particular bit position in the sum is shifted one bit position to the left. That is, it moves from the least significant bit position to the most significant bit position.

【００２３】フォスタ−モンゴメリ換算アルゴリズムにおいては、特定のビット位置のデー
タ・ビット値が論理１の値を有するとき、Ｎの値は、シフト演算により特定のビ
ット位置に整合され、その合計値に加算される。ビット掛け算毎に特定のビット
位置における論理値をチェックし、Ｎの値を適切に整合および加算することによ
り、コプロセッサ４４の出力で生成される各部分積は、コプロセッサ４４内の回
転毎に適切に換算される。一方で、Ｎの値は特定のビット位置のデータ・ビット
値が論理０値を有する場合は、合計値に加算されない。論理０値は、特定のビッ
ト位置の値がすでに換算されており、Ｎのある倍数が(の成分でないことを示す
。In the Foster-Montgomery conversion algorithm, when the data bit value at a particular bit position has a logical 1 value, the value of N is matched to the particular bit position by a shift operation and added to the sum. Is done. By checking the logical value at a particular bit position for each bit multiplication and properly matching and adding the value of N, each partial product produced at the output of coprocessor 44 is Converted appropriately. On the other hand, the value of N is not added to the sum if the data bit value at a particular bit position has a logical zero value. A logical zero value indicates that the value at the particular bit position has already been converted and that some multiple of N is not a component of (.

【００２４】この例では、乗算器５６は値（A*R mod N）すなわち（０００１）と値（B*R m
od N）すなわち（０１１１）の積を生成する。値（０００１）に（０１１１）の
最下位ビットすなわち論理１値を乗算することにより、第１ビット掛け算が生成
される。ビット掛け算を生成する各乗算毎に、その結果が格納される部分積と合
算される。格納される部分積は最初は０であるので、第１合計値とビット掛け算
とは等しい値を有することに留意されたい。In this example, the multiplier 56 calculates the value (A * R mod N), that is, (0001) and the value (B * R m
od N), that is, the product of (0111). The first bit multiplication is generated by multiplying the value (0001) by the least significant bit of (011 1 ), that is, a logical 1 value. For each multiplication that produces a bit multiplication, the result is summed with the stored partial product. Note that the first sum value and the bit multiplication have equal values because the stored partial product is initially zero.

【００２５】（１）０００１ ← 初期値（A*R mod N）（２）ｘ０００１ ← （B*R mod N）の最下位ビット（３）０００１ ← 第１ビット掛け算フォスタ−モンゴメリ換算アルゴリズムを用いて、合計値の特定のビット位置
のデータの論理値により、Ｎの値を合計値に加算して、生成されている部分積を
換算すべきか否かが決まる。この例では、第１ビット掛け算（０００１）の最下
位ビットは、論理１値を有し、従って、Ｎ（４）の値が第１ビット掛け算（３）
に加算される。(1) 0001 ← Initial value (A * R mod N) (2) Least significant bit of x 0001 ← (B * R mod N) (3) 0001 ← First bit multiplication Using Foster-Montgomery conversion algorithm The logical value of the data at a specific bit position of the total value determines whether the value of N should be added to the total value to convert the generated partial product. In this example, the least significant bit of the first bit multiplication (000 1 ) has a logical one value, so the value of N (4) is the first bit multiplication (3)
Is added to

【００２６】（３）０００１ ← 第１ビット掛け算の積（４）＋１１０１ ← Ｎの値（５）１１１０ ← 第１ビット換算後の結果第２ビット掛け算は、（１）に（B*R mod N）の右から第２番目のビット（０
１１１）を乗算する。(3) 0001 ← product of first bit multiplication (4) + 1101 ← value of N (5) 1110 ← result after conversion of first bit The second bit multiplication is performed by adding (B * R mod N) from the right of the second bit (0
Multiply 1 1 1).

【００２７】（１）０００１ ← 初期値（６）ｘ００１０ ← （B*R mod N）の右から２番目のビット（７）００１０ ← 第２ビット掛け算の積第２ビット掛け算の積（７）は、格納された前回の結果（５）と合算されて、
第２合計値（８）を生成する。(1) 0001 ← Initial value (6) x00 10 ← The second bit from the right of (B * R mod N) (7) 0010 ← Product of second bit multiplication Product (7) ) Is added to the stored previous result (5),
Generate a second sum (8).

【００２８】（７）００１０ ← 第２ビット掛け算の積（５）１１１０ ← 第１ビット換算後の結果（８）１００００ ← 第２合計値この場合も、フォスタ−モンゴメリ換算アルゴリズムにおいては、第２合計値
の特定のビット位置の論理値が、第２合計値を換算すべきか否かを決める。この
場合、特定のビット位置は、最下位データ・ビットの左側の位置（１００００）
である。第２データ・ビットは論理０値を有し、従って、Ｎの値は第２合計値に
加算されない。言い換えると、第２合計値はすでに換算されており、シフトされ
るＮ値の加算を必要としない。(7) 0010 ← The product of the multiplication of the second bit (5) 1110 ← The result after the conversion of the first bit (8) 10000 ← The second sum value In this case, the second sum is also used in the Foster-Montgomery conversion algorithm The logical value of a particular bit position of the value determines whether to convert the second sum. In this case, the particular bit position is the position to the left of the least significant data bit (100 0 0)
It is. The second data bit has a logical zero value, so the value of N is not added to the second sum. In other words, the second sum has already been converted and does not require the addition of the shifted N values.

【００２９】第３ビット掛け算は、（１）に（B*R mod N）の右から３番目のビット（０１
１１）に位置する論理値を乗算する。In the third bit multiplication, (1) is added to the third bit from the right of (B * R mod N) (0 1
Multiply the logical value located in 11).

【００３０】（１）０００１ ← 初期値（９）ｘ０００１ ← （B*R mod N）の右から３番目のビット（１０）０１００ ← 第３ビット掛け算の積第３ビット掛け算の後で、第３ビット掛け算の積（１０）が前回の結果（８）
に加算されて、第３合計値（１１）が得られる。(1) 0001 ← Initial value (9) x00 0 1 ← Third bit from the right of (B * R mod N) (10) 0100 ← Product of third bit multiplication After the third bit multiplication, The product of the third bit multiplication (10) is the previous result (8)
To obtain a third total value (11).

【００３１】（１０）０１００ ← 第３ビット掛け算の積（８）＋１００００ ← 前回の結果（１１）０１０１００ ← 第３合計値第３ビット掛け算の積（１０）を前回の結果（８）と加算した後は、第３合計
値の特定のビット位置の論理値が、合計値を換算すべきか否かを決める。この例
では、特定のビット位置は右から３番目のビット位置（０１０１００）である。
第３合計値の特定のビット位置が論理１値を有するとき、Ｎの値は第３特定ビッ
ト位置に整合され、第３合計値に加算される。一方で、第３合計値の第３特定ビ
ット位置が論理０を有するときは、Ｎの値は第３合計値には加算されない。この
例では、第３合計値の右から３番目のビット位置（１０１００）が論理１値を有
し、Ｎの値は右から３番目のビット位置に整合されて第３合計値に加算される。(10) 0100 ← The product of the third bit multiplication (8) +10000 ← Previous result (11) 010100 ← Third total value The product (10) of the third bit multiplication is added to the previous result (8) Thereafter, the logical value at a particular bit position in the third sum determines whether the sum should be converted. In this example, the specific bit position is the third bit position from the right (010 1 00).
When a particular bit position of the third sum has a logical one value, the value of N is aligned to the third particular bit position and added to the third sum. On the other hand, if the third specific bit position of the third sum has a logical zero, the value of N is not added to the third sum. In this example, the third bit position (10 1 00) from the right of the third sum has a logical 1 value, and the value of N is aligned with the third bit position from the right and added to the third sum. Is done.

【００３２】（１１）０１０１００ ← 第３合計値（１２）＋１１０１ ← 正しく整合されるＮの値（１３）１００１０００ ← 第３ビット換算の結果第４ビット掛け算の生成は、（１）に（B*R mod N）の右から４番目のビット
（０１１１）に位置する論理値を乗算する。(11) 010100 ← Third Total Value (12) + 1101 ← Correctly Matched N Value (13) 1001000 ← Result of Third Bit Conversion The generation of the fourth bit multiplication is performed by adding (B) to (1). * R mod N) is multiplied by the logical value located in the fourth bit ( 0 111) from the right.

【００３３】（１）０００１ ← 初期値（１４）ｘ００００ ← （B*R mod N）の右から４番目のビット（１５）００００ ← 第４ビット掛け算の積第４ビット掛け算の後で、第４ビット掛け算の積（１５）が前回の結果（１３
）に加算されて、第４合計値（１６）が得られる。(1) 0001 ← Initial value (14) x000 0 ← Fourth bit from the right of (B * R mod N) (15) 0000 ← Product of fourth bit multiplication After the fourth bit multiplication, The product of the 4-bit multiplication (15) is the previous result (13
) To obtain a fourth total value (16).

【００３４】（１５）００００ ← 第４ビット掛け算の積（１３）１００１０００ ← 前回の結果（１６）１００１０００ ← 第４合計値第４合計値（１６）の第４特定ビット位置が論理１または論理０値であるかチ
ェックされる。この例については、第４合計値（１６）の第４特定ビット位置は
、右から４番目のビット位置（１００１０００）である。この例では、右から４
番目のビット位置（１００１０００）は論理１値を有し、従って、Ｎの値が右か
ら４番目のビット位置に整合され、第４合計値に加算される。(15) 0000 ← The product of the fourth bit multiplication (13) 1001000 ← Previous result (16) 1001000 ← Fourth total value The fourth specific bit position of the fourth total value (16) is logic 1 or logic 0 The value is checked. For this example, the fourth specific bit position of the fourth sum (16) is the fourth bit position from the right (100 1 000). In this example, 4
The fourth bit position (100 1 000) has a logical 1 value, so the value of N is aligned with the fourth bit position from the right and added to the fourth sum.

【００３５】（１６）０１００１０００ ← 第４合計値（１７）＋１１０１ ← 正しく整合されるＮの値（１８）１０１１００００ ← 第４ビット換算の結果（A*R mod N）と（B*R mod N）すなわち（０００１）と（０１１１）の積は、
（A*B*R² mod N）すなわち（１０１１００００）の値を有する。（A*B*R² mod N
）をＲで除算すると、（A*B*R mod N）すなわち（１０１１）となる。Ｒの値は
底２の整数倍として選択されることに注目されたい。言い換えると、Ｒは２¹，
２²，２³，．．．，２^Iなどの値しか持たないことになる。ただしＩは整数であ
る。この例については、Ｒは２⁴の値を有するように選択される。通常、Ｒは２^S の値を有するよう選択される。ただしＳは係数Ｎのビット数である。このため、
Ｒで除算する演算は、Ｉ回だけ積を右にシフトさせるという簡単な動作で実行さ
れる。さらに、Ｒで除算した後の値（A*B*R mod N）はモンゴメリ形式、すなわ
ち（値＊Ｒ）mod Nの形式にある。モンゴメリ形式の値（A*B*R mod N）は、コプ
ロセッサ４４内の複数のパスを可能にする。コプロセッサ４４が利用するパイプ
ライン化法は、Ａ，Ｂ，Ｒ，Ｎの値が大きいときに（A*R mod N）と（B*R mod N
）とを掛け算するための効率的な方法となる。(16) 01001000 ← Fourth Total Value (17) + 1101 ← Correctly Matched N Value (18) 10110000 ← Fourth Bit Conversion Result (A * R mod N) and (B * R mod N) ) That is, the product of (0001) and (0111) is
It has a value of (A * B * R ² mod N), that is, (1010000). (A * B * R ² mod N
) Divided by R gives (A * B * R mod N), that is, (1011). Note that the value of R is chosen as an integer multiple of base 2. In other words, R is 2 ¹ ,
2 ^2, 2 ^3,. . . , 2 ^I, etc. Here, I is an integer. For example, R is selected to have a value of 2 ^4. Usually, R represents is selected to have a value of 2 ^S. Here, S is the number of bits of the coefficient N. For this reason,
The operation of dividing by R is performed by a simple operation of shifting the product rightward I times. Furthermore, the value (A * B * R mod N) after division by R is in Montgomery format, that is, (value * R) mod N. The value in Montgomery format (A * B * R mod N) allows for multiple passes within coprocessor 44. When the values of A, B, R, and N are large, (A * R mod N) and (B * R mod N)
) Is an efficient way to multiply

【００３６】モンゴメリ換算アルゴリズムとは対照的に、フォスタ−モンゴメリ換算アルゴ
リズムにおける↓の値は、２つのオペランドの乗算前には計算されないが、前述
の例に示されるように、各ビット掛け算が前回の結果に合算された後で換算が行
われる。Ｎの値は奇数である、すなわちＮの値は最下位ビットの位置において論
理１を有することに注目されたい。特定のビット位置の論理値が論理１値を有す
るときに合計値にＮを加算することにより、値（A*B*R² mod N）は、さらに下位
のビット位置にいくつかの０を有して生成される。言い換えると、フォスタ−モ
ンゴメリ換算アルゴリズムは、少なくとも最下位ビット位置の数Ｉに論理０値を
持たせる。In contrast to the Montgomery conversion algorithm, the value of ↓ in the Foster-Montgomery conversion algorithm is not calculated before multiplication of the two operands, but as shown in the above example, each bit multiplication is Conversion is performed after summing up the results. Note that the value of N is odd, that is, the value of N has a logical one at the least significant bit position. By adding N to the sum when the logical value at a particular bit position has a logical one value, the value (A * B * R ² mod N) has some zeros at lower bit positions. Generated. In other words, the Foster-Montgomery conversion algorithm assigns a logical 0 value to at least the number I of the least significant bit position.

【００３７】データが（A*B*R mod N）の形式にあるコプロセッサ４４を通じた演算の後で
は、データに関して望ましい最終的形式は（A*B mod N）である。例の（A*B*R m
od N）の値は（１０１１）である。値（A*B*R mod N）は、Ｒによる除算に備え
て換算される。（A*B*R mod N）の換算は、Ｎの最下位ビット位置を（A*B*R mod
N）の値の論理１を有する最も右側のビット位置に整合することである。例とし
て、値（A*B*R mod N）は、最も右側のビット位置に論理１値を有する（１０１
１）。Ｎの値を正しく整合し、（A*B*R mod N）と合算した後で、第１ビット換
算後の合計値は１１０００の値を有する。After the operation through the coprocessor 44 where the data is in the form (A * B * R mod N)
Is the desired final form for the data is (A * B mod N). Example (A * B * R m
od N) is (1011). The value (A * B * R mod N) is ready for division by R
Is converted. To convert (A * B * R mod N), the least significant bit position of N is calculated as (A * B * R mod N).
N) is to match the rightmost bit position with a logical 1 of the value. As an example
Thus, the value (A * B * R mod N) has a logical 1 value in the rightmost bit position (101
1). After correctly matching the value of N and summing with (A * B * R mod N), the first bit conversion
The calculated sum has a value of 11,000.

【００３８】（１９）１０１１ ← （A*B*R mod N）の値（２０）＋１１０１ ← Ｎの値（２１）１１０００ ← 第１ビット換算後の新しい合計値新しい合計値の換算は、Ｎの最下位ビット位置を論理１値を有する新しい合計
値の最も右側のビット位置（１１０００）に整合する。第２ビット換算後の値は
（１０００００００）である。(19) 1011 ← (A * B * R mod N) value (20) + 1101 ← N value (21) 11000 ← New total value after first bit conversion The conversion of the new total value is N To the rightmost bit position (1 1 000) of the new sum value having a logical 1 value. The value after the second bit conversion is (10000000).

【００３９】（２１）１１０００ ← 第１ビット換算後の値（２２）＋１１０１ ← Ｎの値（２３）１０００００００ ← 第２ビット換算後の値最下位ビット位置のビット数Ｉが論理０値を有するとき、（A*B*R mod N）の
値は換算され、Ｒによる除算に備える。言い換えると、Ｒで除算する演算が換算
後の合計値をＩ回だけ右にシフトする動作に変わる。ただしＩはこの例では４で
ある。(21) 11000 ← value after conversion of first bit (22) +1110 ← value of N (23) 10000000 ← value after conversion of second bit The number of bits I at the least significant bit position has a logical 0 value At this time, the value of (A * B * R mod N) is converted and prepared for division by R. In other words, the operation of dividing by R changes to an operation of shifting the converted total value rightward I times. However, I is 4 in this example.

【００４０】（２３）１０００００００ ← 第２ビット換算後の値（２４）Ｒで除算またはＩ回右にシフトする。(23) 10000000 ← Value after conversion to second bit (24) Divide by R or shift right I times.

【００４１】かくして、Ｒでの除算後、被暗号化データは（１０００）の値を有し、フォス
タ−モンゴメリ換算アルゴリズムは（A*B*R mod N）ｘ（A*B mod N）の乗算に用
いられる。この例の段階１９〜２４は、Ｒによる除算を行い、これで値（A*B*R
mod N）が（A*B mod N）に換算される。実際には、この最終的なＲによる除算は
、（A*B*R mod N）に１を掛けて換算することにより行われる。Thus, after division by R, the data to be encrypted has a value of (1000) and the Foster-Montgomery conversion algorithm uses the multiplication of (A * B * R mod N) × (A * B mod N) Used for Steps 19-24 of this example perform a division by R, which yields the value (A * B * R
mod N) is converted to (A * B mod N). In practice, this final division by R is performed by multiplying (A * B * R mod N) by one for conversion.

【００４２】動作中、A/B RAM７２は、６４の値すなわちA₀,A₁,...A₆₃を有するオペランド
Ａと、６４個の値すなわちB₀,B₁,...B₆₃を有するオペランドＢとを伴って、ホス
トI/Fブロック４０を通じてロードされる。１０２４ビット・オペランドＡは６
４個のディジットA₆₃〜A₀が含まれる。ただし各ディジットは１６ビットのデー
タを有する。かくして、A/B RAM６４のＡセクションには６４行があり、１０２
４ビット・オペランドＡ全体を格納することができ、またA/B RAM６４のＢセク
ションには６４行があり、１０２４ビット・オペランドＢ全体を格納することが
できる。さらに、N RAM７０はホストI/Fブロック４０を通じてロードされ、N₆₃
〜N₀の６４ディジットに分割される１０２４ビットを有する。In operation, A / B RAM 72 stores operand A having 64 values, ie, A ₀ , A ₁ ,... A ₆₃ , and 64 values, ie, B ₀ , B ₁ _,. With the operand B having, it is loaded through the host I / F block 40. 1024-bit operand A is 6
Four digits A ₆₃ -A ₀ are included. However, each digit has 16 bits of data. Thus, section A of A / B RAM 64 has 64 rows and 102
The entire 4-bit operand A can be stored, and the B section of the A / B RAM 64 has 64 rows and can store the entire 1024-bit operand B. Further, the N RAM 70 is loaded through the host I / F block 40 and the N ₆₃
Having 1024 bits are divided into 64 digits of to N _0.

【００４３】乗算器５６は、一度に１つのディジットに関してデータ上で働く。従って、暗
号化または暗号解読のためのオペランドＡ，Ｂの乗算は、値Ａ，Ｂ，Ｎの下位デ
ィジットをコプロセッサ４４に転送することで始まる。コプロセッサ４４の端末
４８がディジットA₀を、コプロセッサ４４の端末４６がディジットB₀を、コプロ
セッサ４４の端末５２がディジットN₀を受信する。値A₀とB₀の積が乗算器５６に
より計算され、１６の最下位データ・ビットが加算器５８の入力となる。最上位
の１６ビットは、乗算器５６の搬送チェーンに一時的に格納される。Multiplier 56 operates on the data one digit at a time. Thus, the multiplication of operands A and B for encryption or decryption begins by transferring the lower digits of values A, B and N to coprocessor 44. The terminal 48 digits A ₀ coprocessor 44, the terminal 46 of the coprocessor 44 is the digit B _0, the terminal 52 of the coprocessor 44 receives the digit N _0. The product of the values A ₀ and B ₀ is calculated by multiplier 56 and the 16 least significant data bits are input to adder 58. The 16 most significant bits are temporarily stored in the transport chain of the multiplier 56.

【００４４】加算器５８は、乗算器５６から受信した１６のデータ・ビットを積RAM６４に
格納される１６データ・ビット（当初は０）に加算し、合計値を生成する。モジ
ューロ換算器６０は、加算器５８から１６ビット合計値を、N RAM７０からディ
ジットN₀を受信する。フォスタ−モンゴメリ換算アルゴリズムにおいては、合計
値の特定のビット位置の論理値により合計値を換算すべきか否かが決まる。モジ
ューロ換算器６０は、まず特定のビット位置から最下位ビット位置として始まり
、それに続いてビット掛け算する毎に特定のビット位置が左に１ビット位置ずつ
移動する。言い換えると、特定のビット位置が最下位ビットから始まり、ビット
掛け算毎に最上位ビット位置に向かって指数を付けられる。特定のビット位置の
N₀の値を正しく整合し、N₀を合計値に加えることにより、値A₀とB₀とを乗算した
第１部分積が換算され、０の値を有する。モジューロ換算器６０が生成した部分
積は積RAM６４に格納される。また、↓の１６ビット値は、特定のビット位置に
見られる論理値に応じて決定され、モジューロ換算器６０内に格納される。The adder 58 adds the 16 data bits received from the multiplier 56 to the 16 data bits (initially 0) stored in the product RAM 64 to generate a total value. Modulo converter 60 receives the 16-bit sum from adder 58 and digit N ₀ from N RAM 70. In the Foster-Montgomery conversion algorithm, the logical value of a specific bit position of the total value determines whether to convert the total value. The modulo converter 60 starts with a particular bit position as the least significant bit position, and then moves the particular bit position one bit position to the left for each subsequent bit multiplication. In other words, a particular bit position starts with the least significant bit and is indexed with each bit multiplication toward the most significant bit position. At a particular bit position
Correctly matching the value of N _0, by adding N ₀ to the total value, the first partial product obtained by multiplying the value A ₀ and B ₀ are converted, it has a value of 0. The partial product generated by the modulo converter 60 is stored in the product RAM 64. The 16-bit value of ↓ is determined according to the logical value found at a specific bit position, and stored in the modulo converter 60.

【００４５】第２部分積を生成するには、新たなバイナリ値B₁をA/B RAM７２からコプロセ
ッサ４４に転送して、値A₀と乗算する。乗算器５６は、値A₀とB₁との積を計算し
、１６の最下位データ・ビットが加算器５８の入力となる。加算器５８は、乗算
器５６から受信した１６データ・ビットを積RAM６４に格納される１６データ・
ビット（当初は０）に加算し、合計値を生成する。モジューロ換算器６０は、加
算器５８から１６ビット合計値を、N RAM７０からディジットN₁を受信する。モ
ジューロ換算器６０は、第１部分積の生成により決定される↓の前回に導かれた
値とN₁の値とを利用して、加算器５８からの合計値を換算し、第２部分積を生成
する。第２部分積は、積RAM６４に格納される。To generate a second partial product, the new binary value B ₁ is transferred from the A / B RAM 72 to the coprocessor 44 and multiplied by the value A ₀ . Multiplier 56 calculates the product of the values A ₀ and B _1, and the 16 least significant data bits are input to adder 58. The adder 58 converts the 16 data bits received from the multiplier 56 into 16 data bits stored in the product RAM 64.
Add to the bit (initially 0) to generate the sum. Modulo converter 60 receives the 16-bit sum from adder 58 and digit N ₁ from N RAM 70. The modulo converter 60 converts the total value from the adder 58 by using the value of N ₁ and the value of ↓ determined by the generation of the first partial product, and converts the total value from the second partial product. Generate The second partial product is stored in the product RAM 64.

【００４６】第１群内の部分積の生成を完了するには、A/B RAM７２からの値B₂〜B₆₃が、順
次コプロセッサ４４に転送され、値A₀と乗算される。↓の同値を用いて、モジュ
ーロ換算器６０は、N RAM７０からN₂〜N₆₃の対応する値を用いて残りの部分積を
生成する。部分積は、積RAM６４に格納される。To complete the generation of the partial products in the first group, the values B _{2 to} B ₆₃ from the A / B RAM 72 are sequentially transferred to the coprocessor 44 and multiplied by the value A ₀ . Using the same value of ↓, the modulo converter 60 generates the remaining partial products from the N RAM 70 using the corresponding values of N _{2 to} N ₆₃ . The partial product is stored in the product RAM 64.

【００４７】 A₀にディジットB₀〜B₆₃を乗算して得られる６４の部分積を完成させると、A₂
にディジットB₀〜B₆₃を乗算して得られる第２群の６４の部分積が生成される。A ₁ ，B₀の部分積を生成するにあたり、A₁を含む残りの部分積を生成する際に用い
られる↓の値が決定されることに注目されたい。さらに、部分積の残りの群は、
A₂にディジットB₀〜B₆₃を、A₃にディジットB₀〜B₆₃．．．をA₆₃にディジットB₀
〜B₆₃を乗算して生成される。B₀に関わる部分積が生成されると↓の値が決まり
各群について設定される。ディジットB₀がコプロセッサ４４の端末４６の値であ
るときに↓の値が決定および設定されること、また、出力５４に生成される部分
積は０の値を有するよう換算されることに注目されたい。各群は、N₀〜N₆₃の対
応値も利用する。A₀Digit B₀~ B₆₃Completing the partial product of 64 obtained by multiplying_Two
Digit B₀~ B₆₃Are multiplied to generate a second group of 64 partial products. A ₁ , B₀In generating the partial product of, A₁Used to generate the remaining partial product containing
Note that the value of ↓ is determined. In addition, the remaining groups of the partial product
A_TwoDigit B₀~ B₆₃And A_ThreeDigit B₀~ B₆₃. . . A₆₃Digit B₀
~ B₆₃Is multiplied by B₀When the partial product related to is generated, the value of ↓ is determined
Set for each group. Digit B₀Is the value of the terminal 46 of the coprocessor 44.
The value of ↓ is determined and set when the
Note that the product is scaled to have a value of zero. Each group is N₀~ N₆₃Pair of
The response value is also used.

【００４８】図４は、モジューロ換算器６０の一部分のブロック図である。モジューロ換算
器６０は、簡潔に説明するために、４ｘ４アレイの加算器として簡略化された形
態で説明される。好適な実施例においては、モジューロ換算器６０は、１６行と
１６列を有する加算器アレイを備える。加算器アレイの行数が列数に一致するこ
とが本発明を制限するものでないことに留意されたい。コプロセッサ４４は、正
方形でないモジューロ換算器６０の加算器アレイ内で動作することができる。FIG. 4 is a block diagram of a part of the modulo converter 60. Modulo converter 60 is described in simplified form as a 4x4 array of adders for simplicity. In the preferred embodiment, the modulo converter 60 comprises an adder array having 16 rows and 16 columns. Note that matching the number of rows in the adder array to the number of columns is not a limitation of the present invention. Coprocessor 44 can operate within the adder array of non-square modulo converter 60.

【００４９】モジューロ換算器６０の加算器アレイはＸ列Ｙ行を有する。ただしＸとＹは整
数である。列X₀が第１列であり、加算器９０，９２，９４，９６を備える。X₁が
第２列であり、加算器１００，１０２，１０４，１０６を備える。列X₂が第３列
であり、加算器１１０，１１２，１１４，１１６を備える。列X₃が第４列であり
、加算器１２０，１２２，１２４，１２６を備える。加算器９０〜９６，１００
〜１０６，１１０〜１１６，１２０〜１２６の各々は、第１および第２データ入
力，搬送入力（CI：carry input），搬送出力（CO： carry output）および合計
出力（Ｓ）を有する。The adder array of the modulo converter 60 has X columns and Y rows. Here, X and Y are integers. Column X ₀ is the first column and includes adders 90, 92, 94 and 96. X ₁ is the second column, an adder 100, 102, 104, 106. Column X ₂ is the third column, an adder 110, 112, 114. Column X ₃ is the fourth column, an adder 120, 122, 124, 126. Adders 90 to 96, 100
Each of -106, 110-116, 120-126 has first and second data inputs, a carry input (CI), a carry output (CO), and a total output (S).

【００５０】列X₀の加算器９０，９２，９４，９６の第１入力は、それぞれ端末８０，８２
，８４，８６に接続される。２入力ANDゲート８９，９１，９３，９５は、互い
に接続され、またラッチ１２８のＱ出力に接続される第１入力を有する。ANDゲ
ート８９，９１，９３，９５の出力は、それぞれ加算器９０，９２，９４，９６
の第２入力に接続される。また、加算器９０の搬送出力（CO）は加算器９２の搬
送入力（CI）に、加算器９２の搬送出力は、加算器９４の搬送入力に、加算器９
４の搬送出力は、加算器９６の搬送入力に接続される。加算器９６の搬送出力は
、ラッチ１５２のデータ入力に接続される。ラッチ１５２の出力は、加算器９０
の搬送入力に接続される。The first inputs of adders 90, 92, 94, 96 in column X ₀ are terminals 80, 82, respectively.
, 84, 86. Two-input AND gates 89, 91, 93, 95 are connected together and have a first input connected to the Q output of latch 128. Outputs of the AND gates 89, 91, 93, and 95 are added to adders 90, 92, 94, and 96, respectively.
Is connected to the second input. The transport output (CO) of the adder 90 is supplied to the transport input (CI) of the adder 92, and the transport output of the adder 92 is supplied to the transport input of the adder 94.
The transport output of 4 is connected to the transport input of adder 96. The carry output of adder 96 is connected to the data input of latch 152. The output of the latch 152 is
Is connected to the transport input.

【００５１】列X₁の加算器１００，１０２，１０４，１０６の第１入力は、列X₀の加算器９
０，９２，９４，９６の個々の出力に接続される。２入力ANDゲート９９，１０
１，１０３，１０５は、互いに接続され、またラッチ１３２のＱ出力に共通して
接続される第１入力を有する。ANDゲート９９，１０１，１０３，１０５の出力
は、加算器１００，１０２，１０４，１０６の第２入力にそれぞれ接続される。
また、加算器１００の搬送出力は加算器１０２の搬送入力に、加算器１０２の搬
送出力は加算器１０４の搬送入力に、加算器１０４の搬送出力は加算器１０６の
搬送入力に接続される。加算器１０６の搬送出力は、ラッチ１５６のデータ入力
に接続される。ラッチ１５６の出力は、加算器１００の搬送入力に接続される。The first input of the adders 100, 102, 104, 106 of column X _{1 is} the adder 9 of column X ₀
0, 92, 94, 96 are connected to the individual outputs. 2-input AND gate 99, 10
1, 103 and 105 are connected together and have a first input commonly connected to the Q output of the latch 132. Outputs of the AND gates 99, 101, 103, and 105 are connected to second inputs of adders 100, 102, 104, and 106, respectively.
The transport output of the adder 100 is connected to the transport input of the adder 102, the transport output of the adder 102 is connected to the transport input of the adder 104, and the transport output of the adder 104 is connected to the transport input of the adder 106. The carry output of adder 106 is connected to the data input of latch 156. The output of latch 156 is connected to the carry input of adder 100.

【００５２】列X₂の加算器１１０，１１２，１１４，１１６の第１入力は、列X₁の加算器１
００，１０２，１０４，１０６の個々の出力に接続される。２入力ANDゲート１
０９，１１１，１１３，１１５は、互いに接続され、またラッチ１３６のＱ出力
に共通して接続される第１入力を有する。ANDゲート１０９，１１１，１１３，
１１５の出力は、加算器１１０，１１２，１１４，１１６の第２入力にそれぞれ
接続される。また、加算器１１０の搬送出力は加算器１１２の搬送入力に、加算
器１１２の搬送出力は加算器１１４の搬送入力に、加算器１１４の搬送出力は、
加算器１１６の搬送入力に接続される。加算器１１６の搬送出力はラッチ１６０
のデータ入力に接続される。ラッチ１６０の出力は、加算器１１０の搬送入力に
接続される。[0052] The first input of the adder 110, 112, 114, 116 columns X ₂ includes an adder 1 column X ₁
00, 102, 104, 106. 2-input AND gate 1
09, 111, 113, and 115 are connected together and have a first input commonly connected to the Q output of latch 136. AND gates 109, 111, 113,
The output of 115 is connected to second inputs of adders 110, 112, 114, 116, respectively. The transport output of the adder 110 is the transport input of the adder 112, the transport output of the adder 112 is the transport input of the adder 114, and the transport output of the adder 114 is
Connected to the transport input of adder 116. The transport output of adder 116 is latch 160
Connected to the data input. The output of latch 160 is connected to the transport input of adder 110.

【００５３】列X₃の加算器１２０，１２２，１２４，１２６の第１入力は、列X₂の加算器１
１０，１１２，１１４，１１６の個々の出力に接続される。２入力ANDゲート１
１９，１２１，１２３，１２５は、互いに接続され、またラッチ１４０のＱ出力
に共通して接続される第１入力を有する。ANDゲート１１９，１２１，１２３，
１２５の出力は、加算器１２０，１２２，１２４，１２６の第２入力にそれぞれ
接続される。また、加算器１２０の搬送出力は加算器１２２の搬送入力に、加算
器１２２の搬送出力は加算器１２４の搬送入力に、加算器１２４の搬送出力は加
算器１２６の搬送入力に接続される。加算器１２６の搬送出力は、ラッチ１６２
のデータ入力に接続される。ラッチ１６２の出力は、加算器１２０の搬送入力に
接続される。加算器１２０，１２２，１２４，１２６の出力Ｓは、個々の出力端
末１６４，１６６，１６８．１７０に接続される。The first input of the adders 120, 122, 124, 126 of column X _{3 is} the adder 1 of column X ₂
10, 112, 114, 116 are connected to the individual outputs. 2-input AND gate 1
19, 121, 123, and 125 are connected together and have a first input commonly connected to the Q output of latch 140. AND gates 119, 121, 123,
The output of 125 is connected to second inputs of adders 120, 122, 124, 126, respectively. The transport output of the adder 120 is connected to the transport input of the adder 122, the transport output of the adder 122 is connected to the transport input of the adder 124, and the transport output of the adder 124 is connected to the transport input of the adder 126. The carry output of the adder 126 is
Connected to the data input. The output of latch 162 is connected to the carrier input of adder 120. The outputs S of the adders 120, 122, 124, 126 are connected to individual output terminals 164, 166, 168.170.

【００５４】さらに、ANDゲート８９，１０１，１１３，１２５の第２入力は互いに接続さ
れ、また入力端末８１に共通接続される。ANDゲート９１，１０３，１１５の第
２入力は互いに接続され、さらにラッチ１５８の入力と入力端末８３とに共通接
続される。ANDゲート９３，１０５の第２入力は互いに接続され、さらにラッチ
１５４の入力と入力端末８５とに共通接続される。ANDゲート９５の第２入力は
、ラッチ１５０の入力と入力端末８７とに共通接続される。ANDゲート９９，１
１１，１２３の第２入力は、互いに接続され、ラッチ１５０の出力に共通接続さ
れる。ANDゲート１０９，１２１の第２入力は互いに接続され、さらにラッチ１
５４の出力に共通接続される。ANDゲート１１９の第２入力は、ラッチ１５８の
出力に接続される。Further, the second inputs of the AND gates 89, 101, 113, and 125 are connected to each other and commonly connected to the input terminal 81. The second inputs of the AND gates 91, 103, and 115 are connected to each other, and are further commonly connected to the input of the latch 158 and the input terminal 83. The second inputs of the AND gates 93 and 105 are connected to each other, and are further commonly connected to the input of the latch 154 and the input terminal 85. A second input of the AND gate 95 is commonly connected to the input of the latch 150 and the input terminal 87. AND gate 99,1
The second inputs of 11 and 123 are connected to each other and commonly connected to the output of the latch 150. The second inputs of the AND gates 109 and 121 are connected to each other.
Commonly connected to 54 outputs. A second input of AND gate 119 is connected to the output of latch 158.

【００５５】ラッチ１２８，１３２，１３６，１４０は、それぞれセット入力（Ｓ），リセ
ット入力（Ｒ）および出力（Ｑ）を有する。ラッチ１２８，１３２，１３６，１
４０は、信号Ｔが高であり、出力Ｑの信号が入力Ｓの信号と同じ値を持つときに
イネーブルになる。出力Ｑの信号は、信号Ｔが高論理値から低論理値へと移行す
るとラッチされる。入力Ｒの信号は、出力Ｑの信号をリセットする。ラッチ１２
８，１３２，１３６，１４０のリセット入力Ｒは、互いに、また端末７９に共通
接続される。端末７９は、リセット信号Ｒを受信するよう結合される。２入力AN
Dゲート１３０は、ラッチ１２８のセット入力に接続される出力を有する。ANDゲ
ート１３０の第１入力は加算器９０の第１入力に接続される。２入力ANDゲート
１３４は、ラッチ１３２のセット入力に接続される出力を有する。ANDゲート１
３４の第１入力は加算器１０２の第１入力に接続される。２入力ANDゲート１３
８は、ラッチ１３６のセット入力に接続される出力を有する。ANDゲート１３８
の第１入力は加算器１１４の第１入力に接続される。２入力ANDゲート１４２は
、ラッチ１４０のセット入力に接続される出力を有する。ANDゲート１４２の第
１入力は加算器１２６の第１入力に接続される。ANDゲート１３０，１３４，１
３８，１４２の第２入力は互いに接続され、また端末７８に共通接続される。端
末７８は、信号Ｔを受信するよう結合される。Each of the latches 128, 132, 136, and 140 has a set input (S), a reset input (R), and an output (Q). Latches 128, 132, 136, 1
40 is enabled when signal T is high and the signal at output Q has the same value as the signal at input S. The signal at output Q is latched when signal T transitions from a high logic value to a low logic value. The signal at input R resets the signal at output Q. Latch 12
The reset inputs R of 8, 132, 136, 140 are commonly connected to each other and to the terminal 79. Terminal 79 is coupled to receive reset signal R. 2-input AN
D-gate 130 has an output connected to the set input of latch 128. A first input of AND gate 130 is connected to a first input of adder 90. Two-input AND gate 134 has an output connected to the set input of latch 132. AND gate 1
A first input of 34 is connected to a first input of adder 102. 2-input AND gate 13
8 has an output connected to the set input of latch 136. AND gate 138
Is connected to the first input of adder 114. Two-input AND gate 142 has an output connected to the set input of latch 140. A first input of AND gate 142 is connected to a first input of adder 126. AND gate 130, 134, 1
The second inputs of 38, 142 are connected together and are commonly connected to terminal 78. Terminal 78 is coupled to receive signal T.

【００５６】動作中は、モジューロ換算器６０は、加算器５８（図３）を介して乗算器５６
から入力を受信し、換算された部分積を生成する。前述の例では、数字０００１
と０１１１とが、フォスタ−モンゴメリ換算アルゴリズムを用いて乗算される。
前述の如く、特定のビット位置の論理値により、Ｎの値が合計値に整合され加算
されるか否かが決まる。特定のビット値の論理値が論理１値を有するとき、モジ
ューロ換算器６０のアーキテクチャがＮの値を合計値に整合し、さらにそれに加
算されるようシフトする。これにより、モジューロ換算器６０のアーキテクチャ
によって、↓の値を決定して、ラッチ１２８，１３２，１３６，１４０に格納す
ることができる。言い換えると、↓の値はディジットＡ，Ｂの乗算前には決まら
ず、Ａ，Ｂの特定のディジットの乗算中に決定される。In operation, modulo converter 60 provides a multiplier 56 via adder 58 (FIG. 3).
Receives input from and generates a reduced partial product. In the example above, the number 0001
And 0111 are multiplied using the Foster-Montgomery conversion algorithm.
As described above, the logical value of a particular bit position determines whether the value of N is matched to the sum and added. When the logic value of a particular bit value has a logic one value, the architecture of the modulo converter 60 shifts the value of N to match and add to the sum value. Thus, the value of ↓ can be determined and stored in the latches 128, 132, 136, 140 according to the architecture of the modulo converter 60. In other words, the value of ↓ is not determined before the multiplication of digits A and B, but is determined during the multiplication of specific digits of A and B.

【００５７】ラッチ１２８，１３２，１３６，１４０は、信号Ｒによりリセットされて、Ｑ
出力に論理０値を有する。値０１１１が乗算器５６により生成されて、加算器５
８を介して端末８０，８２，８４，８６に転送される。ANDゲート１３０は、端
末８０から合計値の最下位データ・ビットを受信し、論理１値の信号Ｔと共にラ
ッチ１２８をセットさせる。すなわち、Ｑ出力の信号が論理１値を有する。信号
Ｔは、B₀が乗算器４６に供給されN₀の値がモジューロ換算器６０に供給される間
は、論理１値を有することに注目されたい。信号Ｔが論理１力論理０値に移行す
ると、ラッチ１２８，１３２，１３６，１４０内のデータの論理値がラッチされ
る。N₀の値が端末８１，８３，８５，８７において供給され、前述の例ではN₀は
１１０１の値を有する。N₀の最下位データ・ビットが端末８１において供給され
る。The latches 128, 132, 136 and 140 are reset by the signal R,
The output has a logical zero value. The value 0111 is generated by the multiplier 56 and
8 to the terminals 80, 82, 84, 86. AND gate 130 receives the least significant data bit of the sum from terminal 80 and causes latch 128 to be set with a logic one signal T. That is, the signal of the Q output has a logical 1 value. Note that signal T has a logical 1 value while B ₀ is provided to multiplier 46 and the value of N ₀ is provided to modulo converter 60. When the signal T transitions to a logic one force logic zero value, the logic values of the data in the latches 128, 132, 136, 140 are latched. The value of N ₀ is provided at terminals 81, 83, 85, 87, where N ₀ has a value of 1101 in the above example. N ₀ least significant data bits are provided at terminal 81.

【００５８】 ANDゲート８９，９１，９３，９５は、論理１値を有する、ラッチ１２８のＱ
出力における信号によりイネーブルになる。このため、端末８１，８３，８５，
８７において受信されるN₀の値がそれぞれ、加算器９０，９２，９４，９６の第
２入力に転送される。第１および第２入力において論理１を有する加算器９０は
、論理０値を有する、出力Ｓにおける合計出力信号を与える。さらに、加算器９
０は、出力COにおいて搬送信号を生成する。加算器９２は、端末８３の論理０値
に応答して、第１入力において論理０値を受信する。論理１値が加算器９２の第
２入力に受信され、論理１値が入力CIにおいて搬送信号に関して受信されると、
出力Ｓの合計信号は論理０値を有し、出力COの搬送信号は論理１値を有する。The AND gates 89, 91, 93 and 95 are connected to the Q of the latch 128 having a logical 1 value.
It is enabled by a signal at the output. Therefore, the terminals 81, 83, 85,
The values of N ₀ received at 87 are forwarded to the second inputs of adders 90, 92, 94, 96, respectively. Adder 90 having a logic one at the first and second inputs provides a sum output signal at output S having a logic zero value. Further, the adder 9
0 generates a carrier signal at the output CO. Adder 92 receives a logical zero value at a first input in response to the logical zero value of terminal 83. When a logical one value is received at the second input of adder 92 and a logical one value is received for the carrier signal at input CI,
The sum signal at output S has a logic zero value and the carrier signal at output CO has a logic one value.

【００５９】加算器９４は、第１入力においてANDゲート９３から論理１を、また第２入力
において端末８４から論理１を受信する。加算器９４の合計出力Ｓは論理１値を
有し、搬送出力信号は搬送出力COにおいて論理１値を有する。同様に、加算器９
６は、第１入力においてANDゲート９５から論理１を、また第２入力において端
末８６から論理１を受信する。加算器９６の合計出力Ｓは論理０値を有し、搬送
出力COの搬送信号は論理１値を有する。かくして、加算器９０〜９６は、加算器
５８を通じて乗算器５６の合計値を受信することに応答して、０１００の値を生
成した。さらに、特定のビット位置、すなわち最下位ビット位置が論理１値を有
し、値１は、フォスタ−モンゴメリ換算アルゴリズムにより、合計値に整合およ
び加算された。Adder 94 receives a logic one from AND gate 93 at a first input and a logic one from terminal 84 at a second input. The sum output S of the adder 94 has a logic one value, and the carrier output signal has a logic one value at the carrier output CO. Similarly, adder 9
6 receives a logic one from the AND gate 95 at a first input and a logic one from the terminal 86 at a second input. The sum output S of the adder 96 has a logical 0 value, and the carry signal of the carry output CO has a logical 1 value. Thus, adders 90-96 produced a value of 0100 in response to receiving the sum of multiplier 56 through adder 58. Further, a particular bit position, the least significant bit position, has a logical one value, and the value one has been matched and added to the sum by the Foster-Montgomery conversion algorithm.

【００６０】列X₁における加算器により生成されるデータは、特定のデータ・ビット位置に
おいてデータに依存する値を有する。この場合の特定のデータ・ビット位置は、
加算器９２の出力Ｓに対応する。ANDゲート１３４は加算器９２の出力Ｓにおい
て合計信号から論理０値を受信することに注目されたい。ラッチ１３２はセット
されず、ラッチ１３２のＱ出力は論理０値のままになる。ANDゲート９９，１０
１，１０３，１０５は、それぞれ、加算器１００，１０２，１０４，１０６の第
２入力において論理０値を生成する。加算器１００は、第１および第２入力の両
方において論理０値を有し、出力Ｓに論理０値を生成する。同様に、加算器１０
２は、第１および第２入力の両方において論理０値を有し、出力Ｓに論理０値を
生成する。加算器１０４は、第１入力に論理０値を、第２入力に論理１値を有し
、出力Ｓに論理１値を生成する。加算器１０６は、第１および第２入力の両方に
おいて論理０値を有し、出力Ｓに論理０値を生成する。かくして、列X₁内の加算
器は値０１００を生成する。The data generated by the adder in column X ₁ has a data-dependent value at a particular data bit position. The specific data bit position in this case is
This corresponds to the output S of the adder 92. Note that AND gate 134 receives a logical zero value from the sum signal at output S of adder 92. Latch 132 is not set and the Q output of latch 132 remains at a logic zero value. AND gate 99, 10
1, 103 and 105 generate logic 0 values at the second inputs of the adders 100, 102, 104 and 106, respectively. Adder 100 has a logic zero value at both the first and second inputs and produces a logic zero value at output S. Similarly, the adder 10
2 has a logic zero value at both the first and second inputs and produces a logic zero value at the output S. The adder 104 has a logical 0 value at a first input and a logical 1 value at a second input, and generates a logical 1 value at an output S. Adder 106 has a logical zero value at both the first and second inputs and produces a logical zero value at output S. Thus, the adder in column X ₁ produces the value 0100.

【００６１】列X₂における加算器により生成されるデータも、特定のデータ・ビット位置に
おいてデータに依存する値を有する。ANDゲート１３８は加算器１０４の出力Ｓ
において合計信号から論理１値を受信することに注目されたい。ラッチ１３６が
セットされ、ラッチ１３６のＱ出力は論理１値を有する。ANDゲート１０９，１
１１，１１３，１１５は、ラッチ１３６により生成される論理１値によりイネー
ブルになる。すなわち、加算器１００，１０２，１０４，１０６の出力のデータ
が加算器１１０，１１２，１１４，１１６の第１入力にそれぞれ転送される。加
算器１１０は、第１および第２入力の両方において論理０値を有し、出力Ｓに論
理０値を生成する。同様に、加算器１１２は、第１および第２入力の両方におい
て論理０値を有し、出力Ｓに論理０値を生成する。加算器１１４は、第１および
第２入力の両方において論理１値を有し、出力Ｓに論理０値を、また出力COの搬
送出力信号について論理１値を生成する。加算器１１６は、第１および第２入力
の両方において論理０値を、搬送入力に論理１を有し、出力Ｓに論理１値を生成
する。かくして、列X₂内の加算器は値１０００を生成する。The data generated by the adder in column X ₂ also has a data-dependent value at a particular data bit position. The AND gate 138 outputs the output S of the adder 104.
Note that a logic one value is received from the sum signal at. Latch 136 is set and the Q output of latch 136 has a logic one value. AND gate 109, 1
11, 113 and 115 are enabled by a logic 1 value generated by latch 136. That is, the output data of the adders 100, 102, 104, and 106 are transferred to the first inputs of the adders 110, 112, 114, and 116, respectively. Adder 110 has a logical zero value at both the first and second inputs and produces a logical zero value at output S. Similarly, adder 112 has a logical zero value at both the first and second inputs and produces a logical zero value at output S. Adder 114 has a logic one value at both the first and second inputs, produces a logic zero value at output S, and a logic one value for the carrier output signal at output CO. Adder 116 has a logical 0 value at both the first and second inputs, a logical 1 at the carry input, and produces a logical 1 value at output S. Thus, the adder in column X ₂ generates the value 1000.

【００６２】列X₃における加算器により生成されるデータも、特定のデータ・ビット位置に
おいてデータに依存する値を有する。この場合の特定のデータ・ビットは、加算
器１１６の出力における論理値であることに注目されたい。ANDゲート１４２は
加算器１１６の出力Ｓにおいて合計信号から論理１値を受信する。ラッチ１４０
がセットされ、ラッチ１４０のＱ出力は論理１値を有する。ANDゲート１０９，
１１１，１１３，１１５は、ラッチ１４０により生成される論理１値によりイネ
ーブルになる。すなわち、加算器１１０，１１２，１１４，１１６の出力のデー
タが加算器１２０，１２２，１２４，１２６の第１入力にそれぞれ転送される。
加算器１２０は、第１および第２入力の両方において論理０値を有し、出力Ｓに
論理０値を生成する。同様に、加算器１２２は、第１および第２入力の両方にお
いて論理０値を有し、出力Ｓに論理０値を生成する。加算器１２４も、第１およ
び第２入力の両方において論理０値を有し、出力Ｓに論理０値を生成する。加算
器１２６は、第１および第２入力の両方において論理１値を有し、搬送出力にお
いて搬送出力として論理１値を生成する。かくして、列X₃内の加算器は値０００
０を生成する。The data generated by the adder in column X ₃ also has a data-dependent value at a particular data bit position. Note that the particular data bit in this case is a logical value at the output of adder 116. AND gate 142 receives a logical 1 value from the sum signal at output S of adder 116. Latch 140
Is set, and the Q output of latch 140 has a logical 1 value. AND gate 109,
111, 113, and 115 are enabled by the logic 1 value generated by the latch 140. That is, the output data of the adders 110, 112, 114, and 116 are transferred to the first inputs of the adders 120, 122, 124, and 126, respectively.
Adder 120 has a logical zero value at both the first and second inputs and produces a logical zero value at output S. Similarly, adder 122 has a logic zero value at both the first and second inputs and produces a logic zero value at output S. Adder 124 also has a logic zero value at both the first and second inputs and produces a logic zero value at output S. Adder 126 has a logic one value at both the first and second inputs and produces a logic one value at the carry output as the carry output. Thus, the adder in the column X ₃ is the value 000
Generate 0.

【００６３】列X₀，X₁，X₂，Ｘ₃内の加算器がＡのディジット，ディジットB₀およびN₀の値
に関する部分積値を換算した後は、↓の値が後の乗算で用いるために決定される
ことに注目されたい。詳しくは、各群の第１部分積が０の値を持つようにさせる
換算プロセスの間に、適切なラッチ１２８，１３２，１３６，１４０がセットさ
れ、↓に関する値を有する。第１部分積の０への換算後、信号Ｔは論理１から論
理０値に移行し、↓の値をラッチ１２８，１３２，１３６，１４０に格納する。
格納された↓の値とN₁〜N₆₃の対応する値とが、Ａのディジットが乗算器５６に
おいてディジットB₁〜B₆₃により乗算されるときに用いられる。After the adders in columns X ₀ , X ₁ , X ₂ , and X ₃ have converted the partial product values for the digits of A, digits B ₀ and N ₀ , the value of ↓ is Note that it is determined to be used. Specifically, during the conversion process that causes the first partial product of each group to have a value of zero, the appropriate latches 128, 132, 136, 140 are set and have a value for ↓. After the conversion of the first partial product to 0, the signal T transitions from a logical 1 to a logical 0 value and stores the value of ↓ in the latches 128, 132, 136, 140.
The stored value of ↓ and the corresponding value of N _{1 to} N ₆₃ are used when the digit of A is multiplied by the digits B _{1 to} B ₆₃ in the multiplier 56.

【００６４】例として、デジットA₀，B₀の積に関する１６の最下位データ・ビットを、値N₀ を用いてモジューロ換算器６０により論理０の１６ビットに換算する。モジュー
ロ換算器６０は、第１部分積を０値に換算する↓値を決定し格納する。乗算器５
６が値A₀，B₁に関わる次の部分積を生成する際に用いられる最上位データ・ビッ
トを格納する。モジューロ換算器６０は、↓の被格納値とN₁の値とを用いて、第
２部分積を生成する。A₀とB₂〜B₆₃に関する他の積が乗算器５６により生成され
、↓の被格納値およびN₂〜N₆₃の値とを用いてモジューロ換算器６０内で換算さ
れる。As an example, the 16 least significant data bits for the product of digits A ₀ and B ₀ are converted to 16 logic zero bits by modulo converter 60 using the value N ₀ . The modulo converter 60 determines and stores a ↓ value for converting the first partial product to a zero value. Multiplier 5
6 stores the most significant data bit used in generating the next partial product for the values A ₀ and B ₁ . Modulo conversion unit 60 uses the value of the stored value and N ₁ of ↓, generating a second partial product. Another product for A ₀ and B ₂ -B ₆₃ is generated by multiplier 56 and converted in modulo converter 60 using the stored value of ↓ and the values of N ₂ -N ₆₃ .

【００６５】モジューロ換算器６０は、第２群の第１部分積、すなわちA₁とB₀の積を０値に
換算させる↓の新しい値を決定し格納する。↓のこの新しい値は、A₁にB₀〜B₆₃
を乗算した部分積の群を生成する際に利用される。これにより、ある群内のすべ
ての部分積の生成に続いて、↓の新しい値が次の群のためにモジューロ換算器６
０により決定される。各群に関して生成される第１部分積は、↓の新しい値によ
り０値を有するよう換算されることに注目されたい。The modulo converter 60 determines and stores a first partial product of the second group, that is, a new value of ↓ that causes the product of A ₁ and B ₀ to be converted to a zero value. This new value of ↓ is B _{0 to} B _{63 in} A ₁
Is used to generate a group of partial products multiplied by. Thus, following the generation of all partial products in one group, the new value of ↓
Determined by 0. Note that the first partial product generated for each group is scaled to have a zero value by the new value of ↓.

【００６６】図５は、図３のコプロセッサで用いられる乗算器５６と組み合わせたモジュー
ロ換算器６０の部分のブロック図である。乗算器構造または合併型フォスタ−モ
ンゴメリ（F-M）乗算器１７１は、簡単に説明するために４ｘ４アレイの加算器
として、簡略化された形で説明される。合併型F-M乗算器１７１は、同数の行と
列を有する加算器アレイとして説明されるが、これは本発明を制約するものでは
ない。図３の乗算器５６，加算器５８およびモジューロ換算器６０の主な機能が
共に、合併型F-M乗算器１７１により実行されることに注目されたい。FIG. 5 is a block diagram of a portion of a modulo converter 60 combined with a multiplier 56 used in the coprocessor of FIG. The multiplier structure or merged Foster-Montgomery (FM) multiplier 171 is described in simplified form as a 4x4 array of adders for simplicity. The merged FM multiplier 171 is described as an adder array having the same number of rows and columns, but this is not a limitation of the present invention. Note that the main functions of multiplier 56, adder 58, and modulo converter 60 of FIG. 3 are all performed by merged FM multiplier 171.

【００６７】図５に示される合併型F-M乗算器１７１の形態は、図４に示されるモジューロ
換算器６０の形態と同様である。合併型F-M乗算器１７１もモジューロ換算器６
０も、列X₀内に加算器９０，９２，９４，９６を、列X₁内に加算器１００，１０
２，１０４，１０６を、列X₂内に加算器１１０，１１２，１１４，１１６を、列
X₃内に加算器１２０，１２２，１２４，１２６を有する。また、ラッチ１５２，
１５６，１６０，１６２は、次の部分積を生成する際に用いられる搬送出力信号
を格納する。列X₀〜X₃内の各加算器の入力に接続される出力を有するモジューロ
換算器６０のANDゲートは、合併型F-M乗算器１７１においてはマルチプレクサに
よって置き換えられている。マルチプレクサは加算器の第１入力に接続される出
力を有して図示されるが、代替に、マルチプレクサの出力を加算器の第２入力に
接続することもできる。The form of the merged FM multiplier 171 shown in FIG. 5 is the same as the form of the modulo converter 60 shown in FIG. Combined FM multiplier 171 is also modulo converter 6
0. The adders 90, 92, 94, 96 in column X _0, the adder in the column X ₁ 100, 10
₂ , 104, 106, adders 110, 112, 114, 116 in column X2,
An adder 120, 122, 124, 126 in the X _3. Latch 152,
156, 160 and 162 store the carrier output signals used when generating the next partial product. AND gates modulo conversion unit 60 having an output connected to an input of each adder in the column X ₀ to X ₃ is replaced by the multiplexer in the merged type FM multiplier 171. Although the multiplexer is shown with an output connected to a first input of the adder, alternatively, the output of the multiplexer can be connected to a second input of the adder.

【００６８】合併型F-M乗算器１７１のマルチプレクサ（mux）は、それぞれ４つの入力と、
１つの出力と、２つのセレクタ入力とを有する。詳しくは、mux１７２，１７４
，１７６，１７８は、それぞれ加算器９０〜９６に接続される出力を有する。mu
x１８２，１８４，１８６，１８８は、それぞれ加算器１００〜１０６に接続さ
れる出力を有する。mux１９２，１９４，１９６，１９８は、それぞれ加算器１
１０〜１１６に接続される出力を有する。mux２０２，２０４，２０６，２０８
は、それぞれ加算器１２０〜１２６に接続される出力を有する。さらに、mux１
７２〜１７８の第１セレクタ入力は共に接続され、mux１７２〜１７８の第２セ
レクタ入力は共に接続されて、各muxの４つの入力における４つの信号のうちの
１つを加算器９０〜９６の対応する第１入力に転送する。同様に、mux１８２〜
１８８の第１セレクタ入力は共に接続され、mux１８２〜１８８の第２セレクタ
入力は共に接続されて、各muxの４つの入力における４つの信号のうちの１つを
加算器１００〜１０６の対応する第１入力に転送する。mux１９２〜１９８の第
１セレクタ入力は共に接続され、mux１９２〜１９８の第２セレクタ入力は共に
接続されて、各muxの４つの入力における４つの信号のうちの１つを加算器１１
０〜１１６の対応する第１入力に転送する。mux２０２〜２０８の第１セレクタ
入力は共に接続され、mux２０２〜２０８の第２セレクタ入力は共に接続されて
、各muxの４つの入力における４つの信号のうちの１つを加算器１２０〜１２６
の対応する第１入力に転送する。The multiplexer (mux) of the merged FM multiplier 171 has four inputs,
It has one output and two selector inputs. For details, mux 172, 174
, 176, 178 have outputs connected to adders 90-96, respectively. mu
x182, 184, 186, 188 have outputs connected to adders 100-106, respectively. mux 192, 194, 196, and 198 are adders 1 respectively.
It has an output connected to 10-116. mux 202, 204, 206, 208
Have outputs connected to adders 120-126, respectively. Furthermore, mux1
The first selector inputs 72-178 are connected together, and the second selector inputs mux 172-178 are connected together to provide one of the four signals at the four inputs of each mux to the corresponding one of adders 90-96. To the first input. Similarly, mux182-
188 are connected together, and the second selector inputs of muxes 182 to 188 are connected together to output one of the four signals at the four inputs of each mux to the corresponding one of the adders 100 to 106. Transfer to one input. The first selector inputs of muxes 192-198 are connected together, and the second selector inputs of muxes 192-198 are connected together to add one of the four signals at the four inputs of each mux to adder 11
Forward to corresponding first inputs 0-116. The first selector inputs of muxes 202-208 are connected together and the second selector inputs of muxes 202-208 are connected together to add one of the four signals at the four inputs of each mux to adders 120-126.
To the corresponding first input.

【００６９】 mux１７２〜１７８，１８２〜１８８，１９２〜１９８および２０２〜２０８
の４つの入力のうち第１入力が、論理０値を受信するよう結合される。mux１７
２〜１７８の第２入力はディジットＢの値を受信し、mux１７２〜１７８の第３
入力がＮの値を受信する。例として、ディジットA₀，B₀，N₀が合併型F-M乗算器
１７１に供給される。ディジットB₀の最下位データ・ビット、すなわち信号Ｂ（
ビット０）がmux１７２の第２入力に供給される。同様に、ディジットN₀の最下
位データ・ビット、すなわち信号Ｎ（ビット０）がmux１７２の第３入力に供給
される。mux１７２の第４入力は、ビットN₀およびB₀の最下位データ・ビットの
論理合計値、すなわち信号N+B（ビット０）を受信する。Mux 172 to 178, 182 to 188, 192 to 198 and 202 to 208
Of the four inputs are coupled to receive a logical zero value. mux17
A second input of 2 to 178 receives the value of digit B and a third input of mux 172 to 178.
The input receives the value of N. As an example, the digits A ₀ , B ₀ , N ₀ are supplied to a merged FM multiplier 171. Least significant data bits of the digit B _0, i.e. signal B (
Bit 0) is provided to the second input of mux 172. Similarly, the least significant data bit of digit N ₀ , signal N (bit 0), is provided to the third input of mux 172. fourth input of mux172 the bit N ₀ and logic sum of the least significant data bits B _0, that is, receives the signal N + B (bit 0).

【００７０】ディジットB₀の次に下位のデータ・ビット、すなわち信号Ｂ（ビット１）がmu
x１７４の第２入力に供給される。同様に、ディジットN₀の次に下位のデータ・
ビット、すなわち信号Ｎ（ビット１）がmux１７４の第３入力に供給される。mux
１７４の第４入力は、ビットN₀およびB₀の次に下位のデータ・ビットの論理合計
値、すなわち信号N+B（ビット）１を受信する。ディジットB₀の次のデータ・ビ
ット、すなわち信号Ｂ（ビット２）がmux１７６の第２入力に供給される。同様
に、ディジットN₀の次のデータ・ビット、すなわち信号Ｎ（ビット２）がmux１
７６の第３入力に供給される。mux１７６の第４入力は、ビットN₀およびB₀の次
のデータ・ビットの論理合計値、すなわち信号N+B（ビット２）を受信する。同
様に、合併型F-M乗算器１７１が４ビットｘ４ビットの乗算を行うこの例では、
ディジットB₀の最上位データ・ビット、すなわち信号Ｂ（ビット３）がmux１７
８の第２入力に供給される。同様に、ディジットB₀の最上位データすなわち信号
Ｂ（ビット３）がmux１７８の第３入力に供給される。mux１７８の第４入力は、
muxの第２および第３入力に供給される値の論理合計値、すなわちディジットN₀
，B₀の最上位データ・ビットの和を受信する。各muxの第４入力は、そのmuxの第
２および第３入力に供給される値の論理合計値を受信することに注目されたい。The next lower data bit after digit B ₀ , that is, signal B (bit 1)
x174 is provided to the second input. Similarly, the lower the data of the next digit N ₀
A bit, signal N (bit 1), is provided to a third input of mux 174. mux
Fourth input of 174 bits N ₀ and the next lower data bits of the logical sum of B _0, that is, receives the signal N + B (bits) 1. Next data bit of the digit B _0, that is, the signal B (bit 2) is supplied to the second input of Mux176. Similarly, the next data bit of the digit N _0, i.e. the signal N (bit 2) is mux1
76 is provided to a third input. The fourth input of mux 176 receives the logical sum of the data bits next to bits N ₀ and B ₀ , signal N + B (bit 2). Similarly, in this example where the merged FM multiplier 171 performs a 4-bit by 4-bit multiplication,
Most significant data bit of the digit B _0, that is, the signal B (bit 3) is mux17
8 to a second input. Similarly, the top-level data or signals B of the digit B ₀ (bit 3) is supplied to the third input of Mux178. The fourth input of mux 178 is
The logical sum of the values supplied to the second and third inputs of mux, ie, digit N ₀
, To receive the sum of the most significant data bits of B _0. Note that the fourth input of each mux receives the logical sum of the values provided to the second and third inputs of that mux.

【００７１】 mux１７２〜１７８の第１セレクタ入力信号が、ラッチ２１２から受信される
。ラッチ２１２は、信号Ｔが論理１から論理０値に移行すると、論理回路２１０
からデータ信号をラッチする。論理回路２１０が生成するデータ信号は、信号Ａ
（ビット０）とＢ（ビット０）の積をＰ（０）で排他的論理和演算したものであ
る。ただしＰ（０）は、以前の部分積値の最下位ビットである。mux１７２〜１
７８は、第２セレクタ入力において信号Ａ（ビット０）を受信する。The first selector input signals of mux 172-178 are received from latch 212. Latch 212 activates logic circuit 210 when signal T transitions from a logic one to a logic zero value.
Latch the data signal. The data signal generated by the logic circuit 210 is a signal A
Exclusive OR operation of the product of (bit 0) and B (bit 0) with P (0). Here, P (0) is the least significant bit of the previous partial product value. mux 172-1
78 receives the signal A (bit 0) at the second selector input.

【００７２】 mux１８２〜１８８の第１セレクタ入力信号が、ラッチ２１６から受信される
。ラッチ２１６は、信号Ｔが論理１から論理０値に移行すると、論理回路２１４
からデータ信号をラッチする。論理回路２１４が生成するデータ信号は、信号Ａ
（ビット１）とＢ（ビット１）の積を加算器９２の合計出力信号で排他的論理和
演算したものである。mux１８２〜１８８は、第２セレクタ入力において信号Ａ
（ビット１）を受信する。The first selector input signals of mux 182-188 are received from latch 216. When the signal T transitions from logic 1 to logic 0, the latch 216 activates the logic circuit 214.
Latch the data signal. The data signal generated by the logic circuit 214 is a signal A
The exclusive OR operation is performed on the product of (bit 1) and B (bit 1) with the total output signal of the adder 92. mux 182-188 provide signal A at the second selector input.
(Bit 1) is received.

【００７３】 mux１９２〜１９８の第１セレクタ入力信号が、ラッチ２２０から受信される
。ラッチ２２０は、信号Ｔが論理１から論理０値に移行すると、論理回路２１８
からデータ信号をラッチする。論理回路２１８が生成するデータ信号は、信号Ａ
（ビット２）とＢ（ビット２）の積を加算器１０４の合計出力信号で排他的論理
和演算したものである。mux１９２〜１９８は、第２セレクタ入力において信号
Ａ（ビット２）を受信する。The first selector input signals of mux 192-198 are received from latch 220. Latch 220 activates logic circuit 218 when signal T transitions from a logic one to a logic zero value.
Latch the data signal. The data signal generated by the logic circuit 218 is a signal A
It is obtained by performing an exclusive OR operation on the product of (bit 2) and B (bit 2) with the total output signal of the adder 104. mux 192-198 receive signal A (bit 2) at the second selector input.

【００７４】 mux２０２〜２０８の第１セレクタ入力信号が、ラッチ２２４から受信される
。ラッチ２２４は、信号Ｔが論理１から論理０値に移行すると、論理回路２２２
からデータ信号をラッチする。論理回路２２２が生成するデータ信号は、信号Ａ
（ビット３）とＢ（ビット３）の積を加算器１１６の合計出力信号で排他的論理
和演算したものである。mux２０２〜２０８は、第２セレクタ入力において信号
Ａ（ビット３）を受信する。The first selector input signals of mux 202-208 are received from latch 224. When the signal T transitions from a logic 1 to a logic 0 value, the latch 224
Latch the data signal. The data signal generated by the logic circuit 222 is a signal A
It is obtained by performing an exclusive OR operation on the product of (bit 3) and B (bit 3) with the total output signal of the adder 116. Muxes 202-208 receive signal A (bit 3) at the second selector input.

【００７５】第１および第２セレクタ入力が００の個別論理値を受信すると、mux１７２〜
１７８の第１入力の信号が対応するmuxの出力に転送される。第１および第２セ
レクタ入力が０１の個別論理値を受信すると、mux１７２〜１７８の第２入力の
信号が対応するmuxの出力に転送される。第１および第２セレクタ入力が１０の
個別論理値を受信すると、mux１７２〜１７８の第３入力の信号が対応するmuxの
出力に転送される。第１および第２セレクタ入力が１１の個別論理値を受信する
と、mux１７２〜１７８の第４入力の信号が対応するmuxの出力に転送される。When the first and second selector inputs receive the individual logical value of 00, mux 172-
The signal of the first input at 178 is transferred to the output of the corresponding mux. When the first and second selector inputs receive the individual logical value of 01, the signals of the second inputs of muxes 172 to 178 are transferred to the corresponding mux outputs. When the first and second selector inputs receive ten individual logical values, the signals of the third inputs of muxes 172 to 178 are transferred to the corresponding mux outputs. When the first and second selector inputs receive 11 individual logical values, the signals of the fourth inputs of muxes 172 to 178 are transferred to the corresponding mux outputs.

【００７６】合併型F-M乗算器１７１のアーキテクチャにより、↓の値を決定し、ラッチ２
１２，２１６，２２０，２２４内に格納することができることに注目されたい。
言い換えると、↓値はＡとＢの乗算前には計算されず、ディジットA₀，B₀の乗算
中に合併型F-M乗算器１７１のアーキテクチャにより↓の第１値が決定されてラ
ッチ２１２，２１６，２２０，２２４に格納されることになる。すなわち、A₀，
B₀の積を計算するために用いられるのと同じ乗算サイクルが↓の値を決定するた
めに用いられる。↓の第１値は、他のディジットB₁〜B₆₃とA₀の乗算中に用いら
れる。↓の第２値は、ラッチ２１２，２１６，２２０，２２４内にラッチされて
、ディジットB₁〜B₆₃とA₁との乗算中に用いられる。このように、↓の新しい値
はＡとB₀の各ディジットの乗算中に決定される。According to the architecture of the merged FM multiplier 171, the value of ↓ is determined, and
Note that it can be stored within 12,216,220,224.
In other words, the value of ↓ is not calculated before the multiplication of A and B, the first value of ↓ is determined by the architecture of the merged FM multiplier 171 during the multiplication of the digits A ₀ , B ₀ and the latches 212, 216 , 220, and 224. That is, A ₀ ,
The same multiplication cycle used to calculate the product of B ₀ is used to determine the value of ↓. The first value of ↓ is used during the multiplication of A ₀ with other digits B ₁ -B ₆₃ . The second value of ↓ is latched in the latch 212,216,220,224, used during the multiplication of the digit B ₁ .about.B ₆₃ and A _1. Thus, the new value of ↓ is determined in the multiplication of each digit of A and B _0.

【００７７】図６は、フォスタ−モンゴメリ換算アルゴリズムで用いられる（R² mod N）の
値を決定する方法の流れ図である。フォスタ−モンゴメリ換算アルゴリズムのオ
ペランドＡ，Ｂは、（A*R mod Ｎ）および（B*R mod N）の形を持ち、乗算され
て積（A*B*R² mod N）が生成される。積（A*B*R² mod N）は、値Ｒにより合併型
F-M乗算器１７１（図５）内で換算され、合併型F-M乗算器１７１により生成され
る出力が（A*B*R mod N）の値を有するようになる。値（R² mod N）の生成は、
合併型F-M乗算器１７１の入力値として、データ・ホスト・バス（図３）を介し
て受信されるオペランドＡの初期値を、（A*R mod Ｎ）の正しい形に変換し、オ
ペランドＢの初期値を（B*R mod N）の正しい形に変換するために必要とされる
。これにより、オペランドＡ，Ｂの初期値がモンゴメリ形式に変換される。たと
えば、合併型F-M乗算器１７１においてオペランドＡに値（R² mod N）を乗算す
ると、換算された出力値を有し、（A*R mod Ｎ）となる。同様に、合併型F-M乗
算器１７１においてオペランドＢに値（R² mod N）を乗算すると、これも換算さ
れた出力値を有し、（B*R mod Ｎ）となる。FIG. 6 is a flowchart of a method for determining the value of (R ² mod N) used in the Foster-Montgomery conversion algorithm. Operands A and B of the Foster-Montgomery conversion algorithm have the form (A * R mod N) and (B * R mod N), and are multiplied to generate a product (A * B * R ² mod N). . The product (A * B * R ² mod N) is merged with the value R
The output converted by the FM multiplier 171 (FIG. 5) and generated by the merged FM multiplier 171 has a value of (A * B * R mod N). The generation of the value (R ² mod N) is
The input value of the merged FM multiplier 171 is converted from the initial value of the operand A received via the data host bus (FIG. 3) to the correct form of (A * R mod N), and Required to convert the initial values to the correct form of (B * R mod N). As a result, the initial values of the operands A and B are converted to the Montgomery format. For example, when the operand A is multiplied by the value (R ² mod N) in the merged FM multiplier 171, it has a converted output value and becomes (A * R mod N). Similarly, when the operand B is multiplied by the value (R ² mod N) in the merged FM multiplier 171, this also has a converted output value and becomes (B * R mod N).

【００７８】図６は、値（R² mod N）を生成する方法を示す流れ図２３０である。値（R² m
od N）は、（R mod N）とＲの成分とを有する。ただし、Ｒは２¹⁶＊Ｉまたは２
の整数乗である。Ｒは、Ｎを表すディジット数よりも１ディジット多い大きさを
有するものとして選択される。ブロック２３２は、値（R mod N）に関する初期
値Ｐを有する変数の生成を示す。Ｐの値は、Ｒの値をＮの値から減ずることによ
り計算される。初期値Ｐの生成に続き、ブロック２３６では、Ｐの値がＮの値と
比較される。Ｐ値がＮ値より大きい場合は、ブロック２４０で乗数Ａ値が計算さ
れる。ただし乗数Ａ値はＮ値がＰ値により乗算され、依然としてＰ値よりも小さ
い２の最大べき数である。ブロック２４２で、乗数Ａの値がＮ値により乗算され
、その積（A*N）がＰ値から減じられて、新しいＰ値が得られる。ブロック２３
６で、Ｎ値がＰ値よりも大きい場合は、Ｐがシフトされた回数がブロック２４４
に示される。Ｐ値がＲの最下位ビット内で０の数だけシフトされると、計算は終
了して、ブロック２３８のＰ値は所望の値（R² mod N）となる。あるいは、ブロ
ック２４４で、Ｐ値についてさらにシフトが必要な場合には、ブロック２３４に
示すようにＰを整数回だけシフトさせる。この整数値は、システムのディジット
寸法として選択され、値「１」を有するＰの最上位ビットを適切な回数だけ左に
シフトさせると、値Ｒが得られる。FIG. 6 is a flowchart 230 illustrating a method of generating a value (R ² mod N). Value (R ² m
od N) has (R mod N) and an R component. Where R is 2 ¹⁶ * I or 2
To the power of an integer. R is selected as having a size one digit greater than the number of digits representing N. Block 232 illustrates the generation of a variable having an initial value P for the value (R mod N). The value of P is calculated by subtracting the value of R from the value of N. Following the generation of the initial value P, at block 236, the value of P is compared to the value of N. If the P value is greater than the N value, a multiplier A value is calculated at block 240. However, the multiplier A value is the maximum power of two, the N value being multiplied by the P value and still smaller than the P value. At block 242, the value of multiplier A is multiplied by the N value, and the product (A * N) is subtracted from the P value to obtain a new P value. Block 23
At 6, if the N value is greater than the P value, the number of times P has been shifted is
Is shown in When the P value has been shifted by the number of zeros in the least significant bits of R, the calculation is finished and the P value of block 238 is the desired value (R ² mod N). Alternatively, if further shifting of the P value is needed at block 244, P is shifted an integer number of times, as shown at block 234. This integer value is selected as the digit size of the system, and shifting the most significant bit of P with the value "1" to the left an appropriate number of times gives the value R.

【００７９】ブロック２４０に示されるＡの計算は、２つの異なる方法で実行することがで
きる。第１の方法は、Ａの値がＮ値をＰ値で乗算してなおかつＰの値よりも小さ
い値を有するような２の最大べき数となるようにＡを生成することである。第２
の方法は、Ｐの最上位ディジットをＮの最上位ディジットで除算した結果の整数
よりも１小さくなるようにＡを計算する。流れ図２３０においては、Ａ値を求め
る第１方法を行うと、Ａ値を計算する第２方法よりも、ブロック２３６，２４０
，２４２のループを数回多く実行することになる。しかし、第２方法に必要とさ
れる回路構成は、第１方法に必要とされる回路構成よりも複雑である。The calculation of A shown in block 240 can be performed in two different ways. The first method is to generate A such that the value of A is the largest power of 2 such that the value of N is multiplied by the value of P and still has a value less than the value of P. Second
Calculates A so that it is one less than the integer resulting from dividing the most significant digit of P by the most significant digit of N. In flow chart 230, performing the first method for determining the A value is greater than the second method for calculating the A value, as shown in blocks 236 and 240.
, 242 are executed several times. However, the circuit configuration required for the second method is more complicated than the circuit configuration required for the first method.

【００８０】図７は、図６に説明される値（R² mod N）の生成を示すブロック図である。ま
ず、mux２４０がＲの値をmux２４０の第１入力から減算器２４４の入力に転送す
る。また、Ａ値を最初は０として、乗算器２４２が減算器２４４の他方の入力に
Ａ値を生成するようにする。減算器２４４は、mux２４６の第１入力に値（R-N）
またはＰを与える（図６のブロック２３２参照）。mux２４６は、Ｐ値を、デー
タを１６桁シフトさせる、すなわちデータを２¹⁶倍するシフト回路２４８に転送
する（図６のブロック２３４参照）。比較器２５０が、シフトされたデータがＮ
値よりも大きな値を有するか否かを確認する（図６のブロック２３６参照）。例
として、比較器２５０は、Ｐ_SHIFTED値とＮ値の差を与える減算器である。比較
器２５０は、Ｐ_SHIFTED値とＮ値のビット幅の比較を行う排他的論理和ゲートを
備えることもできる。FIG. 7 is a block diagram showing the generation of the value (R ² mod N) described in FIG. First, mux 240 transfers the value of R from the first input of mux 240 to the input of subtractor 244. Further, the A value is initially set to 0, so that the multiplier 242 generates the A value at the other input of the subtractor 244. The subtractor 244 inputs a value (RN) to the first input of the mux 246.
Or P (see block 232 of FIG. 6). mux246 is the P value, is the data the 16-digit shift, i.e. to transfer data to 2 ¹⁶ multiplying shift circuit 248 (see block 234 of FIG. 6). Comparator 250 determines that the shifted data is N
Check if it has a value greater than the value (see block 236 in FIG. 6). As an example, the comparator 250 is a subtractor that provides a difference between the P _SHIFTED value and the N value. The comparator 250 may include an exclusive OR gate for comparing the bit width of the P _SHIFTED value with the bit width of the N value.

【００８１】Ｐのシフトされた値、すなわちＰ_SHIFTEDがＮ値よりも大きな値を持たない場
合は、Ｐ_SHIFTED値をmux２４６の第２入力に転送し、新しいＰ_SHIFTED値を生成
する。新しいＰ_SHIFTED値は、シフト回路２４８によりさらに１６桁シフトされ
たデータを有する。シフト回路２４８内でデータをシフトするたびに、比較器２
５０内で新たなシフト値がＮ値と比較される。新たなＰ_SHIFTED値がＮ値よりも
大きな値を有するときは、値（R² mod N）が生成される。mux２４０は、新たな
Ｐ_SHIFTED値を減算器２４４の第１入力に転送する。乗算器２４２が生成したＡ
とＮの積が、減算器２４４の第２入力に転送される。ただし、ＡはＮの値で乗算
された結果が新しいＰ_SHIFTED値よりも小さい値を有する積になる２の整数の最
大べき数である。（Ｐ_SHIFTED−A*N）の差値が所望の値（R² mod N）に関して減
算器２４４により生成される。If the shifted value of P, P _SHIFTED , does not have a value greater than the N value, then transfer the P _SHIFTED value to the second input of _{mux 246} to generate a new P _SHIFTED value. The new P _SHIFTED value has data shifted further 16 digits by shift circuit 248. Each time data is shifted in the shift circuit 248, the comparator 2
Within 50, the new shift value is compared to the N value. If the new P _SHIFTED value has a value greater than the N value, a value (R ² mod N) is generated. mux 240 forwards the new P _SHIFTED value to the first input of subtractor 244. A generated by the multiplier 242
And N are forwarded to a second input of subtractor 244. Where A is the largest power of two integers resulting in the product multiplied by the value of N resulting in a product having a value less than the new P _SHIFTED value. A difference value of (P _SHIFTED -A * N) is generated by subtractor 244 for the desired value (R ² mod N).

【００８２】値（R² mod N）を決定する代替の方法には、₍₂₎２ⁿなどの値を有するＲの値を
選択する段階が含まれる。ただし、「ｎ」は整数である。言い換えるとＲは２²
，２⁴，２⁸，．．．，２²⁵⁶などの値を有するよう選択される。バイナリ表現に
おいては、Ｒの値は最上位データ・ビットに関しては論理１を有し、それに０の
列が続く。Ｎのバイナリ値の最上位データ・ビット、すなわちＮの最も左側の論
理１ビット位置が、Ｒの値の論理１に隣接する論理０の値に整合される。Ｎの整
合された値、Ｎ_ALIGNEDがＲから減じられてＲ−Ｎ_ALIGNEDの差値が求められる。
Ｎ値の最も左側の論理１を差値内で論理１値を有する最上位データ・ビットに隣
接する論理０値に整合する段階が、値（R mod N）が得られるまで繰り返される
。言い換えると、Ｎ値より小さい値を有する差値は、値（R mod N）に等しい。An alternative method of determining the value (R ² mod N) includes: ₍₂₎ selecting a value of R having a value such as 2 ⁿ . Here, “n” is an integer. In other words, R is 2 ²
, ²⁴ , 28 ^,. . . , 2 ²⁵⁶ and so on. In the binary representation, the value of R has a logical one for the most significant data bit, followed by a sequence of zeros. The most significant data bit of the binary value of N, the leftmost logical one bit position of N, is matched to a logical zero value adjacent to the logical one of the value of R. The aligned value of N, N _ALIGNED, is subtracted from R to _determine the difference value of R-N _ALIGNED .
The process of matching the left most logic one of the N values to the logic zero value adjacent to the most significant data bit having a logic one value in the difference value is repeated until a value (R mod N) is obtained. In other words, the difference value having a value smaller than the N value is equal to the value (R mod N).

【００８３】値（R mod N）を左に１ビット位置シフトさせて、必要に応じて、Ｎ値を値（R
mod N）から減ずることにより、値（2R mod N）が得られる。値（2R mod N）は
、乗算器５６（図３参照）のオペランドＡ，Ｂの両方に関して用いられる。新た
に求められる値（2²R mod N）を乗算器５６のオペランドＡ，Ｂの両方に関して
用いて、コプロセッサ４４の出力に値（2⁴R mod N）を生成する。コプロセッサ
４４から新たに生成される値は、新しい値が₍₂₎2ⁿR mod Ｎ）を有するまで新し
い値の生成において、オペランドＡ，Ｂの両方に関する値として繰り返し利用さ
れる。ただし₍₂₎2ⁿは、Ｒの値に等しい。これにより、値（R² mod N）が求めら
れる。The value (R mod N) is shifted to the left by one bit position, and the N value is changed to the value (R
mod N) gives the value (2R mod N). The value (2R mod N) is used for both operands A and B of multiplier 56 (see FIG. 3). The newly determined value (2 ² R mod N) is used for both operands A and B of multiplier 56 to generate a value (2 ⁴ R mod N) at the output of coprocessor 44. The value newly generated from the coprocessor 44 is repeatedly used as a value for both the operands A and B in generating a new value until the new value has ₍₂₎ 2 ⁿ R mod N). However, ₍₂₎ 2 ⁿ is equal to the value of R. Thereby, a value (R ² mod N) is obtained.

【００８４】以上、本発明は集積回路内に構築するための高性能で低価格および低電力を実
現する暗号化乗算システムを提供することが理解頂けよう。フォスタ−モンゴメ
リ・ハードウェア・アクセラレータは、フォスタ−モンゴメリ換算アルゴリズム
を計算し、従来技術によるシステムよりも少ないクロック・サイクル内で大きな
オペランドの乗算を実行することにより高性能を実現する。本方法および回路構
成は、ビット数を増大させたオペランドに適用可能である。Thus, it can be seen that the present invention provides a high performance, low cost and low power cryptographic multiplication system for construction in an integrated circuit. The Foster-Montgomery hardware accelerator achieves high performance by computing the Foster-Montgomery conversion algorithm and performing large operand multiplications in fewer clock cycles than prior art systems. The method and circuitry are applicable to operands with an increased number of bits.

[Brief description of the drawings]

【図１】フォスタ−モンゴメリ・ハードウェア・アクセラレータ（FMHA：
Foster-Montgomery Hardware Accelerator）ブロックを備えるスマートカードの
ブロック図である。[Figure 1] Foster-Montgomery Hardware Accelerator (FMHA:
1 is a block diagram of a smart card including a Foster-Montgomery Hardware Accelerator) block.

【図２】 FMHAブロックを備える集積回路からインターネット上に転送され
るデータを示す図である。FIG. 2 is a diagram showing data transferred from an integrated circuit having an FMHA block to the Internet.

【図３】図１のFMHAブロックに含まれる機能ブロックを示すブロック図で
ある。FIG. 3 is a block diagram showing functional blocks included in the FMHA block of FIG. 1;

【図４】モジューロ換算器の部分のブロック図である。FIG. 4 is a block diagram of a modulo converter.

【図５】図１のFMHAで用いられる乗算器と組み合わせたモジューロ換算器
の部分のブロック図である。FIG. 5 is a block diagram of a part of a modulo converter combined with a multiplier used in the FMHA of FIG. 1;

【図６】フォスタ−モンゴメリ換算アルゴリズムで用いられる値（R² mod
Ｎ）を生成する方法を示す流れ図２３０である。FIG. 6 shows a value (R ² mod) used in the Foster-Montgomery conversion algorithm.
N) is a flowchart 230 illustrating a method of generating N).

【図７】図６に説明される値（R² mod Ｎ）の生成を示すブロック図で
ある。FIG. 7 is a block diagram illustrating generation of a value (R ² mod N) illustrated in FIG. 6;

───────────────────────────────────────────────────── フロントページの続き (81)指定国ＥＰ(ＡＴ，ＢＥ，ＣＨ，ＣＹ，ＤＥ，ＤＫ，ＥＳ，ＦＩ，ＦＲ，ＧＢ，ＧＲ，ＩＥ，ＩＴ，ＬＵ，ＭＣ，ＮＬ，ＰＴ，ＳＥ)，ＯＡ(ＢＦ，ＢＪ，ＣＦ，ＣＧ，ＣＩ，ＣＭ，ＧＡ，ＧＮ，ＧＷ，ＭＬ，ＭＲ，ＮＥ，ＳＮ，ＴＤ，ＴＧ)，ＡＰ(ＧＨ，ＧＭ，ＫＥ，ＬＳ，ＭＷ，ＳＤ，ＳＺ，ＵＧ，ＺＷ)，ＥＡ(ＡＭ，ＡＺ，ＢＹ，ＫＧ，ＫＺ，ＭＤ，ＲＵ，ＴＪ，ＴＭ) ，ＡＬ，ＡＭ，ＡＴ，ＡＵ，ＡＺ，ＢＡ，ＢＢ，ＢＧ，ＢＲ，ＢＹ，ＣＡ，ＣＨ，ＣＮ，ＣＵ，ＣＺ，ＤＥ，ＤＫ，ＥＥ，ＥＳ，ＦＩ，ＧＢ，ＧＥ，ＧＨ，ＧＭ，ＨＲ，ＨＵ，ＩＤ，ＩＬ，ＩＮ，ＩＳ，ＪＰ，ＫＥ，ＫＧ，ＫＰ，ＫＲ，ＫＺ，ＬＣ，ＬＫ，ＬＲ，ＬＳ，ＬＴ，ＬＵ，ＬＶ，ＭＤ，ＭＧ，ＭＫ，ＭＮ，ＭＷ，ＭＸ，ＮＯ，ＮＺ，ＰＬ，ＰＴ，ＲＯ，ＲＵ，ＳＤ，ＳＥ，ＳＧ，ＳＩ，ＳＫ，ＳＬ，ＴＪ，ＴＭ，ＴＲ，ＴＴ，ＵＡ，ＵＧ，ＵＺ，ＶＮ，ＹＵ，ＺＷ (72)発明者ロドニー・シー・テッシュアメリカ合衆国アリゾナ州 85020 フェニックスノース・13ス・ストリート 7026 (72)発明者ジェイムス・ダグラス・デュウォーキンアメリカ合衆国アリゾナ州 85226 チャンドラーウェスト・シャノン・ストリート6802 (72)発明者マイケル・ジェイ・トーラアメリカ合衆国アリゾナ州 85226 チャンドラーウェスト・デル・リオ・ストリート5301 Ｆターム(参考） 5B022 AA05 BA04 CA03 CA04 FA01 5B056 AA01 FF01 FF02 FF05 FF16 5J104 AA22 NA18 ──────────────────────────────────────────────────続き Continuation of front page (81) Designated country EP (AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, MC, NL, PT, SE ), OA (BF, BJ, CF, CG, CI, CM, GA, GN, GW, ML, MR, NE, SN, TD, TG), AP (GH, GM, KE, LS, MW, SD, SZ, UG, ZW), EA (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), AL, AM, AT, AU, AZ, BA, BB, BG, BR, BY, CA, CH, CN, CU, CZ, DE, DK, EE, ES, FI, GB, GE, GH, GM, HR, HU, ID, IL, IN, IS, JP, KE, KG , KP, KR, KZ, LC, LK, LR, LS, LT, LU, LV, MD, MG, MK, MN, MW, MX, NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, TJ, TM, TR, TT, UA, UG, UZ, VN, YU, ZW (72) Inventor Rodney Sea Tesh 85020 Phoenix, Arizona USA 13th Street, Phoenix 7026 (72) Inventor James Douglas Duwalkin Arizona, U.S.A. 85226 Chandler West Shannon Street 6802 (72) Inventor Michael J. Tora, U.S.A. 5301 F-term (Reference) 5B022 AA05 BA04 CA03 CA04 FA01 5B056 AA01 FF01 FF02 FF05 FF16 5J104 AA22 NA18

Claims

[Claims]

1. A data processing system (26) for performing modulo multiplication comprising: a multiplier (5) having inputs (48, 46) for receiving binary data values A and B;
6); an adder (58) having a first input coupled to the output of the multiplier, a second input coupled to receive the partial product, and an output providing a sum value; And a first input coupled to the output of the adder (58), a second input (52) coupled to receive a binary data value N, and (A * B / R mod N). An output providing a data value having a format.
When a predetermined bit position of the sum has a first logic state,
A modulo converter (60) that matches the data value N and adds the binary data value N to the sum to produce the least significant data bit of the reduced value ↓
A data processing system comprising:

2. The data processing system according to claim 1, wherein when all bits of the conversion value ↓ are determined, the data value is converted to a zero value.

3. A smart card (10) comprising: a data bus (15) for transferring data to an output (13) of said smart card.
And coupled to the data bus, multiplying a first digit (A * R mod N) by a second digit (B * R mod N) to produce a product (A * B * R mod N) A coprocessor (26), the product of which is reduced by dividing the product by the value of R during multiplication, where A and B are integer values, and N is an odd integer value in modulo count. And
R is an integer value, and the modulo multiplication is performed based on (↓ * N), where ↓ is a coprocessor (26) determined when multiplying the first and second digits. A smart card.

4. The multiplier (56) coupled to the data bus for receiving the data, the coprocessor (26) receiving the data at a first input (46) of the multiplier. A multiplier (56) comprising: a first operand to be obtained and a second operand received at a second input (48) of the multiplier, the multiplier generating a product from the first and second operands (56); A first input coupled to receive the product, a second input coupled to receive a previous partial product, and an output providing a sum of the product and the previous partial product. A first input coupled to the output of the adder circuit; a second input coupled to receive the binary value N; A module having an output for providing a conversion product. 4. The smart card according to claim 3, wherein the smart card is constituted by:

5. An encryption system (10) for interfacing with the Internet.
A central processing unit (18) having a data bus (15) for transferring data; and a first digit (A * R mod N) and a second digit (B *) coupled to said data bus. R mod N) to generate a product (A * B * R mod N) that is a modulo N converted by dividing by the value of R during multiplication of the first and second digits. An accelerator block (26), where A and B are integer values, N is an odd integer value in modulo counting, R is an integer value, and modulo multiplication is based on the value (↓ * N) And an encryption accelerator block (26) determined when multiplying the first and second digits by an encryption accelerator block (26).

6. The encryption accelerator block (26) is: a multiplier (56) coupled to the data bus for receiving the data, the data being a first input (46) of the multiplier. A) comprising a first value received at a second input of the multiplier and a second value received at a second input of the multiplier, the multiplier generating a product from the first and second values; A first input coupled to the output of the multiplier, a second input coupled to receive a previous partial product, and an output providing a sum of the product and the previous partial product. An adder circuit (58) having a first input coupled to the output of the adder circuit, a second input coupled to receive the integer value N, and a reduced product. A modulo converter (60) having an output Encryption system according to claim 5, characterized in that it is made (10)
.

7. A memory (64) having an input coupled to the output of the modulo converter (60) and an output coupled to the second input of the adder circuit.
The encryption system (10) according to claim 6, further comprising:

8. An input coupled to the data bus for receiving the first value and coupled to the first input of the multiplier to provide a two's negative complement of the first value. The encryption system (10) according to claim 6, further comprising a digit negation unit (42) having an output to output.

9. An architecture for a Foster-Montgomery hardware accelerator (FMHA) (44) on which mathematical operations are performed, comprising: a first and a second combined to receive operands A and B, respectively. The second input (
A multiplier having an output for providing a partial product; a first input coupled to the output of the multiplier; and a first input coupled to receive a previous reduced partial product. An adder (58) having a second input (50) and an output for providing a sum value; and a first input coupled to the output of the adder, and a second input coupled to receive coefficients. An architecture comprising: a modulo converter (60) having an input (52) and an output for providing a reduced partial product.

10. A data bus (41) for transferring data; a first memory coupled to said data bus and storing said operands A and B;
72) having a first input coupled to the data bus (41), and a second input coupled to receive the reduced partial product, storing the reduced partial product; A second memory (64) for providing the previous reduced partial product; and an input coupled to the data bus (41); and an output coupled to the second input of the modulo converter. 10. The architecture of claim 9, further comprising: a third memory (70) for storing the coefficients.