JP2001265592A

JP2001265592A - Information processor

Info

Publication number: JP2001265592A
Application number: JP2000076086A
Authority: JP
Inventors: Nobuo Higaki; 信生檜垣; Kenichi Kawaguchi; 謙一川口
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 2000-03-17
Filing date: 2000-03-17
Publication date: 2001-09-28

Abstract

PROBLEM TO BE SOLVED: To provide an information processor for operating based on the result of comparing fast in an SIMD instruction for storing plural pieces of data on a register and batch operating the plural pieces of data. SOLUTION: On performing a condition performing instruction, data designated by an operand is divided into the plural pieced of data by each specific number of bits, and the corresponding verifying result of a state holding means corresponding to each of the divided plural pieces of data is examined. The processor is controlled to perform operation designated in the instruction to corresponding divided data when a condition is formed, and to avoid performing the operation designated in the instruction to the corresponding divided data when the condition is not formed. Thus, with respect to the respective data divided plurally, the condition performing instruction can operate only to data where the condition is formed based on the verifying result held by the state holding means and can operate fast.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、フラグレジスタを
有し、命令実行結果を反映するフラグレジスタの内容に
従って複数のデータに対して演算を行うＳＩＭＤ演算命
令をもつ情報処理装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an information processing apparatus having a flag register and having a SIMD operation instruction for performing an operation on a plurality of data in accordance with the contents of the flag register reflecting an instruction execution result.

【０００２】[0002]

【従来の技術】近年、マイクロコンピュータなどの情報
処理装置の処理能力が飛躍的に向上し、あらゆる分野で
用いられている。2. Description of the Related Art In recent years, the processing capability of information processing apparatuses such as microcomputers has been dramatically improved, and they have been used in various fields.

【０００３】特に近年の情報処理装置では、レジスタ上
に複数のデータを格納し、その複数のデータを一括に演
算するＳＩＭＤ(Single Instruction Multiple Data)命
令を実装することにより、データ処理の処理能力を向上
させている（例えば、「Intel Architecture Software
Developer's Manual Volume 1 : Basic Architectur
e」、Intel Corporation発行、p.6-9〜p.6-11参照）。In particular, in recent information processing apparatuses, a plurality of data are stored in a register, and a SIMD (Single Instruction Multiple Data) instruction for calculating the plurality of data at a time is implemented, thereby increasing the data processing capability. (For example, “Intel Architecture Software
Developer's Manual Volume 1: Basic Architectur
e ", published by Intel Corporation, pp. 6-9 to pp. 6-11).

【０００４】図２０は従来の情報処理装置である「Ｐｅ
ｎｔｉｕｍ（登録商標）ＩＩ」プロセッサのＳＩＭＤ
命令（以下、ＭＭＸ命令とする）の一部の比較命令の仕
様を示す第１の従来例の図である。FIG. 20 shows a conventional information processing apparatus "Pe
ntium (R) II "processor SIMD
FIG. 11 is a diagram of a first conventional example showing specifications of some comparison instructions of an instruction (hereinafter, referred to as an MMX instruction).

【０００５】同図において、オペランド”ｓｄ”、”ｄ
ｄ”は６４ビット長のレジスタであり、”［］”内はレ
ジスタのどのビット位置からどのビット位置までの範囲
であるかを示している。例えば、”ｓｄ［１５：
０］”、”ｓｄ［３１：１６］”、”ｓｄ［４７：３
２］”、”ｓｄ［６３：４８］”はそれぞれ「レジスタ
ｓｄのビット０（ＬＳＢ）からビット１５までの１６ビ
ット」、「レジスタｓｄのビット１６からビット３１ま
での１６ビット」、「レジスタｓｄのビット３２からビ
ット４７までの１６ビット」、「レジスタｓｄのビット
４８からビット６３（ＭＳＢ）までの１６ビット」を示
している。また、”０ｘ”付き数値は１６進数を、”
←”は右辺の値を左辺に格納することを表している。In FIG. 1, operands "sd", "d"
"d" is a 64-bit length register, and "[]" indicates a range from which bit position to which bit position of the register, for example, "sd [15:
0] "," sd [31:16] "," sd [47: 3
2] ”and“ sd [63:48] ”are“ 16 bits from bit 0 (LSB) to bit 15 of register sd ”,“ 16 bits from bit 16 to bit 31 of register sd ”, and“ register sd 16 bits from bit 32 to bit 47 of the register sd ”and“ 16 bits from bit 48 to bit 63 (MSB) of the register sd ”. In addition, the numerical value with “0x” represents a hexadecimal number,
“←” indicates that the value on the right side is stored in the left side.

【０００６】この情報処理装置では、レジスタを１６ビ
ット毎に分割しそれぞれを比較して、条件成立時には”
0xffff”を条件非成立時には”0x0000”を、レジスタｄ
ｄの対応する１６ビットの位置に格納している。「PCMP
EQW命令」は、レジスタｄｄとレジスタｓｄの対応する
１６ビットが等しいかどうかを、「PCMPGTW命令」は、
レジスタｄｄとレジスタｓｄの対応する１６ビットがレ
ジスタｄｄのほうが大きいかどうかを比較条件としてい
る。In this information processing apparatus, the register is divided for every 16 bits, and they are compared with each other.
0xffff ”is set to“ 0x0000 ”when the condition is not satisfied.
It is stored in the corresponding 16-bit position of d. "PCMP
The "EQW instruction" determines whether the corresponding 16 bits of the register dd and the register sd are equal, and the "PCMPGTW instruction"
The comparison condition is whether or not the corresponding 16 bits of the register dd and the register sd are larger in the register dd.

【０００７】上記第１の従来例では、１６ビット毎の比
較結果に応じて、対応する１６ビット全てを０または１
にしているが、１ビットのみに反映する情報処理装置も
ある（例えば、「Ultra SPARC-IIi User's Manual」、S
un Microsystems, Inc.発行、p.135〜p.180参照）。In the first conventional example, all the corresponding 16 bits are set to 0 or 1 according to the comparison result for every 16 bits.
However, there are information processing devices that reflect only one bit (for example, "Ultra SPARC-IIi User's Manual",
un Microsystems, Inc., pp. 135-180.

【０００８】図２１は従来の情報処理装置である「Ultr
a SPARC-IIi」プロセッサのＳＩＭＤ命令（以下、ＶＩ
Ｓ命令とする）の一部の比較命令の仕様を示す第２の従
来例の図である。FIG. 21 shows a conventional information processing apparatus "Ultr
a SPARC-IIi ”processor SIMD instruction (hereafter VI
FIG. 11 is a diagram of a second conventional example showing the specifications of some comparison instructions (referred to as an S instruction).

【０００９】同図において、オペランド”ｒｓ１”、”
ｒｓ２”、”ｒｄ”は６４ビット長のレジスタであ
る。”［］”、”０ｘ”、”←”の意味は、上記第１の
従来例と同じであるので説明は省略する。ただ
し、”［］”内に１つの数値のみ記述されているもの
は、レジスタのそのビット位置を示している。例え
ば、”ｒｄ［０］”、”ｒｄ［１］”、”ｒｄ
［２］”、”ｒｄ［３］”はそれぞれ「レジスタｒｄの
ビット０の１ビット」、「レジスタｒｄのビット１の１
ビット」、「レジスタｒｄのビット２の１ビット」、
「レジスタｒｄのビット３の１ビット」を示している。In FIG. 1, operands “rs1”, “
“rs2” and “rd” are 64-bit registers, and the meanings of “[]”, “0x”, and “←” are the same as those in the first conventional example, and the description is omitted. When only one numerical value is described in [] ”, the bit position of the register is indicated. For example,“ rd [0] ”,“ rd [1] ”,“ rd ”
[2] ”and“ rd [3] ”are“ one bit of bit 0 of register rd ”and“ one bit of bit 1 of register rd ”, respectively.
Bit, 1 bit of bit 2 of register rd,
"1 bit of bit 3 of register rd" is shown.

【００１０】また、ビット位置に関しては、本来「Ultr
a SPARC-IIi」プロセッサのビット並びはビッグエンデ
ィアン形式（ＭＳＢがビット０、ＬＳＢがビット６３）
であるが、他の従来例と比較しやすいように図中ではリ
トルエンディアン形式（ＭＳＢがビット６３、ＬＳＢが
ビット０）で表記している。[0010] Also, regarding the bit position, "Ultr
The bit arrangement of the "a SPARC-IIi" processor is in big endian format (MSB is bit 0, LSB is bit 63)
However, for ease of comparison with other conventional examples, the data is represented in the little endian format (MSB is bit 63, LSB is bit 0) in the figure.

【００１１】この情報処理装置では、レジスタを１６ビ
ット毎に分割しそれぞれを比較して、条件成立時には”
0x1”を条件非成立時には”0x0”をレジスタｒｄの下位
４ビットにそれぞれ１ビットずつに格納している。In this information processing apparatus, the register is divided every 16 bits, and each is compared.
When "0x1" is not satisfied, "0x0" is stored in the lower 4 bits of the register rd, one bit at a time.

【００１２】これらにより、上記第１および第２の従来
例では、１つのレジスタに４つの１６ビットのデータを
格納し、一度に比較することが可能となり、複数のデー
タを一括に処理する場合の処理能力を向上できる。例え
ば、画像処理などで１画素のＲＧＢデータを１つのレジ
スタに格納しておき、ある特定のＲＧＢデータのみを抽
出する時などに使用できる。As a result, in the first and second conventional examples, four 16-bit data can be stored in one register and compared at a time, so that a plurality of data can be collectively processed. Processing capacity can be improved. For example, it is possible to store RGB data of one pixel in one register by image processing or the like, and to use it when extracting only specific RGB data.

【００１３】上記第１および第２の従来例では、比較結
果をレジスタの１６ビットまたは１ビットに反映してい
る比較命令を実装しているが、比較命令を実装せずに２
つの数の大きいまたは小さいほうを選択する最大値命令
または最小値命令を実装しているものもある（例えば、
「Alpha Architecture Handbook」、Degital Equipment
Corporation発行、p.4-151〜p.4-156参照）。In the first and second conventional examples, a comparison instruction in which a comparison result is reflected in 16 bits or 1 bit of a register is implemented.
Some implement maximum or minimum instructions that select the larger or smaller of the numbers (e.g.,
"Alpha Architecture Handbook", Digital Equipment
Corporation, p.4-151 to p.4-156).

【００１４】図２２は従来の情報処理装置である「Alph
a」プロセッサのＳＩＭＤ命令（以下、ＭＶＩ命令とす
る）の一部の最小値命令の仕様を示す第３の従来例の図
である。FIG. 22 shows a conventional information processing apparatus "Alph
FIG. 14 is a diagram of a third conventional example showing the specification of a part of the minimum value instruction of the SIMD instruction (hereinafter referred to as the MVI instruction) of the “a” processor.

【００１５】同図において、オペランド”Ｒａ”、”Ｒ
ｂ”、”Ｒｃ”は６４ビット長のレジスタであ
る。”［］”、”←”の意味は、上記第１の従来例と同
じであるので説明は省略する。”ｍｉｎ（Ａ，Ｂ）”は
Ａ、Ｂのうちの小さいほうの値を選択するものである。
例えば、”Ｃ←ｍｉｎ（Ａ，Ｂ）”は「Ａ、Ｂの小さい
ほうの値をＣに格納する」を示している。In FIG. 1, operands "Ra", "R"
“b” and “Rc” are 64-bit registers.The meanings of “[]” and “←” are the same as those in the first conventional example, and a description thereof is omitted. ”min (A, B) "Selects the smaller value of A and B.
For example, “C ← min (A, B)” indicates “store the smaller value of A and B in C”.

【００１６】この情報処理装置では、レジスタを１６ビ
ット毎に分割しそれぞれを比較して、小さいほうの値を
レジスタＲｃに格納している。In this information processing apparatus, the register is divided every 16 bits, and each is compared, and the smaller value is stored in the register Rc.

【００１７】上記第３の従来例と同様の最大値命令また
は最小値命令を実装している情報処理装置の他の命令仕
様として、「V830R/AV」プロセッサのＳＩＭＤ命令（以
下、ＭＩＸ２命令とする）もある（例えば、「NEC技法
Vol.51 No.3 / 1998」、日本電気(株)発行、p.50〜p.54
参照）。As another instruction specification of the information processing apparatus which implements the maximum value instruction or the minimum value instruction similar to the third conventional example, the SIMD instruction of the "V830R / AV" processor (hereinafter referred to as MIX2 instruction) ) (For example, "NEC technology"
Vol.51 No.3 / 1998 '', NEC Corporation, p.50-p.54
reference).

【００１８】これらにより、上記第３の従来例では、１
つのレジスタに４つの１６ビットのデータを格納し、一
度に最大値または最小値を求めることが可能となり、複
数のデータを一括に処理する場合の処理能力を向上でき
る。Thus, in the third conventional example, 1
Four 16-bit data can be stored in one register, and the maximum value or the minimum value can be obtained at a time, so that the processing capability when processing a plurality of data at once can be improved.

【００１９】上記第１および第２の従来例では比較命令
を実装し、上記第３の実施例では最大値命令または最小
値命令を実装しているが、この両方を実装しているもの
もある（例えば、「AltiVec Technology Programming E
nvironments Manual」、Motorola Inc.発行、p.4-1〜p.
4-41参照）。In the first and second conventional examples, a comparison instruction is implemented, and in the third embodiment, a maximum value instruction or a minimum value instruction is implemented. (For example, “AltiVec Technology Programming E
nvironments Manual '', published by Motorola Inc., pp. 4-1 to p.
4-41).

【００２０】図２３は従来の情報処理装置である「Powe
rPC」プロセッサのＳＩＭＤ命令（以下、ＡｌｔｉＶｅ
ｃ命令とする）の一部の比較命令および最小値命令の仕
様を示す第４の従来例の図である。FIG. 23 shows a conventional information processing apparatus "Powe
rPC ”processor SIMD instruction (hereinafter referred to as“ AltiVe ”)
FIG. 13 is a diagram of a fourth conventional example showing the specifications of a part of the comparison instruction and the minimum value instruction (hereinafter referred to as c instruction).

【００２１】同図において、オペランド”Ａ”、”
Ｂ”、”Ｃ”は１２８ビット長のレジスタであり、”Ｃ
Ｒ６”は４ビット長の条件レジスタであ
る。”［］”、”０ｘ”、”←”、”ｍｉｎ（Ａ，
Ｂ）”は上記第１および第３の従来例と同じであるので
説明は省略する。In the figure, operands "A", "A"
B ”and“ C ”are 128-bit registers, and“ C ”
R6 "is a 4-bit condition register." [] "," 0x "," ← "," min (A,
B) "is the same as in the first and third conventional examples, and a description thereof will be omitted.

【００２２】この情報処理装置では、レジスタを３２ビ
ット毎に分割しそれぞれを比較して、条件成立時には”
0xffffffff”を条件非成立時には”0x00000000”を、レ
ジスタＤの対応する３２ビットの位置に格納している。
また、レジスタを３２ビット毎に分割しそれぞれを比較
して、小さいほうの値をレジスタＤに格納している。In this information processing apparatus, the register is divided for every 32 bits, and they are compared with each other.
When “0xffffffff” is not satisfied, “0x00000000” is stored in the corresponding 32-bit position of the register D.
Further, the register is divided for every 32 bits, and each is compared, and the smaller value is stored in the register D.

【００２３】これにより、上記第４の従来例では、１つ
のレジスタに４つの３２ビットのデータを格納し、一度
に比較したり、最大値または最小値を求めることが可能
となり、複数のデータを一括に処理する場合の処理能力
を向上できる。As a result, in the fourth conventional example, four 32-bit data can be stored in one register and compared at a time, or a maximum value or a minimum value can be obtained. It is possible to improve the processing capacity when processing all at once.

【００２４】[0024]

【発明が解決しようとする課題】しかしながら上記従来
技術によれば、「比較した結果に基づいて演算を行う」
という処理を実行したい場合に、数多くの命令が必要と
なる。However, according to the above prior art, "operation is performed based on the comparison result".
If you want to execute such processing, many instructions are required.

【００２５】例えば、２つのレジスタＡ、Ｂにａ１、ａ
２、ａ３、ａ４とｂ１、ｂ２、ｂ３、ｂ４のそれぞれ４
つのデータが格納されており、それぞれのデータの大小
関係はａ１＜ｂ１、ａ２＞ｂ２、ａ３＞ｂ３、ａ４＜ｂ
４とする。この時、Ａ、Ｂのそれぞれのデータを比較し
て、レジスタＡのデータのほうが小さい場合にレジスタ
Ｂのデータを足し込み、最終的にａ１＋ｂ１、ａ２、ａ
３、ａ４＋ｂ４というデータを求めるというような、画
像合成等の画像処理によく使用される処理を考えてみ
る。For example, a1 and a2 are stored in two registers A and B, respectively.
2, 4 each of a3, a4 and b1, b2, b3, b4
Data are stored, and the magnitude relation of each data is a1 <b1, a2> b2, a3> b3, a4 <b
4 is assumed. At this time, the data of A and B are compared, and if the data of register A is smaller, the data of register B is added, and finally a1 + b1, a2, a
3. Consider a process often used in image processing such as image synthesis, such as obtaining data of a4 + b4.

【００２６】上記ＭＭＸ命令を実行する情報処理装置の
場合は、レジスタＡ、Ｂを比較すると小さいデータに対
応する部分のビットがセットされるので、”0xffff0000
0000ffff”というデータが格納される。このデータとレ
ジスタＢを論理積し、ｂ１、０、０、ｂ４というデータ
を生成し、これをレジスタＡに加算すれば、最終的に求
めたいａ１＋ｂ１、ａ２、ａ３、ａ４＋ｂ４というデー
タが得られる。上記ＡｌｔｉＶｅｃ命令を実行する情報
処理装置の場合も同じである。In the case of the information processing apparatus that executes the MMX instruction, a bit corresponding to small data is set by comparing the registers A and B, so that “0xffff0000” is set.
0000ffff "is stored. The data is logically ANDed with the register B to generate data b1, 0, 0, and b4, and this is added to the register A, so that a1 + b1, a2, The data a3 and a4 + b4 are obtained, and the same applies to an information processing apparatus that executes the above-mentioned Altivec instruction.

【００２７】上記ＶＩＳ命令を実行する情報処理装置の
場合は、レジスタＡ、Ｂを比較すると比較結果が下位４
ビットに格納されるので、下位４ビットは”１００１”
というデータが格納される。このデータを通常の比較命
令で解析し、セットされているビットに対応するレジス
タＢのデータをレジスタＢから取り出して、レジスタＡ
の該当するデータに足し込むというプログラムをＳＩＭ
Ｄ命令でない通常の演算命令で書くことにより、最終的
に求めたいａ１＋ｂ１、ａ２、ａ３、ａ４＋ｂ４という
データが得られる。In the case of an information processing apparatus that executes the VIS instruction, when the registers A and B are compared,
The lower 4 bits are “1001”
Is stored. This data is analyzed by a normal comparison instruction, the data of the register B corresponding to the set bit is taken out from the register B, and the data of the register A
SIM to add the program to the corresponding data
By writing with a normal operation instruction other than the D instruction, data a1 + b1, a2, a3, and a4 + b4 to be finally obtained can be obtained.

【００２８】上記ＭＶＩ命令、ＭＩＸ２命令を実行する
情報処理装置の場合は、最大値命令または最小値命令し
かなく、最大値または最小値を求めるために比較した結
果をどこにも格納しないので、通常の比較命令を用いて
レジスタＡ、Ｂに格納している４つのデータを順に比較
し、レジスタＡのデータのほうが小さい場合に、レジス
タＢの該当するデータを取り出し、レジスタＡの該当す
るデータに足し込むというＳＩＭＤ命令でない通常の演
算命令で書くことにより、最終的に求めたいａ１＋ｂ
１、ａ２、ａ３、ａ４＋ｂ４というデータが得られる。In the case of an information processing apparatus which executes the MVI instruction and the MIX2 instruction, there is only a maximum value instruction or a minimum value instruction, and the comparison result for obtaining the maximum value or the minimum value is not stored anywhere. The four data stored in the registers A and B are sequentially compared using a comparison instruction. If the data in the register A is smaller, the corresponding data in the register B is extracted and added to the corresponding data in the register A. A1 + b which is finally obtained by writing with a normal operation instruction other than the SIMD instruction
Data of 1, a2, a3, and a4 + b4 are obtained.

【００２９】このように、画像処理でよく使用されるよ
うな比較した結果に基づいて演算を行う場合には、処理
速度が極端に遅くなるという課題を有していた。As described above, when an operation is performed based on a comparison result often used in image processing, there is a problem that the processing speed becomes extremely slow.

【００３０】本発明はかかる課題に鑑み、レジスタ上に
複数のデータを格納し、その複数のデータを一括に演算
するＳＩＭＤ命令において、高速に比較した結果に基づ
いて演算を行う情報処理装置を提供することを目的とす
る。In view of the foregoing, the present invention provides an information processing apparatus that stores a plurality of data in a register and performs an arithmetic operation based on a result of high-speed comparison in a SIMD instruction for operating the plurality of data collectively. The purpose is to do.

【００３１】[0031]

【課題を解決するための手段】この課題を解決するため
本発明のプロセッサは、オペランドで指定したデータを
１つのデータとして、条件を満たすことを検証する第１
の命令と、オペランドで指定したデータを複数のデータ
として、条件を満たすことを検証する第２の命令と、条
件が成立した時のみに、オペランドで指定したデータを
複数のデータとして指定された操作を実行する条件実行
命令とを含む機械語命令を解読する命令解読手段と、前
記命令解読手段に従って命令を実行する命令実行手段
と、前記第１の命令で検証した結果指定された条件が成
立したことを示す第１の状態保持手段と、前記第２の命
令で検証した結果指定された条件が成立したことを示す
第２の状態保持手段とを備え、前記命令解読手段は前記
第１の命令を解読すると、オペランドで指定したデータ
を１つのデータとして条件を検証し、条件を満たせば前
記第１の状態保持手段に該条件が成立したことを示さ
せ、前記第２の命令を解読すると、オペランドで指定し
たデータを特定ビット数毎に複数のデータに分割し、分
割した複数のデータを各データ毎に条件を検証し、条件
を満たせば前記第２の状態保持手段に該条件が成立した
ことを各データ毎に示させ、前記条件実行命令を解読す
ると、オペランドで指定したデータを特定ビット数毎に
複数のデータに分割し、分割した複数の各データに対応
する前記第２の状態保持手段の対応する検証結果を吟味
し、条件が成立している時は、対応する分割したデータ
に対して前記命令実行手段で該命令中に指定された操作
を行い、条件が成立していない時は、対応する分割した
データに対して前記命令実行手段で該命令中に指定され
た操作を行わないように制御することを特徴とする。In order to solve this problem, a processor according to the present invention uses a data designated by an operand as one data to verify that a condition is satisfied.
And a second instruction for verifying that a condition is satisfied by using the data specified by the operand as a plurality of data, and an operation specifying the data specified by the operand as a plurality of data only when the condition is satisfied Instruction decoding means for decoding a machine language instruction including a condition execution instruction for executing the instruction, instruction execution means for executing an instruction in accordance with the instruction decoding means, and a condition specified as a result of verification by the first instruction is satisfied And a second state holding means for indicating that a specified condition is satisfied as a result of verification by the second instruction. Is decoded, the condition specified by the operand is verified as one data, and if the condition is satisfied, the first state holding means indicates that the condition is satisfied, and the second instruction is When read, the data specified by the operand is divided into a plurality of data for each specific number of bits, the condition of each of the divided data is verified for each data, and if the condition is satisfied, the condition is stored in the second state holding means. Is established for each data, and when the conditional execution instruction is decoded, the data specified by the operand is divided into a plurality of data for each specific number of bits, and the second data corresponding to each of the plurality of divided data is divided. Examining the corresponding verification result of the state holding means, and when the condition is satisfied, the instruction execution means performs the operation specified in the instruction on the corresponding divided data, and the condition is satisfied. If not, the instruction execution means controls the corresponding divided data so that the operation specified in the instruction is not performed.

【００３２】これにより、複数に分割した各データに対
して、条件実行命令は、前記状態保持手段に保持されて
いる検証結果に基づき、条件が成立したデータに対して
のみ演算を行うことができ、高速に演算することが可能
となる。Thus, for each of the plurality of divided data, the condition execution instruction can perform an operation only on the data for which the condition is satisfied, based on the verification result held in the state holding means. , And can be operated at high speed.

【００３３】ここでプロセッサにおける前記条件実行命
令は、前記第２の状態保持手段が、条件が成立している
ことを示している時に転送を行う命令、演算を行う命令
であるとしてもよい。Here, the condition execution instruction in the processor may be an instruction to perform a transfer or an instruction to perform an operation when the second state holding means indicates that the condition is satisfied.

【００３４】さらに、前記条件実行命令は、前記第２の
状態保持手段が、条件が成立していることを示している
時に第１の操作を行い、成立していないことを示してい
る時に第１の操作とは異なる第２の操作を行ってもよ
い。Further, the condition execution instruction is used to execute the first operation when the second state holding means indicates that the condition is satisfied, and to execute the first operation when the second state holding means indicates that the condition is not satisfied. A second operation different from the first operation may be performed.

【００３５】[0035]

【発明の実施の形態】以下、本発明の実施の形態におけ
る情報処理装置（以下プロセッサと呼ぶ）について、図
面を用いて説明する。DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, an information processing apparatus (hereinafter referred to as a processor) according to an embodiment of the present invention will be described with reference to the drawings.

【００３６】（実施の形態１）図１は、本発明の第１の
実施の形態におけるプロセッサに設けられるＳＩＭＤ演
算フラグレジスタＳＦＲのビット構成を示す図である。(Embodiment 1) FIG. 1 is a diagram showing a bit configuration of a SIMD operation flag register SFR provided in a processor according to a first embodiment of the present invention.

【００３７】同図のようにＳＩＭＤ演算フラグレジスタ
ＳＦＲは、Ｃ０フラグ、Ｃ１フラグ、Ｃ２フラグ、Ｃ３
フラグの４ビットのフラグから構成されている。この４
ビットのフラグは、ＳＩＭＤ命令において比較命令を実
行した時に更新されるフラグであり、Ｃ０フラグは、ビ
ット４８からビット６３（ＭＳＢ）までの１６ビットの
データの比較において、Ｃ１フラグは、ビット３２から
ビット４７までの１６ビットデータの比較において、Ｃ
２フラグは、ビット１６からビット３１までの１６ビッ
トデータの比較において、Ｃ３フラグは、ビット０（Ｌ
ＳＢ）からビット１５までの１６ビットデータの比較に
おいて、それぞれ比較条件が成立した時にセットされ
る。As shown in the figure, the SIMD operation flag register SFR stores C0 flag, C1 flag, C2 flag, C3 flag.
It consists of a 4-bit flag. This 4
The bit flag is updated when a comparison instruction is executed in the SIMD instruction. The C0 flag is used for comparing 16-bit data from bit 48 to bit 63 (MSB). In comparison of 16-bit data up to bit 47, C
The 2 flag is used for comparing 16-bit data from bit 16 to bit 31 and the C3 flag is used for comparing bit 0 (L
In the comparison of 16-bit data from SB) to bit 15, each bit is set when a comparison condition is satisfied.

【００３８】なお、Ｃ０フラグ、Ｃ１フラグ、Ｃ２フラ
グ、Ｃ３フラグは、図１のように１本のレジスタ内に設
けてもよいし、異なるレジスタに設けてもよい。The C0, C1, C2, and C3 flags may be provided in one register as shown in FIG. 1, or may be provided in different registers.

【００３９】また、本発明の第１の実施の形態において
は、通常の演算用のフラグレジスタも存在するが、説明
を簡単にするために省略している。Further, in the first embodiment of the present invention, a flag register for normal operation also exists, but is omitted for the sake of simplicity.

【００４０】なお、ＳＩＭＤ演算用フラグレジスタは、
通常の演算用のフラグレジスタとは別に設けてもよい
し、同じレジスタに設けてもよい。The SIMD operation flag register is
It may be provided separately from the normal operation flag register, or may be provided in the same register.

【００４１】図２は、本発明の第１の実施の形態のＳＩ
ＭＤ命令における比較命令の命令仕様を表す図である。
同図において、「ｖｃｍｐｅｈＲｍ，Ｒｎ」、「ｖｃ
ｍｐｇｈＲｍ，Ｒｎ」、「ｖｃｍｐｇｅｈＲｍ，Ｒ
ｎ」はレジスタに格納されている４つのデータをそれぞ
れ比較する機械語命令であり、ニモニック形式で表して
いる。また、各命令毎に動作も図示してある。FIG. 2 is a diagram showing an SI according to the first embodiment of the present invention.
FIG. 4 is a diagram illustrating an instruction specification of a comparison instruction in an MD instruction.
In the figure, “vccmpeh Rm, Rn”, “vc”
mpgh Rm, Rn "," vcmpgeh Rm, R
"n" is a machine language instruction for comparing each of the four data stored in the register, and is expressed in a mnemonic format. The operation is also shown for each instruction.

【００４２】”ｖｃｍｐｅｈ”、”ｖｃｍｐｇｈ”、”
ｖｃｍｐｇｅｈ”は、ＯＰコードである。”Ｒｍ”、”
Ｒｎ”はオペランドであり、３２ビット長のレジスタＲ
ｍ、レジスタＲｎを示している。"Vcmpeh", "vcmpgh", "
“vcmpgeh” is an OP code.
Rn "is an operand and a 32-bit register R
m and a register Rn.

【００４３】”［］”内はレジスタのどのビット位置か
らどのビット位置までの範囲であるかを示している。例
えば、”Ｒｍ［１５：０］”、”Ｒｍ［３１：１
６］”、”Ｒｍ［４７：３２］”、”Ｒｍ［６３：４
８］”はそれぞれ「レジスタＲｍのビット０（ＬＳＢ）
からビット１５までの１６ビット」、「レジスタＲｍの
ビット１６からビット３１までの１６ビット」、「レジ
スタＲｍのビット３２からビット４７までの１６ビッ
ト」、「レジスタＲｍのビット４８からビット６３（Ｍ
ＳＢ）までの１６ビット」を示している。また”Ｃ
０”、”Ｃ１”、”Ｃ２”、”Ｃ３”は、上記フラグレ
ジスタのＣ０フラグ、Ｃ１フラグ、Ｃ２フラグ、Ｃ３フ
ラグを示している。"[]" Indicates a range from which bit position to which bit position of the register. For example, “Rm [15: 0]”, “Rm [31: 1]
6] ”,“ Rm [47:32] ”,“ Rm [63: 4
8] is “bit 0 (LSB) of register Rm
16 bits from bit 16 to bit 31 of register Rm "," 16 bits from bit 32 to bit 47 of register Rm ", and" bit 16 to bit 63 of register Rm (M
16) up to SB). Also, "C
“0”, “C1”, “C2”, and “C3” indicate the C0 flag, C1 flag, C2 flag, and C3 flag of the flag register.

【００４４】「ｖｃｍｐｅｈＲｍ，Ｒｎ」は「等し
い」ことを判定する比較命令であり、レジスタＲｍ、レ
ジスタＲｎに格納されている４つの１６ビットデータを
比較し、Ｒｍ［１５：０］とＲｎ［１５：０］が等しけ
ればＣ３フラグを、Ｒｍ［３１：１６］とＲｎ［３１：
１６］が等しければＣ２フラグを、Ｒｍ［４７：３２］
とＲｎ［４７：３２］が等しければＣ１フラグを、Ｒｍ
［６３：４８］とＲｎ［６３：４８］が等しければＣ０
フラグをセットする。"Vcmpeh Rm, Rn" is a comparison instruction for judging "equal", compares four 16-bit data stored in the registers Rm and Rn, and outputs Rm [15: 0] and Rn [ 15: 0] are equal, the C3 flag is set to Rm [31:16] and Rn [31:
16] are equal, the C2 flag is set to Rm [47:32].
And Rn [47:32] are equal, the C1 flag is set to Rm
If [63:48] and Rn [63:48] are equal, C0
Set a flag.

【００４５】「ｖｃｍｐｇｅｈＲｍ，Ｒｎ」は「大き
い」ことを判定する比較命令であり、レジスタＲｍ、レ
ジスタＲｎに格納されている４つの１６ビットデータを
比較し、Ｒｍ［１５：０］がＲｎ［１５：０］より大き
ければＣ３フラグを、Ｒｍ［３１：１６］がＲｎ［３
１：１６］より大きければＣ２フラグを、Ｒｍ［４７：
３２］がＲｎ［４７：３２］より大きければＣ１フラグ
を、Ｒｍ［６３：４８］がＲｎ［６３：４８］より大き
ければＣ０フラグをセットする。“Vcmpgeh Rm, Rn” is a comparison instruction for determining “large”, compares four 16-bit data stored in the registers Rm and Rn, and finds that Rm [15: 0] is Rn [ 15: 0], the C3 flag is set, and Rm [31:16] is set to Rn [3
1:16], the C2 flag is set, and Rm [47:
32] is larger than Rn [47:32], the C1 flag is set. If Rm [63:48] is larger than Rn [63:48], the C0 flag is set.

【００４６】「ｖｃｍｐｇｅｈＲｍ，Ｒｎ」は「等し
いかまたは大きい」ことを判定する比較命令であり、レ
ジスタＲｍ、レジスタＲｎに格納されている４つの１６
ビットデータを比較し、Ｒｍ［１５：０］とＲｎ［１
５：０］が等しいかまたはＲｍ［１５：０］がＲｎ［１
５：０］より大きければＣ３フラグを、Ｒｍ［３１：１
６］とＲｎ［３１：１６］が等しいかまたはＲｍ［３
１：１６］がＲｎ［３１：１６］より大きければＣ２フ
ラグを、Ｒｍ［４７：３２］とＲｎ［４７：３２］が等
しいかまたはＲｍ［４７：３２］がＲｎ［４７：３２］
より大きければＣ１フラグを、Ｒｍ［６３：４８］とＲ
ｎ［６３：４８］が等しいかまたはＲｍ［６３：４８］
がＲｎ［６３：４８］より大きければＣ０フラグをセッ
トする。"Vcmpgeh Rm, Rn" is a comparison instruction for judging "equal to or greater than", and the four 16 bits stored in the register Rm and the register Rn.
The bit data is compared, and Rm [15: 0] and Rn [1
5: 0] are equal or Rm [15: 0] is Rn [1
5: 0], the C3 flag is set to Rm [31: 1].
6] and Rn [31:16] are equal or Rm [3
1:16] is larger than Rn [31:16], the C2 flag is set, and Rm [47:32] is equal to Rn [47:32] or Rm [47:32] is Rn [47:32].
If larger, the C1 flag is set to Rm [63:48] and R
n [63:48] is equal or Rm [63:48]
Is larger than Rn [63:48], the C0 flag is set.

【００４７】図３は、本発明の第１の実施の形態のＳＩ
ＭＤ命令における条件付き命令の命令仕様を表す図であ
る。同図において、「ｖｍｏｖｈＲｍ，Ｒｎ」、「ｖ
ｓｗａｐｈＲｍ，Ｒｎ」、「ｖａｄｄｈＲｍ，Ｒ
ｎ」はレジスタに格納されている４つのデータをそれぞ
れＳＩＭＤ演算フラグレジスタＳＦＲの内容に基づき演
算する機械語命令であり、図２と同様にニモニック形式
で表している。また、各命令毎に動作も図２と同様に図
示してある。FIG. 3 is a block diagram showing the SI according to the first embodiment of the present invention.
FIG. 5 is a diagram illustrating an instruction specification of a conditional instruction in an MD instruction. In the figure, “vmovh Rm, Rn”, “v
swap Rm, Rn "," vaddh Rm, R
"n" is a machine language instruction for calculating each of the four data stored in the register based on the contents of the SIMD operation flag register SFR, and is expressed in a mnemonic format as in FIG. The operation for each instruction is also shown in the same manner as in FIG.

【００４８】”ｖｍｏｖｈ”、”ｖｓｗａｐｈ”、”ｖ
ａｄｄｈ”は、ＯＰコードである。”Ｒｍ”、”Ｒ
ｎ”、”［］”、”Ｃ０”、”Ｃ１”、”Ｃ２”、”Ｃ
３”は、図２と同じであるので説明は省略する。"Vmovh", "vswaph", "v
"addh" is an OP code. "Rm", "R"
n "," [] "," C0 "," C1 "," C2 "," C
3 ″ is the same as FIG. 2 and will not be described.

【００４９】「ｖｍｏｖｈＲｍ，Ｒｎ」はＳＩＭＤ演
算フラグレジスタＳＦＲの内容に基づき、レジスタの内
容を「転送する」命令であり、Ｃ３フラグが１の時はＲ
ｎ［１５：０］の内容をＲｍ［１５：０］に、Ｃ２フラ
グが１の時はＲｎ［３１：１６］の内容をＲｍ［３１：
１６］に、Ｃ１フラグが１の時はＲｎ［４７：３２］の
内容をＲｍ［４７：３２］に、Ｃ０フラグが１の時はＲ
ｎ［６３：４８］の内容をＲｍ［６３：４８］に転送す
る。フラグが０の場合は、対応する１６ビットデータは
転送しない。"Vmovh Rm, Rn" is an instruction to "transfer" the contents of the SIMD operation flag register SFR based on the contents of the register.
The content of n [15: 0] is set to Rm [15: 0], and when the C2 flag is 1, the content of Rn [31:16] is set to Rm [31:
16], the content of Rn [47:32] is set to Rm [47:32] when the C1 flag is 1, and Rn [47:32] is set when the C0 flag is 1.
The contents of n [63:48] are transferred to Rm [63:48]. When the flag is 0, the corresponding 16-bit data is not transferred.

【００５０】「ｖｓｗａｐｈＲｍ，Ｒｎ」はＳＩＭＤ
演算フラグレジスタＳＦＲの内容に基づき、レジスタの
内容を「入れ換える」命令であり、Ｃ３フラグが１の時
はＲｎ［１５：０］の内容とＲｍ［１５：０］の内容
を、Ｃ２フラグが１の時はＲｎ［３１：１６］の内容と
Ｒｍ［３１：１６］の内容を、Ｃ１フラグが１の時はＲ
ｎ［４７：３２］の内容とＲｍ［４７：３２］の内容
を、Ｃ０フラグが１の時はＲｎ［６３：４８］の内容と
Ｒｍ［６３：４８］の内容を入れ換える。フラグが０の
場合は、対応する１６ビットデータは入れ換えない。"Vswap Rm, Rn" is SIMD
This is an instruction to “swap” the register contents based on the contents of the operation flag register SFR. When the C3 flag is 1, the contents of Rn [15: 0] and Rm [15: 0] are set, and the C2 flag is set to 1 When the C1 flag is 1, the contents of Rn [31:16] and the contents of Rm [31:16] are stored.
The contents of n [47:32] and Rm [47:32] are exchanged. When the C0 flag is 1, the contents of Rn [63:48] and Rm [63:48] are exchanged. If the flag is 0, the corresponding 16-bit data is not replaced.

【００５１】「ｖａｄｄｈＲｍ，Ｒｎ」はＳＩＭＤ演
算フラグレジスタＳＦＲの内容に基づき、レジスタの内
容を「加算する」命令であり、Ｃ３フラグが１の時はＲ
ｎ［１５：０］の内容とＲｍ［１５：０］の内容を加算
しＲｎ［１５：０］に格納する。Ｃ２フラグが１の時は
Ｒｎ［３１：１６］の内容とＲｍ［３１：１６］の内容
を加算しＲｎ［３１：１６］に格納する。Ｃ１フラグが
１の時はＲｎ［４７：３２］の内容とＲｍ［４７：３
２］の内容を加算しＲｎ［４７：３２］に格納する。Ｃ
０フラグが１の時はＲｎ［６３：４８］の内容とＲｍ
［６３：４８］の内容を加算しＲｎ［６３：４８］に格
納する。フラグが０の場合は、対応する１６ビットデー
タは加算しない。"Vaddh Rm, Rn" is an instruction for "adding" the contents of the SIMD operation flag register SFR based on the contents of the SIMD operation flag register SFR.
The contents of n [15: 0] and the contents of Rm [15: 0] are added and stored in Rn [15: 0]. When the C2 flag is 1, the contents of Rn [31:16] and the contents of Rm [31:16] are added and stored in Rn [31:16]. When the C1 flag is 1, the contents of Rn [47:32] and Rm [47: 3]
2] is added and stored in Rn [47:32]. C
When the 0 flag is 1, the contents of Rn [63:48] and Rm
The contents of [63:48] are added and stored in Rn [63:48]. If the flag is 0, the corresponding 16-bit data is not added.

【００５２】図４は、本発明の第１の実施の形態におけ
るプロセッサの主要部の構成を示すブロック図である。FIG. 4 is a block diagram showing a configuration of a main part of the processor according to the first embodiment of the present invention.

【００５３】本プロセッサは、命令レジスタ１０１、命
令解読器１０２、レジスタファイル１０３、６４ビット
演算器１０４、１６ビット演算器１０５〜１０８、ＳＩ
ＭＤ演算制御部１０９、レジスタ書込制御部１１０、Ｓ
ＩＭＤ演算フラグ格納部１１１、ラッチ１１２〜１２
１、バス１２２〜１２５から構成されている。The present processor comprises an instruction register 101, an instruction decoder 102, a register file 103, a 64-bit operation unit 104, 16-bit operation units 105 to 108, an SI
MD operation control unit 109, register write control unit 110, S
IMD operation flag storage unit 111, latches 112 to 12
1. It is composed of buses 122 to 125.

【００５４】命令レジスタ１０１は、メモリからフェッ
チされた命令を順次保持する。The instruction register 101 sequentially holds instructions fetched from the memory.

【００５５】命令解読器１０２は、命令レジスタ１０１
に保持された命令を解読し、その命令実行を制御するた
めの各種制御信号を出力する。特に命令解読器１０２
は、図２に示したｖｃｍｐｅｈ命令、ｖｃｍｐｇｈ命
令、ｖｃｍｐｇｅｈ命令を解読した場合には、ＳＩＭＤ
演算フラグレジスタＳＦＲを更新することを要求するた
めに、ＳＩＭＤ演算フラグ格納部１１１に出力している
フラグ更新要求信号６２０を有効にする。The instruction decoder 102 includes an instruction register 101
And outputs various control signals for controlling the execution of the instruction. In particular, the command decoder 102
Is SIMD if the vcmpeg instruction, vcmpgh instruction, and vcmpgeh instruction shown in FIG.
In order to request that the operation flag register SFR be updated, the flag update request signal 620 output to the SIMD operation flag storage unit 111 is made valid.

【００５６】レジスタファイル１０３は、６４ビット長
の複数の汎用レジスタから構成され、命令解読器１０２
から出力されるレジスタ書込要求信号６０６、６０７
と、ＳＩＭＤ演算フラグ格納部１１１から出力されるＳ
ＩＭＤ演算フラグ情報６０８〜６１１に基づき、レジス
タ書込制御部１１０で生成したレジスタ書込制御信号に
従って、バス１２２、１２３にレジスタデータを出力
し、バス１２４、１２５上のデータをレジスタに取り込
む。The register file 103 is composed of a plurality of general-purpose registers having a length of 64 bits.
Write request signals 606 and 607 output from
And S output from the SIMD operation flag storage unit 111.
Based on the IMD operation flag information 608 to 611, register data is output to the buses 122 and 123 according to the register write control signal generated by the register write control unit 110, and the data on the buses 124 and 125 is taken into the register.

【００５７】６４ビット演算器１０４は、６４ビットデ
ータを１つのデータとして扱うスカラデータの演算を行
うスカラ演算器で、命令解読器１０２の制御に従って、
バス１２２、１２３上のデータをラッチ１１２、１１３
を経由して取り込み、命令のＯＰコードで指定された演
算を行う。The 64-bit arithmetic unit 104 is a scalar arithmetic unit that performs scalar data operation that treats 64-bit data as one data.
Data on buses 122 and 123 are latched 112 and 113
And performs the operation specified by the OP code of the instruction.

【００５８】１６ビット演算器１０５〜１０８は、６４
ビットデータを４つの１６ビットデータに分割して扱う
ＳＩＭＤデータの演算を行う演算器（以下、４つの１６
ビット演算器１０５〜１０８を一括して表現する時には
ＳＩＭＤ演算器と呼ぶ）である。命令解読器１０２から
出力されるＳＩＭＤ演算要求信号６０１と、ＳＩＭＤ演
算フラグ格納部１１１から出力されるＳＩＭＤ演算フラ
グ情報６０８〜６１１に基づき、ＳＩＭＤ演算制御部１
０９で生成したＳＩＭＤ演算制御信号６０２〜６０５に
従って、バス１２２、１２３上のデータをラッチ１１４
〜１２１を経由して取り込み、命令のＯＰコードで指定
された演算を行うとともに、演算結果に基づきＳＩＭＤ
演算フラグを生成するために必要なＳＩＭＤフラグ生成
情報６２１〜６２８を出力する。The 16-bit arithmetic units 105 to 108 have 64
An arithmetic unit (hereinafter, referred to as four 16 bits) that performs an operation on SIMD data that handles bit data divided into four 16 bit data
The bit arithmetic units 105 to 108 are collectively expressed as a SIMD arithmetic unit.) Based on the SIMD operation request signal 601 output from the instruction decoder 102 and the SIMD operation flag information 608 to 611 output from the SIMD operation flag storage unit 111, the SIMD operation control unit 1
In accordance with the SIMD operation control signals 602 to 605 generated in step 09, the data on the buses 122 and 123 are latched 114
To 121, perform the operation specified by the OP code of the instruction, and execute SIMD based on the operation result.
It outputs SIMD flag generation information 621 to 628 necessary for generating the operation flag.

【００５９】ＳＩＭＤ演算制御部１０９は、命令解読器
１０２から出力されるＳＩＭＤ演算要求信号６０１と、
ＳＩＭＤ演算フラグ格納部１１１から出力されるＳＩＭ
Ｄ演算フラグ情報６０８〜６１１に基づき、１６ビット
演算器１０５〜１０８を制御するＳＩＭＤ演算制御信号
６０２〜６０５を出力する。The SIMD operation control unit 109 includes: a SIMD operation request signal 601 output from the instruction decoder 102;
SIM output from SIMD operation flag storage unit 111
It outputs SIMD operation control signals 602 to 605 for controlling the 16-bit operation units 105 to 108 based on the D operation flag information 608 to 611.

【００６０】レジスタ書込制御部１１０は、命令解読器
１０２から出力されるレジスタ書込要求信号６０６、６
０７と、ＳＩＭＤ演算フラグ格納部１１１から出力され
るＳＩＭＤ演算フラグ情報６０８〜６１１に基づき、バ
ス１２４、１２５上のデータをレジスタファイル１０３
のレジスタへ書き込むのを制御するレジスタ書込制御信
号６１２〜６１９を出力する。Register write control section 110 outputs register write request signals 606 and 6 output from instruction decoder 102.
07 and the SIMD operation flag information 608 to 611 output from the SIMD operation flag storage unit 111, the data on the buses 124 and 125 are stored in the register file 103.
, And outputs register write control signals 612 to 619 for controlling writing to the register.

【００６１】ＳＩＭＤ演算フラグ格納部１１１は、命令
解読器１０２から出力されるフラグ更新要求信号６２０
と、ＳＩＭＤ演算器から出力されるＳＩＭＤフラグ生成
情報６２１〜６２８に基づき、ＳＩＭＤ演算フラグレジ
スタＳＦＲを更新する。The SIMD operation flag storage unit 111 stores a flag update request signal 620 output from the instruction decoder 102.
And updates the SIMD operation flag register SFR based on the SIMD flag generation information 621 to 628 output from the SIMD operation unit.

【００６２】図５は、図４の命令解読器１０２に図２、
図３に示すＳＩＭＤ命令を入力した場合の入出力論理を
表した説明図（真理値表）である。FIG. 5 shows the instruction decoder 102 of FIG.
FIG. 4 is an explanatory diagram (truth value table) showing input / output logic when the SIMD instruction shown in FIG. 3 is input.

【００６３】同図では、入力、出力ともに２進数で表し
ており、”ｖｃｍｐｅｈ”、”ｖｃｍｐｇｈ”、”ｖｃ
ｍｐｇｅｈ”、”ｖｍｏｖｈ”、”ｖｓｗａｐｈ”、”
ｖａｄｄｈ”の機械語命令が入力された場合のＳＩＭＤ
演算要求信号６０１、レジスタ書込要求信号６０６、６
０７、フラグ更新要求信号６２０のそれぞれの出力を示
している。ＳＩＭＤ演算要求信号６０１の”０００
０”、”０００１”、”００１０”、”０１００”、”
１０００”はそれぞれ「ｎｏｐ」、「ｍｏｖ」、「ｓｗ
ａｐ」、「ａｄｄ」、「ｃｍｐ」を表しており、順に
「何もしない」、「転送」、「入れ換え」、「加算」を
実行することをＳＩＭＤ演算器に要求している。ただ
し、”００１１”、”０１０１”〜”０１１１”、”１
００１”〜”１１１１”は”ｒｅｓｅｒｖｅｄ”とし出
力されないものとする。レジスタ書込要求信号６０６、
６０７は、それぞれ１つのレジスタへの書き込みに対応
しており、合計２つのレジスタへの書き込みの要求を示
している。”１”の場合がレジスタへの書き込みを要求
している。ただし、ＳＩＭＤ命令以外の場合は、それぞ
れの命令に依存してレジスタ書込要求信号６０６、６０
７が出力されるので、図中では「命令に依存」と表記し
ている。例えば、通常の加算命令の場合には、演算結果
は１つのレジスタのみに書き込むため、レジスタ書込要
求信号６０６が”１”、６０７が”０”となる。フラグ
更新要求信号６２０の”００”、”０１”、”１
０”、”１１”はそれぞれ「ｎｏｐ」、「ｅｑ」、「ｇ
ｔ」、「ｇｅ」を表しており、順に「更新しない」、
「等しいことを判定し更新する」、「大きいことを判定
し更新する」、「等しいかまたは大きいことを判定し更
新する」を要求していることを示している。In the figure, both input and output are represented by binary numbers, and “vcmpeh”, “vcmpgh”, “vc”
mpgeh "," vmovh "," vswaph ","
SIMD when the machine language command “vaddh” is input
Operation request signal 601, register write request signals 606 and 6
07 and the flag update request signal 620 are shown. “000” of the SIMD operation request signal 601
0 "," 0001 "," 0010 "," 0100 ","
1000 ”are“ nop ”,“ mov ”, and“ sw ”, respectively.
“ap”, “add”, and “cmp”, and requests the SIMD arithmetic unit to execute “do nothing”, “transfer”, “swap”, and “add” in order. However, “0011”, “0101” to “0111”, “1”
001 ”to“ 1111 ”are“ reserved ”and are not output.
Reference numerals 607 each correspond to writing to one register, and indicate requests for writing to a total of two registers. The case of "1" requests writing to the register. However, in the case other than the SIMD instruction, the register write request signals 606 and 60 depend on each instruction.
7 is output, and is described as "depending on the instruction" in the figure. For example, in the case of a normal addition instruction, since the operation result is written into only one register, the register write request signal 606 becomes “1” and the register write request signal 607 becomes “0”. “00”, “01”, “1” of the flag update request signal 620
0 ”and“ 11 ”are“ nop ”,“ eq ”, and“ g ”, respectively.
t "," ge ", and" do not update ",
This indicates that the user requests "determine and update equality", "determine and update largeness", and "determine and update equality or greater".

【００６４】図６は、図４のＳＩＭＤ演算制御部１０９
の内部構成を示すブロック図である。FIG. 6 shows the SIMD operation control unit 109 of FIG.
FIG. 2 is a block diagram showing an internal configuration of the device.

【００６５】このＳＩＭＤ演算制御部１０９は、フリッ
プフロップ２０１〜２０４と、論理ゲート２０５〜２０
７から構成される。点線で囲んだ箇所は全て同じ回路で
あるため、図中では省略している。The SIMD operation control unit 109 includes flip-flops 201 to 204 and logic gates 205 to 20
7 is comprised. The portions enclosed by the dotted line are all the same circuit, and are omitted in the figure.

【００６６】フリップフロップ２０１〜２０４は、タイ
ミングを合わせるためのものであり、命令解読器１０２
から出力されるＳＩＭＤ演算要求信号６０１をラッチす
る。論理ゲート２０５〜２０７は、フリップフロップ２
０１〜２０４にラッチされた内容と、ＳＩＭＤ演算フラ
グ格納部１１１から出力されるＳＩＭＤ演算フラグ情報
６０８〜６１１の内容に基づき、ＳＩＭＤ演算制御信号
６０２〜６０５を出力する。ＳＩＭＤ演算制御信号６０
２はビット６３〜４８に対する演算の、ＳＩＭＤ演算制
御信号６０３はビット４７〜３２に対する演算の、ＳＩ
ＭＤ演算制御信号６０４はビット３１〜１６に対する演
算の、ＳＩＭＤ演算制御信号６０５はビット１５〜０に
対する演算の制御信号である。The flip-flops 201 to 204 are for adjusting timing, and the instruction decoder 102
Latches the SIMD operation request signal 601 output from. Logic gates 205 to 207 are connected to flip-flop 2
SIMD operation control signals 602 to 605 are output based on the contents latched in 01 to 204 and the contents of SIMD operation flag information 608 to 611 output from the SIMD operation flag storage unit 111. SIMD operation control signal 60
2 is an operation for bits 63 to 48, and a SIMD operation control signal 603 is an operation for bits 47 to 32.
The MD operation control signal 604 is a control signal for operations on bits 31 to 16, and the SIMD operation control signal 605 is a control signal for operations on bits 15 to 0.

【００６７】図７に論理ゲート２０５〜２０７の入出力
論理を表した説明図（真理値表）を示す。同図では、入
力、出力ともに２進数で表している。FIG. 7 is an explanatory diagram (truth value table) showing the input / output logic of the logic gates 205 to 207. In the figure, both input and output are represented by binary numbers.

【００６８】同図において、ＳＩＭＤ演算要求信号６０
１、およびＳＩＭＤ演算制御信号６０２の意味は、図５
の場合と同じなので説明は省略する。ただし、ＳＩＭＤ
演算制御信号６０２において、”００００”、”０００
１”、”００１０”、”０１００”、”１０００”以外
は”ｒｅｓｅｒｖｅｄ”とし出力しないものとする。Ｓ
ＩＭＤ演算フラグ情報６０８は、ＳＩＭＤ演算フラグ格
納部１１１の出力であり、ＳＩＭＤ演算フラグレジスタ
ＳＦＲのＣ０フラグの内容である。ここで、ＳＩＭＤ演
算制御信号６０３〜６０５に関しては、ＳＩＭＤ演算制
御信号６０２と同じ論理なので、真理値表を省略してい
る。In the figure, SIMD operation request signal 60
1 and the meaning of the SIMD operation control signal 602 are shown in FIG.
Therefore, the description is omitted. However, SIMD
In the operation control signal 602, “0000”, “000”
Values other than “1”, “0010”, “0100”, and “1000” are “reserved” and are not output.
The IMD operation flag information 608 is an output of the SIMD operation flag storage unit 111 and is the content of the C0 flag of the SIMD operation flag register SFR. Here, since the SIMD operation control signals 603 to 605 have the same logic as the SIMD operation control signal 602, the truth table is omitted.

【００６９】ＳＩＭＤ演算制御部１０９は、ＳＩＭＤ演
算フラグ情報６０８〜６１１が”１”の場合は、ＳＩＭ
Ｄ演算要求信号６０１をそのままＳＩＭＤ演算制御信号
６０２〜６０５に出力し、”０”の場合は、下位３ビッ
トを”０”にし”１０００（ｃｍｐ）”以外のＳＩＭＤ
演算要求信号６０１を”００００（ｎｏｐ）”にしてＳ
ＩＭＤ演算制御信号６０２〜６０５に出力している。When the SIMD operation flag information 608 to 611 is “1”, the SIMD operation control
The D operation request signal 601 is output as it is to the SIMD operation control signals 602 to 605. If the signal is "0", the lower 3 bits are set to "0" and SIMD other than "1000 (cmp)".
Set the operation request signal 601 to “0000 (nop)” and set S
The IMD operation control signals 602 to 605 are output.

【００７０】図８は、図４のレジスタ書込制御部１１０
の内部構成を示すブロック図である。FIG. 8 shows the register write control unit 110 of FIG.
FIG. 2 is a block diagram showing an internal configuration of the device.

【００７１】このレジスタ書込制御部１１０は、フリッ
プフロップ２０２〜２０４、３０１〜３０６と、論理ゲ
ート３０７〜３１０から構成される。点線で囲んだ箇所
は全て同じ回路であるため、図中では省略している。The register write control unit 110 includes flip-flops 202 to 204 and 301 to 306, and logic gates 307 to 310. The portions enclosed by the dotted line are all the same circuit, and are omitted in the figure.

【００７２】フリップフロップ２０２〜２０４は、図６
に示したものと同じであり、命令解読器１０２から出力
されるＳＩＭＤ演算要求信号６０１の下位３ビットをラ
ッチする。フリップフロップ３０１〜３０６はタイミン
グを合わせるためのものであり、命令解読器１０２から
出力されるレジスタ書込要求信号６０６、６０７をラッ
チする。フリップフロップ３０１〜３０４はクロックの
立ち上がりにラッチし、フリップフロップ３０５〜３０
６はクロックの立ち下がりにラッチする。本実施の形態
では、レジスタに書き込むのをクロック立ち上がりにし
ているので、その時に安定している信号にするためにフ
リップフロップ３０５〜３０６でクロックの立ち下がり
でラッチし、クロックの立ち上がり時に変化しないよう
にしている。倫理ゲート３０７〜３１０は、フリップフ
ロップ３０１、３０２にラッチされた内容と、ＳＩＭＤ
演算要求信号６０１をフリップフロップ２０２〜２０４
にラッチされた内容と、ＳＩＭＤ演算フラグ格納部１１
１から出力されるＳＩＭＤ演算フラグ情報６０８〜６１
１に基づき、レジスタ書込制御信号６１２〜６１９を出
力する。レジスタ書込制御信号６１２、６１３はビット
６３〜４８に対する書込の、レジスタ書込制御信号６１
４、６１５はビット４７〜３２に対する書込の、レジス
タ書込制御信号６１６、６１７はビット３１〜１６に対
する書込の、レジスタ書込制御信号６１８、６１９はビ
ット１５〜０に対する書込の制御信号である。また、レ
ジスタ書込制御信号６１２、６１４、６１６、６１８
で、同じ１つのレジスタに対する書込を制御し、レジス
タ書込制御信号６１３、６１５、６１７、６１９で、別
の１つのレジスタに対する書込を制御する。The flip-flops 202 to 204 are shown in FIG.
And latches the lower 3 bits of the SIMD operation request signal 601 output from the instruction decoder 102. The flip-flops 301 to 306 are for adjusting timing, and latch register write request signals 606 and 607 output from the instruction decoder 102. The flip-flops 301 to 304 latch at the rising edge of the clock, and
6 latches at the falling edge of the clock. In this embodiment, since writing to the register is performed at the rising edge of the clock, the signal is latched by the flip-flops 305 to 306 at the falling edge of the clock so that the signal is stable at that time. I have to. The ethics gates 307 to 310 determine the contents latched by the flip-flops 301 and 302 and the SIMD
The operation request signal 601 is supplied to the flip-flops 202 to 204.
And the SIMD operation flag storage unit 11
SIMD operation flag information 608 to 61 output from 1
1 to output register write control signals 612 to 619. Register write control signals 612 and 613 are register write control signals 61 for writing to bits 63 to 48.
4 and 615 are control signals for writing to bits 47 to 32, register write control signals 616 and 617 are control signals for writing to bits 31 to 16, and register write control signals 618 and 619 are control signals for writing to bits 15 to 0. It is. Also, register write control signals 612, 614, 616, 618
, Writing to the same register is controlled, and writing to another register is controlled by register write control signals 613, 615, 617, and 619.

【００７３】図９に論理ゲート３０７〜３１０の入出力
論理を表した説明図（真理値表）を示す。同図では、入
力、出力ともに２進数で表している。FIG. 9 is an explanatory diagram (truth table) showing the input / output logic of the logic gates 307 to 310. In the figure, both input and output are represented by binary numbers.

【００７４】同図において、ＳＩＭＤ演算要求信号６０
１、レジスタ書込要求信号６０６、６０７、およびＳＩ
ＭＤ演算フラグ情報６０８の意味は、図５、および図７
と同じなので説明は省略する。ここで、レジスタ書込制
御信号６１４〜６１９に関しては、レジスタ書込制御信
号６１２、６１３と同じ論理なので、真理値表を省略し
ている。In the figure, SIMD operation request signal 60
1, register write request signals 606 and 607, and SI
The meaning of the MD operation flag information 608 is described in FIGS.
Therefore, the description is omitted. Here, since the register write control signals 614 to 619 have the same logic as the register write control signals 612 and 613, the truth table is omitted.

【００７５】レジスタ書込制御部１１０は、ＳＩＭＤ演
算フラグ情報６０８〜６１１が”１”の場合は、レジス
タ書込制御信号６１２〜６１９はレジスタ書込要求信号
６０６、６０７をそのまま出力し、”０”の場合は、レ
ジスタ書込制御信号６１２、６１９は、ＳＩＭＤ演算要
求信号６０１が”００００（ｎｏｐ）”、”１０００
（ｃｍｐ）”の時はレジスタ要求信号６０６、６０７を
そのまま、”０００１（ｍｏｖ）”、”００１０（ｓｗ
ａｐ）”、”０１００（ａｄｄ）”の時はレジスタ書込
要求信号６０６、６０７を”０”にして出力している。When the SIMD operation flag information 608 to 611 is “1”, the register write control unit 110 outputs the register write request signals 606 and 607 as they are, and outputs “0”. In the case of “”, the register write control signals 612 and 619 indicate that the SIMD operation request signal 601 is “0000 (nop)” and “1000”.
(Cmp) ", the register request signals 606 and 607 are left as they are, and" 0001 (mov) "and" 0010 (sw
ap) ”and“ 0100 (add) ”, the register write request signals 606 and 607 are output as“ 0 ”.

【００７６】図１０は、図４のＳＩＭＤ演算フラグ格納
部１１１の内部構成を示すブロック図である。FIG. 10 is a block diagram showing the internal configuration of the SIMD operation flag storage unit 111 of FIG.

【００７７】このＳＩＭＤ演算フラグ格納部１１１は、
フリップフロップ４０１〜４０７と、論理ゲート４０８
〜４１３から構成される。点線で囲んだ箇所は全て同じ
回路であるため、図中では省略している。The SIMD operation flag storage unit 111 stores
Flip-flops 401 to 407 and a logic gate 408
To 413. The portions enclosed by the dotted line are all the same circuit, and are omitted in the figure.

【００７８】フリップフロップ４０１〜４０４は、ＳＩ
ＭＤ演算によって生成されたフラグをラッチするもの
で、それぞれ図１のＳＩＭＤ演算フラグレジスタＳＦＲ
のＣ０フラグ、Ｃ１フラグ、Ｃ２フラグ、Ｃ３フラグに
対応する。フリップフロップ４０５〜４０７は、タイミ
ングを合わせるためのものであり、フリップフロップ４
０５、４０６は命令解読器１０２から出力されるフラグ
更新要求信号６２０をラッチし、フリップフロップ４０
７は論理ゲート４０８で生成した信号をラッチしフラグ
更新制御信号６２９を出力する。フリップフロップ４０
５、４０６はクロックの立ち上がりにラッチし、フリッ
プフロップ４０７はクロックの立ち下がりにラッチす
る。本実施の形態では、ＳＩＭＤ演算フラグレジスタの
更新をクロック立ち上がりにしているので、その時に安
定している信号にするためにフリップフロップ４０７で
クロックの立ち下がりでラッチし、クロックの立ち上が
り時に変化しないようにしている。論理ゲート４０８
は、フリップフロップ４０５、４０６にラッチされた内
容に基づき、フラグを更新する時に”１”になるフラグ
更新制御信号６２９を生成する。具体的には、論理ゲー
ト４０８は、フラグ更新要求信号６２０のどちらかのビ
ットが”１”の場合に”１”を出力する。これにより、
フラグ要求信号６２０が「等しいことを判定し更新す
る”０１（ｅｑ）”」、「大きいことを判定し更新す
る”１０（ｇｔ）”」、「等しいかまたは大きいことを
判定し更新する”１１（ｇｅ）”」の時にフリップフロ
ップ４０１〜４０４にフラグをラッチすることになる。
論理ゲート４０９〜４１３は、フリップフロップ４０
５、４０６にラッチされた内容と、１６ビット演算器１
０５〜１０８が出力するＳＩＭＤフラグ生成情報６２１
〜６２８に基づき、ＳＩＭＤ演算フラグレジスタＳＦＲ
のＣ０フラグ、Ｃ１フラグ、Ｃ２フラグ、Ｃ３フラグに
格納するフラグを生成する。ＳＩＭＤフラグ生成情報６
２１、６２２はビット６３〜４８に、ＳＩＭＤフラグ生
成情報６２３、６２４はビット４７〜３２に、ＳＩＭＤ
フラグ生成情報６２５、６２６はビット３１〜１６に、
ＳＩＭＤフラグ生成情報６２７〜６２８はビット１５〜
０に対応するフラグ生成情報である。また、ＳＩＭＤフ
ラグ生成情報６２１、６２３、６２５、６２７は、それ
ぞれの１６ビットデータにおいて比較結果が「等しい」
場合に”１”になり、ＳＩＭＤフラグ生成情報６２２、
６２４、６２６、６２８は、それぞれの１６ビットデー
タにおいて比較結果が「大きい」場合に”１”になる。
論理ゲート４０９〜４１３の論理は、論理ゲート４１２
がセレクタの役割をするので、フラグ更新要求信号６２
０が”００（ｎｏｐ）”の時は論理ゲート４０９〜４１
１が”０”を出力するので、論理ゲート４１２はＳＩＭ
Ｄフラグ生成情報６２１、６２２、論理ゲート４１３出
力のいずれも選択しない。フラグ更新要求信号６２０
が”０１（ｅｑ）”の時は論理ゲート４０９が”１”を
出力するので論理ゲート４１２はＳＩＭＤフラグ生成情
報６２１を選択する。フラグ更新要求信号６２０が”１
０（ｇｔ）”の時は論理ゲート４１０が”１”を出力す
るので論理ゲート４１２はＳＩＭＤフラグ生成情報６２
２を選択する。フラグ更新要求信号６２０が”１１（ｇ
ｅ）”の時は論理ゲート４１１が”１”を出力するので
論理ゲート４１２は論理ゲート４１３出力（ＳＩＭＤフ
ラグ生成情報６２１、６２２の論理和）を選択する。こ
れにより、フラグ要求信号６２０が「等しいことを判定
し更新する”０１（ｅｑ）”」の時には「等しい」場合
に”１”になるＳＩＭＤフラグ生成情報６２１、６２
３、６２５、６２７を、「大きいことを判定し更新す
る”１０（ｇｔ）”」の時には「大きい」場合に”１”
になるＳＩＭＤフラグ生成情報６２２、６２４、６２
６、６２８を、「等しいかまたは大きいことを判定し更
新する”１１（ｇｅ）”」の時には上記ＳＩＭＤフラグ
生成情報６２１〜６２８の同じ１６ビットデータに対応
するもの同士を論理和した内容をフリップフロップ４０
１〜４０４にラッチすることになる。The flip-flops 401 to 404 are connected to the SI
The flag generated by the MD operation is latched. The SIMD operation flag register SFR in FIG.
Correspond to the C0, C1, C2, and C3 flags. The flip-flops 405 to 407 are for adjusting timing, and the flip-flop 4
05 and 406 latch the flag update request signal 620 output from the instruction decoder 102, and
7 latches the signal generated by the logic gate 408 and outputs a flag update control signal 629. Flip-flop 40
5, 406 latch at the rising edge of the clock, and the flip-flop 407 latches at the falling edge of the clock. In this embodiment, since the update of the SIMD operation flag register is performed at the rising edge of the clock, the signal is latched at the falling edge of the clock by the flip-flop 407 so that the signal is stable at that time. I have to. Logic gate 408
Generates a flag update control signal 629 that becomes "1" when updating the flag, based on the content latched by the flip-flops 405 and 406. Specifically, the logic gate 408 outputs “1” when either bit of the flag update request signal 620 is “1”. This allows
The flag request signal 620 is "determined equality and updated" 01 (eq) "", "determined large and updated" 10 (gt) "", "determined equality or greater and updated" 11 (Ge) At the time of "", the flag is latched in the flip-flops 401 to 404.
The logic gates 409 to 413 are connected to the flip-flop 40
5, the contents latched in 406 and the 16-bit arithmetic unit 1
SIMD flag generation information 621 output from 05 to 108
628 based on SIMD operation flag register SFR
The flag to be stored in the C0, C1, C2, and C3 flags is generated. SIMD flag generation information 6
21 and 622 are in bits 63 to 48, and the SIMD flag generation information 623 and 624 are in bits 47 to 32.
The flag generation information 625 and 626 are bits 31 to 16,
The SIMD flag generation information 627 to 628 includes bits 15 to
This is flag generation information corresponding to 0. The SIMD flag generation information 621, 623, 625, and 627 indicate that the comparison result is “equal” for each 16-bit data.
In this case, it becomes “1” and SIMD flag generation information 622,
624, 626, and 628 become "1" when the comparison result is "large" in the respective 16-bit data.
The logic of the logic gates 409 to 413 is
Serves as a selector, so that the flag update request signal 62
When 0 is "00 (nop)", logic gates 409-41
Since 1 outputs “0”, the logic gate 412
Neither the D flag generation information 621 or 622 nor the output of the logic gate 413 is selected. Flag update request signal 620
Is "01 (eq)", the logic gate 409 outputs "1", and the logic gate 412 selects the SIMD flag generation information 621. When the flag update request signal 620 is "1"
When "0 (gt)", the logic gate 410 outputs "1", so that the logic gate 412 outputs the SIMD flag generation information 62.
Select 2. When the flag update request signal 620 is "11 (g
In the case of “e)”, the logic gate 411 outputs “1”, so that the logic gate 412 selects the output of the logic gate 413 (the logical sum of the SIMD flag generation information 621 and 622), whereby the flag request signal 620 becomes “1”. SIMD flag generation information 621, 62 that becomes “1” when “equal” when “01 (eq)” is determined and updated to determine that they are equal
3, 625, and 627 are "1" when "large" when "judgment and update is large" (10 (gt) "")
SIMD flag generation information 622, 624, 62
When the values of 6 and 628 are "11 (ge) to judge that they are equal or greater and updated", the contents obtained by logically ORing the SIMD flag generation information 621 to 628 corresponding to the same 16-bit data are flip-flopped. Step 40
1 to 404.

【００７９】図１１は、図４のレジスタファイル１０３
内の１本のレジスタの構成を示すブロック図である。FIG. 11 shows the register file 103 of FIG.
FIG. 3 is a block diagram showing a configuration of one register in FIG.

【００８０】このレジスタは、フリップフロップ５０１
〜５０８と、論理ゲート５０９〜５４０から構成され
る。同図において、ビット６３〜４８の各ビットの回路
は同じであるので、ビット６２〜４９のビットは省略し
てビット６３とビット４８のみを記載している。同様に
ビット４７〜３２も、ビット３１〜１６も、ビット１５
〜０もそれぞれ各ビットの回路は同じであるので、ビッ
ト４６〜３３、ビット３０〜１７、ビット１４〜１のビ
ットは省略している。This register has a flip-flop 501
To 508 and logic gates 509 to 540. In the figure, since the circuits of the bits 63 to 48 are the same, the bits 62 to 49 are omitted and only the bits 63 and 48 are shown. Similarly, bits 47 to 32, bits 31 to 16, bit 15
Since the circuit of each bit is the same for .about.0, the bits of bits 46 to 33, bits 30 to 17, and bits 14 to 1 are omitted.

【００８１】フリップフロップ５０１〜５０８は、１本
のレジスタの各ビットに対応し、バス１２４またはバス
１２５上のいずれかのデータを論理ゲート５１７〜５２
４を介してラッチする。論理ゲート５０９〜５１６は、
レジスタ書込制御信号６１２〜６１９と、本レジスタが
書込対象である時に”１”になる書込対象レジスタ選択
信号に基づき、フリップフロップ５０１〜５０８へのラ
ッチを許可する信号（イネーブル信号）を生成する。具
体的には、フリップフロップ５０１、５０２はレジスタ
書込制御信号６１２、６１３のどちらかが”１”でかつ
書込対象レジスタ選択信号が”１”の場合に、フリップ
フロップ５０３、５０４はレジスタ書込制御信号６１
４、６１５のどちらかが”１”でかつ書込対象レジスタ
選択信号が”１”の場合に、フリップフロップ５０５、
５０６はレジスタ書込制御信号６１６、６１７のどちら
かが”１”でかつ書込対象レジスタ選択信号が”１”の
場合に、フリップフロップ５０７、５０８はレジスタ書
込制御信号６１８、６１９のどちらかが”１”でかつ書
込対象レジスタ選択信号が”１”の場合にラッチする。
論理ゲート５１７〜５２４は、バス１２４、１２５上の
データと、レジスタ書込制御信号６１２〜６１９に基づ
き、フリップフロップ５０１〜５０８（レジスタ）に書
き込むデータを生成する。論理ゲート５１７〜５２４の
各ゲートはセレクタの役割をするので、論理ゲート５１
７、５１８においては、レジスタ書込制御信号６１２
が”１”の場合にバス１２４を、レジスタ書込制御信号
６１３が”１”の場合にバス１２５を選択する。論理ゲ
ート５１９、５２０においては、レジスタ書込制御信号
６１４が”１”の場合にバス１２４を、レジスタ書込制
御信号６１５が”１”の場合にバス１２５を選択する。
論理ゲート５２１、５２２においては、レジスタ書込制
御信号６１６が”１”の場合にバス１２４を、レジスタ
書込制御信号６１７が”１”の場合にバス１２５を選択
する。論理ゲート５２３〜５２４においては、レジスタ
書込制御信号６１８が”１”の場合にバス１２４を、レ
ジスタ書込制御信号６１９が”１”の場合にバス１２５
を選択する。これにより、フリップフロップ５０１〜５
０８（レジスタ）は、レジスタ書込制御信号６１２、６
１４、６１６、６１８が”１”の場合にはバス１２４上
のデータを、レジスタ書込制御信号６１３、６１５、６
１７、６１９が”１”の場合にはバス１２５上のデータ
を取り込みラッチすることになる。論理ゲート５２５〜
５４０は、トライステートバッファであり、フリップフ
ロップ５０１〜５０８（レジスタ）に格納されているデ
ータをバス１２２、またはバス１２３に出力する。The flip-flops 501 to 508 correspond to each bit of one register and transfer any data on the bus 124 or the bus 125 to the logic gates 517 to 52.
Latch via 4. The logic gates 509 to 516
Based on register write control signals 612 to 619 and a write target register selection signal which becomes "1" when this register is a write target, a signal (enable signal) for permitting latching to flip-flops 501 to 508 is provided. Generate. Specifically, when either of the register write control signals 612 and 613 is “1” and the write target register selection signal is “1”, the flip-flops 503 and 504 output the register write data. Control signal 61
4 or 615 is “1” and the write target register selection signal is “1”, the flip-flop 505,
Reference numeral 506 denotes a case where one of the register write control signals 616 and 617 is “1” and the register selection signal to be written is “1”, and the flip-flops 507 and 508 output one of the register write control signals 618 and 619. Is "1" and the write target register selection signal is "1".
The logic gates 517 to 524 generate data to be written to the flip-flops 501 to 508 (register) based on the data on the buses 124 and 125 and the register write control signals 612 to 619. Since each of the logic gates 517 to 524 functions as a selector, the logic gate 51
7, 518, the register write control signal 612
Is "1", the bus 124 is selected, and when the register write control signal 613 is "1", the bus 125 is selected. The logic gates 519 and 520 select the bus 124 when the register write control signal 614 is “1”, and select the bus 125 when the register write control signal 615 is “1”.
The logic gates 521 and 522 select the bus 124 when the register write control signal 616 is “1”, and select the bus 125 when the register write control signal 617 is “1”. In the logic gates 523 to 524, the bus 124 is used when the register write control signal 618 is "1", and the bus 125 is used when the register write control signal 619 is "1".
Select Thereby, the flip-flops 501 to 5
08 (register) is a register write control signal 612,6
When the bits 14, 616 and 618 are "1", the data on the bus 124 is transferred to the register write control signals 613, 615 and 6
When 17 and 619 are "1", the data on the bus 125 is fetched and latched. Logic gate 525
Reference numeral 540 denotes a tri-state buffer which outputs data stored in the flip-flops 501 to 508 (register) to the bus 122 or the bus 123.

【００８２】以上のように構成されたプロセッサについ
て、その動作を説明する。The operation of the processor configured as described above will be described.

【００８３】図１２は、本プロセッサにおいて、図２、
図３で示す命令を実行した場合のタイムチャートを示
す。FIG. 12 is a block diagram of the present processor.
4 shows a time chart when the instruction shown in FIG. 3 is executed.

【００８４】同図では次の命令を実行した場合の動作を
表している。FIG. 14 shows the operation when the following instruction is executed.

【００８５】ｖｃｍｐｅｈＲｍ，ＲｎｖｍｏｖｈＲｍ，Ｒｎ … ｖｃｍｐｇｈＲｍ，ＲｎｖｓｗａｐｈＲｍ，Ｒｎ … ｖｃｍｐｇｅｈＲｍ，ＲｎｖａｄｄｈＲｍ，Ｒｎ図中のＴ１〜Ｔ１０は、それぞれ１マシンサイクルの時
間を示しており、時刻順にＴ１から順にＴ２、Ｔ３、Ｔ
４、Ｔ５、Ｔ６、Ｔ７、Ｔ８、Ｔ９、Ｔ１０としてい
る。Vcmpeh Rm, Rn vmovh Rm, Rn ... vcmpgh Rm, Rn vsswap Rm, Rn ... vcmpgeh Rm, Rn vaddh Rm, Rn T1 to T10 in FIG. T2, T3, T in order from T1
4, T5, T6, T7, T8, T9, and T10.

【００８６】まず、ｖｃｍｐｅｈ命令を実行した場合の
動作を説明する。First, the operation when the vcmpeh instruction is executed will be described.

【００８７】マシンサイクルＴ１において、命令レジス
タ１０１にラッチされたｖｃｍｐｅｈ命令が命令解読器
１０２で解読される。命令のＯＰコードが”ｖｃｍｐｅ
ｈ”を示すので、命令解読器１０２は、ＳＩＭＤ演算制
御部１０９に対して「Ｔ２において１６ビット毎の比較
を行う」ようにＳＩＭＤ演算要求信号６０１として”１
０００（ｃｍｐ）”を、レジスタ書込制御部１１０に対
して「レジスタに書き込まない」ようにレジスタ書込要
求信号６０６、６０７としてそれぞれ”０”、”０”
を、ＳＩＭＤ演算フラグ格納部１１１に対して「等しい
ことを判定し更新する」ように”０１（ｅｑ）”を出力
する。また、命令のオペランドが”Ｒｍ”、”Ｒｎ”で
あるので、命令解読器１０２は、レジスタファイル１０
３に対して、「レジスタＲｍ、レジスタＲｎを読み出
し、それぞれのデータをバス１２２、１２３を介して、
Ｔ２においてラッチ１１４〜１２１にラッチし、１６ビ
ット演算器１０５〜１０８に入力する」ように演算器制
御信号を出力する。In the machine cycle T 1, the vcmpeh instruction latched in the instruction register 101 is decoded by the instruction decoder 102. The OP code of the instruction is "vcmpe
h ”, the instruction decoder 102 sets the SIMD operation control unit 109 to“ 1 ”as the SIMD operation request signal 601 so as to“ perform comparison every 16 bits at T2 ”.
000 (cmp) ”as“ 0 ”and“ 0 ”, respectively, as register write request signals 606 and 607 so that the register write control unit 110“ does not write to the register ”.
Is output to the SIMD operation flag storage unit 111 as “01 (eq)” so as to “determine that they are equal and update them”. Since the operands of the instruction are “Rm” and “Rn”, the instruction decoder 102
3, "Read out the register Rm and the register Rn, and transfer the respective data via the buses 122 and 123.
At T2, the latches are latched by the latches 114 to 121 and input to the 16-bit arithmetic units 105 to 108, and the arithmetic unit control signal is output.

【００８８】マシンサイクルＴ２において、ＳＩＭＤ演
算制御部１０９は、ＳＩＭＤ演算要求信号６０１とＳＩ
ＭＤ演算フラグ情報６０８〜６１１を入力し、１６ビッ
ト演算器１０５〜１０８を制御するＳＩＭＤ演算制御信
号６０２〜６０５を出力する。ＳＩＭＤ演算要求信号
が”１０００”であるので、ＳＩＭＤ演算フラグ情報６
０２〜６０５が如何なる値であっても論理ゲート２０５
〜２０７の出力は”０”となり、ＳＩＭＤ演算制御信号
６０２〜６０５は”１０００”が出力される。これによ
り、１６ビット演算器１０５〜１０８は、ＳＩＭＤ演算
制御信号６０２〜６０５に従って、ラッチ１１４〜１２
１にラッチされているデータを比較し、ＳＩＭＤフラグ
生成情報６２１〜６２８を出力する。ここで、１６ビッ
ト演算器１０５、１０７で等しいことが判定されたとす
ると、ＳＩＭＤフラグ生成情報は６２１〜６２８はそれ
ぞれ”１”、”０”、”０”、”０”、”１”、”
０”、”０”、”０”となる。また、ＳＩＭＤ演算フラ
グ格納部１１１は、フラグ更新要求信号６２０がマシン
サイクルＴ１において”１０”であったので、「フラグ
を更新する」ようにフラグ更新制御信号６２９として”
１”を出力する。また、レジスタ書込制御部１１０は、
レジスタ書込要求信号６０６、６０７がマシンサイクル
Ｔ１において”０”、”０”であったので、「レジスタ
を書き込まない」ようにレジスタ書込制御信号６１２〜
６１９として全て”０”を出力する。In machine cycle T 2, SIMD operation control section 109 transmits SIMD operation request signal 601 to SIMD operation request signal 601.
MD operation flag information 608 to 611 is input, and SIMD operation control signals 602 to 605 for controlling the 16-bit operation units 105 to 108 are output. Since the SIMD operation request signal is “1000”, the SIMD operation flag information 6
No matter what value 02-605 is, the logic gate 205
To 207 are "0", and the SIMD operation control signals 602 to 605 are "1000". As a result, the 16-bit operation units 105 to 108 cause the latches 114 to 12 to operate according to the SIMD operation control signals 602 to 605.
The data latched at 1 is compared, and SIMD flag generation information 621 to 628 is output. Here, if it is determined that the 16-bit arithmetic units 105 and 107 are equal, the SIMD flag generation information 621 to 628 are “1”, “0”, “0”, “0”, “1”, “1”.
0 ”,“ 0 ”,“ 0. ”Also, since the flag update request signal 620 was“ 10 ”in the machine cycle T1, the SIMD operation flag storage unit 111 sets the flag to“ update the flag ”. As the update control signal 629,
1 ”. The register write control unit 110 outputs
Since the register write request signals 606 and 607 were “0” and “0” in the machine cycle T1, the register write control signals 612 to 612 are set so as not to write the register.
As "619", all "0" are output.

【００８９】マシンサイクルＴ３において、ＳＩＭＤ演
算フラグ格納部１１１は、フラグ更新制御信号６２９
が”１”であるので、ＳＩＭＤフラグ生成情報６２１〜
６２８を入力して、ＳＩＭＤ演算フラグレジスタＳＦＲ
のＣ０〜Ｃ３フラグを更新する。マシンサイクルＴ２に
おいて、フラグ更新要求信号６２０が”０１”で、ＳＩ
ＭＤフラグ生成情報６２１〜６２８が”１”、”
０”、”０”、”０”、”１”、”０”、”０”、”
０”であるので、Ｃ０〜Ｃ３フラグはそれぞれ”
１”、”０”、”１”、”０”に更新される。また、レ
ジスタファイル１０３は、レジスタ書込制御信号６１２
〜６１９が全て”０”であるので、レジスタへの書込は
行われない。In machine cycle T 3, SIMD operation flag storage section 111 stores flag update control signal 629
Is “1”, the SIMD flag generation information 621 to
628 to the SIMD operation flag register SFR
Are updated. In the machine cycle T2, if the flag update request signal 620 is "01" and the SI
MD flag generation information 621 to 628 is "1", "
0 "," 0 "," 0 "," 1 "," 0 "," 0 ","
0 ”, the C0 to C3 flags are respectively“
1 "," 0 "," 1 ", and" 0 ", and the register file 103 stores the register write control signal 612.
Since 619 are all “0”, writing to the register is not performed.

【００９０】次に、ｖｍｏｖｈ命令を実行した場合の動
作を説明する。Next, the operation when the vmovh instruction is executed will be described.

【００９１】マシンサイクルＴ２において、命令レジス
タ１０１にラッチされたｖｍｏｖｈ命令が命令解読器１
０２で解読される。命令のＯＰコードが”ｖｍｏｖｈ”
を示すので、命令解読器１０２は、ＳＩＭＤ演算制御部
１０９に対して「Ｔ３においてＳＩＭＤ演算フラグレジ
スタＳＦＲのＣ０〜Ｃ３フラグが”１”の場合には１６
ビット毎の転送を行う」ようにＳＩＭＤ演算要求信号６
０１として”０００１（ｍｏｖ）”を、レジスタ書込制
御部１１０に対して「一つのレジスタに書き込む」よう
にレジスタ書込要求信号６０６、６０７としてそれぞ
れ”１”、”０”を、ＳＩＭＤ演算フラグ格納部１１１
に対して「フラグを更新しない」ように”００（ｎｏ
ｐ）”を出力する。また、命令のオペランドが”Ｒｍ”
であるので、命令解読器１０２は、レジスタファイル１
０３に対して、「レジスタＲｍを読み出し、それぞれの
データをバス１２２を介して、Ｔ３においてラッチ１１
４〜１１７にラッチし、１６ビット演算器１０５〜１０
８に入力する」ように演算器制御信号を出力する。In the machine cycle T2, the vmovh instruction latched in the instruction register 101 is
02 is decrypted. The OP code of the instruction is "vmovh"
Therefore, the instruction decoder 102 instructs the SIMD operation control unit 109 to “16 when the C0 to C3 flags of the SIMD operation flag register SFR are“ 1 ”at T3.
SIMD operation request signal 6
"0001 (mov)" as 01, "1" and "0" as register write request signals 606 and 607, respectively, so that the register write control unit 110 "writes to one register". Storage unit 111
To “00 (no)
p) ”, and the operand of the instruction is“ Rm ”.
Therefore, the instruction decoder 102 reads the register file 1
03, the register Rm is read, and the respective data is latched at T3 via the bus 122 at T3.
4 to 117, 16-bit arithmetic units 105 to 10
8 to output an arithmetic unit control signal.

【００９２】マシンサイクルＴ３において、ＳＩＭＤ演
算制御部１０９は、ＳＩＭＤ演算要求信号６０１とＳＩ
ＭＤ演算フラグ情報６０８〜６１１を入力し、１６ビッ
ト演算器１０５〜１０８を制御するＳＩＭＤ演算制御信
号６０２〜６０５を出力する。ＳＩＭＤ演算フラグ情報
６０８、６１０が”１”であるので、論理ゲート２０５
〜２０７の出力は入力データがそのまま出力され、ＳＩ
ＭＤ演算制御信号６０２、６０４は”０００１”とな
る。ＳＩＭＤ演算フラグ情報６０９〜６１１が”０”で
あるので、論理ゲート２０５〜２０７の出力は”０”と
なり、ＳＩＭＤ演算制御信号６０３、６０５は”０００
０（ｎｏｐ）”が出力される。これにより、１６ビット
演算器１０５〜１０８は、ＳＩＭＤ演算制御信号６０２
〜６０５に従って、ラッチ１１４、１１６にラッチされ
ているデータをそのままバス１２４に出力する。また、
ＳＩＭＤ演算フラグ格納部１１１は、フラグ更新要求信
号６２０がマシンサイクルＴ２において”００”であっ
たので、「フラグを更新しない」ようにフラグ更新制御
信号６２９として”０”を出力する。また、レジスタ書
込制御部１１０は、レジスタ書込要求信号６０６、６０
７がマシンサイクルＴ２において”１”、”０”であっ
たので、「一つのレジスタに書き込む」ようにレジスタ
書込制御信号６１２〜６１９としてそれぞれ”１”、”
０”、”１”、”０”、”０”、”０”、”０”、”
０”を出力する。In the machine cycle T3, the SIMD operation control unit 109 sends the SIMD operation request signal
MD operation flag information 608 to 611 is input, and SIMD operation control signals 602 to 605 for controlling the 16-bit operation units 105 to 108 are output. Since the SIMD operation flag information 608 and 610 are “1”, the logic gate 205
207 output the input data as they are,
The MD operation control signals 602 and 604 become "0001". Since the SIMD operation flag information 609 to 611 is “0”, the outputs of the logic gates 205 to 207 are “0”, and the SIMD operation control signals 603 and 605 are “000”.
0 (nop) ". The 16-bit operation units 105 to 108 output the SIMD operation control signal 602.
The data latched by the latches 114 and 116 is output to the bus 124 as it is according to. Also,
Since the flag update request signal 620 was “00” in the machine cycle T2, the SIMD operation flag storage unit 111 outputs “0” as the flag update control signal 629 so as to “do not update the flag”. Further, register write control section 110 outputs register write request signals 606, 60
7 are “1” and “0” in the machine cycle T2, so that “1” and “1” are respectively set as the register write control signals 612 to 619 so as to “write to one register”.
0 "," 1 "," 0 "," 0 "," 0 "," 0 ","
0 "is output.

【００９３】マシンサイクルＴ４において、ＳＩＭＤ演
算フラグ格納部１１１は、フラグ更新制御信号６２９
が”０”であるので、ＳＩＭＤ演算フラグレジスタＳＦ
ＲのＣ０〜Ｃ３フラグを更新しない。また、レジスタフ
ァイル１０３は、レジスタ書込制御信号６１２、６１６
が”１”であるので、書込対象のレジスタに対応する論
理ゲート５０９、５１１が”１”となり、バス１２４上
にあるデータがフリップフロップ５０１、５０２、５０
５、５０６に格納され、レジスタへの書込が行われる。In machine cycle T4, SIMD operation flag storage section 111 stores flag update control signal 629
Is “0”, the SIMD operation flag register SF
The C0 to C3 flags of R are not updated. The register file 103 stores the register write control signals 612 and 616
Is "1", the logic gates 509 and 511 corresponding to the register to be written become "1", and the data on the bus 124 is flip-flops 501, 502 and 50.
5, 506, and writing to the register is performed.

【００９４】以後、ｖｃｍｐｇｈ命令、ｖｓｗａｐｈ命
令、ｖｃｍｐｇｅｈ命令、ｖａｄｄｈ命令と実行してい
るが、基本的に上記ｖｃｍｐｅｈ命令、ｖｍｏｖｈ命令
と同じように動作しているので、ここでは説明を省略す
る。Thereafter, the vcmpgh instruction, vswaph instruction, vcmpgeh instruction, and vaddh instruction are executed. However, since the operation is basically the same as the above-mentioned vcmpeh instruction and vmovh instruction, the description is omitted here.

【００９５】（実施の形態２）上記実施の形態では、Ｓ
ＩＭＤ演算フラグレジスタＳＦＲのＣ０〜Ｃ３フラグ
が”０”の場合には、何も演算を行わない（ｎｏｐ）と
しているが、代わりに他の演算を行っても構わない。(Embodiment 2) In the above embodiment, S
When the C0 to C3 flags of the IMD operation flag register SFR are “0”, no operation is performed (nop), but another operation may be performed instead.

【００９６】図１３は、本発明の第２の実施の形態とし
て、第１の実施の形態の命令仕様に追加した命令の命令
仕様を表す図である。同図において、「ｖｓｗｍｖｈ
Ｒｍ，Ｒｎ」、「ｖｍｖａｄｈＲｍ，Ｒｎ」、「ｖｓ
ｗａｄｈＲｍ，Ｒｎ」は、Ｃ０〜Ｃ３フラグが”０”
の場合には、”１”の場合と異なる演算を行う命令の機
械語命令であり、ニモニック形式で表している。また、
各命令毎に動作も図示してある。FIG. 13 is a diagram showing an instruction specification of an instruction added to the instruction specification of the first embodiment as a second embodiment of the present invention. In the figure, “vswmvh
Rm, Rn "," vmvadh Rm, Rn "," vs
wad Rm, Rn ”indicates that the C0 to C3 flags are“ 0 ”.
Is a machine language instruction of an instruction for performing an operation different from that of "1", and is expressed in a mnemonic format. Also,
The operation for each instruction is also shown.

【００９７】図１４、図１５、図１６、図１７、図１８
は、この命令仕様を実現するために、本発明の第１の実
施の形態と異なる部分である。図１４は、図４の命令解
読器１０２に図２、図３、図１３に示すＳＩＭＤ命令を
入力した場合の入出力論理を表した説明図（真理値
表）、図１５は、図４のＳＩＭＤ演算制御部１０９の内
部構成を示すブロック図、図１６は、論理ゲート７０７
〜７１４の入出力論理を表した説明図（真理値表）、図
１７は、図４のレジスタ書込制御部１１０の内部構成を
示すブロック図、図１８は、論理ゲート８０７〜８１３
の入出力論理を表した説明図（真理値表）を示す。FIG. 14, FIG. 15, FIG. 16, FIG. 17, FIG.
Is a part different from the first embodiment of the present invention in order to realize this instruction specification. FIG. 14 is an explanatory diagram (truth table) showing input / output logic when the SIMD instruction shown in FIGS. 2, 3, and 13 is input to the instruction decoder 102 of FIG. 4, and FIG. FIG. 16 is a block diagram showing the internal configuration of the SIMD operation control unit 109. FIG.
FIG. 17 is a block diagram showing the internal configuration of the register write control unit 110 in FIG. 4, and FIG. 18 is a logic gate 807 to 813.
FIG. 2 is an explanatory diagram (truth value table) showing the input / output logic of FIG.

【００９８】図１９は、本プロセッサにおいて、図２、
図３、図１３で示す命令を実行した場合のタイムチャー
トを示す。FIG. 19 is a block diagram of the processor shown in FIG.
FIG. 14 shows a time chart when the instructions shown in FIGS. 3 and 13 are executed.

【００９９】同図では次の命令を実行した場合の動作を
表している。FIG. 14 shows the operation when the following instruction is executed.

【０１００】ｖｃｍｐｅｈＲｍ，ＲｎｖｓｗｍｖｈＲｍ，Ｒｎ … ｖｃｍｐｇｈＲｍ，ＲｎｖｍｖａｄｈＲｍ，Ｒｎ … ｖｃｍｐｇｅｈＲｍ，ＲｎｖｓｗａｄｈＲｍ，Ｒｎ上記命令の動作は、基本的に本発明の第１の実施の形態
と同じであり、ＳＩＭＤ演算フラグレジスタＳＲＦのＣ
０〜Ｃ３フラグが”０”の場合に、”１”の場合と異な
る演算を行っているだけであるので、ここでは説明を省
略する。Vcmpeh Rm, Rn vswmvh Rm, Rn... Vcmpgh Rm, Rn vmvadh Rm, Rn. Yes, C in SIMD operation flag register SRF
When the 0 to C3 flags are "0", only operations different from those in the case of "1" are performed, and the description is omitted here.

【０１０１】[0101]

【発明の効果】以上の説明から明らかなように、本発明
のプロセッサは、オペランドで指定したデータを１つの
データとして、条件を満たすことを検証する第１の命令
と、オペランドで指定したデータを複数のデータとし
て、条件を満たすことを検証する第２の命令と、条件が
成立した時のみに、オペランドで指定したデータを複数
のデータとして指定された操作を実行する条件実行命令
とを含む機械語命令を解読する命令解読手段と、前記命
令解読手段に従って命令を実行する命令実行手段と、前
記第１の命令で検証した結果指定された条件が成立した
ことを示す第１の状態保持手段と、前記第２の命令で検
証した結果指定された条件が成立したことを示す第２の
状態保持手段とを備え、前記命令解読手段は前記第１の
命令を解読すると、オペランドで指定したデータを１つ
のデータとして条件を検証し、条件を満たせば前記第１
の状態保持手段に該条件が成立したことを示させ、前記
第２の命令を解読すると、オペランドで指定したデータ
を特定ビット数毎に複数のデータに分割し、分割した複
数のデータを各データ毎に条件を検証し、条件を満たせ
ば前記第２の状態保持手段に該条件が成立したことを各
データ毎に示させ、前記条件実行命令を解読すると、オ
ペランドで指定したデータを特定ビット数毎に複数のデ
ータに分割し、分割した複数の各データに対応する前記
第２の状態保持手段の対応する検証結果を吟味し、条件
が成立している時は、対応する分割したデータに対して
前記命令実行手段で該命令中に指定された操作を行い、
条件が成立していない時は、対応する分割したデータに
対して前記命令実行手段で該命令中に指定された操作を
行わないように制御することを特徴とする。As is apparent from the above description, the processor of the present invention uses the data specified by the operand as one data, the first instruction for verifying that the condition is satisfied, and the data specified by the operand. A machine including a second instruction for verifying that a condition is satisfied as a plurality of data, and a condition execution instruction for executing an operation specified by the data specified by the operand as the plurality of data only when the condition is satisfied; Instruction decoding means for decoding a word instruction; instruction execution means for executing an instruction in accordance with the instruction decoding means; first state holding means for indicating that a condition specified as a result of verification by the first instruction is satisfied; And a second state holding means indicating that a condition specified as a result of verification by the second instruction is satisfied, wherein the instruction decoding means decodes the first instruction, The specified data to verify the condition as one data operand, the satisfies the condition first
When the second instruction is decoded, the data specified by the operand is divided into a plurality of data for each specific number of bits, and the divided plurality of data is For each condition, the condition is verified, and if the condition is satisfied, the second state holding means is made to indicate for each data that the condition is satisfied. When the condition execution instruction is decoded, the data specified by the operand is changed to a specific number of bits. Each of the data is divided into a plurality of data, and the verification result of the second state holding means corresponding to each of the plurality of divided data is examined. When the condition is satisfied, the corresponding divided data is Performing the operation specified in the instruction by the instruction execution means,
When the condition is not satisfied, the instruction execution means controls the corresponding divided data so that the operation specified in the instruction is not performed.

【０１０２】これにより、複数に分割した各データに対
して、条件実行命令は、前記第２の状態保持手段に保持
されている検証結果に基づき、条件が成立したデータに
対してのみ演算を行うことができ、高速に演算すること
が可能となる。Thus, for each of the plurality of divided data, the condition execution instruction performs an operation only on the data for which the condition is satisfied, based on the verification result held in the second state holding means. And high-speed operation can be performed.

【０１０３】また、オペランドで指定したデータを１つ
のデータとして条件を検証（通常の比較操作）した結果
を保持する第１の状態保持手段とは別に、オペランドで
指定したデータを複数のデータとして条件を検証（ＳＩ
ＭＤデータの比較操作）した結果を保持する第２の状態
保持手段を持つことにより、通常の比較操作とＳＩＭＤ
データの比較操作が独立に行えることができるので、プ
ログラムの融通性が増すという効果がある。Also, apart from the first state holding means for holding the result of verifying the condition (normal comparison operation) with the data specified by the operand as one data, the data specified by the operand is defined as a plurality of data. Verification (SI
MD data comparison operation) by having the second state holding means for holding the result of the
Since the data comparison operation can be performed independently, there is an effect that the flexibility of the program is increased.

【０１０４】以上のように本発明の技術の実用的価値は
大きい。As described above, the practical value of the technology of the present invention is great.

[Brief description of the drawings]

【図１】本発明の第１の実施の形態におけるプロセッサ
に設けられるＳＩＭＤ演算フラグレジスタＳＦＲのビッ
ト構成を示す図FIG. 1 is a diagram showing a bit configuration of a SIMD operation flag register SFR provided in a processor according to a first embodiment of the present invention.

【図２】同実施の形態のＳＩＭＤ命令における比較命令
の命令仕様を表す図FIG. 2 is a view showing an instruction specification of a comparison instruction in the SIMD instruction of the embodiment.

【図３】同実施形態のＳＩＭＤ命令における条件付き命
令の命令仕様を表す図FIG. 3 is a view showing an instruction specification of a conditional instruction in the SIMD instruction of the embodiment.

【図４】本発明の実施の形態におけるプロセッサの主要
部の構成を示すブロック図FIG. 4 is a block diagram showing a configuration of a main part of a processor according to the embodiment of the present invention.

【図５】同実施の形態の命令解読器１０２にＳＩＭＤ命
令を入力した場合の入出力論理を表した説明図（真理値
表）FIG. 5 is an explanatory diagram (truth value table) showing input / output logic when a SIMD instruction is input to the instruction decoder 102 of the embodiment.

【図６】同実施の形態のＳＩＭＤ演算制御部１０９の内
部構成を示すブロック図FIG. 6 is a block diagram showing an internal configuration of a SIMD operation control unit 109 according to the embodiment;

【図７】同実施の形態のＳＩＭＤ演算制御部１０９の論
理ゲート２０５〜２０７の入出力論理を表した説明図
（真理値表）FIG. 7 is an explanatory diagram (truth value table) showing input / output logic of logic gates 205 to 207 of the SIMD operation control unit 109 according to the embodiment.

【図８】同実施の形態のレジスタ書込制御部１１０の内
部構成を示すブロック図FIG. 8 is a block diagram showing an internal configuration of a register write control unit 110 according to the embodiment;

【図９】同実施の形態のレジスタ書込制御部１１０の論
理ゲート３０７〜３１０の入出力論理を表した説明図
（真理値表）FIG. 9 is an explanatory diagram (truth value table) showing input / output logic of logic gates 307 to 310 of register write control unit 110 of the embodiment.

【図１０】同実施の形態のＳＩＭＤ演算フラグ格納部１
１１の内部構成を示すブロック図FIG. 10 is a SIMD operation flag storage unit 1 according to the embodiment.
11 is a block diagram showing the internal configuration of the eleventh embodiment.

【図１１】同実施の形態のレジスタファイル１０３内の
１本のレジスタの構成を示すブロック図FIG. 11 is a block diagram showing the configuration of one register in a register file 103 according to the embodiment;

【図１２】同実施の形態において、ＳＩＭＤ命令を実行
した場合のタイムチャートFIG. 12 is a time chart when the SIMD instruction is executed in the embodiment.

【図１３】本発明の第２の実施の形態として、第１の実
施の形態の命令仕様に追加した命令の命令仕様を表す図FIG. 13 is a diagram showing an instruction specification of an instruction added to the instruction specification of the first embodiment as a second embodiment of the present invention;

【図１４】同実施の形態において、命令解読器１０２に
ＳＩＭＤ命令を入力した場合の入出力論理を表した説明
図（真理値表）FIG. 14 is an explanatory diagram (truth value table) showing input / output logic when a SIMD instruction is input to the instruction decoder 102 in the embodiment.

【図１５】同実施の形態のＳＩＭＤ演算制御部１０９の
内部構成を示すブロック図FIG. 15 is a block diagram showing an internal configuration of a SIMD operation control unit 109 according to the embodiment;

【図１６】同実施の形態のＳＩＭＤ演算制御部１０９の
論理ゲート７０７〜７１４の入出力論理を表した説明図
（真理値表）FIG. 16 is an explanatory diagram (truth table) showing input / output logic of logic gates 707 to 714 of the SIMD operation control unit 109 according to the embodiment.

【図１７】同実施の形態のレジスタ書込制御部１１０の
内部構成を示すブロック図FIG. 17 is a block diagram showing an internal configuration of a register write control unit 110 according to the embodiment;

【図１８】同実施の形態のレジスタ書込制御部１１０の
論理ゲート８０７〜８１３の入出力論理を表した説明図
（真理値表）FIG. 18 is an explanatory diagram (truth value table) showing input / output logic of logic gates 807 to 813 of register write control unit 110 according to the embodiment;

【図１９】同実施の形態において、ＳＩＭＤ命令を実行
した場合のタイムチャートFIG. 19 is a time chart when an SIMD instruction is executed in the embodiment.

【図２０】従来の情報処理装置である「Pentium II」プ
ロセッサのＳＩＭＤ命令の一部の比較命令の仕様を示す
図FIG. 20 is a diagram showing the specifications of some comparison instructions of SIMD instructions of a “Pentium II” processor, which is a conventional information processing apparatus.

【図２１】従来の情報処理装置である「Ultra SPARC-II
i」プロセッサのＳＩＭＤ命令の一部の比較命令の仕様
を示す図FIG. 21 shows a conventional information processing apparatus “Ultra SPARC-II”.
Diagram showing specifications of some comparison instructions of SIMD instruction of "i" processor

【図２２】従来の情報処理装置である「Alpha」プロセ
ッサのＳＩＭＤ命令の一部の最小値命令の仕様を示す図FIG. 22 is a diagram showing the specification of a part of a minimum value instruction of a SIMD instruction of an “Alpha” processor which is a conventional information processing apparatus.

【図２３】従来の情報処理装置である「PowerPC」プロ
セッサのＳＩＭＤ命令の一部の比較命令および最小値命
令の仕様を示す図FIG. 23 is a diagram showing specifications of a part of a comparison instruction and a minimum value instruction of a SIMD instruction of a “PowerPC” processor which is a conventional information processing apparatus.

[Explanation of symbols]

１０１命令レジスタ１０２命令解読器１０３レジスタファイル１０４６４ビット演算器１０５〜１０８１６ビット演算器１０９ＳＩＭＤ演算制御部１１０レジスタ書込制御部１１１ＳＩＭＤ演算フラグ格納部１１２〜１２１ラッチ１２２〜１２５バス２０１〜２０４フリップフロップ２０５〜２０７論理ゲート３０１〜３０６フリップフロップ３０７〜３１０論理ゲート４０１〜４０７フリップフロップ４０８〜４１３論理ゲート５０１〜５０８フリップフロップ５０９〜５４０論理ゲート６０１ＳＩＭＤ演算要求信号６０２〜６０５ＳＩＭＤ演算制御信号６０６、６０７レジスタ書込要求信号６０８〜６１１ＳＩＭＤ演算フラグ情報６１２〜６１９レジスタ書込制御信号６２０フラグ更新要求信号６２１〜６２８ＳＩＭＤフラグ生成情報７０１〜７０６フリップフロップ７０７〜７１４論理ゲート８０１〜８０６フリップフロップ８０７〜８１３論理ゲート 101 Instruction register 102 Instruction decoder 103 Register file 104 64-bit operation unit 105-108 16-bit operation unit 109 SIMD operation control unit 110 Register writing control unit 111 SIMD operation flag storage unit 112-121 Latch 122-125 Bus 201-204 Flip-flops 205 to 207 Logic gates 301 to 306 Flip-flops 307 to 310 Logic gates 401 to 407 Flip-flops 408 to 413 Logic gates 501 to 508 Flip-flops 509 to 540 Logic gate 601 SIMD operation request signal 602 to 605 SIMD operation control signal 606 , 607 register write request signal 608 to 611 SIMD operation flag information 612 to 619 register write control signal 620 flag update request signal 621 628 SIMD flag generation information 701 to 706 flip-flops 707 to 714 logic gates 801 to 806 flip-flops 807 to 813 logic gates

Claims

[Claims]

1. A first instruction for verifying that a condition is satisfied by using data specified by an operand as one data, and a second instruction for verifying that the condition is satisfied by using data specified by an operand as a plurality of data. Instruction decoding means for decoding a machine language instruction including an instruction, instruction execution means for executing an instruction in accordance with the instruction decoding means, and a first indicating that a specified condition is satisfied as a result of verification by the first instruction State holding means, and second state holding means indicating that a specified condition is satisfied as a result of verification by the second instruction, wherein the instruction decoding means decodes the first instruction, The condition specified is verified as one data, and the condition is verified. If the condition is satisfied, the first state holding means is made to show that the condition is satisfied. When the second instruction is decoded, the operation is performed. The data specified by the land is divided into a plurality of data for each specific number of bits, and a condition is verified for each of the plurality of divided data. If the condition is satisfied, the condition is satisfied in the second state holding means. Characterized in that the control is performed so as to indicate this for each data.

2. The processor according to claim 1, wherein said plurality of divided data all have the same number of bits.

3. A first instruction for verifying that a condition is satisfied by using data specified by an operand as one data, and a second instruction for verifying that the condition is satisfied by using data specified by an operand as a plurality of data. Instruction decoding means for decoding a machine language instruction including an instruction, instruction execution means for executing an instruction in accordance with the instruction decoding means, and state holding means for indicating that a specified condition is satisfied as a result of the verification. When the instruction decoding means decodes the first instruction, it verifies the condition using the data specified by the operand as one data, and if the condition is satisfied, causes the state holding means to indicate that the condition has been satisfied; When the instruction is decoded, the data specified by the operand is divided into a plurality of data for each specific number of bits, and the conditions for each of the divided data are verified for each data. Processor, characterized in that said conditions are controlled so as to indicate that established for each of the data to the state holding means satisfies a.

4. The processor according to claim 3, wherein said plurality of divided data all have the same number of bits.

5. A first instruction for verifying that a condition is satisfied by using data specified by an operand as one data, and a second instruction for verifying that the condition is satisfied by using data specified by an operand as a plurality of data. An instruction decoding unit that decodes a machine language instruction including an instruction and a condition execution instruction that executes an operation specified as a plurality of pieces of data specified by the operand only when a condition is satisfied; and Instruction execution means for executing an instruction; first state holding means indicating that a condition specified as a result of verification by the first instruction is satisfied; and a condition specified as a result of verification by the second instruction. And second state holding means for indicating that the condition has been satisfied. The instruction decoding means, when decoding the first instruction, sets data designated by an operand as one data as a condition. When the condition is satisfied, the first state holding means is made to show that the condition is satisfied. When the second instruction is decoded, the data specified by the operand is converted into a plurality of data for each specific number of bits. Dividing a plurality of divided data, verifying a condition for each data, and, if the condition is satisfied, causing the second state holding unit to indicate that the condition is satisfied for each data; decoding the condition execution instruction Then, the data specified by the operand is divided into a plurality of data for each specific number of bits, and the corresponding verification result of the second state holding means corresponding to each of the plurality of divided data is examined, and the condition is satisfied. When the condition is satisfied, the instruction execution means performs the operation specified in the instruction on the corresponding divided data, and when the condition is not satisfied, the instruction execution means performs the operation on the corresponding divided data. During the order Processor and controlling so as not to perform the specified operation.

6. The processor according to claim 5, wherein the condition execution instruction is an instruction for the second state holding unit to perform a transfer when a condition is satisfied.

7. The processor according to claim 5, wherein the condition execution instruction is an instruction for the second state holding means to perform an operation when a condition is satisfied.

8. A first instruction for verifying that a condition is satisfied by using data specified by an operand as one data, and a second instruction for verifying that the condition is satisfied by using data specified by an operand as a plurality of data. An instruction decoding unit that decodes a machine language instruction including an instruction and a condition execution instruction that executes an operation specified as a plurality of pieces of data specified by the operand only when a condition is satisfied; and Instruction execution means for executing an instruction; first state holding means indicating that a condition specified as a result of verification by the first instruction is satisfied; and a condition specified as a result of verification by the second instruction. And second state holding means for indicating that the condition has been satisfied. The instruction decoding means, when decoding the first instruction, sets data designated by an operand as one data as a condition. When the condition is satisfied, the first state holding means is made to show that the condition is satisfied. When the second instruction is decoded, the data specified by the operand is converted into a plurality of data for each specific number of bits. Dividing a plurality of divided data, verifying a condition for each data, and, if the condition is satisfied, causing the second state holding unit to indicate that the condition is satisfied for each data; decoding the condition execution instruction Then, the data specified by the operand is divided into a plurality of data for each specific number of bits, and the corresponding verification result of the second state holding means corresponding to each of the plurality of divided data is examined, and the condition is satisfied. When the condition is satisfied, the first operation specified in the instruction is performed by the instruction execution means on the corresponding divided data. When the condition is not satisfied, the instruction is executed on the corresponding divided data. Execution means Processor and controlling so as not to perform the second operation specified in the decree.