JP2006338684A

JP2006338684A - Processor

Info

Publication number: JP2006338684A
Application number: JP2006210393A
Authority: JP
Inventors: Tetsuya Tanaka; 哲也田中; Hazuki Okabayashi; はづき岡林; Takehito Heiji; 岳人瓶子; Hajime Ogawa; 一小川; Yoshihiro Koga; 義宏古賀; Manabu Kuroda; 学黒田; Masato Suzuki; 正人鈴木; Tokuzo Kiyohara; 督三清原; Takeshi Tanaka; 健田中; Hideshi Nishida; 英志西田
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 2006-08-01
Filing date: 2006-08-01
Publication date: 2006-12-14

Abstract

<P>PROBLEM TO BE SOLVED: To provide a processor which executes an SIMD operation with high flexibility with less constraints regarding a position of an operand to be an object for the SIMD operation. <P>SOLUTION: The processor has a decoding part 20 and an operation part 40, etc., when the decode part 20 decodes instructions "vhaddh Rc, Ra, Rb", an arithmetic logic/comparison arithmetic unit 41, etc. (i) add high-order 16 bits of a register Ra to high-order 16 bits of a register Rb, store the result in high-order 16 bits of a register Rc, in parallel with this, (ii) add low-order 16 bits of the register Ra to the high-order 16 bits of the register Rb and store the result in low-order 16 bits of the register Rc. <P>COPYRIGHT: (C)2007,JPO&INPIT

Description

本発明は、ＤＳＰやＣＰＵ等のプロセッサに関し、特にＳＩＭＤ型の命令を実行するプロセッサに関する。 The present invention relates to a processor such as a DSP or a CPU, and more particularly to a processor that executes SIMD type instructions.

従来、ＳＩＭＤ（Single Instruction Multiple Data）型の命令をサポートしているプロセッサとして、米国インテル社のＰｅｎｔｉｕｍ（登録商標）／同III／同４のＭＭＸ／ＳＳＥ／ＳＳＥ２等がある。 Conventionally, as processors that support SIMD (Single Instruction Multiple Data) type instructions, there are Pentium (registered trademark) / III // 4 MMX / SSE / SSE2 of Intel Corporation.

例えば、ＭＭＸであれば、６４ビット長のＭＭＸレジスタに格納された最大８個の整数を対象として、１つの命令で同一のオペレーションを実行することができる。 For example, in the case of MMX, the same operation can be executed with one instruction for up to eight integers stored in a 64-bit MMX register.

しかしながら、上記のような従来のプロセッサは、ＳＩＭＤ演算の対象となるオペランドの位置に関する制約が多いという問題がある。 However, the conventional processor as described above has a problem that there are many restrictions on the position of the operand to be subjected to the SIMD operation.

例えば、第１レジスタの上位桁及び下位桁それぞれに数値Ａ及びＢが格納され、第２レジスタの上位桁及び下位桁それぞれに数値Ｃ及びＤが格納された状態で、第１レジスタと第２レジスタをオペランドとするＳＩＭＤ型の加算命令を従来のプロセッサが実行した場合には、得られる加算値は、Ａ＋Ｃ、及び、Ｂ＋Ｄである。つまり、両レジスタの上位桁に格納されたデータ同士の加算値と両レジスタの下位桁に格納されたデータ同士の加算値が得られることとなり、レジスタでのデータの格納位置によって演算対象が一義的に決定されてしまう。 For example, in a state in which numerical values A and B are stored in the upper and lower digits of the first register and numerical values C and D are stored in the upper and lower digits of the second register, respectively, When a conventional processor executes a SIMD type addition instruction with オペランド as an operand, the obtained addition values are A + C and B + D. In other words, an addition value between the data stored in the upper digits of both registers and an addition value between the data stored in the lower digits of both registers are obtained, and the operation target is uniquely determined by the data storage position in the registers. Will be decided.

そのために、例えば、上記のような第１レジスタ及び第２レジスタに対して、Ａ＋Ｄ、及び、Ｂ＋Ｃという加算値を求めたい場合には、一方のレジスタに格納された上位桁のデータと下位桁のデータの格納位置を入れ替えてから、ＳＩＭＤ型の加算命令を実行したり、あるいは、ＳＩＭＤ型の加算命令を用いることなく、通常のＳＩＳＤ（Single Instruction Single Data）型の加算命令を２回実行したりする必要がある。 Therefore, for example, when it is desired to obtain addition values of A + D and B + C for the first register and the second register as described above, the upper digit data and the lower digit data stored in one register are obtained. Execute SIMD type addition instruction after changing data storage location, or execute normal SISD (Single Instruction Single Data) type addition instruction twice without using SIMD type addition instruction There is a need to.

ところで、近年の通信のデジタル化に伴い、フーリエ変換やフィルタ処理等のデジタル信号処理が必要とされる画像処理や音声処理の分野では、複数のデータに対して同一の演算処理を施すことが必要とされるが、その場合に、例えば、データの並びにおける中心位置から左右対称に位置するデータに対して同一の演算を施す等の処理が必要とされるケースが多い。このようなケースでは、演算の対象となる２種類のオペランドは逆順に並べられ、例えば、２つのレジスタにおける一方の上位桁に格納されたデータと他方の下位桁に格納されたデータとを対象として演算をしなければならない。 By the way, with the recent digitalization of communication, in the field of image processing and sound processing that require digital signal processing such as Fourier transform and filter processing, it is necessary to perform the same arithmetic processing on a plurality of data. However, in that case, for example, there are many cases where a process such as performing the same operation on data located symmetrically from the center position in the data arrangement is required. In such a case, the two types of operands to be operated are arranged in reverse order, for example, data stored in one upper digit and data stored in the other lower digit in two registers. You have to do the operation.

ところが、従来のプロセッサによるＳＩＭＤ演算では、上述のように、演算の対象となるオペランドは同じ順序で並べられている必要があるために、オペランドの並び替え等が必要とされ、デジタル信号処理に多くの時間を要してしまうという問題がある。 However, in the SIMD calculation by the conventional processor, as described above, the operands to be calculated need to be arranged in the same order. Therefore, the operands need to be rearranged, which is often used for digital signal processing. There is a problem that it takes a long time.

そこで、本発明は、このような状況に鑑みてなされたものであり、ＳＩＭＤ演算の対象となるオペランドの位置に関する制約が少ない、柔軟性の高いＳＩＭＤ演算を実行するプロセッサを提供することを目的とする。より具体的には、例えば、デジタル信号処理を高速に実行するマルチメディア用途に適したプロセッサを提供することを目的とする。 Therefore, the present invention has been made in view of such a situation, and an object of the present invention is to provide a processor that executes a highly flexible SIMD operation with few restrictions on the position of an operand to be subjected to SIMD operation. To do. More specifically, for example, an object of the present invention is to provide a processor suitable for multimedia use that executes digital signal processing at high speed.

上記目的を達成するために、本発明に係るプロセッサは、１つの命令で複数のデータを演算するＳＩＭＤ型の命令を実行するプロセッサであって、命令を解読する解読手段と、解読手段による解読結果に基づいて、命令を実行する実行手段とを備え、前記実行手段は、演算の種類を指定する命令コードとｎ（≧２）個のデータの並びからなる第１データ群を指定する第１オペランドとｎ個のデータの並びからなる第２データ群を指定する第２オペランドとを含むＳＩＭＤ型の命令が前記解読手段によって解読された場合に、前記第１データ群を構成するｎ個のデータの並びにおける第ｉ番目のデータと前記第２データ群を構成するｎ個のデータの並びにおける第ｊ番目のデータとからなるｎ組それぞれに対して前記命令コードで指定される演算を行い、前記ｉ＝１，２，…，ｎ、前記ｊ＝一定であることを特徴とする。 In order to achieve the above object, a processor according to the present invention is a processor that executes a SIMD type instruction that calculates a plurality of data with one instruction, and includes a decoding means for decoding the instruction, and a decoding result by the decoding means And an execution means for executing an instruction, wherein the execution means specifies a first data group consisting of an instruction code for specifying the type of operation and a sequence of n (≧ 2) pieces of data. And a second operand designating a second data group consisting of a sequence of n data, when the SIMD type instruction is decoded by the decoding means, the n data constituting the first data group An operation specified by the instruction code for each of n sets of the i-th data in the sequence and the j-th data in the sequence of n data constituting the second data group. Was carried out, the i = 1, 2, ..., n, wherein said a j = constant.

たとえば、前記ｎは、２であり、前記第１データ群は、第１番目データと第２番目データとからなり、前記第２データ群は、第１番目データと第２番目データとからなり、前記実行手段は、前記第１データ群の第１番目データと第２データ群の第１番目データとの組に対して前記演算を行うとともに、前記第１データ群の第２番目データと第２データ群の第１番目データとの組に対して前記演算を行う。 For example, the n is 2, the first data group is composed of first data and second data, the second data group is composed of first data and second data, The execution means performs the calculation on a set of the first data of the first data group and the first data of the second data group, and the second data and second of the first data group. The calculation is performed on a set of the data group and the first data.

ここで、前記命令コードが指定する演算の種類は、乗算、積和又は積差であり、前記命令は、演算の結果を格納するための第３データを指定する第３オペランドを含み、前記実行手段は、前記２つの組に対して前記演算を行って得られた結果それぞれの下位部分又は上位部分を前記第３データに格納してもよい。 Here, the type of operation specified by the instruction code is multiplication, product-sum or product-difference, and the instruction includes a third operand that specifies third data for storing the result of the operation. The means may store the lower part or the upper part of each result obtained by performing the calculation on the two sets in the third data.

また、たとえば、前記ｎは、４であり、前記第１データ群は、第１〜第４番目データの並びからなり、前記第２データ群は、第１〜第４番目データの並びからなり、前記実行手段は、前記第１データ群の第１番目データと第２データ群の第１番目データとの組、前記第１データ群の第２番目データと第２データ群の第１番目データとの組、前記第１データ群の第３番目データと第２データ群の第１番目データとの組、及び、前記第１データ群の第４番目データと第２データ群の第１番目データとの組に対して前記演算を行う。 Further, for example, the n is 4, the first data group is composed of a sequence of first to fourth data, and the second data group is composed of a sequence of first to fourth data, The execution means includes a set of the first data of the first data group and the first data of the second data group, the second data of the first data group and the first data of the second data group, A set of the third data of the first data group and the first data of the second data group, and the fourth data of the first data group and the first data of the second data group, The above operation is performed on a set of

ここで、前記命令コードが指定する演算の種類は、乗算、積和又は積差であり、前記命令は、演算の結果を格納するための第３データを指定する第３オペランドを含み、前記実行手段は、前記４つの組に対して前記演算を行って得られた結果それぞれの下位部分又は上位部分を前記第３データに格納してもよい。 Here, the type of operation specified by the instruction code is multiplication, product-sum or product-difference, and the instruction includes a third operand that specifies third data for storing the result of the operation. The means may store the lower part or the upper part of each result obtained by performing the calculation on the four sets in the third data.

なお、本発明は、このような特徴的な命令を実行するプロセッサとして実現することができるだけでなく、複数のデータ等に対する演算処理方法として実現したり、特徴的な命令を含むプログラムとして実現したりすることもできる。そして、そのようなプログラムは、ＣＤ−ＲＯＭ等の記録媒体やインターネット等の伝送媒体を介して流通させることができるのは言うまでもない。 The present invention can be realized not only as a processor that executes such characteristic instructions, but also as an arithmetic processing method for a plurality of data or the like, or as a program including characteristic instructions. You can also Needless to say, such a program can be distributed via a recording medium such as a CD-ROM or a transmission medium such as the Internet.

本発明に係るプロセッサは、１つの命令で複数のデータを演算するＳＩＭＤ型の命令を実行するプロセッサであって、２つのデータ列に対して、同一順位のデータ同士だけでなく、一方のデータ列の演算対象を共通（位置を固定）にしたデータ組に対して並列に演算を実行する。したがって、複数のデータ要素に対して共通のデータ要素との演算を繰り返すが高速化され、マルチメディア処理等に好適なプロセッサが実現される。 The processor according to the present invention is a processor that executes a SIMD type instruction that calculates a plurality of data with one instruction, and not only two pieces of data in the same order but also one data row with respect to two data rows. The calculation is executed in parallel on the data set having the same calculation target (fixed position). Therefore, the calculation with a common data element is repeated for a plurality of data elements, but the processing speed is increased and a processor suitable for multimedia processing or the like is realized.

ここで、演算の種類が乗算、積和又は積差である場合には、得られた演算結果それぞれの下位部分だけを出力したり、上位部分だけを出力したり、一部の演算結果だけを出力してもよい。これによって、複素数の内積等において、整数データを扱う場合や固定小数点データを扱う場合に必要となるビット切り出し処理が演算とともに実行されるので、複素数等の２次元データを用いる演算、例えば、２次元座標を用いる画像処理や振幅と位相の２次元表現が用いられる音声信号の処理等が高速化される。 Here, when the type of operation is multiplication, sum of products or product difference, only the lower part of each obtained operation result is output, only the upper part is output, or only part of the operation results are output. It may be output. As a result, in the inner product of complex numbers, etc., the bit cut-out processing required when handling integer data or fixed-point data is executed together with the calculation, so that an operation using two-dimensional data such as a complex number, for example, two-dimensional Image processing using coordinates, audio signal processing using a two-dimensional representation of amplitude and phase, and the like are accelerated.

以上のように、本発明に係るプロセッサは、通常のマイコンに比べて並列性が高く、ＡＶメディア系信号処理が高速であり、携帯電話、モバイルＡＶ機器、デジタルＴＶ、ＤＶＤ等に共通のコアプロセッサとして使用可能であり、高性能・高コストパフォーマンスなマルチメディア機器の出現が望まれる今日における実用的価値は極めて高いと言える。 As described above, the processor according to the present invention has higher parallelism than ordinary microcomputers, high-speed AV media signal processing, and a common core processor for mobile phones, mobile AV devices, digital TVs, DVDs, and the like. It can be said that the practical value is extremely high today when the appearance of high performance and high cost performance multimedia equipment is desired.

本発明に係るプロセッサのアーキテクチャについて説明する。本プロセッサの命令は通常のマイコンに比べて並列性が高く、ＡＶメディア系信号処理技術分野をターゲットとして開発された汎用プロセッサである。携帯電話、モバイルＡＶ機器、デジタルＴＶ、ＤＶＤ等に共通コアを使用することにより、ソフト再利用性を向上させることができる。また、本プロセッサは、高性能・高コストパフォーマンスで多くのメディア処理を実現することができ、さらに、開発効率向上を目的とした高級言語開発環境を提供する。 The architecture of the processor according to the present invention will be described. The instructions of this processor have higher parallelism than ordinary microcomputers, and are general-purpose processors developed for the AV media signal processing technology field. By using a common core for mobile phones, mobile AV devices, digital TVs, DVDs, etc., software reusability can be improved. In addition, this processor can realize a lot of media processing with high performance and high cost performance, and provides a high-level language development environment for the purpose of improving development efficiency.

図１は、本プロセッサの概略ブロック図である。本プロセッサ１は、命令制御部１０、デコード部２０、レジスタファイル３０、演算部４０、Ｉ／Ｆ部５０、命令メモリ部６０、データメモリ部７０、拡張レジスタ部８０及びＩ／Ｏインターフェース部９０から構成される。演算部４０は、ＳＩＭＤ型命令の演算を実行する算術論理・比較演算器４１〜４３、乗算・積和演算器４４、バレルシフタ４５、除算器４６及び変換器４７からなる。乗算・積和演算器４４は、ビット精度を落とさないように、最長で６５ビットで累算する。また、乗算・積和演算器４４は、算術論理・比較演算器４１〜４３と同様、ＳＩＭＤ型命令の実行が可能である。更に、このプロセッサ１は、算術論理・比較演算命令が最大３並列実行可能である。 FIG. 1 is a schematic block diagram of the processor. The processor 1 includes an instruction control unit 10, a decoding unit 20, a register file 30, a calculation unit 40, an I / F unit 50, an instruction memory unit 60, a data memory unit 70, an expansion register unit 80, and an I / O interface unit 90. Composed. The arithmetic unit 40 includes arithmetic logic / comparison arithmetic units 41 to 43 for executing arithmetic operations of SIMD type instructions, a multiplication / product-sum arithmetic unit 44, a barrel shifter 45, a divider 46 and a converter 47. The multiplication / product-sum calculator 44 accumulates up to 65 bits so as not to degrade the bit precision. Further, the multiplication / product-sum operation unit 44 can execute SIMD type instructions similarly to the arithmetic logic / comparison operation units 41 to 43. Further, the processor 1 can execute a maximum of three arithmetic logic / comparison operation instructions in parallel.

図２は、算術論理・比較演算器４１〜４３の概略図を示す。算術論理・比較演算器４１〜４３それぞれは、ＡＬＵ部４１ａ、飽和処理部４１ｂ及びフラグ部４１ｃから構成される。ＡＬＵ部４１ａは、算術演算器、論理演算器、比較器、ＴＳＴ器からなる。対応する演算データのビット幅は、８ビット(演算器を４並列で使用)、１６ビット(演算器を２並列で使用)、３２ビットである(全演算器で３２ビットデータ処理)。更に算術演算結果に対しては、フラグ部４１ｃ等により、オーバーフローの検出とコンディションフラグの生成が行われる。各演算器、比較器、ＴＳＴ器の結果は、算術右シフト、飽和処理部４１ｂによる飽和、最大・最小値検出、絶対値生成処理が行われる。 FIG. 2 shows a schematic diagram of the arithmetic logic / comparison arithmetic units 41-43. Each of the arithmetic logic / comparison arithmetic units 41 to 43 includes an ALU unit 41a, a saturation processing unit 41b, and a flag unit 41c. The ALU unit 41a includes an arithmetic operation unit, a logical operation unit, a comparator, and a TST unit. The corresponding arithmetic data has a bit width of 8 bits (4 arithmetic units are used in parallel), 16 bits (2 arithmetic units are used in parallel), and 32 bits (32-bit data processing in all arithmetic units). Further, for the arithmetic operation result, overflow detection and condition flag generation are performed by the flag unit 41c and the like. The arithmetic unit, the comparator, and the TST unit are subjected to arithmetic right shift, saturation by the saturation processing unit 41b, maximum / minimum value detection, and absolute value generation processing.

図３は、バレルシフタ４５の構成を示すブロック図である。バレルシフタ４５は、セレクタ４５ａ、４５ｂ、上位バレルシフタ４５ｃ、下位バレルシフタ４５ｄ及び飽和処理部４５ｅから構成され、データの算術シフト(２の補数体系のシフト)または、論理シフト(符号なしシフト)を実行する。通常は、３２ビットもしくは、６４ビットのデータを入出力としている。レジスタ３０ａ、３０ｂに格納された被シフトデータに対して、別のレジスタまたは即値でシフト量が指定される。データは、左６３ビット〜右６３ビットの算術または論理シフトが行われ、入力ビット長で出力される。 FIG. 3 is a block diagram showing the configuration of the barrel shifter 45. The barrel shifter 45 includes selectors 45a and 45b, an upper barrel shifter 45c, a lower barrel shifter 45d, and a saturation processing unit 45e, and executes an arithmetic shift (shift of 2's complement system) or logical shift (unsigned shift) of data. Normally, 32-bit or 64-bit data is input / output. For the data to be shifted stored in the registers 30a and 30b, the shift amount is designated by another register or an immediate value. The data is arithmetically or logically shifted from the left 63 bits to the right 63 bits and output with an input bit length.

また、バレルシフタ４５は、ＳＩＭＤ型命令に対して、８、１６、３２、６４ビットのデータをシフトすることができる。例えば、８ビットデータのシフトを４並列で処理することができる。 In addition, the barrel shifter 45 can shift 8, 16, 32, and 64-bit data with respect to the SIMD type instruction. For example, a shift of 8-bit data can be processed in 4 parallel.

算術シフトは、２の補数体系のシフトであり、加算や減算時の小数点の位置合わせや、２のべき乗の乗算(２、２の２乗、２の（−１）乗、２の（−２）乗倍など)等のために行われる。 Arithmetic shift is a shift of 2's complement system, and alignment of decimal point at the time of addition or subtraction, multiplication by power of 2 (2, 2 squared, 2 (-1) power, 2 (-2) Etc.) etc.

図４は、変換器４７の構成を示すブロック図である。変換器４７は、飽和ブロック(SAT)４７ａ、BSEQブロック４７ｂ、MSKGENブロック４７ｃ、VSUMBブロック４７、BCNTブロック４７ｅ及びILブロック４７ｆから構成される。 FIG. 4 is a block diagram showing the configuration of the converter 47. The converter 47 includes a saturation block (SAT) 47a, a BSEQ block 47b, an MSKGEN block 47c, a VSUMB block 47, a BCNT block 47e, and an IL block 47f.

飽和ブロック(SAT)４７ａは、入力データに対する飽和処理を行う。３２ビットデータを飽和処理するブロックを２つ持つことにより、２並列のＳＩＭＤ型命令をサポートする。 The saturation block (SAT) 47a performs saturation processing on the input data. By having two blocks that saturate 32-bit data, two parallel SIMD instructions are supported.

BSEQブロック４７ｂは、MSBから連続する０か１をカウントする。
MSKGENブロック４７ｃは、指定されたビット区間を１、それ以外を０として出力する。 The BSEQ block 47b counts consecutive 0s or 1s from the MSB.
The MSKGEN block 47c outputs the designated bit interval as 1, and outputs the other as 0.

VSUMBブロック４７ｄは、入力データを指定されたビット幅に区切り、その総和を出力する。 The VSUMB block 47d divides input data into designated bit widths and outputs the sum.

BCNTブロック４７ｅは、入力データで１となっているビットの数をカウントする。
ILブロック４７ｆは、入力データを指定されたビット幅に区切り、各データブロックを入れ換えた値を出力する。 The BCNT block 47e counts the number of bits that are 1 in the input data.
The IL block 47f divides the input data into a designated bit width and outputs a value obtained by replacing each data block.

図５は、除算器４６の構成を示すブロック図である。除算器４６は、被除数を６４ビット、除数を３２ビットとし、商と剰余を３２ビットずつ出力する。商と剰余を求めるまでに３４サイクルを必要とする。符号付き、符号なし、両方のデータを扱うことが可能である。ただし、被除数と除数において符号の有無の設定は共通とする。その他、オーバーフローフラグ、０除算フラグを出力する機能を有する。 FIG. 5 is a block diagram showing a configuration of the divider 46. The divider 46 sets the dividend to 64 bits and the divisor to 32 bits, and outputs the quotient and the remainder each 32 bits. 34 cycles are required to find the quotient and the remainder. Both signed and unsigned data can be handled. However, the setting of the presence / absence of a sign is common to the dividend and the divisor. In addition, it has a function of outputting an overflow flag and a division by zero flag.

図６は、乗算・積和演算器４４の構成を示すブロック図である。乗算・積和演算器４４は、２つの３２ビット乗算器（MUL）４４ａ、４４ｂ、３つの６４ビット加算器（Adder）４４ｃ〜４４ｅ、セレクタ４４ｆ及び飽和処理部（Saturation）４４ｇから構成され、以下の乗算、積和演算を行う。 FIG. 6 is a block diagram showing a configuration of the multiplication / product-sum calculator 44. The multiplier / product-sum calculator 44 includes two 32-bit multipliers (MUL) 44a and 44b, three 64-bit adders (Adder) 44c to 44e, a selector 44f, and a saturation processing unit (Saturation) 44g. Multiply and multiply and accumulate.

・３２×３２ビットのsignedの乗算、積和、積差演算
・３２×３２ビットのunsignedの乗算
・１６×１６ビットの２並列のsignedの乗算、積和、積差演算
・３２×１６ビットの２並列のsignedの乗算、積和、積差演算 32 × 32 bit signed multiplication, product sum, product difference operation 32 × 32 bit unsigned multiplication 16 × 16 bit 2 parallel signed multiplication, product sum, product difference operation 32 × 16 bit 2 parallel signed multiplication, product sum, product difference operation

これらの演算を整数、固定小数点フォーマット（ｈ１、ｈ２、ｗ１、ｗ２）のデータに対して行う。また、これらの演算に対し、丸め、飽和を行う。 These operations are performed on data in integer and fixed-point format (h1, h2, w1, w2). Also, rounding and saturation are performed for these operations.

図７は、命令制御部１０の構成を示すブロック図である。命令制御部１０は、命令キャッシュ１０ａ、アドレス管理部１０ｂ、命令バッファ１０ｃ〜１０ｅ、ジャンプバッファ１０ｆ及びローテーション部（rotation）１０ｇから構成され、通常時及び分岐時の命令供給を行う。１２８ビットの命令バッファを３つ（命令バッファ１０ｃ〜１０ｅ）持つことにより、最大並列実行数に対応している。分岐処理に関しては、分岐実行前に、ジャンプバッファ１０ｆ等を介して、後述するＴＡＲレジスタに予め分岐先アドレスを格納する(settar命令)。ＴＡＲレジスタに格納された分岐先アドレスを使用して、分岐を行う。 FIG. 7 is a block diagram illustrating a configuration of the instruction control unit 10. The instruction control unit 10 includes an instruction cache 10a, an address management unit 10b, instruction buffers 10c to 10e, a jump buffer 10f, and a rotation unit (rotation) 10g, and supplies instructions at normal time and branch time. By having three 128-bit instruction buffers (instruction buffers 10c to 10e), the maximum number of parallel executions is supported. Regarding branch processing, a branch destination address is stored in advance in a TAR register, which will be described later, via the jump buffer 10f or the like before branch execution (settar instruction). Branch is performed using the branch destination address stored in the TAR register.

なお、本プロセッサ１はＶＬＩＷアーキテクチャを持つプロセッサである。ここで、ＶＬＩＷアーキテクチャとは、１つの命令語中に複数の命令(ロード、ストア、演算、分岐など)を格納し、それらを全て同時に実行するアーキテクチャである。プログラマは、並列実行可能な命令を１つの発行グループとして記述することによって、その発行グループを並列処理させることができる。本明細書では、発行グループの区切りを“;;”で示す。以下に表記例を示す。 The processor 1 is a processor having a VLIW architecture. Here, the VLIW architecture is an architecture that stores a plurality of instructions (load, store, operation, branch, etc.) in one instruction word and executes them all at the same time. The programmer can process the issue groups in parallel by describing instructions that can be executed in parallel as one issue group. In this specification, the issue group delimiters are indicated by “;;”. A notation example is shown below.

（例１）
mov r1, 0x23;; (Example 1)
mov r1, 0x23 ;;

この命令記述は、命令movのみを実行することを意味する。 This instruction description means that only the instruction mov is executed.

（例２）
mov r1, 0x38
add r0, r1, r2
sub r3, r1, r2;; (Example 2)
mov r1, 0x38
add r0, r1, r2
sub r3, r1, r2 ;;

これらの命令記述は、命令mov、add、subを３並列で実行することを意味する。 These instruction descriptions mean that instructions mov, add, and sub are executed in parallel.

命令制御部１０は、発行グループを識別し、デコード部２０に送る。デコード部２０では、発行グループの命令を解析し、必要な資源を制御する。 The instruction control unit 10 identifies the issue group and sends it to the decoding unit 20. The decoding unit 20 analyzes the instruction of the issuing group and controls necessary resources.

次に、本プロセッサ１が備えるレジスタについて説明する。
本プロセッサ１のレジスタセットは、以下の表１に示される通りである。 Next, the registers included in the processor 1 will be described.
The register set of the processor 1 is as shown in Table 1 below.

また、本プロセッサ１のフラグセット（後述する条件フラグレジスタ等で管理されるフラグ）は、以下の表２に示される通りである。 The flag set of the processor 1 (flags managed by a condition flag register or the like described later) is as shown in Table 2 below.

図８は、汎用レジスタ（Ｒ０〜Ｒ３１）３０ａの構造を示す図である。汎用レジスタ（Ｒ０〜Ｒ３１）３０ａは、実行対象となっているタスクのコンテキストの一部を構成し、データまたはアドレスを格納する３２ビットのレジスタ群である。なお、汎用レジスタＲ３０およびＲ３１は、それぞれグローバルポインタ、スタックポインタとして、ハードウェアが使用する。 FIG. 8 is a diagram illustrating the structure of the general-purpose registers (R0 to R31) 30a. The general-purpose registers (R0 to R31) 30a constitute a part of the context of the task to be executed and are a 32-bit register group for storing data or addresses. The general-purpose registers R30 and R31 are used by hardware as a global pointer and a stack pointer, respectively.

図９は、リンクレジスタ（ＬＲ）３０ｃの構造を示す図である。なお、このリンクレジスタ（ＬＲ）３０ｃと関連して、本プロセッサ１は、図示されていない退避レジスタ（ＳＶＲ）も備える。リンクレジスタ（ＬＲ）３０ｃは、関数コール時のリターンアドレスを格納する３２ビットのレジスタである。なお、退避レジスタ（ＳＶＲ）は、関数コール時の条件フラグレジスタのコンディションフラグ（CFR.CF）を退避する１６ビットのレジスタである。リンクレジスタ（ＬＲ）３０ｃは、後述する分岐レジスタ（ＴＡＲ）と同様に、ループ高速化にも使用される。下位１ビットは常に０が読み出されるが、書き込み時には０を書き込む必要がある。 FIG. 9 is a diagram illustrating the structure of the link register (LR) 30c. In connection with the link register (LR) 30c, the processor 1 also includes a save register (SVR) not shown. The link register (LR) 30c is a 32-bit register that stores a return address at the time of a function call. The save register (SVR) is a 16-bit register that saves the condition flag (CFR.CF) of the condition flag register at the time of function call. The link register (LR) 30c is also used for increasing the loop speed, similarly to a branch register (TAR) described later. The lower 1 bit is always read as 0, but 0 must be written when writing.

例えば、call(brl,jmpl)命令を実行した場合には、本プロセッサ１は、リンクレジスタ（ＬＲ）３０ｃに戻りアドレスを退避し、退避レジスタ（ＳＶＲ）にコンディションフラグ(CFR.CF)を退避する。また、jmp命令を実行した場合には、リンクレジスタ（ＬＲ）３０ｃから戻りアドレス(分岐先アドレス)を取り出し、プログラムカウンタ（ＰＣ）を復帰させる。さらに、ret(jmpr)命令を実行した場合には、リンクレジスタ（ＬＲ）３０ｃから分岐先アドレス(戻りアドレス)を取り出し、プログラムカウンタ（ＰＣ）に格納(復帰)する。さらに、退避レジスタ（ＳＶＲ）からコンディションフラグを取り出し、条件フラグレジスタ（ＣＦＲ）３２のコンディションンフラグ領域CFR.CFに格納(復帰)する。 For example, when the call (brl, jmpl) instruction is executed, the processor 1 returns the address to the link register (LR) 30c and saves the condition flag (CFR.CF) to the save register (SVR). . When the jmp instruction is executed, the return address (branch destination address) is extracted from the link register (LR) 30c, and the program counter (PC) is restored. Further, when the ret (jmpr) instruction is executed, the branch destination address (return address) is extracted from the link register (LR) 30c and stored (returned) in the program counter (PC). Further, the condition flag is extracted from the save register (SVR) and stored (returned) in the condition flag area CFR.CF of the condition flag register (CFR) 32.

図１０は、分岐レジスタ（ＴＡＲ）３０ｄの構造を示す図である。分岐レジスタ（ＴＡＲ）３０ｄは、分岐ターゲットアドレスを格納する３２ビットのレジスタである。主に、ループの高速化に用いられる。下位１ビットは常に０が読み出されるが、書き込み時には０を書き込む必要がある。 FIG. 10 is a diagram showing the structure of the branch register (TAR) 30d. The branch register (TAR) 30d is a 32-bit register that stores a branch target address. Mainly used to speed up loops. The lower 1 bit is always read as 0, but 0 must be written when writing.

例えば、jmp,jloop命令を実行した場合には、本プロセッサ１は、分岐レジスタ（ＴＡＲ）３０ｄから分岐先アドレスを取り出し、プログラムカウンタ（ＰＣ）に格納する。分岐レジスタ（ＴＡＲ）３０ｄに格納されたアドレスの命令が分岐用命令バッファに格納されている場合は、分岐ペナルティが０になる。分岐レジスタ（ＴＡＲ）３０ｄにループの先頭アドレスを格納しておくことでループを高速化することができる。 For example, when a jmp, jloop instruction is executed, the processor 1 extracts the branch destination address from the branch register (TAR) 30d and stores it in the program counter (PC). When the instruction at the address stored in the branch register (TAR) 30d is stored in the branch instruction buffer, the branch penalty is zero. By storing the top address of the loop in the branch register (TAR) 30d, the speed of the loop can be increased.

図１１は、プログラム状態レジスタ（ＰＳＲ）３１の構造を示す図である。プログラム状態レジスタ（ＰＳＲ）３１は、実行対象となっているタスクのコンテキストの一部を構成し、以下に示されるプロセッサ状態情報を格納する３２ビットのレジスタである。 FIG. 11 is a diagram showing the structure of the program status register (PSR) 31. The program status register (PSR) 31 is a 32-bit register that constitutes a part of the context of the task to be executed and stores the processor status information shown below.

ビットＳＷＥ：ＶＭＰ（Virtual Multi-Processor）のＬＰ（Logical Processor）切替えイネーブルを示す。「０」はＬＰ切替え不許可を示し、「１」はＬＰ切替え許可を示す。 Bit SWE: indicates an LP (Logical Processor) switching enable of VMP (Virtual Multi-Processor). “0” indicates that LP switching is not permitted, and “1” indicates that LP switching is permitted.

ビットＦＸＰ：固定小数点モードを示す。「０」はモード０を示し、「１」はモード１を示す。 Bit FXP: indicates a fixed point mode. “0” indicates mode 0, and “1” indicates mode 1.

ビットＩＨ：割込み処理フラグであり、マスカブル割込み処理中であることを示す。「１」は割込み処理中であることを示し、「０」は割込み処理中でないことを示す。割込みが発生すると自動的にセットされる。rti命令で割込みから復帰したところが、他の割込み処理中かプログラム処理中であるのかを見分けるために使用される。 Bit IH: This is an interrupt processing flag and indicates that maskable interrupt processing is in progress. “1” indicates that interrupt processing is in progress, and “0” indicates that interrupt processing is not in progress. Set automatically when an interrupt occurs. It is used to determine whether the place where the rti instruction has returned from the interrupt is processing another interrupt or program.

ビットＥＨ：エラーまたはＮＭＩを処理中であることを示すフラグである。「０」はエラー/ＮＭＩ割込み処理中でないことを示し、「１」はエラー/ＮＭＩ割込み処理中であることを示す。ＥＨ＝１のとき、非同期エラーまたはＮＭＩが発生した場合は、マスクされる。また、ＶＭＰイネーブル時はＶＭＰのプレート切り替えがマスクされる。 Bit EH: a flag indicating that an error or NMI is being processed. “0” indicates that an error / NMI interrupt is not being processed, and “1” indicates that an error / NMI interrupt is being processed. When EH = 1, if an asynchronous error or NMI occurs, it is masked. When VMP is enabled, VMP plate switching is masked.

ビットＰＬ［１：０］：特権レベルを示す。「００」は特権レベル０、つまり、プロセッサアブストラクションレベルを示し、「０１」は特権レベル１（設定できない）を示し、「１０」は特権レベル２、つまり、システムプログラムレベルを示し、「１１」は特権レベル３、つまり、ユーザプログラムレベルを示す。 Bit PL [1: 0]: indicates a privilege level. “00” indicates privilege level 0, that is, processor abstraction level, “01” indicates privilege level 1 (cannot be set), “10” indicates privilege level 2, that is, system program level, and “11” indicates Indicates privilege level 3, that is, the user program level.

ビットＬＰＩＥ３：ＬＰ固有割込み３イネーブルを示す。「１」は割込み許可を示し、「０」は割込み不許可を示す。 Bit LPIE3: indicates LP specific interrupt 3 enable. “1” indicates interrupt permission, and “0” indicates interrupt disapproval.

ビットＬＰＩＥ２：ＬＰ固有割込み２イネーブルを示す。「１」は割込み許可を示し、「０」は割込み不許可を示す。 Bit LPIE2: indicates LP specific interrupt 2 enable. “1” indicates interrupt permission, and “0” indicates interrupt disapproval.

ビットＬＰＩＥ１：ＬＰ固有割込み１イネーブルを示す。「１」は割込み許可を示し、「０」は割込み不許可を示す。 Bit LPIE1: Indicates LP specific interrupt 1 enable. “1” indicates interrupt permission, and “0” indicates interrupt disapproval.

ビットＬＰＩＥ０：ＬＰ固有割込み０イネーブルを示す。「１」は割込み許可を示し、「０」は割込み不許可を示す。 Bit LPIE0: indicates LP specific interrupt 0 enable. “1” indicates interrupt permission, and “0” indicates interrupt disapproval.

ビットＡＥＥ：ミスアライメント例外イネーブルを示す。「１」はミスアライメント例外許可を示し、「０」はミスアライメント例外不許可を示す。 Bit AEE: indicates misalignment exception enable. “1” indicates that misalignment exception is permitted, and “0” indicates that misalignment exception is not permitted.

ビットＩＥ：レベル割込みイネーブルを示す。「１」はレベル割込み許可を示し、「０」はレベル割込み不許可を示す。 Bit IE indicates level interrupt enable. “1” indicates that a level interrupt is permitted, and “0” indicates that a level interrupt is not permitted.

ビットＩＭ［７：０］：割込みマスクを示す。レベル０〜７まで定義され、個々のレベルでマスクすることができる。レベル０が最も高いレベルとなる。ＩＭによりマスクされていない割込み要求のうち最も高いレベルを持った割込み要求のみがプロセッサ１に受理される。割込み要求を受理すると受理したレベル以下のレベルはハードウェアで自動的にマスクされる。IM[0]はレベル0のマスクであり、IM[1]はレベル1のマスクであり、IM[2]はレベル2のマスクであり、IM[3]はレベル3のマスクであり、IM[4]はレベル4のマスクであり、IM[5]はレベル5のマスクであり、IM[6]はレベル6のマスクであり、IM[7]はレベル7のマスクである。 Bit IM [7: 0]: indicates an interrupt mask. Levels 0-7 are defined and can be masked at individual levels. Level 0 is the highest level. Only the interrupt request having the highest level among the interrupt requests not masked by the IM is accepted by the processor 1. When an interrupt request is accepted, the level below the accepted level is automatically masked by hardware. IM [0] is a level 0 mask, IM [1] is a level 1 mask, IM [2] is a level 2 mask, IM [3] is a level 3 mask, and IM [ 4] is a level 4 mask, IM [5] is a level 5 mask, IM [6] is a level 6 mask, and IM [7] is a level 7 mask.

ｒｅｓｅｒｖｅｄ：予約ビットを示す。常に０が読み出される。書き込む時は０を書き込む必要がある。 reserved: Indicates a reserved bit. 0 is always read. When writing, it is necessary to write 0.

図１２は、条件フラグレジスタ（ＣＦＲ）３２の構造を示す図である。条件フラグレジスタ（ＣＦＲ）３２は、実行対象となっているタスクのコンテキストの一部を構成する３２ビットのレジスタであり、コンディションフラグ(条件フラグ)、オペレーションフラグ(演算フラグ)、ベクタコンディションフラグ(ベクタ条件フラグ)、演算命令用ビット位置指定フィールド、SIMDデータアライン情報フィールドから構成される。 FIG. 12 is a diagram showing the structure of the condition flag register (CFR) 32. The condition flag register (CFR) 32 is a 32-bit register that constitutes a part of the context of the task to be executed, and includes a condition flag (condition flag), an operation flag (operation flag), and a vector condition flag (vector). Condition flag), a bit position designation field for operation instructions, and a SIMD data alignment information field.

ビットＡＬＮ［１：０］：アラインモードを示す。valnvc命令のアラインモードを設定する。 Bit ALN [1: 0]: indicates an alignment mode. Sets the alignment mode of the valnvc instruction.

ビットＢＰＯ［４：０］：ビットポジションを示す。ビット位置指定の必要な命令で使用する。 Bit BPO [4: 0]: indicates a bit position. Used in instructions that require bit position specification.

ビットＶＣ０〜ＶＣ３：ベクタ条件フラグである。ＬＳＢ側のバイトあるいはハーフワードから順にＶＣ０に対応し、ＭＳＢ側がＶＣ３に対応する。 Bits VC0 to VC3: Vector condition flags. The LSB side byte or halfword sequentially corresponds to VC0, and the MSB side corresponds to VC3.

ビットＯＶＳ：オーバーフローフラグ(サマリー)である。飽和発生やオーバーフロー検出でセットされる。検出されなかった場合は、命令実行前の値を保持する。クリアはソフトで行う必要がある。 Bit OVS: an overflow flag (summary). Set when saturation occurs or overflow is detected. If not detected, the value before instruction execution is held. Clearing must be done by software.

ビットＣＡＳ：キャリーフラグ(サマリー)である。addc命令でキャリーまたはsubc命令でボローが発生した場合セットされる。addc命令でキャリーもしくはsubc命令でボローが発生しなかった場合は、命令実行前の値を保持する。クリアはソフトで行う必要がある。 Bit CAS: carry flag (summary). Set when a carry occurs with the addc instruction or a borrow occurs with the subc instruction. If no carry occurs with the addc instruction or a borrow does not occur with the subc instruction, the value before the instruction execution is retained. Clearing must be done by software.

ビットＣ０〜Ｃ７：コンディションフラグである。フラグＣ７は常に値が１である。フラグＣ７へのＦＡＬＳＥ条件の反映(０書き込み)は無視される。 Bits C0 to C7: Condition flags. The value of the flag C7 is always 1. Reflecting the FALSE condition (writing 0) to the flag C7 is ignored.

図１３は、アキュームレータ（Ｍ０，Ｍ１）３０ｂの構造を示す図である。このアキュームレータ（Ｍ０，Ｍ１）３０ｂは、実行対象となっているタスクのコンテキストの一部を構成し、図１３（ａ）に示される３２ビットレジスタMH0-MH1（乗除算・積和用レジスタ(上位３２ビット)）と、図１３（ｂ）に示される３２ビットレジスタML0-ML1乗除算・積和用レジスタ(下位３２ビット)とからなる。 FIG. 13 is a diagram showing the structure of the accumulator (M0, M1) 30b. This accumulator (M0, M1) 30b constitutes a part of the context of the task to be executed, and is a 32-bit register MH0-MH1 (multiplication / multiplication / sum of products register (high order) shown in FIG. 13 (a). 32 bits)) and the 32-bit register ML0-ML1 multiplication / division / product-sum register (lower 32 bits) shown in FIG. 13 (b).

レジスタMH0-MHは、乗算命令では結果の上位３２ビットを格納するのに使用される。積和命令ではアキュムレータの上位３２ビットとして使用される。また、ビットストリームを取り扱う場合に汎用レジスタと組み合わせて使用することができる。レジスタML0-ML1は、乗算命令では結果の下位３２ビットを格納するのに使用される。積和命令ではアキュムレータの下位３２ビットとして使用される。 Registers MH0-MH are used to store the upper 32 bits of the result in multiply instructions. In the product-sum instruction, it is used as the upper 32 bits of the accumulator. Further, when handling a bit stream, it can be used in combination with a general-purpose register. Registers ML0-ML1 are used in the multiply instruction to store the lower 32 bits of the result. In the product-sum instruction, it is used as the lower 32 bits of the accumulator.

図１４は、プログラムカウンタ（ＰＣ）３３の構造を示す図である。このプログラムカウンタ（ＰＣ）３３は、実行対象となっているタスクのコンテキストの一部を構成し、実行中の命令のアドレスを保持する３２ビットのカウンタである。下位１ビットは常に０が格納される。 FIG. 14 is a diagram showing the structure of the program counter (PC) 33. This program counter (PC) 33 is a 32-bit counter that constitutes a part of the context of the task to be executed and holds the address of the instruction being executed. 0 is always stored in the lower 1 bit.

図１５は、ＰＣ退避用レジスタ（ＩＰＣ）３４の構造を示す図である。このＰＣ退避用レジスタ（ＩＰＣ）３４は、実行対象となっているタスクのコンテキストの一部を構成する３２ビットのレジスタであり、下位１ビットは常に０が読み出されるが、書き込み時には０を書き込む必要がある。 FIG. 15 is a diagram showing the structure of the PC save register (IPC) 34. The PC save register (IPC) 34 is a 32-bit register that constitutes a part of the context of the task to be executed. The lower 1 bit is always read as 0, but 0 must be written when writing. There is.

図１６は、ＰＳＲ退避用レジスタ（ＩＰＳＲ）３５の構造を示す図である。このＰＳＲ退避用レジスタ（ＩＰＳＲ）３５は、実行対象となっているタスクのコンテキストの一部を構成し、プログラム状態レジスタ（ＰＳＲ）３１を退避するための３２ビットのレジスタであり、プログラム状態レジスタ（ＰＳＲ）３１の予約ビットに対応する部分は常に０が読み出されるが、書き込み時には０を書き込む必要がある。 FIG. 16 is a diagram showing the structure of the PSR save register (IPSR) 35. The PSR save register (IPSR) 35 is a 32-bit register that constitutes a part of the context of the task to be executed and saves the program status register (PSR) 31. 0 is always read out from the portion corresponding to the reserved bit of (PSR) 31, but it is necessary to write 0 when writing.

次に、本プロセッサ１のメモリ空間について説明する。本プロセッサ１では、４ＧＢのリニアなメモリ空間を３２分割し、１２８ＭＢ単位の空間に命令ＳＲＡＭ（Static RAM）とデータＳＲＡＭが割り当てられる。この１２８ＭＢの空間を１ブロックとして、ＳＡＲ(ＳＲＡＭ Area Register)にアクセスしたいブロックを設定する。アクセスされたアドレスがＳＡＲで設定された空間である場合は、直接命令ＳＲＡＭ/データＳＲＡＭに対してアクセスを行うが、ＳＡＲで設定された空間でない場合は、バスコントローラ（ＢＣＵ）に対してアクセス要求を出する。ＢＣＵにはオン・チップ・メモリ（ＯＣＭ）、外部メモリ、外部デバイス、Ｉ／Ｏポート等が接続されており、それらのデバイスに対して読み書きを行うことができる。 Next, the memory space of the processor 1 will be described. In the processor 1, a 4 GB linear memory space is divided into 32, and an instruction SRAM (Static RAM) and a data SRAM are allocated to a 128 MB unit space. With this 128 MB space as one block, a block to be accessed in the SAR (SRAM Area Register) is set. When the accessed address is a space set by the SAR, the instruction SRAM / data SRAM is directly accessed. When the accessed address is not the space set by the SAR, an access request is sent to the bus controller (BCU). Take out. An on-chip memory (OCM), an external memory, an external device, an I / O port, and the like are connected to the BCU, and reading and writing can be performed on these devices.

図１７は、本プロセッサ１のパイプライン動作を示すタイミング図である。本プロセッサ１は、本図に示されるように、基本的に命令フェッチ、命令割り当て(ディスパッチ)、デコード、実行、書き込みの５段パイプラインで構成されている。 FIG. 17 is a timing chart showing the pipeline operation of the processor 1. As shown in the figure, the processor 1 basically includes a five-stage pipeline of instruction fetch, instruction allocation (dispatch), decode, execution, and write.

図１８は、本プロセッサ１による命令実行時の各パイプライン動作を示すタイミング図である。命令フェッチステージでは、プログラムカウンタ（ＰＣ）３３で指定されるアドレスの命令メモリをアクセスし、命令を命令バッファ１０ｃ〜１０ｅ等に転送する。命令割り当てステージでは、分岐系命令に対する分岐先アドレス情報の出力、入力レジスタ制御信号の出力、可変長命令の割り当てを行い、命令をインストラクションレジスタ（ＩＲ）に転送する。デコードステージでは、ＩＲをデコード部２０に入力し、演算器制御信号、メモリアクセス信号を出力する。実行ステージでは、演算を実行、演算結果をデータメモリか汎用レジスタ（Ｒ０〜Ｒ３１）３０ａに出力する。書き込みステージでは、データ転送、演算結果を汎用レジスタに格納する。 FIG. 18 is a timing chart showing each pipeline operation when an instruction is executed by the processor 1. In the instruction fetch stage, the instruction memory at the address specified by the program counter (PC) 33 is accessed, and the instruction is transferred to the instruction buffers 10c to 10e and the like. In the instruction allocation stage, branch destination address information is output to a branch system instruction, an input register control signal is output, a variable length instruction is allocated, and the instruction is transferred to an instruction register (IR). In the decode stage, IR is input to the decode unit 20 and an arithmetic unit control signal and a memory access signal are output. In the execution stage, the operation is executed, and the operation result is output to the data memory or the general-purpose register (R0 to R31) 30a. In the write stage, data transfer and operation results are stored in general purpose registers.

本プロセッサ１は、ＶＬＩＷアーキテクチャにより上記の処理を最高３並列で行うことができる。したがって、図１８に示された動作については、本プロセッサ１は、図１９に示されるタイミングで並列に実行する。 The processor 1 can perform the above-described processing in a maximum of three in parallel using the VLIW architecture. Therefore, the processor 1 executes the operations shown in FIG. 18 in parallel at the timing shown in FIG.

次に、以上のように構成された本プロセッサ１の命令セットについて説明する。
以下の表３〜表５は、本プロセッサ１が実行する命令をカテゴリー別に分類した表である。 Next, the instruction set of the processor 1 configured as described above will be described.
Tables 3 to 5 below are tables in which instructions executed by the processor 1 are classified by category.

なお、表中の「演算器」は、その命令が使用する演算器を示す。演算器の略号の意味は次の通りである。つまり、「Ａ」はＡＬＵ命令、「Ｂ」は分岐命令、「Ｃ」は変換命令、「ＤＩＶ」は除算命令、「ＤＢＧＭ」はデバッグ命令、「Ｍ」はメモリアクセス命令、「Ｓ１」、「Ｓ２」はシフト命令、「Ｘ１」、「Ｘ２」は乗算命令を意味する。 Note that “operator” in the table indicates an arithmetic unit used by the instruction. The meaning of the abbreviation of the arithmetic unit is as follows. That is, “A” is an ALU instruction, “B” is a branch instruction, “C” is a conversion instruction, “DIV” is a division instruction, “DBGM” is a debug instruction, “M” is a memory access instruction, “S1”, “ “S2” means a shift instruction, and “X1” and “X2” mean a multiplication instruction.

図２０は、本プロセッサ１が実行する命令のフォーマットを示す図である。そのフォーマットには、図２０（ａ）に示される１６ビット命令フォーマットと、図２０（ｂ）に示される３２ビット命令フォーマットとがある。 FIG. 20 is a diagram showing a format of instructions executed by the processor 1. The formats include a 16-bit instruction format shown in FIG. 20A and a 32-bit instruction format shown in FIG.

なお、図中における略号の意味は次の通りである。つまり、「Ｅ」はエンドビット（並列実行の境界）、「Ｆ」はフォーマットビット（００、０１、１０：１６ビット命令フォーマット、１１：３２ビット命令フォーマット）、「Ｐ」はプレディケート（実行条件：８個の条件フラグＣ０〜Ｃ７のいずれかを指定）、「ＯＰ」はオペコードフィールド、「Ｒ」はレジスタフィールド、「Ｉ」は即値フィールド、「Ｄ」ディスプースメントフィールドを意味する。なお、「Ｅ」フィールドはＶＬＩＷに特有のもので、Ｅ＝０の命令は次の命令と並列に実行される。つまり、「Ｅ」フィールドによって並列度が可変のＶＬＩＷを実現している。 In addition, the meaning of the symbol in the figure is as follows. That is, “E” is an end bit (parallel execution boundary), “F” is a format bit (00, 01, 10:16 bit instruction format, 11:32 bit instruction format), and “P” is a predicate (execution condition: "OP" means an operation code field, "R" means a register field, "I" means an immediate field, and "D" a displacement field. Note that the “E” field is unique to VLIW, and an instruction with E = 0 is executed in parallel with the next instruction. That is, a VLIW having a variable parallelism is realized by the “E” field.

図２１〜図３６は、本プロセッサ１が実行する命令の概略的な機能を説明する図である。つまり、図２１は、カテゴリー「ALUadd（加算）系」に属する命令を説明する図であり、図２２は、カテゴリー「ALUsub（減算）系」に属する命令を説明する図であり、図２３は、カテゴリー「ALUlogic（論理演算）系ほか」に属する命令を説明する図であり、図２４は、カテゴリー「CMP（比較演算）系」に属する命令を説明する図であり、図２５は、カテゴリー「mul（乗算）系」に属する命令を説明する図であり、図２６は、カテゴリー「mac（積和演算）系」に属する命令を説明する図であり、図２７は、カテゴリー「msu（積差演算）系」に属する命令を説明する図であり、図２８は、カテゴリー「MEMｌd（メモリ読み出し）系」に属する命令を説明する図であり、図２９は、カテゴリー「MEMstore（メモリ書き出し）系」に属する命令を説明する図であり、図３０は、カテゴリー「BRA（分岐）系」に属する命令を説明する図であり、図３１は、カテゴリー「BSasl（算術バレルシフト）系ほか」に属する命令を説明する図であり、図３２は、カテゴリー「BSlsr（論理バレルシフト）系ほか」に属する命令を説明する図であり、図３３は、カテゴリー「CNVvaln（算術変換）系」に属する命令を説明する図であり、図３４は、カテゴリー「CNV（一般変換）系」に属する命令を説明する図であり、図３５は、カテゴリー「SATvlpk（飽和処理）系」に属する命令を説明する図であり、図３６は、カテゴリー「ETC（その他）系」に属する命令を説明する図である。 FIG. 21 to FIG. 36 are diagrams for explaining the schematic functions of the instructions executed by the processor 1. That is, FIG. 21 is a diagram for explaining instructions belonging to the category “ALUadd (addition) system”, FIG. 22 is a diagram for explaining instructions belonging to the category “ALUsub (subtraction) system”, and FIG. FIG. 24 is a diagram illustrating instructions belonging to the category “ALUlogic (logical operation) system, etc.”, FIG. 24 is a diagram illustrating instructions belonging to the category “CMP (comparison operation) system”, and FIG. FIG. 26 is a diagram illustrating instructions belonging to the category “mac (product-sum operation) system”, and FIG. 27 is a category “msu (product-difference operation)”. FIG. 28 is a diagram illustrating instructions belonging to the category “MEMld (memory read) system”, and FIG. 29 is a category “MEMstore (memory write) system”. Diagram explaining the instruction to which it belongs FIG. 30 is a diagram for explaining instructions belonging to the category “BRA (branch) system”, and FIG. 31 is a diagram for explaining instructions belonging to the category “BSasl (arithmetic barrel shift) system, etc.” 32 is a diagram for explaining instructions belonging to the category “BSlsr (logical barrel shift) system, etc.”, FIG. 33 is a diagram for explaining instructions belonging to the category “CNVvaln (arithmetic conversion) system”, and FIG. FIG. 35 is a diagram illustrating instructions belonging to the category “SATvlpk (saturation processing) system”, and FIG. 36 is a diagram illustrating the category “ETC”. It is a figure explaining the command which belongs to (other) type | system | group.

これらの図において、項目「ＳＩＭＤ」は、その命令の型（ＳＩＳＤ（SINGLE）かＳＩＭＤかの区別）を示し、項目「サイズ」は、演算の対象となる個々のオペランドのサイズを示し、項目「命令」は、その命令のオペコードを示し、項目「オペランド」は、その命令のオペランドを示し、項目「ＣＦＲ」は、条件フラグレジスタの変化を示し、項目「ＰＳＲ」は、プロセッサ状態レジスタの変化を示し、項目「代表的な動作」は、動作の概要を示し、項目「演算器」は、使用される演算器を示し、項目「３１１６」は、命令のサイズを示す。 In these figures, item “SIMD” indicates the type of instruction (distinguishing between SISD (SINGLE) and SIMD), item “size” indicates the size of each operand to be operated, and item “SIMD” "Instruction" indicates the opcode of the instruction, item "operand" indicates the operand of the instruction, item "CFR" indicates a change in the condition flag register, and item "PSR" indicates a change in the processor status register The item “representative operation” indicates an outline of the operation, the item “operation unit” indicates the operation unit to be used, and the item “3116” indicates the size of the instruction.

図３７〜図７４８は、本プロセッサ１が実行する命令の詳細な機能を説明する図である。なお、命令の説明に用いられている各種記号の意味は、以下の表６〜表１０の通りである。 37 to 748 are diagrams illustrating detailed functions of instructions executed by the processor 1. The meanings of various symbols used in the description of the instructions are as shown in Tables 6 to 10 below.

図３７〜図１１９は、「load」関連の命令を説明する図である。
図１２０〜図１８４は、「store」関連の命令を説明する図である。
図１８５〜図１８６は、「memory(etc)」関連の命令を説明する図である。
図１８７〜図２０６は、「external register」関連の命令を説明する図である。
図２０７〜図２４７は、「branch」関連の命令を説明する図である。
図２４８〜図２６４は、「VMP/interupt」関連の命令を説明する図である。
図２６５〜図２５８は、「program interrupt」関連の命令を説明する図である。
図２５９〜図３０３は、「arithmetic」関連の命令を説明する図である。
図３０４〜図３１７は、「logic」関連の命令を説明する図である。
図３１８〜図３５９は、「compare」関連の命令を説明する図である。
図３６０〜図４２０は、「move」関連の命令を説明する図である。
図４２１は、「nop」関連の命令を説明する図である。
図４２２〜図４４１は、「shift(S1)」関連の命令を説明する図である。
図４４２〜図４６０は、「shift(S2)」関連の命令を説明する図である。
図４６１〜図３８０は、「extract」関連の命令を説明する図である。
図４７１〜図４７４は、「mask」関連の命令を説明する図である。
図４７５〜図４８０は、「saturation」関連の命令を説明する図である。
図４８１〜図５１２は、「conversion」関連の命令を説明する図である。
図５１３〜図５１６は、「bit count」関連の命令を説明する図である。
図５１７〜図５２０は、「etc」関連の命令を説明する図である。
図５２１〜図５２６は、「mul(X1)」関連の命令を説明する図である。
図５２７〜図５３１は、「mul(X2)」関連の命令を説明する図である。
図５３２〜図５４３は、「mac(X1)」関連の命令を説明する図である。
図５４４〜図５４７は、「mac(X2)」関連の命令を説明する図である。
図５４８〜図５５９は、「msu(X1)」関連の命令を説明する図である。
図５６０〜図５６３は、「msu(X2)」関連の命令を説明する図である。
図５６４〜図５６５は、「divide」関連の命令を説明する図である。
図５６６〜図５６９は、「debug」関連の命令を説明する図である。
図５７０〜図６１５は、「SIMD arithmetic」関連の命令を説明する図である。
図６１６〜図６３９は、「SIMD compare」関連の命令を説明する図である。
図６４０〜図６６４は、「SIMD shift(S1)」関連の命令を説明する図である。
図６６５〜図６７５は、「SIMD shift(S2)」関連の命令を説明する図である。
図６７６〜図６８０は、「SIMD saturation」関連の命令を説明する図である。
図６８１〜図６８２は、「SIMD etc」関連の命令を説明する図である。
図６８３〜図６９９は、「SIMD mul(X2)」関連の命令を説明する図である。
図７００〜図７２８は、「SIMD mac(X2)」関連の命令を説明する図である。
図７２９〜図７４８は、「SIMD msu(X2)」関連の命令を説明する図である。 FIG. 37 to FIG. 119 are diagrams for explaining a command related to “load”.
120 to 184 are diagrams for explaining a “store” related command.
FIG. 185 to FIG. 186 are diagrams for explaining instructions related to “memory (etc)”.
FIG. 187 to FIG. 206 are diagrams for explaining instructions related to “external register”.
FIG. 207 to FIG. 247 are diagrams for explaining “branch” related instructions.
FIG. 248 to FIG. 264 are diagrams for explaining commands related to “VMP / interupt”.
265 to 258 are diagrams for explaining instructions related to “program interrupt”.
FIG. 259 to FIG. 303 are diagrams for explaining “arithmetic” related instructions.
304 to 317 are diagrams for explaining “logic” related instructions.
FIG. 318 to FIG. 359 are diagrams for explaining instructions related to “compare”.
360 to 420 are diagrams for explaining instructions related to “move”.
FIG. 421 is a diagram illustrating “nop” related instructions.
FIG. 422 to FIG. 441 are diagrams for explaining a command related to “shift (S1)”.
FIG. 442 to FIG. 460 are diagrams for explaining a command related to “shift (S2)”.
FIG. 461 to FIG. 380 are diagrams for explaining “extract” related instructions.
FIG. 471 to FIG. 474 are diagrams for explaining “mask” related instructions.
FIG. 475 to FIG. 480 are diagrams for explaining “saturation” related instructions.
481 to 512 are diagrams for explaining a command related to “conversion”.
FIG. 513 to FIG. 516 are diagrams for explaining instructions related to “bit count”.
FIG. 517 to FIG. 520 are diagrams for explaining commands related to “etc”.
FIG. 521 to FIG. 526 are diagrams illustrating instructions related to “mul (X1)”.
FIG. 527 to FIG. 531 are diagrams for explaining an instruction related to “mul (X2)”.
FIG. 532 to FIG. 543 are diagrams for explaining a command related to “mac (X1)”.
FIG. 544 to FIG. 547 are diagrams for explaining commands related to “mac (X2)”.
FIG. 548 to FIG. 559 are diagrams for explaining instructions related to “msu (X1)”.
560 to 563 are diagrams for explaining an instruction related to “msu (X2)”.
FIG. 564 to FIG. 565 are diagrams for explaining “divide” related commands.
FIG. 566 to FIG. 569 are diagrams for explaining “debug” related instructions.
570 to 615 are diagrams for explaining instructions related to “SIMD arithmetic”.
FIGS. 616 to 639 are diagrams for explaining instructions related to “SIMD compare”.
FIG. 640 to FIG. 664 are diagrams for explaining commands related to “SIMD shift (S1)”.
FIG. 665 to FIG. 675 are diagrams for explaining instructions related to “SIMD shift (S2)”.
FIG. 676 to FIG. 680 are diagrams for explaining instructions related to “SIMD saturation”.
681 to 682 are diagrams for explaining instructions related to “SIMD etc”.
FIGS. 683 to 699 are diagrams for explaining instructions related to “SIMD mul (X2)”.
700 to 728 are diagrams for explaining instructions related to “SIMD mac (X2)”.
FIG. 729 to FIG. 748 are diagrams for explaining instructions related to “SIMD msu (X2)”.

次に、いくつかの特徴的な命令について、本プロセッサ１の動作を説明する。 Next, the operation of the processor 1 will be described for some characteristic instructions.

（１）ＳＩＭＤ２項演算を、オペランドをクロスして行う命令
まず、２並列ＳＩＭＤ演算のうち、クロスの位置関係にあるオペランドを対象として演算を行う命令について説明する。 (1) Instructions for performing SIMD binary operations by crossing operands First, among two parallel SIMD operations, instructions for performing operations on operands having a cross positional relationship will be described.

［命令ｖｘａｄｄｈ］
命令ｖｘａｄｄｈは、ハーフワード（１６ビット）単位で、クロスの位置関係にある２組のオペランドを加算するＳＩＭＤ型の命令である。例えば、
ｖｘａｄｄｈＲｃ，Ｒａ，Ｒｂ
であれば、プロセッサ１は、算術論理・比較演算器４１等により、
（i）レジスタＲａの上位１６ビットとレジスタＲｂの下位１６ビットとを加算し、結果をレジスタＲｃの上位１６ビットに格納し、それと並行して、
（ii）レジスタＲａの下位１６ビットとレジスタＲｂの上位１６ビットとを加算し、結果をレジスタＲｃの下位１６ビットに格納する。 [Instruction vxaddh]
The instruction vxaddh is a SIMD type instruction that adds two sets of operands in a cross positional relationship in units of half words (16 bits). For example,
vxaddh Rc, Ra, Rb
If so, the processor 1 uses an arithmetic logic / comparison unit 41 or the like to
(I) The upper 16 bits of the register Ra and the lower 16 bits of the register Rb are added, and the result is stored in the upper 16 bits of the register Rc.
(Ii) Add the lower 16 bits of the register Ra and the upper 16 bits of the register Rb, and store the result in the lower 16 bits of the register Rc.

このような命令は、対称型フィルタ（中心から左右対称の係数）で乗算を減らすために、予め同じ係数が乗算される値同士を加算（又は、減算）しておく場合に有効である。 Such an instruction is effective when adding (or subtracting) values multiplied by the same coefficient in advance in order to reduce multiplication by a symmetric filter (coefficient which is symmetrical from the center).

なお、本プロセッサ１は、減算命令（ｖｘｓｕｂｈ等）についても、この加算命令と同様の処理を実行する。 Note that the processor 1 executes the same processing as this addition instruction for a subtraction instruction (vxsubh or the like).

［命令ｖｘｍｕｌ］
命令ｖｘｍｕｌは、ハーフワード（１６ビット）単位で、クロスの位置関係にある２組のオペランドを乗算し、それらの結果の下位ハーフワードそれぞれを残す（ＳＩＭＤ格納する）ＳＩＭＤ型の命令である。例えば、
ｖｘｍｕｌＲｃ，Ｒａ，Ｒｂ
であれば、プロセッサ１は、乗算・積和演算器４４等により、
（i）レジスタＲａの上位１６ビットとレジスタＲｂの下位１６ビットとを乗算し、その乗算結果を演算レジスタＭＨｍの上位１６ビットと演算レジスタＭＬｍの上位１６ビットに格納するとともに、その乗算結果の下位１６ビットをレジスタＲｃの上位１６ビットに格納し、それと並行して、
（ii）レジスタＲａの下位１６ビットとレジスタＲｂの上位１６ビットとを乗算し、その乗算結果を演算レジスタＭＨｍの下位１６ビットと演算レジスタＭＬｍの下位１６ビットに格納するとともに、その乗算結果の下位１６ビットをレジスタＲｃの下位１６ビットに格納する。 [Instruction vxmul]
The instruction vxmul is a SIMD type instruction that multiplies two sets of operands in a cross positional relationship in units of halfwords (16 bits) and leaves (stores in SIMD) each lower halfword of the result. For example,
vxmul Rc, Ra, Rb
If so, the processor 1 uses the multiplication / product-sum calculator 44 or the like,
(I) Multiply the upper 16 bits of the register Ra and the lower 16 bits of the register Rb, store the multiplication result in the upper 16 bits of the arithmetic register MHm and the upper 16 bits of the arithmetic register MLm, and 16 bits are stored in the upper 16 bits of the register Rc, and in parallel,
(Ii) Multiply the lower 16 bits of the register Ra by the upper 16 bits of the register Rb, store the multiplication result in the lower 16 bits of the operation register MHm and the lower 16 bits of the operation register MLm, and The 16 bits are stored in the lower 16 bits of the register Rc.

このような命令は、複素数の内積に有効である。結果の下位を取り出すのは整数データを扱う場合に有効である（主に画像）。 Such an instruction is effective for inner products of complex numbers. Extracting the lower order of the result is effective when handling integer data (mainly images).

［命令ｖｘｆｍｕｌｈ］
命令ｖｘｍｕｌｈは、ハーフワード（１６ビット）単位で、クロスの位置関係にある２組のオペランドを乗算し、それらの結果の上位ハーフワードそれぞれを残す（ＳＩＭＤ格納する）ＳＩＭＤ型の命令である。例えば、
ｖｘｆｍｕｌｈＲｃ，Ｒａ，Ｒｂ
であれば、プロセッサ１は、乗算・積和演算器４４等により、
（i）レジスタＲａの上位１６ビットとレジスタＲｂの下位１６ビットとを乗算し、その乗算結果を演算レジスタＭＨｍの上位１６ビットと演算レジスタＭＬｍの上位１６ビットに格納するとともに、その乗算結果の上位１６ビットをレジスタＲｃの上位１６ビットに格納し、それと並行して、
（ii）レジスタＲａの下位１６ビットとレジスタＲｂの上位１６ビットとを乗算し、その乗算結果を演算レジスタＭＨｍの下位１６ビットと演算レジスタＭＬｍの下位１６ビットに格納するとともに、その乗算結果の上位１６ビットをレジスタＲｃの下位１６ビットに格納する。 [Instruction vxfmulh]
The instruction vxmulh is an SIMD type instruction that multiplies two sets of operands in a cross positional relationship in units of halfwords (16 bits) and leaves the upper halfwords of those results (stores them in SIMD). For example,
vxfmulh Rc, Ra, Rb
If so, the processor 1 uses the multiplication / product-sum calculator 44 or the like,
(I) Multiply the upper 16 bits of the register Ra and the lower 16 bits of the register Rb and store the multiplication result in the upper 16 bits of the arithmetic register MHm and the upper 16 bits of the arithmetic register MLm. 16 bits are stored in the upper 16 bits of the register Rc, and in parallel,
(Ii) Multiply the lower 16 bits of the register Ra and the upper 16 bits of the register Rb, store the multiplication result in the lower 16 bits of the arithmetic register MHm and the lower 16 bits of the arithmetic register MLm, and The 16 bits are stored in the lower 16 bits of the register Rc.

このような命令は、複素数の内積に有効である。結果の上位を取り出すのは固定小数点データを扱う場合に有効である。Ｑ３１／Ｑ１５と呼ばれる標準的なファーマット（ＭＳＢ詰め）に適用することができる。 Such an instruction is effective for inner products of complex numbers. Extracting the top of the result is effective when dealing with fixed-point data. It can be applied to a standard format (MSB stuffing) called Q31 / Q15.

［命令ｖｘｆｍｕｌｗ］
命令ｖｘｆｍｕｌｗは、ハーフワード（１６ビット）単位で、クロスの位置関係にある２組のオペランドを乗算し、一方の乗算結果のみ残す（非ＳＩＭＤ格納する）ＳＩＭＤ型の命令である。例えば、
ｖｘｆｍｕｌｈＲｃ，Ｒａ，Ｒｂ
であれば、プロセッサ１は、乗算・積和演算器４４等により、
（i）レジスタＲａの上位１６ビットとレジスタＲｂの下位１６ビットとを乗算し、その乗算結果を演算レジスタＭＨｍの上位１６ビットと演算レジスタＭＬｍの上位１６ビットに格納するとともに、その乗算結果（ワード）をレジスタＲｃに格納し、それと並行して、
（ii）レジスタＲａの下位１６ビットとレジスタＲｂの上位１６ビットとを乗算し、その乗算結果を演算レジスタＭＨｍの下位１６ビットと演算レジスタＭＬｍの下位１６ビットに格納する（レジスタＲｃには格納しない）。 [Instruction vxfmulw]
The instruction vxfmulw is a SIMD type instruction that multiplies two sets of operands in a cross positional relationship in units of half words (16 bits) and leaves only one multiplication result (non-SIMD storage). For example,
vxfmulh Rc, Ra, Rb
If so, the processor 1 uses the multiplication / product-sum calculator 44 or the like,
(I) Multiply the upper 16 bits of the register Ra and the lower 16 bits of the register Rb, store the multiplication result in the upper 16 bits of the arithmetic register MHm and the upper 16 bits of the arithmetic register MLm, and the multiplication result (word ) Is stored in the register Rc, and in parallel therewith,
(Ii) Multiply the lower 16 bits of the register Ra by the upper 16 bits of the register Rb, and store the multiplication result in the lower 16 bits of the arithmetic register MHm and the lower 16 bits of the arithmetic register MLm (not stored in the register Rc) ).

このような命令は、精度が１６ビットで足りなくなりＳＩＭＤが維持できなくなる場合に有効である（オーディオなど）。 Such an instruction is effective when the precision is insufficient at 16 bits and SIMD cannot be maintained (such as audio).

［命令ｖｘｍａｃ］
命令ｖｘｍａｃは、ハーフワード（１６ビット）単位で、クロスの位置関係にある２組のオペランドの積和演算を行い、それらの結果の下位ハーフワードそれぞれを残す（ＳＩＭＤ格納する）ＳＩＭＤ型の命令である。例えば、
ｖｘｍａｃＭｍ，Ｒｃ，Ｒａ，Ｒｂ，Ｍｎ
であれば、プロセッサ１は、乗算・積和演算器４４等により、
（i）レジスタＲａの上位１６ビットとレジスタＲｂの下位１６ビットとを乗算し、その乗算結果と演算レジスタＭＨｎ及びＭＬｎの上位１６ビットからなる３２ビットとを加算し、その加算結果の３２ビットを演算レジスタＭＨｍ及びＭＬｍの上位１６ビットからなる３２ビット領域に格納するとともに、その加算結果の下位１６ビットをレジスタＲｃの上位１６ビットに格納し、それと並行して、
（ii）レジスタＲａの下位１６ビットとレジスタＲｂの上位１６ビットとを乗算し、その乗算結果と演算レジスタＭＨｎ及びＭＬｎの下位１６ビットからなる３２ビットとを加算し、その加算結果の３２ビットを演算レジスタＭＨｍ及びＭＬｍの下位１６ビットからなる３２ビット領域に格納するとともに、その加算結果の下位１６ビットをレジスタＲｃの下位１６ビットに格納する。 [Instruction vxmac]
The instruction vxmac is a SIMD type instruction that performs a product-sum operation on two pairs of operands in a cross positional relationship in units of halfwords (16 bits) and leaves each of the lower halfwords of those results (stores in SIMD). is there. For example,
vxmac Mm, Rc, Ra, Rb, Mn
If so, the processor 1 uses the multiplication / product-sum calculator 44 or the like,
(I) Multiply the upper 16 bits of the register Ra and the lower 16 bits of the register Rb, add the multiplication result and 32 bits consisting of the upper 16 bits of the operation registers MHn and MLn, and add the 32 bits of the addition result Store in the 32-bit area consisting of the upper 16 bits of the arithmetic registers MHm and MLm, and store the lower 16 bits of the addition result in the upper 16 bits of the register Rc.
(Ii) Multiply the lower 16 bits of the register Ra by the upper 16 bits of the register Rb, add the multiplication result and 32 bits consisting of the lower 16 bits of the operation registers MHn and MLn, and add the 32 bits of the addition result The lower 16 bits of the addition result are stored in the lower 16 bits of the register Rc, while being stored in a 32-bit area consisting of the lower 16 bits of the arithmetic registers MHm and MLm.

［命令ｖｘｆｍａｃｈ］
命令ｖｘｍａｃは、ハーフワード（１６ビット）単位で、クロスの位置関係にある２組のオペランドの積和演算を行い、それらの結果の上位ハーフワードそれぞれを残す（ＳＩＭＤ格納する）ＳＩＭＤ型の命令である。例えば、
ｖｘｆｍａｃｈＭｍ，Ｒｃ，Ｒａ，Ｒｂ，Ｍｎ
であれば、プロセッサ１は、乗算・積和演算器４４等により、
（i）レジスタＲａの上位１６ビットとレジスタＲｂの下位１６ビットとを乗算し、その乗算結果と演算レジスタＭＨｎ及びＭＬｎの上位１６ビットからなる３２ビットとを加算し、その加算結果の３２ビットを演算レジスタＭＨｍ及びＭＬｍの上位１６ビットからなる３２ビット領域に格納するとともに、その加算結果の上位１６ビットをレジスタＲｃの上位１６ビットに格納し、それと並行して、
（ii）レジスタＲａの下位１６ビットとレジスタＲｂの上位１６ビットとを乗算し、その乗算結果と演算レジスタＭＨｎ及びＭＬｎの下位１６ビットからなる３２ビットとを加算し、その加算結果の３２ビットを演算レジスタＭＨｍ及びＭＬｍの下位１６ビットからなる３２ビット領域に格納するとともに、その加算結果の上位１６ビットをレジスタＲｃの下位１６ビットに格納する。 [Instruction vxfmach]
The instruction vxmac is a SIMD type instruction that performs a product-sum operation on two sets of operands in a cross positional relationship in units of halfwords (16 bits) and leaves each of the higher halfwords of those results (stores them in SIMD). is there. For example,
vxfmach Mm, Rc, Ra, Rb, Mn
If so, the processor 1 uses the multiplication / product-sum calculator 44 or the like,
(I) Multiply the upper 16 bits of the register Ra and the lower 16 bits of the register Rb, add the multiplication result and 32 bits consisting of the upper 16 bits of the operation registers MHn and MLn, and add the 32 bits of the addition result In addition to storing in the 32-bit area consisting of the upper 16 bits of the arithmetic registers MHm and MLm, the upper 16 bits of the addition result are stored in the upper 16 bits of the register Rc.
(Ii) Multiply the lower 16 bits of the register Ra by the upper 16 bits of the register Rb, add the multiplication result and 32 bits consisting of the lower 16 bits of the operation registers MHn and MLn, and add the 32 bits of the addition result The upper 16 bits of the addition result are stored in the lower 16 bits of the register Rc, while being stored in the 32-bit area consisting of the lower 16 bits of the arithmetic registers MHm and MLm.

［命令ｖｘｆｍａｃｗ］
命令ｖｘｆｍａｃｈは、ハーフワード（１６ビット）単位で、クロスの位置関係にある２組のオペランドを乗算し、一方の乗算結果のみ残す（非ＳＩＭＤ格納する）ＳＩＭＤ型の命令である。例えば、
ｖｘｆｍａｃｈＭｍ，Ｒｃ，Ｒａ，Ｒｂ，Ｍｎ
であれば、プロセッサ１は、乗算・積和演算器４４等により、
（i）レジスタＲａの上位１６ビットとレジスタＲｂの下位１６ビットとを乗算し、その乗算結果と演算レジスタＭＨｎ及びＭＬｎの上位１６ビットからなる３２ビットとを加算し、その加算結果の３２ビットを演算レジスタＭＨｍ及びＭＬｍの上位１６ビットからなる３２ビット領域に格納するとともに、その加算結果の３２ビットをレジスタＲｃに格納し、それと並行して、
（ii）レジスタＲａの下位１６ビットとレジスタＲｂの上位１６ビットとを乗算し、その乗算結果と演算レジスタＭＨｎ及びＭＬｎの下位１６ビットからなる３２ビットとを加算し、その加算結果の３２ビットを演算レジスタＭＨｍ及びＭＬｍの下位１６ビットからなる３２ビット領域に格納する（レジスタＲｃには格納しない）。 [Instruction vxfmacw]
The instruction vxfmach is a SIMD type instruction that multiplies two sets of operands in a cross positional relationship in units of halfwords (16 bits) and leaves only one multiplication result (non-SIMD storage). For example,
vxfmach Mm, Rc, Ra, Rb, Mn
If so, the processor 1 uses the multiplication / product-sum calculator 44 or the like,
(I) Multiply the upper 16 bits of the register Ra and the lower 16 bits of the register Rb, add the multiplication result and 32 bits consisting of the upper 16 bits of the operation registers MHn and MLn, and add the 32 bits of the addition result In addition to storing in the 32-bit area consisting of the upper 16 bits of the arithmetic registers MHm and MLm, 32 bits of the addition result are stored in the register Rc.
(Ii) Multiply the lower 16 bits of the register Ra by the upper 16 bits of the register Rb, add the multiplication result and 32 bits consisting of the lower 16 bits of the operation registers MHn and MLn, and add the 32 bits of the addition result Store in the 32-bit area consisting of the lower 16 bits of the arithmetic registers MHm and MLm (not stored in the register Rc).

なお、本プロセッサ１は、積差命令（ｖｘｍｓｕ、ｖｘｍｓｕｈ、ｖｘｍｓｕｗ等）についても、これらの積和命令と同様の処理を実行する。 Note that the processor 1 executes the same processing as the product-sum instruction for product difference instructions (vxmsu, vxmsuh, vxmsuw, etc.).

また、本プロセッサ１は、以上のようなクロスの位置関係にある２組のオペランドに対する演算（２並列ＳＩＭＤによる加算、減算、乗算、積和、積差）だけでなく、これを拡張したｎ組のオペランドに対する演算（４並列、８並列ＳＩＭＤ演算など）を実行する機能を備えてもよい。 Further, the processor 1 not only performs operations (addition, subtraction, multiplication, product-sum, product-difference by two parallel SIMD) on the two sets of operands having the cross positional relationship as described above, but n sets obtained by expanding this May be provided with a function for executing operations on the operands (4 parallel, 8 parallel SIMD, etc.).

例えば、いま、レジスタＲａに格納されている４つのバイトデータを上位からＲａ１、Ｒａ２，Ｒａ３，Ｒａ４とし、レジスタＲｂに格納されている４つのバイトデータを上位からＲｂ１、Ｒｂ２，Ｒｂ３，Ｒｂ４とした場合に、レジスタＲａとレジスタＲｂとをオペランドとするＳＩＭＤ演算命令であって、以下のようなクロスの位置関係にあるバイトデータに対する演算を並列に実行する命令を備えてもよい。つまり、
（i）１対称クロスの命令
Ｒａ１とＲｂ４、Ｒａ２とＲｂ３、Ｒａ３とＲｂ２、Ｒａ４とＲｂ１、それぞれを演算対象とする４並列ＳＩＭＤ型の演算命令を備えてもよい。
（ii）２対称クロスの命令
Ｒａ１とＲｂ２、Ｒａ２とＲｂ１、Ｒａ３とＲｂ４、Ｒａ４とＲｂ３、それぞれを演算対象とする４並列ＳＩＭＤ型の演算命令を備えてもよい。
（iii）２重クロスの命令
Ｒａ１とＲｂ３、Ｒａ２とＲｂ４、Ｒａ３とＲｂ１、Ｒａ４とＲｂ２、それぞれを演算対象とする４並列ＳＩＭＤ型の演算命令を備えてもよい。 For example, the four byte data currently stored in the register Ra are Ra1, Ra2, Ra3, Ra4 from the top, and the four byte data stored in the register Rb are Rb1, Rb2, Rb3, Rb4 from the top. In this case, a SIMD operation instruction using the register Ra and the register Rb as operands, and an instruction for executing operations on byte data having a cross positional relationship as described below in parallel may be provided. That means
(I) Instructions with 1 Symmetric Crossing A 4-parallel SIMD type operation instruction for Ra1 and Rb4, Ra2 and Rb3, Ra3 and Rb2, and Ra4 and Rb1 may be provided.
(Ii) Two-symmetrical cross instruction It is possible to provide four parallel SIMD type arithmetic instructions for Ra1 and Rb2, Ra2 and Rb1, Ra3 and Rb4, and Ra4 and Rb3.
(Iii) Double-crossing instruction It is possible to provide four parallel SIMD type arithmetic instructions for Ra1 and Rb3, Ra2 and Rb4, Ra3 and Rb1, and Ra4 and Rb2, respectively.

そして、これらの４並列ＳＩＭＤ型の演算の種類は、上記２並列ＳＩＭＤ型の演算と同様に、加算、減算、乗算、積和、積差のいずれについても適用することができる。さらに、乗算、積和、積差については、上記２並列ＳＩＭＤ型の演算命令（ｖｘｍｕｌ，ｖｘｆｍｕｌｈ，ｖｘｆｍｕｌｗなど）と同様に、４つの演算結果それぞれの下位バイトだけをレジスタＲｃ等にＳＩＭＤ格納したり、４つの演算結果それぞれの上位バイトだけをレジスタＲｃ等にＳＩＭＤ格納したり、４つの演算結果のうちの２つだけをレジスタＲｃ等にＳＩＭＤ格納したりする命令を備えてもよい。 These types of 4-parallel SIMD type operations can be applied to any of addition, subtraction, multiplication, product-sum, and product-difference similarly to the above-described 2-parallel SIMD type operations. Further, with respect to multiplication, product sum, and product difference, similar to the above-described 2-parallel SIMD type operation instructions (vxmul, vxfmulh, vxfmulw, etc.), only the lower byte of each of the four operation results can be stored in the register Rc or the like. There may be provided an instruction for SIMD storing only the upper byte of each of the four calculation results in the register Rc or the like, or storing only two of the four calculation results in the register Rc or the like.

なお、上記のようなクロスの位置関係にあるデータに対する３種類の演算は、一般化すると以下のように表現することができる。つまり、いま、演算の対象となるデータの組が、第１データ群を構成するｎ個のデータの並びにおける第ｉ番目のデータと第２データ群を構成するｎ個のデータの並びにおける第ｊ番目のデータとすると、
上記（i）１対称クロスの命令では、ｊ＝ｎ−ｉ＋１が成立し、
上記（ii）２対称クロスの命令では、ｊ＝ｉ−（−１）＾（ｉｍｏｄ２）が成立し、
上記（iii）２重クロスの命令では、ｊ＝ｎ−ｉ＋１＋（−１）＾（ｉｍｏｄ２）が成立する。ここで、「＾」はべき乗を意味し、「ｍｏｄ」は剰余を意味する。 It should be noted that the three types of operations on the data having the cross positional relationship as described above can be expressed as follows when generalized. That is, now, the set of data to be calculated is jth in the arrangement of the i-th data and the n data constituting the second data group in the arrangement of the n data constituting the first data group. As the second data,
In the above (i) 1-symmetric cross instruction, j = n−i + 1 holds.
In the above (ii) 2-symmetric cross instruction, j = i − (− 1) ^ (i mod 2) holds,
In the above (iii) double cross instruction, j = n−i + 1 + (− 1) ^ (i mod 2) holds. Here, “^” means power and “mod” means remainder.

以上のような命令は、複素数の内積等、２個の複素数を同時に演算する場合に有効である。 The above instruction is effective when computing two complex numbers simultaneously, such as inner product of complex numbers.

（２）ＳＩＭＤ２項演算を、一方のオペランドを固定して行う命令
次に、２並列ＳＩＭＤ演算のうち、一方のオペランドを固定して行う（一方のオペランドを共通とする）演算命令について説明する。 (2) Instruction for Performing SIMD Binary Operation with One Operand Fixed Next, an operation instruction performed with one operand fixed (with one operand in common) out of two parallel SIMD operations will be described.

［命令ｖｈａｄｄｈ］
命令ｖｈａｄｄｈは、ハーフワード（１６ビット）単位で、一方のオペランド（レジスタ中の上位１６ビット）を共通とする２組のオペランドを加算するＳＩＭＤ型の命令である。例えば、
ｖｈａｄｄｈＲｃ，Ｒａ，Ｒｂ
であれば、プロセッサ１は、算術論理・比較演算器４１等により、
（i）レジスタＲａの上位１６ビットとレジスタＲｂの上位１６ビットとを加算し、結果をレジスタＲｃの上位１６ビットに格納し、それと並行して、
（ii）レジスタＲａの下位１６ビットとレジスタＲｂの上位１６ビットとを加算し、結果をレジスタＲｃの下位１６ビットに格納する。 [Instruction vhaddh]
The instruction vhaddh is a SIMD type instruction that adds two sets of operands that share one operand (the upper 16 bits in the register) in units of half words (16 bits). For example,
vhaddh Rc, Ra, Rb
If so, the processor 1 uses an arithmetic logic / comparison unit 41 or the like to
(I) The upper 16 bits of the register Ra and the upper 16 bits of the register Rb are added, and the result is stored in the upper 16 bits of the register Rc.
(Ii) Add the lower 16 bits of the register Ra and the upper 16 bits of the register Rb, and store the result in the lower 16 bits of the register Rc.

このような命令は、２つの配列の要素間の加減算で２つの配列のアライメントがずれておりＳＩＭＤ適用が困難な場合に有効である。 Such an instruction is effective when the two arrays are misaligned by addition / subtraction between the elements of the two arrays and it is difficult to apply SIMD.

なお、本プロセッサ１は、減算命令（ｖｈｓｕｂｈ等）についても、この加算命令と同様の処理を実行する。 Note that the processor 1 executes the same processing as this addition instruction for a subtraction instruction (vhsubh or the like).

［命令ｖｈｍｕｌ］
命令ｖｈｍｕｌは、ハーフワード（１６ビット）単位で、一方のオペランド（レジスタ中の上位１６ビット）を共通とする２組のオペランドを乗算し、それらの結果の下位ハーフワードそれぞれを残す（ＳＩＭＤ格納する）ＳＩＭＤ型の命令である。例えば、
ｖｈｍｕｌＲｃ，Ｒａ，Ｒｂ
であれば、プロセッサ１は、乗算・積和演算器４４等により、
（i）レジスタＲａの上位１６ビットとレジスタＲｂの上位１６ビットとを乗算し、その乗算結果を演算レジスタＭＨｍの上位１６ビットと演算レジスタＭＬｍの上位１６ビットに格納するとともに、その乗算結果の下位１６ビットをレジスタＲｃの上位１６ビットに格納し、それと並行して、
（ii）レジスタＲａの下位１６ビットとレジスタＲｂの上位１６ビットとを乗算し、その乗算結果を演算レジスタＭＨｍの下位１６ビットと演算レジスタＭＬｍの下位１６ビットに格納するとともに、その乗算結果の下位１６ビットをレジスタＲｃの下位１６ビットに格納する。 [Instruction vhmul]
The instruction vhmul multiplies two sets of operands that share one operand (the upper 16 bits in the register) in units of halfwords (16 bits), and leaves the lower halfwords of those results (SIMD storage). ) SIMD type instructions. For example,
vhmul Rc, Ra, Rb
If so, the processor 1 uses the multiplication / product-sum calculator 44 or the like,
(I) Multiply the upper 16 bits of the register Ra and the upper 16 bits of the register Rb and store the multiplication result in the upper 16 bits of the arithmetic register MHm and the upper 16 bits of the arithmetic register MLm. 16 bits are stored in the upper 16 bits of the register Rc, and in parallel,
(Ii) Multiply the lower 16 bits of the register Ra by the upper 16 bits of the register Rb, store the multiplication result in the lower 16 bits of the operation register MHm and the lower 16 bits of the operation register MLm, and The 16 bits are stored in the lower 16 bits of the register Rc.

このような命令は、ゲインコントロールなど全要素に係数を乗ずる場合、ループをイタレーションで展開し、ＳＩＭＤで並列処理するとき、要素がずれてＳＩＭＤ適用が困難な場合に有効である。基本的には、下記の下位固定の命令とペア（交互）で使用する。結果の下位を取り出すのは整数データを扱う場合に有効である（主に、画像）。 Such an instruction is effective when multiplying all elements by a coefficient such as gain control, when the loop is expanded by iteration and parallel processing is performed by SIMD, and when it is difficult to apply SIMD. Basically, it is used as a pair (alternate) with the following lower fixed instructions. Extracting the lower order of the result is effective when handling integer data (mainly images).

［命令ｖｈｆｍｕｌｈ］
命令ｖｈｍｕｌｈは、ハーフワード（１６ビット）単位で、一方のオペランド（レジスタ中の上位１６ビット）を共通とする２組のオペランドを乗算し、それらの結果の上位ハーフワードそれぞれを残す（ＳＩＭＤ格納する）ＳＩＭＤ型の命令である。例えば、
ｖｈｆｍｕｌｈＲｃ，Ｒａ，Ｒｂ
であれば、プロセッサ１は、乗算・積和演算器４４等により、
（i）レジスタＲａの上位１６ビットとレジスタＲｂの上位１６ビットとを乗算し、その乗算結果を演算レジスタＭＨｍの上位１６ビットと演算レジスタＭＬｍの上位１６ビットに格納するとともに、その乗算結果の上位１６ビットをレジスタＲｃの上位１６ビットに格納し、それと並行して、
（ii）レジスタＲａの下位１６ビットとレジスタＲｂの上位１６ビットとを乗算し、その乗算結果を演算レジスタＭＨｍの下位１６ビットと演算レジスタＭＬｍの下位１６ビットに格納するとともに、その乗算結果の上位１６ビットをレジスタＲｃの下位１６ビットに格納する。 [Instruction vhfmulh]
The instruction vhmulh multiplies two sets of operands that share one operand (upper 16 bits in the register) in units of halfwords (16 bits), and leaves the upper halfwords of those results (SIMD storage). ) SIMD type instructions. For example,
vhfmulh Rc, Ra, Rb
If so, the processor 1 uses the multiplication / product-sum calculator 44 or the like,
(I) Multiply the upper 16 bits of the register Ra and the upper 16 bits of the register Rb, store the multiplication result in the upper 16 bits of the arithmetic register MHm and the upper 16 bits of the arithmetic register MLm, and 16 bits are stored in the upper 16 bits of the register Rc, and in parallel,
(Ii) Multiply the lower 16 bits of the register Ra and the upper 16 bits of the register Rb, store the multiplication result in the lower 16 bits of the arithmetic register MHm and the lower 16 bits of the arithmetic register MLm, and The 16 bits are stored in the lower 16 bits of the register Rc.

このような命令は、上記と同様に有効である。結果の上位を取り出すのは固定小数点データを扱う場合に有効である。Ｑ３１／Ｑ１５と呼ばれる標準的なファーマット（ＭＳＢ詰め）に適用することができる。 Such an instruction is valid as described above. Extracting the top of the result is effective when dealing with fixed-point data. It can be applied to a standard format (MSB stuffing) called Q31 / Q15.

［命令ｖｈｆｍｕｌｗ］
命令ｖｈｆｍｕｌｗは、ハーフワード（１６ビット）単位で、一方のオペランド（レジスタ中の上位１６ビット）を共通とする２組のオペランドを乗算し、一方の乗算結果のみ残す（非ＳＩＭＤ格納する）ＳＩＭＤ型の命令である。例えば、
ｖｈｆｍｕｌｈＲｃ，Ｒａ，Ｒｂ
であれば、プロセッサ１は、乗算・積和演算器４４等により、
（i）レジスタＲａの上位１６ビットとレジスタＲｂの上位１６ビットとを乗算し、その乗算結果を演算レジスタＭＨｍの上位１６ビットと演算レジスタＭＬｍの上位１６ビットに格納するとともに、その乗算結果（ワード）をレジスタＲｃに格納し、それと並行して、
（ii）レジスタＲａの下位１６ビットとレジスタＲｂの上位１６ビットとを乗算し、その乗算結果を演算レジスタＭＨｍの下位１６ビットと演算レジスタＭＬｍの下位１６ビットに格納する（レジスタＲｃには格納しない）。 [Instruction vhfmulw]
The instruction vhfmulw is a SIMD type that multiplies two operands that share one operand (upper 16 bits in the register) in units of halfword (16 bits) and leaves only one multiplication result (non-SIMD storage). It is an instruction. For example,
vhfmulh Rc, Ra, Rb
If so, the processor 1 uses the multiplication / product-sum calculator 44 or the like,
(I) Multiply the upper 16 bits of the register Ra and the upper 16 bits of the register Rb, store the multiplication result in the upper 16 bits of the arithmetic register MHm and the upper 16 bits of the arithmetic register MLm, and the multiplication result (word ) Is stored in the register Rc, and in parallel therewith,
(Ii) Multiply the lower 16 bits of the register Ra by the upper 16 bits of the register Rb, and store the multiplication result in the lower 16 bits of the arithmetic register MHm and the lower 16 bits of the arithmetic register MLm (not stored in the register Rc) ).

このような命令は精度確保に有効である。 Such an instruction is effective in ensuring accuracy.

［命令ｖｈｍａｃ］
命令ｖｈｍａｃは、ハーフワード（１６ビット）単位で、一方のオペランド（レジスタ中の上位１６ビット）を共通とする２組のオペランドの積和演算を行い、それらの結果の下位ハーフワードそれぞれを残す（ＳＩＭＤ格納する）ＳＩＭＤ型の命令である。例えば、
ｖｈｍａｃＭｍ，Ｒｃ，Ｒａ，Ｒｂ，Ｍｎ
であれば、プロセッサ１は、乗算・積和演算器４４等により、
（i）レジスタＲａの上位１６ビットとレジスタＲｂの上位１６ビットとを乗算し、その乗算結果と演算レジスタＭＨｎ及びＭＬｎの上位１６ビットからなる３２ビットとを加算し、その加算結果の３２ビットを演算レジスタＭＨｍ及びＭＬｍの上位１６ビットからなる３２ビット領域に格納するとともに、その加算結果の下位１６ビットをレジスタＲｃの上位１６ビットに格納し、それと並行して、
（ii）レジスタＲａの下位１６ビットとレジスタＲｂの上位１６ビットとを乗算し、その乗算結果と演算レジスタＭＨｎ及びＭＬｎの下位１６ビットからなる３２ビットとを加算し、その加算結果の３２ビットを演算レジスタＭＨｍ及びＭＬｍの下位１６ビットからなる３２ビット領域に格納するとともに、その加算結果の下位１６ビットをレジスタＲｃの下位１６ビットに格納する。 [Instruction vhmac]
The instruction vhmac performs a product-sum operation on two sets of operands that share one operand (the upper 16 bits in the register) in units of halfwords (16 bits), and leaves each of the lower halfwords of those results ( This is an SIMD type instruction (stored in SIMD). For example,
vhmac Mm, Rc, Ra, Rb, Mn
If so, the processor 1 uses the multiplication / product-sum calculator 44 or the like,
(I) Multiply the upper 16 bits of the register Ra and the upper 16 bits of the register Rb, add the multiplication result and 32 bits consisting of the upper 16 bits of the operation registers MHn and MLn, and add the 32 bits of the addition result Store in the 32-bit area consisting of the upper 16 bits of the arithmetic registers MHm and MLm, and store the lower 16 bits of the addition result in the upper 16 bits of the register Rc.
(Ii) Multiply the lower 16 bits of the register Ra by the upper 16 bits of the register Rb, add the multiplication result and 32 bits consisting of the lower 16 bits of the operation registers MHn and MLn, and add the 32 bits of the addition result The lower 16 bits of the addition result are stored in the lower 16 bits of the register Rc, while being stored in a 32-bit area consisting of the lower 16 bits of the arithmetic registers MHm and MLm.

このような命令は、ＦＩＲ（フィルタ）などでループをイタレーションで展開し、ＳＩＭＤで並列処理するとき、要素がずれてＳＩＭＤ適用が困難な場合に有効である。基本的には、下記の下位固定の命令とペア（交互）で使用する。結果の下位を取り出すのは整数データを扱う場合に有効である（主に、画像）。 Such an instruction is effective when it is difficult to apply SIMD when the loop is expanded by iteration using FIR (filter) or the like and parallel processing is performed using SIMD. Basically, it is used as a pair (alternate) with the following lower fixed instructions. Extracting the lower order of the result is effective when handling integer data (mainly images).

［命令ｖｈｆｍａｃｈ］
命令ｖｈｍａｃは、ハーフワード（１６ビット）単位で、一方のオペランド（レジスタ中の上位１６ビット）を共通とする２組のオペランドの積和演算を行い、それらの結果の上位ハーフワードそれぞれを残す（ＳＩＭＤ格納する）ＳＩＭＤ型の命令である。例えば、
ｖｈｆｍａｃｈＭｍ，Ｒｃ，Ｒａ，Ｒｂ，Ｍｎ
であれば、プロセッサ１は、乗算・積和演算器４４等により、
（i）レジスタＲａの上位１６ビットとレジスタＲｂの上位１６ビットとを乗算し、その乗算結果と演算レジスタＭＨｎ及びＭＬｎの上位１６ビットからなる３２ビットとを加算し、その加算結果の３２ビットを演算レジスタＭＨｍ及びＭＬｍの上位１６ビットからなる３２ビット領域に格納するとともに、その加算結果の上位１６ビットをレジスタＲｃの上位１６ビットに格納し、それと並行して、
（ii）レジスタＲａの下位１６ビットとレジスタＲｂの上位１６ビットとを乗算し、その乗算結果と演算レジスタＭＨｎ及びＭＬｎの下位１６ビットからなる３２ビットとを加算し、その加算結果の３２ビットを演算レジスタＭＨｍ及びＭＬｍの下位１６ビットからなる３２ビット領域に格納するとともに、その加算結果の上位１６ビットをレジスタＲｃの下位１６ビットに格納する。 [Instruction vhfmach]
The instruction vhmac performs a product-sum operation on two sets of operands that share one operand (the upper 16 bits in the register) in units of halfwords (16 bits), and leaves each of the upper halfwords of those results ( This is an SIMD type instruction (stored in SIMD). For example,
vhfmach Mm, Rc, Ra, Rb, Mn
If so, the processor 1 uses the multiplication / product-sum calculator 44 or the like,
(I) Multiply the upper 16 bits of the register Ra and the upper 16 bits of the register Rb, add the multiplication result and 32 bits consisting of the upper 16 bits of the operation registers MHn and MLn, and add the 32 bits of the addition result In addition to storing in the 32-bit area consisting of the upper 16 bits of the arithmetic registers MHm and MLm, the upper 16 bits of the addition result are stored in the upper 16 bits of the register Rc.
(Ii) Multiply the lower 16 bits of the register Ra by the upper 16 bits of the register Rb, add the multiplication result and 32 bits consisting of the lower 16 bits of the operation registers MHn and MLn, and add the 32 bits of the addition result The upper 16 bits of the addition result are stored in the lower 16 bits of the register Rc, while being stored in the 32-bit area consisting of the lower 16 bits of the arithmetic registers MHm and MLm.

［命令ｖｈｆｍａｃｗ］
命令ｖｈｆｍａｃｈは、ハーフワード（１６ビット）単位で、一方のオペランド（レジスタ中の上位１６ビット）を共通とする２組のオペランドを乗算し、一方の乗算結果のみ残す（非ＳＩＭＤ格納する）ＳＩＭＤ型の命令である。例えば、
ｖｈｆｍａｃｈＭｍ，Ｒｃ，Ｒａ，Ｒｂ，Ｍｎ
であれば、プロセッサ１は、乗算・積和演算器４４等により、
（i）レジスタＲａの上位１６ビットとレジスタＲｂの上位１６ビットとを乗算し、その乗算結果と演算レジスタＭＨｎ及びＭＬｎの上位１６ビットからなる３２ビットとを加算し、その加算結果の３２ビットを演算レジスタＭＨｍ及びＭＬｍの上位１６ビットからなる３２ビット領域に格納するとともに、その加算結果の３２ビットをレジスタＲｃに格納し、それと並行して、
（ii）レジスタＲａの下位１６ビットとレジスタＲｂの上位１６ビットとを乗算し、その乗算結果と演算レジスタＭＨｎ及びＭＬｎの下位１６ビットからなる３２ビットとを加算し、その加算結果の３２ビットを演算レジスタＭＨｍ及びＭＬｍの下位１６ビットからなる３２ビット領域に格納する（レジスタＲｃには格納しない）。 [Instruction vhfmacw]
The instruction vhfmach is a SIMD type that multiplies two operands that share one operand (upper 16 bits in the register) in halfword (16 bits) units and leaves only one multiplication result (non-SIMD storage). It is an instruction. For example,
vhfmach Mm, Rc, Ra, Rb, Mn
If so, the processor 1 uses the multiplication / product-sum calculator 44 or the like,
(I) Multiply the upper 16 bits of the register Ra and the upper 16 bits of the register Rb, add the multiplication result and 32 bits consisting of the upper 16 bits of the operation registers MHn and MLn, and add the 32 bits of the addition result In addition to storing in the 32-bit area consisting of the upper 16 bits of the arithmetic registers MHm and MLm, 32 bits of the addition result are stored in the register Rc.
(Ii) Multiply the lower 16 bits of the register Ra by the upper 16 bits of the register Rb, add the multiplication result and 32 bits consisting of the lower 16 bits of the operation registers MHn and MLn, and add the 32 bits of the addition result Store in the 32-bit area consisting of the lower 16 bits of the arithmetic registers MHm and MLm (not stored in the register Rc).

このような命令は、精度確保に有効である。
なお、本プロセッサ１は、積差命令（ｖｈｍｓｕ、ｖｈｍｓｕｈ、ｖｈｍｓｕｗ等）についても、これらの積和命令と同様の処理を実行する。 Such an instruction is effective in ensuring accuracy.
Note that the processor 1 executes the same processing as the product-sum instruction for product difference instructions (vhmsu, vhmsuh, vhmsuw, etc.).

また、これらの命令では、レジスタ中の上位１６ビットが固定（共通化）されたが、レジスタ中の下位１６ビットを固定（共通化）する命令（ｖｌａｄｄｈ、ｖｌｓｕｂｈ、ｖｌｍｕｌ、ｖｌｆｍｕｌｈ、ｖｌｆｍｕｌｗ、ｖｌｍａｃ、ｖｌｍｓｕ、ｖｌｆｍａｃｈ、ｖｌｍｓｕｈ、ｖｌｆｍａｃｗ、ｖｌｍｓｕｗ等）についても、本プロセッサ１は、これらの演算命令と同様の処理を実行する。このような命令は、上記の上位固定の命令とペアで使用すると有効である。 In these instructions, the upper 16 bits in the register are fixed (shared), but the lower 16 bits in the register are fixed (shared) (vladdh, vlsubh, vlmul, vlfmulh, vlfmulw, vlmulw, vlmac, (vlmsu, vlfmach, vlmsuh, vlfmacw, vlmsuw, etc.), the processor 1 executes the same processing as these operation instructions. Such an instruction is effective when used as a pair with the upper fixed instruction.

また、本プロセッサ１は、以上のような一方のオペランド（レジスタ中の上位１６ビット）を共通とする２組のオペランドに対する演算（２並列ＳＩＭＤによる加算、減算、乗算、積和、積差）だけでなく、これを拡張したｎ組のオペランドに対する演算（４並列、８並列ＳＩＭＤ演算など）を実行する機能を備えてもよい。 Further, the processor 1 performs only operations (addition, subtraction, multiplication, product sum, product difference by two parallel SIMD) on two sets of operands that share one operand (the upper 16 bits in the register) as described above. Instead, it may have a function of executing operations (4 parallel, 8 parallel SIMD operations, etc.) on n sets of operands that are expanded from this.

例えば、いま、レジスタＲａに格納されている４つのバイトデータを上位からＲａ１、Ｒａ２，Ｒａ３，Ｒａ４とし、レジスタＲｂに格納されている４つのバイトデータを上位からＲｂ１、Ｒｂ２，Ｒｂ３，Ｒｂ４とした場合に、レジスタＲａとレジスタＲｂとをオペランドとするＳＩＭＤ演算命令であって、以下のような一方のオペランド（レジスタ中の１バイト）を共通とするバイトデータに対する演算を並列に実行する命令を備えてもよい。つまり、
（i）最上位固定の命令
Ｒａ１とＲｂ１、Ｒａ２とＲｂ１、Ｒａ３とＲｂ１、Ｒａ４とＲｂ１、それぞれを演算対象とする４並列ＳＩＭＤ型の演算命令を備えてもよい。
（i）中上位固定の命令
Ｒａ１とＲｂ２、Ｒａ２とＲｂ２、Ｒａ３とＲｂ２、Ｒａ４とＲｂ２、それぞれを演算対象とする４並列ＳＩＭＤ型の演算命令を備えてもよい。
（i）中下位位固定の命令
Ｒａ１とＲｂ３、Ｒａ２とＲｂ３、Ｒａ３とＲｂ３、Ｒａ４とＲｂ３、それぞれを演算対象とする４並列ＳＩＭＤ型の演算命令を備えてもよい。
（i）最下位固定の命令
Ｒａ１とＲｂ４、Ｒａ２とＲｂ４、Ｒａ３とＲｂ４、Ｒａ４とＲｂ４、それぞれを演算対象とする４並列ＳＩＭＤ型の演算命令を備えてもよい。 For example, the four byte data stored in the register Ra are Ra1, Ra2, Ra3, and Ra4 from the top, and the four byte data stored in the register Rb are Rb1, Rb2, Rb3, and Rb4 from the top. In this case, a SIMD operation instruction having the register Ra and the register Rb as operands, and an instruction for executing an operation on byte data sharing one operand (one byte in the register) as follows in parallel is provided. May be. That means
(I) Highest level fixed instruction Ra1 and Rb1, Ra2 and Rb1, Ra3 and Rb1, and Ra4 and Rb1 may be provided as 4-parallel SIMD type operation instructions.
(I) Middle-upper fixed instructions Ra1 and Rb2, Ra2 and Rb2, Ra3 and Rb2, and Ra4 and Rb2 may each be provided with a 4-parallel SIMD type arithmetic instruction.
(I) Middle and lower order fixed instructions Ra1 and Rb3, Ra2 and Rb3, Ra3 and Rb3, Ra4 and Rb3 may be provided as 4-parallel SIMD type operation instructions.
(I) The lowest fixed instruction Ra1 and Rb4, Ra2 and Rb4, Ra3 and Rb4, Ra4 and Rb4 may be provided as 4-parallel SIMD type operation instructions.

そして、これらの４並列ＳＩＭＤ型の演算の種類は、上記２並列ＳＩＭＤ型の演算と同様に、加算、減算、乗算、積和、積差のいずれについても適用することができる。さらに、乗算、積和、積差については、上記２並列ＳＩＭＤ型の演算命令（ｖｈｍｕｌ，ｖｈｆｍｕｌｈ，ｖｈｆｍｕｌｗなど）と同様に、４つの演算結果それぞれの下位バイトだけをレジスタＲｃ等にＳＩＭＤ格納したり、４つの演算結果それぞれの上位バイトだけをレジスタＲｃ等にＳＩＭＤ格納したり、４つの演算結果のうちの２つだけをレジスタＲｃ等にＳＩＭＤ格納したりする命令を備えてもよい。このような命令は、以下のようなケースに有効である。つまり、２つの要素を１要素ずつずらしながら要素ごとの演算をする場合に有効である。１個ずれ、２個ずれ、３個ずれの演算が必要となるからである。 These types of 4-parallel SIMD type operations can be applied to any of addition, subtraction, multiplication, product-sum, and product-difference similarly to the above-described 2-parallel SIMD type operations. Further, for multiplication, sum of products, and product difference, similar to the above-described two-parallel SIMD type operation instructions (vhmul, vhfmulh, vhfmulw, etc.), only the lower byte of each of the four operation results can be stored in the register Rc or the like. There may be provided an instruction for SIMD storing only the upper byte of each of the four calculation results in the register Rc or the like, or storing only two of the four calculation results in the register Rc or the like. Such an instruction is effective in the following cases. In other words, this is effective when performing the calculation for each element while shifting the two elements one element at a time. This is because it is necessary to calculate one deviation, two deviations, and three deviations.

以上のようなオペランドの一方を固定した演算を一般化して表現すると、次のようになる。つまり、本プロセッサ１は、ｎ（≧２）個のデータの並びからなる第１データ群を指定する第１オペランドとｎ個のデータの並びからなる第２データ群を指定する第２オペランドとを含むＳＩＭＤ型の命令に対して、ｉ＝１，２，…，ｎ、ｊ＝一定値とした場合における第１データ群の第ｉ番目データと第２データ群の第ｊ番目データとの組からなるｎ組それぞれに対して演算を行ってもよい。 The above generalized operation with one of the operands fixed is expressed as follows. That is, the processor 1 includes a first operand that specifies a first data group that includes a sequence of n (≧ 2) data and a second operand that specifies a second data group that includes a sequence of n data. For a SIMD type instruction to be included, from a set of i-th data in the first data group and j-th data in the second data group when i = 1, 2,..., N, j = constant values. An operation may be performed on each of the n sets.

（３）ＳＩＭＤ２項演算を実行し、その結果をビットシフトする命令
次に、２並列ＳＩＭＤ演算のうち、クロスの位置関係にあるオペランドを対象として演算を行う命令について説明する。 (3) Instruction for executing SIMD binary operation and bit-shifting the result Next, of 2 parallel SIMD operations, an instruction for performing an operation on operands in a cross positional relationship will be described.

［命令ｖａｄｄｓｈ］
命令ｖａｄｄｓｈは、ハーフワード（１６ビット）単位で２組のオペランドを加算し、その結果を１ビットだけ算術右シフトするＳＩＭＤ型の命令である。例えば、
ｖａｄｄｓｈＲｃ，Ｒａ，Ｒｂ
であれば、プロセッサ１は、算術論理・比較演算器４１等により、
（i）レジスタＲａの上位１６ビットとレジスタＲｂの上位１６ビットとを加算し、結果を１ビットだけ算術右シフトした値をレジスタＲｃの上位１６ビットに格納し、それと並行して、
（ii）レジスタＲａの下位１６ビットとレジスタＲｂの下位１６ビットとを加算し、結果を結果を１ビットだけ算術右シフトした値をレジスタＲｃの下位１６ビットに格納する。 [Instruction vaddsh]
The instruction vaddsh is a SIMD type instruction that adds two sets of operands in units of halfwords (16 bits) and arithmetically shifts the result by one bit to the right. For example,
vaddsh Rc, Ra, Rb
If so, the processor 1 uses an arithmetic logic / comparison unit 41 or the like to
(I) The upper 16 bits of the register Ra and the upper 16 bits of the register Rb are added, and a value obtained by arithmetically shifting the result by one bit is stored in the upper 16 bits of the register Rc.
(Ii) Add the lower 16 bits of the register Ra and the lower 16 bits of the register Rb, and store the result obtained by arithmetically shifting the result by one bit to the lower 16 bits of the register Rc.

このような命令は、加算により１６ビット精度におさまらなくなる前にシフトダウンして精度を確保するときに有効である。ケースによっては丸めが必要である。ＦＦＴ（バタフライ）では複素数同士の加減算が繰り返されるので多用される。 Such an instruction is effective for ensuring accuracy by shifting down before it is not reduced to 16-bit accuracy by addition. Some cases require rounding. FFT (butterfly) is frequently used because addition and subtraction of complex numbers are repeated.

なお、本プロセッサ１は、減算命令（ｖｓｕｂｓｈ等）についても、この加算命令と同様の処理を実行する。 Note that the processor 1 executes the same processing as this addition instruction for a subtraction instruction (vsubsh or the like).

また、本プロセッサ１は、以上のような２組のオペランドに対する演算（２並列ＳＩＭＤによる加算、減算）だけでなく、これを拡張したｎ組のオペランドに対する演算（４並列、８並列ＳＩＭＤ演算など）を実行する機能を備えてもよい。 Further, the processor 1 performs not only operations on the above two sets of operands (addition and subtraction by 2-parallel SIMD), but also operations on n sets of operands (4-parallel, 8-parallel SIMD operations, etc.) that are expanded. May be provided with a function of executing

例えば、いま、レジスタＲａに格納されている４つのバイトデータを上位からＲａ１、Ｒａ２，Ｒａ３，Ｒａ４とし、レジスタＲｂに格納されている４つのバイトデータを上位からＲｂ１、Ｒｂ２，Ｒｂ３，Ｒｂ４とした場合に、レジスタＲａとレジスタＲｂとをオペランドとするＳＩＭＤ演算命令であって、以下のような演算とビットシフトとを行う命令を備えてもよい。つまり、Ｒａ１とＲｂ１、Ｒａ２とＲｂ２、Ｒａ３とＲｂ３、Ｒａ４とＲｂ４、それぞれを演算対象とする４並列ＳＩＭＤ型の演算命令を備えてもよい。例えば、バイト単位で４組のオペランドを加算し、その結果を１ビットだけ算術右シフトする命令ｖａｄｄｓｂなどである。 For example, the four byte data currently stored in the register Ra are Ra1, Ra2, Ra3, Ra4 from the top, and the four byte data stored in the register Rb are Rb1, Rb2, Rb3, Rb4 from the top. In this case, a SIMD operation instruction having operands of the register Ra and the register Rb and an instruction for performing the following operation and bit shift may be provided. That is, a 4-parallel SIMD type operation instruction for Ra1 and Rb1, Ra2 and Rb2, Ra3 and Rb3, and Ra4 and Rb4 may be provided. For example, an instruction vaddsb that adds four operands in byte units and arithmetically shifts the result by one bit to the right.

このような命令は、同様に精度確保に有効である。主に平均をとる場合に使用する（縦の平均）。 Such an instruction is also effective in ensuring accuracy. Used mainly for averaging (vertical averaging).

また、このようなＳＩＭＤ演算とシフトとを行う特徴的な命令は、上記のような右方向に１ビットだけシフトさせる命令に限られない。つまり、そのシフト量は、固定及び可変のいずれであってもよいし、右方向及び左方向のいずれであってもよい。また、右シフト時にこぼれたビットを丸めるようにしてもよい（例えば、命令ｖａｄｄｓｒｈや命令ｖａｄｄｓｒｂなど）。 Further, the characteristic instruction for performing such SIMD operation and shift is not limited to the instruction for shifting by one bit in the right direction as described above. That is, the shift amount may be either fixed or variable, and may be either the right direction or the left direction. Further, bits spilled during the right shift may be rounded (for example, instruction vaddsrh or instruction vaddsrb).

（４）ＳＩＭＤ（ベクトル）データを塁加算してスカラ化又はベクトルの低次元化をする命令 (4) An instruction to add a SIMD (vector) data to make it scalar or vector

次に、ベクトルデータをスカラ化又はベクトルの低次元化をするＳＩＭＤ型の命令について説明する。 Next, a SIMD type instruction for making vector data scalar or vector reduction will be described.

［命令ｖｓｕｍｈ］
命令ｖｓｕｍｈは、ハーフワード（１６ビット）単位で、２つのＳＩＭＤデータ（ベクトルデータ）を加算し、スカラ化するＳＩＭＤ型の命令である。例えば、
ｖｓｕｍｈＲｂ，Ｒａ
であれば、プロセッサ１は、算術論理・比較演算器４１等により、レジスタＲａの上位１６ビットとレジスタＲａの下位１６ビットとを加算し、結果をレジスタＲｂに格納する。 [Instruction vsumh]
The instruction vsumh is a SIMD type instruction that adds two SIMD data (vector data) in units of halfwords (16 bits) to make a scalar. For example,
vsumh Rb, Ra
If so, the processor 1 adds the upper 16 bits of the register Ra and the lower 16 bits of the register Ra using the arithmetic logic / comparison unit 41 and the like, and stores the result in the register Rb.

このような命令は、様々な用途に使うことができる。平均（横の平均）、個別に演算（積和、加算）した結果を足し合わせる等である。 Such an instruction can be used for various purposes. For example, the average (horizontal average) and the results of individual calculations (sum of products, addition) are added.

［命令ｖｓｕｍｈ２］
命令ｖｓｕｍｈ２は、バイト単位で、２つのＳＩＭＤデータ（ベクトルデータ）を組とする２組それぞれについて、要素を塁加算し、スカラ化するＳＩＭＤ型の命令である。例えば、
ｖｓｕｍｈ２Ｒｂ，Ｒａ
であれば、プロセッサ１は、算術論理・比較演算器４１等により、
（i）レジスタＲａの最上位バイトと中上位バイトとを塁加算し、結果をレジスタＲｂの上位１６ビットに格納し、それと並行して、
（ii）レジスタＲａの中下位バイトと最下位バイトとを塁加算し、結果をレジスタＲｂの下位１６ビットに格納する。 [Instruction vsumh2]
The instruction vsumh2 is a SIMD type instruction that adds elements to each of two sets each including two SIMD data (vector data) in units of bytes to make a scalar. For example,
vsumh2 Rb, Ra
If so, the processor 1 uses an arithmetic logic / comparison unit 41 or the like to
(I) Add the most significant byte and the middle high-order byte of the register Ra and store the result in the high-order 16 bits of the register Rb.
(Ii) The middle lower byte and the least significant byte of the register Ra are added together and the result is stored in the lower 16 bits of the register Rb.

このような命令は、画像処理、動き補償（ＭＣ）、ハーフペル用命令として有効である。 Such a command is effective as a command for image processing, motion compensation (MC), and half-pel.

なお、本プロセッサ１は、以上のような２並列ＳＩＭＤデータをスカラ化する演算だけでなく、これを拡張したｎ（４、８など）個の要素からなるｎ並列ＳＩＭＤデータをスカラ化する演算を実行する機能を備えてもよい。 The processor 1 performs not only the above-described operation for converting the 2-parallel SIMD data into a scalar, but also the operation for converting the n-parallel SIMD data including n (4, 8, etc.) elements, which is an extension of the 2-parallel SIMD data, into a scalar. A function to be executed may be provided.

例えば、いま、レジスタＲａに格納されている４つのバイトデータを上位からＲａ１、Ｒａ２，Ｒａ３，Ｒａ４とした場合に、それらＲａ１、Ｒａ２，Ｒａ３及びＲａ４を塁加算し、その結果をレジスタＲｂに格納する命令を備えてもよい。 For example, when the four byte data stored in the register Ra is set to Ra1, Ra2, Ra3, Ra4 from the top, the Ra1, Ra2, Ra3, and Ra4 are added together and the result is stored in the register Rb. Instructions may be provided.

また、本プロセッサ１は、単に複数の要素データからなるベクトルを１つの要素データからなるスカラに変換するだけでなく、その要素データの個数を減少させるというベクトルの低次元化を行ってもよい。 Further, the processor 1 may not only simply convert a vector composed of a plurality of element data into a scalar composed of one element data, but also reduce the number of vectors so as to reduce the number of element data.

また、演算の種類は、加算だけに限られず、平均値を算出する演算であってもよい。このような命令は、平均、演算結果の足し合わせ等に有効である。 Further, the type of calculation is not limited to addition, but may be calculation for calculating an average value. Such an instruction is effective for averaging, addition of calculation results, and the like.

（５）その他ＳＩＭＤに関する命令
次に、以上の分類に属さないその他のＳＩＭＤ型の命令について説明する。 (5) Other SIMD Instructions Next, other SIMD type instructions not belonging to the above classification will be described.

［命令ｖｅｘｔｈ］
命令ｖｅｘｔｈは、ハーフワード（１６ビット）単位で、２つのＳＩＭＤデータそれぞれを符号拡張するＳＩＭＤ型の命令である。例えば、
ｖｅｘｔｈＭｍ，Ｒｂ，Ｒａ
であれば、プロセッサ１は、変換器４７の飽和ブロック(SAT)４７ａ等により、
（i）レジスタＲａの上位１６ビットを３２ビットに符号拡張し、その結果を演算レジスタＭＨｍの上位１６ビットと演算レジスタＭＬｍの上位１６ビットに格納し、それと並行して、
（ii）レジスタＲａの下位１６ビットを３２ビットに符号拡張し、その結果を演算レジスタＭＨｍの下位１６ビットと演算レジスタＭＬｍの下位１６ビットに格納し、それと並行して、
（iii）レジスタＲａの３２ビットをレジスタＲｂに格納する。 [Instruction vexth]
The instruction vexth is a SIMD type instruction that sign-extends each of two SIMD data in units of half words (16 bits). For example,
vexth Mm, Rb, Ra
If so, the processor 1 uses the saturation block (SAT) 47a of the converter 47, etc.
(I) The upper 16 bits of the register Ra are sign-extended to 32 bits, and the result is stored in the upper 16 bits of the arithmetic register MHm and the upper 16 bits of the arithmetic register MLm.
(Ii) The lower 16 bits of the register Ra are sign-extended to 32 bits, and the result is stored in the lower 16 bits of the arithmetic register MHm and the lower 16 bits of the arithmetic register MLm.
(Iii) 32 bits of the register Ra are stored in the register Rb.

なお、符号拡張とは、符号情報を維持したままデータの長さを大きくすることをいい、例えば、ハーフワードで表現された符号付き数値をワードで表現された同じ数値に変換することである。具体的には、元のデータの符号ビット（最上位ビット）の値で拡張した上位ビットを埋め尽くす処理である。 Note that sign extension refers to increasing the length of data while maintaining code information, for example, converting a signed numeric value expressed in half words into the same numeric value expressed in words. Specifically, it is a process of filling up the higher bits expanded with the value of the sign bit (most significant bit) of the original data.

このような命令は、ＳＩＭＤデータのアキュムレータへの転送（精度が必要な場合）に有効である。 Such an instruction is effective for transferring SIMD data to an accumulator (when accuracy is required).

［命令ｖａｓｕｂｂ］
命令ｖａｓｕｂｂは、バイト単位で、４組のＳＩＭＤデータそれぞれを減算し、その結果得られる４つの符号を条件フラグレジスタに格納するＳＩＭＤ型の命令である。例えば、
ｖａｓｕｂｂＲｃ，Ｒｂ，Ｒａ
であれば、プロセッサ１は、算術論理・比較演算器４１等により、
（i）レジスタＲｂの最上位８ビットからレジスタＲａの最上位８ビットを減算し、その結果をレジスタＲｃの最上位８ビットに格納するとともに、その符号を条件フラグレジスタ（ＣＦＲ）３２のＶＣ３に格納し、それと並行して、
（ii）レジスタＲｂの中上位８ビットからレジスタＲａの中上位８ビットを減算し、その結果をレジスタＲｃの中上位８ビットに格納するとともに、その符号を条件フラグレジスタ（ＣＦＲ）３２のＶＣ２に格納し、それと並行して、
（iii）レジスタＲｂの中下位８ビットからレジスタＲａの中下位８ビットを減算し、その結果をレジスタＲｃの中下位８ビットに格納するとともに、その符号を条件フラグレジスタ（ＣＦＲ）３２のＶＣ１に格納し、それと並行して、
（iv）レジスタＲｂの最下位８ビットからレジスタＲａの最下位８ビットを減算し、その結果をレジスタＲｃの最下位８ビットに格納するとともに、その符号を条件フラグレジスタ（ＣＦＲ）３２のＶＣ０に格納する。 [Instruction vasubb]
The instruction vassub is a SIMD type instruction that subtracts four sets of SIMD data in units of bytes and stores the four codes obtained as a result in the condition flag register. For example,
Vasubb Rc, Rb, Ra
If so, the processor 1 uses an arithmetic logic / comparison unit 41 or the like to
(I) The most significant 8 bits of the register Ra is subtracted from the most significant 8 bits of the register Rb, the result is stored in the most significant 8 bits of the register Rc, and the sign thereof is stored in the VC3 of the condition flag register (CFR) 32 Store and in parallel,
(Ii) Subtract the upper 8 bits of the register Ra from the upper 8 bits of the register Rb, store the result in the upper 8 bits of the register Rc, and store the sign in the VC2 of the condition flag register (CFR) 32 Store and in parallel,
(Iii) The lower 8 bits of the register Ra are subtracted from the lower 8 bits of the register Rb, the result is stored in the 8 lower bits of the register Rc, and the sign is stored in the VC1 of the condition flag register (CFR) 32 Store and in parallel,
(Iv) The least significant 8 bits of the register Ra are subtracted from the least significant 8 bits of the register Rb, the result is stored in the least significant 8 bits of the register Rc, and the sign thereof is stored in the VC0 of the condition flag register (CFR) 32 Store.

このような命令は、絶対値差分和を得ることに一時的に９ビット精度が必要とされる場合に有効である。 Such an instruction is effective when 9-bit precision is temporarily required to obtain the absolute value difference sum.

［命令ｖａｂｓｓｕｍｂ］
命令ｖａｂｓｓｕｍｂは、バイト単位で、４組のＳＩＭＤデータそれぞれを絶対値加算し、その結果と他の４バイトデータとを加算するＳＩＭＤ型の命令である。例えば、
ｖａｂｓｓｕｍｂＲｃ，Ｒａ，Ｒｂ
であれば、プロセッサ１は、算術論理・比較演算器４１等により、レジスタＲａの最上位８ビットの絶対値と中上位８ビットの絶対値と中下位８ビットの絶対値と最下位８ビットの絶対値とを加算し、その結果とレジスタＲｂの３２ビットとを加算し、得られた結果をレジスタＲｃに格納する。なお、本プロセッサ１は、条件フラグレジスタ（ＣＦＲ）３２のフラグＶＣ０〜ＶＣ３によって、レジスタＲａに格納された各バイトの絶対値を特定する。 [Instruction vabsum]
The instruction vabssum is a SIMD type instruction that adds the absolute value of each of four sets of SIMD data in units of bytes and adds the result to the other 4-byte data. For example,
vabsumb Rc, Ra, Rb
If so, the processor 1 uses the arithmetic logic / comparison unit 41 or the like to calculate the absolute value of the most significant 8 bits, the absolute value of the middle and upper 8 bits, the absolute value of the middle and lower 8 bits, and the least significant 8 bits of the register Ra. The absolute value is added, the result is added to 32 bits of the register Rb, and the obtained result is stored in the register Rc. The processor 1 specifies the absolute value of each byte stored in the register Ra by the flags VC0 to VC3 of the condition flag register (CFR) 32.

このような命令は、上記命令ｖａｓｕｂｂと組み合わせて使用することにより、複数組のデータに対して、各組の差分を算出した後に、それぞれの差分の絶対値を合計した値が得られ、画像処理の動き予測の差分絶対値和を算出する場合等に有効である。 Such an instruction is used in combination with the instruction vasubb, so that after calculating the difference of each set for a plurality of sets of data, a value obtained by summing the absolute values of the respective differences is obtained. This is effective when calculating the sum of absolute differences of motion prediction.

（６）その他マスク演算等に関する命令
次に、ＳＩＭＤ型の命令ではないが、特徴的な処理を行う命令について説明する。 (6) Other Instructions Related to Mask Operations, etc. Next, instructions that are not SIMD type instructions but perform characteristic processing will be described.

［命令ａｄｄｍｓｋ］
命令ａｄｄｍｓｋは、２つのオペランドの一方の一部のビット（上位）をマスクして加算する命令である。例えば、
ａｄｄｍｓｋＲｃ，Ｒａ，Ｒｂ
であれば、プロセッサ１は、算術論理・比較演算器４１や変換器４７等により、条件フラグレジスタ（ＣＦＲ）３２のＢＰＯで指定された範囲（下位ビット）でのみレジスタＲａとレジスタＲｂに格納されたデータを加算してレジスタＲｃに格納するとともに、指定されていない範囲（上位ビット）についてはレジスタＲａの値をそのままレジスタＲｃに格納する。 [Instruction addmsk]
The instruction addmsk is an instruction that masks and adds some bits (upper order) of one of two operands. For example,
addmsk Rc, Ra, Rb
If so, the processor 1 is stored in the register Ra and the register Rb only by the arithmetic logic / comparison calculator 41, the converter 47, etc. within the range (lower bits) designated by the BPO of the condition flag register (CFR) 32. The added data is added and stored in the register Rc, and the value of the register Ra is stored in the register Rc as it is for the unspecified range (upper bit).

このような命令は、モジュロアドレッシング（ＤＳＰでは一般的）への対応として有効である。バタフライ演算の準備のため予め特定のパターンにデータを並べ替えるあめに必要である。 Such an instruction is effective as a response to modulo addressing (common in DSP). This is necessary for rearranging data into a specific pattern in advance for preparation of butterfly computation.

なお、本プロセッサ１は、減算命令（ｓｕｂｍｓｋ等）についても、この加算命令と同様の処理を実行する。 Note that the processor 1 executes the same processing as the addition instruction for the subtraction instruction (submsk or the like).

［命令ｍｓｋｂｒｖｈ］
命令ｍｓｋｂｒｖｈは、２つのオペランドの一方の一部のビット（下位）をリバース（ビットの並びを逆順に）した後に、他方のオペランドとビット連結する命令である。例えば、
ｍｓｋｂｒｖｈＲｃ，Ｒａ，Ｒｂ
であれば、プロセッサ１は、変換器４７等により、レジスタＲｂの下位１６ビットを逆順にした後に、条件フラグレジスタ（ＣＦＲ）３２のＢＰＯで指定されたビット位置で、レジスタＲａのデータとレジスタＲｂのデータとを連結し、結果をレジスタＲｃに格納する。このとき、レジスタＲｂの上位１６ビットのうち、ＢＰ０で指定された位置よりも下位部分が０にマスクされる。 [Instruction mskbrvh]
The instruction mskbrvh is an instruction that performs bit concatenation with the other operand after reversing a part (lower order) of one of the two operands (in reverse order of the bit sequence). For example,
mskbrvh Rc, Ra, Rb
If so, the processor 1 reverses the lower 16 bits of the register Rb with the converter 47 and the like, and then at the bit position designated by the BPO of the condition flag register (CFR) 32, the data of the register Ra and the register Rb And the result are stored in the register Rc. At this time, the lower part of the upper 16 bits of the register Rb is masked to 0 from the position specified by BP0.

このような命令は、リバースアドレッシングに対応し、バタフライ演算の準備のために予め特定のパターンにデータを並べ替えるのに必要である。 Such an instruction corresponds to reverse addressing and is necessary for rearranging data in a specific pattern in advance for preparation of butterfly computation.

なお、本プロセッサ１は、１６ビットを逆順にするだけでなく、１バイト等の領域を逆順にする命令（ｍｓｋｂｒｖｂ等）についても、この命令と同様の処理を実行する。 Note that the processor 1 executes the same processing as this instruction not only for the 16 bits in the reverse order but also for an instruction (such as mskbrvb) that reverses the area of 1 byte or the like.

［命令ｍｓｋ］
命令ｍｓｋは、オペランドを構成するビットのうち、指定された２つのビット位置で挟まれた領域をマスク（０に）する、又は、その領域の外をマスクする命令である。例えば、
ｍｓｋＲｃ，Ｒａ，Ｒｂ
であれば、プロセッサ１は、変換器４７等により、
（i）Ｒｂ［１２：８］≧Ｒｂ［４：０］である場合には、
レジスタＲａに格納された３２ビットのうち、レジスタＲｂの第０〜４の５ビットＲｂ［４：０］で指定されるビット位置からレジスタＲｂの第８〜１２の５ビットＲｂ［１２：８］で指定されるビット位置についてはそのままにし、他のビットについてはマスク（０に）して、レジスタＲｃに格納する。
（ii）Ｒｂ［１２：８］＜Ｒｂ［４：０］である場合には、
レジスタＲａに格納された３２ビットのうち、レジスタＲｂの第８〜１２の５ビットＲｂ［１２：８］で指定されるビット位置からレジスタＲｂの第０〜４の５ビットＲｂ［４：０］で指定されるビット位置についてはマスク（０に）し、他のビットについてはそのままにして、レジスタＲｃに格納する。 [Instruction msk]
The instruction msk is an instruction that masks (sets to 0) an area sandwiched between two designated bit positions among bits constituting an operand, or masks the outside of the area. For example,
msk Rc, Ra, Rb
If so, the processor 1 uses the converter 47 or the like to
(I) When Rb [12: 8] ≧ Rb [4: 0],
Of the 32 bits stored in the register Ra, the 8th to 12th 5 bits Rb [12: 8] of the register Rb from the bit position specified by the 0th to 4th 5 bits Rb [4: 0] of the register Rb. The bit position specified by is kept as it is, and the other bits are masked (set to 0) and stored in the register Rc.
(Ii) When Rb [12: 8] <Rb [4: 0],
Of the 32 bits stored in the register Ra, the 0th to 4th 5 bits Rb [4: 0] of the register Rb from the bit position specified by the 8th to 12th 5 bits Rb [12: 8] of the register Rb. The bit position designated by (1) is masked (set to 0), the other bits are left as they are, and stored in the register Rc.

このような命令は、ビットフィールドの抽出、挿入（組み立て）に使用することができる。ＶＬＤ／ＶＬＣをソフトで行う場合に使用することができる。 Such an instruction can be used for bit field extraction and insertion (assembly). It can be used when VLD / VLC is performed by software.

［命令ｂｓｅｑ］
命令ｂｓｅｑは、オペランドのＭＳＢの１ビット下から連続する符号ビットの数を数える命令である。例えば、
ｂｓｅｑＲｂ，Ｒａ
であれば、プロセッサ１は、変換器４７のBSEQブロック４７ｂにより、レジスタＲａの１ビット下から、連続する符号ビットの数を数えて結果をレジスタＲｂに格納する。なお、レジスタＲａの値が０である場合には、レジスタＲｂに０を格納する。 [Instruction bseq]
The instruction bseq is an instruction that counts the number of sign bits that are consecutive from one bit below the MSB of the operand. For example,
bseq Rb, Ra
If so, the processor 1 uses the BSEQ block 47b of the converter 47 to count the number of consecutive sign bits from one bit below the register Ra, and stores the result in the register Rb. When the value of the register Ra is 0, 0 is stored in the register Rb.

このような命令は、有効桁の検出に使用することができる。ダイナミックレンジを大きく取るため、一部浮動小数点演算を行う場合がある。例えば、配列内の最も有効桁の多いデータに合わせて全てのデータを正規化し、演算するといったことに使用することができる。 Such an instruction can be used to detect significant digits. In order to increase the dynamic range, some floating point operations may be performed. For example, it can be used for normalizing and calculating all data according to data having the most significant digits in the array.

［命令ｌｄｂｐ］
命令ｌｄｂｐは、メモリから２バイトのデータを符号拡張してレジスタにロードする命令である。例えば、
ｌｄｂｐＲｂ：Ｒｂ＋１，（Ｒａ，Ｄ９）
であれば、プロセッサ１は、Ｉ／Ｆ部５０等により、レジスタＲａの値にディスプレースメント値（Ｄ９）を加算したアドレスから２つのバイトデータを符号拡張してレジスタＲａとレジスタ（Ｒａ＋１）それぞれにロードする。 [Instruction ldbp]
The instruction ldbp is an instruction for sign-extending 2-byte data from the memory and loading the register. For example,
ldbp Rb: Rb + 1, (Ra, D9)
If so, the processor 1 sign-extends two byte data from the address obtained by adding the displacement value (D9) to the value of the register Ra by the I / F unit 50 or the like, and assigns it to each of the register Ra and the register (Ra + 1). Load it.

このような命令は、データ供給を高速にする。
なお、本プロセッサ１は、２つのレジスタへのロードだけでなく、１つのレジスタの上位ハーフワードと下位ハーフワードへのロード等（ｌｄｂｈ等）についても、このロード命令と同様の処理（符号拡張を伴うロード）を実行する。
［命令ｒｄｅ］
命令ｒｄｅは、外部レジスタの値を読み込み、読み込みの失敗時にエラー例外を発生する命令である。例えば、
ｒｄｅＣ０：Ｃ１，Ｒｂ，（Ｒａ，Ｄ５）
であれば、プロセッサ１は、Ｉ／Ｆ部５０等により、レジスタＲａの値にディスプレースメント値（Ｄ５）を加算した値を外部レジスタ番号とし、その外部レジスタ（拡張レジスタ部８０）の値をレジスタＲｂに読み込むとともに、読み込みの成功・失敗を条件フラグレジスタ（ＣＦＲ）３２のコンディションフラグＣ０及びＣ１に出力する。失敗時には拡張レジスタエラー例外を発生する。 Such instructions speed up data supply.
The processor 1 not only loads the two registers but also loads the upper halfword and the lower halfword of one register (ldbh, etc.) in the same manner as this load instruction (sign extension is performed). Load).
[Instruction rde]
The instruction rde is an instruction that reads the value of the external register and generates an error exception when reading fails. For example,
rde C0: C1, Rb, (Ra, D5)
If so, the processor 1 uses the I / F unit 50 or the like as a value obtained by adding the displacement value (D5) to the value of the register Ra as an external register number, and registers the value of the external register (extended register unit 80) as a register. In addition to reading into Rb, the success / failure of reading is output to the condition flags C0 and C1 of the condition flag register (CFR) 32. An extension register error exception occurs on failure.

このような命令は、ハードアクセラレータを制御するための命令として有効である。ハードがエラーを返したら例外が発生し、フラグにも反映される。 Such an instruction is effective as an instruction for controlling the hard accelerator. If hardware returns an error, an exception is raised and reflected in the flag.

なお、本プロセッサ１は、外部レジスタからの読み込みだけでなく、外部レジスタへの書き出し（命令ｗｔｅ）についても、この読み出し命令と同様の処理（フラグのセット、例外の発生）を実行する。 The processor 1 executes not only reading from the external register but also writing (instruction wete) to the external register in the same manner as the read instruction (setting a flag and generating an exception).

［命令ａｄｄａｒｖｗ］
命令ａｄｄａｒｖｗは、絶対値丸め（ゼロから離れる方向に丸め）の加算を行う命令である。例えば、
ａｄｄａｒｖｗＲｃ，Ｒｂ，Ｒａ
であれば、プロセッサ１は、算術論理・比較演算器４１等により、レジスタＲａの３２ビットとレジスタＲｂの３２ビットとを加算し、その結果が正であれば、丸めるビットを切り上げる処理をし、結果が負であれば、丸めるビットを切り下げる処理をする。具体的には、レジスタＲａとＲｂの値を加算し、レジスタＲａの値が正である場合に１を加算する。なお、絶対値丸めを行う場合、レジスタＲｂには丸めするビットより下位のビットを１で埋めた値をいれておく。 [Instruction addarrvw]
The instruction adddarww is an instruction that performs addition of absolute value rounding (rounding away from zero). For example,
addarrvw Rc, Rb, Ra
If so, the processor 1 adds 32 bits of the register Ra and 32 bits of the register Rb by the arithmetic logic / comparison unit 41 or the like, and if the result is positive, rounds up the bits to be rounded. If the result is negative, round down the bits to be rounded. Specifically, the values of the registers Ra and Rb are added, and 1 is added when the value of the register Ra is positive. In the case of performing absolute value rounding, a value obtained by padding bits lower than the bit to be rounded with 1 is stored in the register Rb.

このような命令は、絶対値丸め（ゼロから離れる方向に丸め）の加算ＩＤＣＴ（Inverse Discrete Cosine Transform）に有効である。 Such an instruction is effective for addition IDCT (Inverse Discrete Cosine Transform) of absolute value rounding (rounding away from zero).

本発明は、ＤＳＰやＣＰＵ等のプロセッサとして、特にＳＩＭＤ型の命令を実行するプロセッサとして、例えば、携帯電話、モバイルＡＶ機器、デジタルＴＶ、ＤＶＤ等に共通のコアプロセッサとして、利用することができる。 The present invention can be used as a processor such as a DSP or a CPU, particularly as a processor for executing SIMD instructions, for example, as a core processor common to a mobile phone, a mobile AV device, a digital TV, a DVD, or the like.

本発明に係るプロセッサの概略ブロック図である。1 is a schematic block diagram of a processor according to the present invention. 同プロセッサの算術論理・比較演算器の概略図を示す。A schematic diagram of an arithmetic logic / comparison arithmetic unit of the processor is shown. 同プロセッサのバレルシタの構成を示すブロック図である。It is a block diagram which shows the structure of the barrel cutter of the processor. 同プロセッサの変換器の構成を示すブロック図である。It is a block diagram which shows the structure of the converter of the processor. 同プロセッサの除算器の構成を示すブロック図である。It is a block diagram which shows the structure of the divider of the processor. 同プロセッサの乗算・積和演算器の構成を示すブロック図である。It is a block diagram which shows the structure of the multiplication / product-sum calculator of the processor. 同プロセッサの命令制御部の構成を示すブロック図である。It is a block diagram which shows the structure of the instruction control part of the processor. 同プロセッサの汎用レジスタ（Ｒ０〜Ｒ３１）の構造を示す図である。It is a figure which shows the structure of the general purpose register | resistor (R0-R31) of the processor. 同プロセッサのリンクレジスタ（ＬＲ）の構造を示す図である。It is a figure which shows the structure of the link register (LR) of the processor. 同プロセッサの分岐レジスタ（ＴＡＲ）の構造を示す図である。It is a figure which shows the structure of the branch register (TAR) of the processor. 同プロセッサのプログラム状態レジスタ（ＰＳＲ）の構造を示す図である。It is a figure which shows the structure of the program status register (PSR) of the processor. 同プロセッサの条件フラグレジスタ（ＣＦＲ）の構造を示す図である。It is a figure which shows the structure of the condition flag register (CFR) of the processor. 同プロセッサのアキュームレータ（Ｍ０，Ｍ１）の構造を示す図である。It is a figure which shows the structure of the accumulator (M0, M1) of the processor. 同プロセッサのプログラムカウンタ（ＰＣ）の構造を示す図である。It is a figure which shows the structure of the program counter (PC) of the processor. 同プロセッサのＰＣ退避用レジスタ（ＩＰＣ）の構造を示す図である。It is a figure which shows the structure of the register | resistor for PC saving (IPC) of the processor. 同プロセッサのＰＳＲ退避用レジスタ（ＩＰＳＲ）の構造を示す図である。It is a figure which shows the structure of the PSR save register (IPSR) of the processor. 同プロセッサのパイプライン動作を示すタイミング図である。FIG. 10 is a timing chart showing a pipeline operation of the processor. 同プロセッサによる命令実行時の各パイプライン動作を示すタイミング図である。FIG. 10 is a timing chart showing each pipeline operation when an instruction is executed by the processor. 同プロセッサの並列動作を示す図である。It is a figure which shows the parallel operation | movement of the processor. 同プロセッサが実行する命令のフォーマットを示す図である。It is a figure which shows the format of the command which the processor performs. カテゴリー「ALUadd（加算）系」に属する命令を説明する図である。It is a figure explaining the instruction which belongs to the category "ALUadd (addition) system". カテゴリー「ALUsub（減算）系」に属する命令を説明する図である。It is a figure explaining the instruction which belongs to the category "ALUsub (subtraction) system". カテゴリー「ALUlogic（論理演算）系ほか」に属する命令を説明する図である。It is a figure explaining the instruction which belongs to a category "ALUlogic (logic operation) system etc.". カテゴリー「CMP（比較演算）系」に属する命令を説明する図である。It is a figure explaining the instruction which belongs to the category "CMP (comparison operation) system". カテゴリー「mul（乗算）系」に属する命令を説明する図である。It is a figure explaining the instruction which belongs to the category "mul (multiplication) system". カテゴリー「mac（積和演算）系」に属する命令を説明する図である。It is a figure explaining the instruction which belongs to the category "mac (product-sum operation) system". カテゴリー「msu（積差演算）系」に属する命令を説明する図である。It is a figure explaining the instruction which belongs to the category "msu (product difference operation) system". カテゴリー「MEMｌd（メモリ読み出し）系」に属する命令を説明する図である。It is a figure explaining the instruction which belongs to the category "MEMld (memory read) system". カテゴリー「MEMstore（メモリ書き出し）系」に属する命令を説明する図である。It is a figure explaining the instruction which belongs to the category "MEMstore (memory writing) system". カテゴリー「BRA（分岐）系」に属する命令を説明する図である。It is a figure explaining the instruction which belongs to the category "BRA (branch) system". カテゴリー「BSasl（算術バレルシフト）系ほか」に属する命令を説明する図である。It is a figure explaining the instruction which belongs to the category "BSasl (arithmetic barrel shift) system etc.". カテゴリー「BSlsr（論理バレルシフト）系ほか」に属する命令を説明する図である。It is a figure explaining the instruction which belongs to the category "BSlsr (logical barrel shift) system etc.". カテゴリー「CNVvaln（算術変換）系」に属する命令を説明する図である。It is a figure explaining the instruction which belongs to the category "CNVvaln (arithmetic conversion) system". カテゴリー「CNV（一般変換）系」に属する命令を説明する図である。It is a figure explaining the instruction which belongs to the category "CNV (general conversion) system". カテゴリー「SATvlpk（飽和処理）系」に属する命令を説明する図である。It is a figure explaining the instruction which belongs to the category "SATvlpk (saturation processing) system". カテゴリー「ETC（その他）系」に属する命令を説明する図である。It is a figure explaining the instruction which belongs to a category "ETC (others) system". 命令「ld Rb,(Ra,D10)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ld Rb, (Ra, D10)”. 命令「ld Rb3,(Ra3,D5)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ld Rb3, (Ra3, D5)”. 命令「ld Rb,(GP,D13)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ld Rb, (GP, D13)”. 命令「ld Rb2,(GP,D6)」を説明する図である。FIG. 5 is a diagram for explaining an instruction “ld Rb2, (GP, D6)”. 命令「ld Rb2,(GP)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ld Rb2, (GP)”. 命令「ld Rb,(SP,D13)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ld Rb, (SP, D13)”. 命令「ld Rb2,(SP,D6)」を説明する図である。FIG. 11 is a diagram for explaining an instruction “ld Rb2, (SP, D6)”. 命令「ld Rb2,(SP)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ld Rb2, (SP)”. 命令「ld Rb,(Ra+)I10」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ld Rb, (Ra +) I10”. 命令「ld Rb,(Ra+)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ld Rb, (Ra +)”. 命令「ld Rb2,(Ra2+)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ld Rb2, (Ra2 +)”. 命令「ld Rb,(Ra)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ld Rb, (Ra)”. 命令「ld Rb2,(Ra2)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ld Rb2, (Ra2)”. 命令「ldh Rb,(Ra,D9)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldh Rb, (Ra, D9)”. 命令「ldh Rb3,(Ra3,D4)」を説明する図である。FIG. 10 is a diagram for explaining an instruction “ldh Rb3, (Ra3, D4)”. 命令「ldh Rb,(GP,D12)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldh Rb, (GP, D12)”. 命令「ldh Rb2,(GP,D5)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldh Rb2, (GP, D5)”. 命令「ldh Rb2,(GP)」を説明する図である。FIG. 10 is a diagram for explaining an instruction “ldh Rb2, (GP)”. 命令「ldh Rb,(SP,D12)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldh Rb, (SP, D12)”. 命令「ldh Rb2,(SP,D5)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldh Rb2, (SP, D5)”. 命令「ldh Rb2,(SP)」を説明する図である。FIG. 11 is a diagram for explaining an instruction “ldh Rb2, (SP)”. 命令「ldh Rb,(Ra+)I9」を説明する図である。FIG. 11 is a diagram for explaining an instruction “ldh Rb, (Ra +) I9”. 命令「ldh Rb,(Ra+)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldh Rb, (Ra +)”. 命令「ldh Rb2,(Ra2+)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldh Rb2, (Ra2 +)”. 命令「ldh Rb,(Ra)」を説明する図である。FIG. 10 is a diagram for explaining an instruction “ldh Rb, (Ra)”. 命令「ldh Rb2,(Ra2)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldh Rb2, (Ra2)”. 命令「ldhu Rb,(Ra,D9)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldhu Rb, (Ra, D9)”. 命令「ldhu Rb,(GP,D12)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldhu Rb, (GP, D12)”. 命令「ldhu Rb,(SP,D12)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldhu Rb, (SP, D12)”. 命令「ldhu Rb,(Ra+)I9」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldhu Rb, (Ra +) I9”. 命令「ldhu Rb,(Ra+)」を説明する図である。FIG. 11 is a diagram for explaining an instruction “ldhu Rb, (Ra +)”. 命令「ldhu Rb,(Ra)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldhu Rb, (Ra)”. 命令「ldb Rb,(Ra,D8)」を説明する図である。FIG. 5 is a diagram for explaining an instruction “ldb Rb, (Ra, D8)”. 命令「ldb Rb,(GP,D11)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldb Rb, (GP, D11)”. 命令「ldb Rb,(SP,D11)」を説明する図である。FIG. 10 is a diagram for explaining an instruction “ldb Rb, (SP, D11)”. 命令「ldb Rb,(Ra+)I8」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldb Rb, (Ra +) I8”. 命令「ldb Rb,(Ra+)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldb Rb, (Ra +)”. 命令「ldb Rb,(Ra)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldb Rb, (Ra)”. 命令「ldbu Rb,(Ra,D8)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldbu Rb, (Ra, D8)”. 命令「ldbu Rb,(GP,D11)」を説明する図である。FIG. 10 is a diagram for explaining an instruction “ldbu Rb, (GP, D11)”. 命令「ldbu Rb,(SP,D11)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldbu Rb, (SP, D11)”. 命令「ldbu Rb,(Ra+)I8」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldbu Rb, (Ra +) I8”. 命令「ldbu Rb,(Ra+)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldbu Rb, (Ra +)”. 命令「ldbu Rb,(Ra)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldbu Rb, (Ra)”. 命令「ldp Rb:Rb+1,(Ra,D11)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldp Rb: Rb + 1, (Ra, D11)”. 命令「ldp Rb:Rb+1,(GP,D14)」を説明する図である。FIG. 6 is a diagram illustrating an instruction “ldp Rb: Rb + 1, (GP, D14)”. 命令「ldp Rb:Rb+1,(SP,D14)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldp Rb: Rb + 1, (SP, D14)”. 命令「ldp Rb:Rb+1,(SP,D7)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldp Rb: Rb + 1, (SP, D7)”. 命令「ldp Rb:Rb+1,(SP)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldp Rb: Rb + 1, (SP)”. 命令「ldp Rb:Rb+1,(Ra+)I11」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldp Rb: Rb + 1, (Ra +) I11”. 命令「ldp Rb2:Rb2+1,(Ra2+)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldp Rb2: Rb2 + 1, (Ra2 +)”. 命令「ldp Rb:Rb+1,(Ra+)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldp Rb: Rb + 1, (Ra +)”. 命令「ldp Rb:Rb+1,(Ra)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldp Rb: Rb + 1, (Ra)”. 命令「ldp LR:SVR,(Ra,D14)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldp LR: SVR, (Ra, D14)”. 命令「ldp LR:SVR,(Ra)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldp LR: SVR, (Ra)”. 命令「ldp LR:SVR,(GP,D14)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldp LR: SVR, (GP, D14)”. 命令「ldp LR:SVR,(SP,D14)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldp LR: SVR, (SP, D14)”. 命令「ldp LR:SVR,(SP,D7)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldp LR: SVR, (SP, D7)”. 命令「ldp LR:SVR,(SP)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldp LR: SVR, (SP)”. 命令「ldp TAR:UDR,(Ra,D11)」を説明する図である。FIG. 10 is a diagram for explaining an instruction “ldp TAR: UDR, (Ra, D11)”. 命令「ldp TAR:UDR,(GP,D14)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldp TAR: UDR, (GP, D14)”. 命令「ldp TAR:UDR,(SP,D14)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldp TAR: UDR, (SP, D14)”. 命令「ldhp Rb:Rb+1,(Ra,D10)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldhp Rb: Rb + 1, (Ra, D10)”. 命令「ldhp Rb:Rb+1,(Ra+)I10」を説明する図である。FIG. 6 is a diagram illustrating an instruction “ldhp Rb: Rb + 1, (Ra +) I10”. 命令「ldhp Rb2:Rb2+1,(Ra2+)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldhp Rb2: Rb2 + 1, (Ra2 +)”. 命令「ldhp Rb:Rb+1, (Ra+)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldhp Rb: Rb + 1, (Ra +)”. 命令「ldhp Rb:Rb+1,(Ra)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldhp Rb: Rb + 1, (Ra)”. 命令「ldbp Rb:Rb+1,(Ra,D9)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldbp Rb: Rb + 1, (Ra, D9)”. 命令「ldbp Rb:Rb+1,(Ra+)I9」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldbp Rb: Rb + 1, (Ra +) I9”. 命令「ldbp Rb:Rb+1,(Ra+)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldbp Rb: Rb + 1, (Ra +)”. 命令「ldbp Rb:Rb+1,(Ra)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldbp Rb: Rb + 1, (Ra)”. 命令「ldbh Rb,(Ra+)I7」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldbh Rb, (Ra +) I7”. 命令「ldbh Rb,(Ra+)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldbh Rb, (Ra +)”. 命令「ldbh Rb,(Ra)」を説明する図である。FIG. 6 is a diagram illustrating an instruction “ldbh Rb, (Ra)”. 命令「ldbuh Rb,(Ra+)I7」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldbuh Rb, (Ra +) I7”. 命令「ldbuh Rb,(Ra+)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldbuh Rb, (Ra +)”. 命令「ldbuh Rb,(Ra)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldbuh Rb, (Ra)”. 命令「ldbhp Rb:Rb+1,(Ra+)I7」を説明する図である。FIG. 11 is a diagram for explaining an instruction “ldbhp Rb: Rb + 1, (Ra +) I7”. 命令「ldbhp Rb:Rb+1,(Ra+)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldbhp Rb: Rb + 1, (Ra +)”. 命令「ldbhp Rb:Rb+1,(Ra)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldbhp Rb: Rb + 1, (Ra)”. 命令「ldbuhp Rb:Rb+1,(Ra+)I7」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldbuhp Rb: Rb + 1, (Ra +) I7”. 命令「ldbuhp Rb:Rb+1,(Ra+)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldbuhp Rb: Rb + 1, (Ra +)”. 命令「ldbuhp Rb:Rb+1,(Ra)」を説明する図である。FIG. 6 is a diagram illustrating an instruction “ldbuhp Rb: Rb + 1, (Ra)”. 命令「st (Ra,D10),Rb」を説明する図である。FIG. 6 is a diagram illustrating an instruction “st (Ra, D10), Rb”. 命令「st (Ra3,D5),Rb3」を説明する図である。FIG. 5 is a diagram for explaining an instruction “st (Ra3, D5), Rb3”. 命令「st (GP,D13),Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “st (GP, D13), Rb”. 命令「st (GP,D6),Rb2」を説明する図である。FIG. 6 is a diagram for explaining an instruction “st (GP, D6), Rb2”. 命令「st (GP),Rb2」を説明する図である。FIG. 6 is a diagram illustrating an instruction “st (GP), Rb2”. 命令「st (SP,D13),Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “st (SP, D13), Rb”. 命令「st (SP,D6),Rb2」を説明する図である。FIG. 6 is a diagram for explaining an instruction “st (SP, D6), Rb2”. 命令「st (SP),Rb2」を説明する図である。FIG. 6 is a diagram illustrating an instruction “st (SP), Rb2”. 命令「st (Ra+)I10,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “st (Ra +) I10, Rb”. 命令「st (Ra+),Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “st (Ra +), Rb”. 命令「st (Ra2+),Rb2」を説明する図である。FIG. 6 is a diagram for explaining an instruction “st (Ra2 +), Rb2”. 命令「st (Ra),Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “st (Ra), Rb”. 命令「st (Ra2),Rb2」を説明する図である。FIG. 6 is a diagram for explaining an instruction “st (Ra2), Rb2”. 命令「sth (Ra,D9),Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “sth (Ra, D9), Rb”. 命令「sth (Ra3,D4),Rb3」を説明する図である。FIG. 6 is a diagram for explaining an instruction “sth (Ra3, D4), Rb3”. 命令「sth (GP,D12),Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “sth (GP, D12), Rb”. 命令「sth (GP,D5),Rb2」を説明する図である。FIG. 6 is a diagram for explaining an instruction “sth (GP, D5), Rb2”. 命令「sth (GP),Rb2」を説明する図である。FIG. 6 is a diagram illustrating an instruction “sth (GP), Rb2”. 命令「sth (SP,D12),Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “sth (SP, D12), Rb”. 命令「sth (SP,D5),Rb2」を説明する図である。FIG. 6 is a diagram illustrating an instruction “sth (SP, D5), Rb2”. 命令「sth (SP),Rb2」を説明する図である。FIG. 6 is a diagram illustrating an instruction “sth (SP), Rb2”. 命令「sth (Ra+)I9,Rb」を説明する図である。FIG. 6 is a diagram illustrating an instruction “sth (Ra +) I9, Rb”. 命令「sth (Ra+),Rb」を説明する図である。FIG. 6 is a diagram illustrating an instruction “sth (Ra +), Rb”. 命令「sth (Ra2+),Rb2」を説明する図である。FIG. 6 is a diagram for explaining an instruction “sth (Ra2 +), Rb2”. 命令「sth (Ra),Rb」を説明する図である。FIG. 6 is a diagram illustrating an instruction “sth (Ra), Rb”. 命令「sth (Ra2),Rb2」を説明する図である。FIG. 5 is a diagram for explaining an instruction “sth (Ra2), Rb2”. 命令「stb (Ra,D8),Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “stb (Ra, D8), Rb”. 命令「stb (GP,D11),Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “stb (GP, D11), Rb”. 命令「stb (SP,D11),Rb」を説明する図である。FIG. 6 is a diagram illustrating an instruction “stb (SP, D11), Rb”. 命令「stb (Ra+)I8,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “stb (Ra +) I8, Rb”. 命令「stb (Ra+),Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “stb (Ra +), Rb”. 命令「stb (Ra),Rb」を説明する図である。FIG. 6 is a diagram illustrating an instruction “stb (Ra), Rb”. 命令「stp (Ra,D11),Rb:Rb+1」を説明する図である。FIG. 6 is a diagram for explaining an instruction “stp (Ra, D11), Rb: Rb + 1”. 命令「stp (GP,D14),Rb:Rb+1」を説明する図である。FIG. 6 is a diagram for explaining an instruction “stp (GP, D14), Rb: Rb + 1”. 命令「stp (SP,D14),Rb:Rb+1」を説明する図である。FIG. 10 is a diagram for explaining an instruction “stp (SP, D14), Rb: Rb + 1”. 命令「stp (SP,D7),Rb:Rb+1」を説明する図である。FIG. 6 is a diagram for explaining an instruction “stp (SP, D7), Rb: Rb + 1”. 命令「stp (SP),Rb:Rb+1」を説明する図である。FIG. 6 is a diagram for explaining an instruction “stp (SP), Rb: Rb + 1”. 命令「stp (Ra+)I11,Rb:Rb+1」を説明する図である。FIG. 6 is a diagram for explaining an instruction “stp (Ra +) I11, Rb: Rb + 1”. 命令「stp (Ra+),Rb:Rb+1」を説明する図である。FIG. 6 is a diagram for explaining an instruction “stp (Ra +), Rb: Rb + 1”. 命令「stp (Ra2+),Rb2:Rb2+1」を説明する図である。FIG. 6 is a diagram for explaining an instruction “stp (Ra2 +), Rb2: Rb2 + 1”. 命令「stp (Ra),Rb:Rb+1」を説明する図である。FIG. 6 is a diagram illustrating an instruction “stp (Ra), Rb: Rb + 1”. 命令「stp (Ra,D11),LR:SVR」を説明する図である。FIG. 6 is a diagram for explaining an instruction “stp (Ra, D11), LR: SVR”. 命令「stp (Ra),LR:SVR」を説明する図である。FIG. 6 is a diagram for explaining an instruction “stp (Ra), LR: SVR”. 命令「stp (GP,D14),LR:SVR」を説明する図である。FIG. 6 is a diagram for explaining an instruction “stp (GP, D14), LR: SVR”. 命令「stp (SP,D14),LR:SVR」を説明する図である。FIG. 6 is a diagram for explaining an instruction “stp (SP, D14), LR: SVR”. 命令「stp (SP,D7),LR:SVR」を説明する図である。FIG. 6 is a diagram for explaining an instruction “stp (SP, D7), LR: SVR”. 命令「stp (SP),LR:SVR」を説明する図である。FIG. 6 is a diagram for explaining an instruction “stp (SP), LR: SVR”. 命令「stp (Ra,D11),TAR:UDR」を説明する図である。FIG. 6 is a diagram for explaining an instruction “stp (Ra, D11), TAR: UDR”. 命令「stp (GP,D14),TAR:UDR」を説明する図である。FIG. 6 is a diagram for explaining an instruction “stp (GP, D14), TAR: UDR”. 命令「stp (SP,D14),TAR:UDR」を説明する図である。FIG. 6 is a diagram for explaining an instruction “stp (SP, D14), TAR: UDR”. 命令「sthp (Ra,D10),Rb:Rb+1」を説明する図である。FIG. 6 is a diagram for explaining an instruction “sthp (Ra, D10), Rb: Rb + 1”. 命令「sthp (Ra+)I10,Rb:Rb+1」を説明する図である。FIG. 6 is a diagram for explaining an instruction “sthp (Ra +) I10, Rb: Rb + 1”. 命令「sthp (Ra+),Rb:Rb+1」を説明する図である。FIG. 6 is a diagram for explaining an instruction “sthp (Ra +), Rb: Rb + 1”. 命令「sthp (Ra2+),Rb2:Rb2+1」を説明する図である。FIG. 6 is a diagram for explaining an instruction “sthp (Ra2 +), Rb2: Rb2 + 1”. 命令「sthp (Ra),Rb:Rb+1」を説明する図である。FIG. 6 is a diagram for explaining an instruction “sthp (Ra), Rb: Rb + 1”. 命令「stbp (Ra,D9),Rb:Rb+1」を説明する図である。FIG. 6 is a diagram for explaining an instruction “stbp (Ra, D9), Rb: Rb + 1”. 命令「stbp (Ra+)I9,Rb:Rb+1」を説明する図である。FIG. 6 is a diagram for explaining an instruction “stbp (Ra +) I9, Rb: Rb + 1”. 命令「stbp (Ra+),Rb:Rb+1」を説明する図である。FIG. 6 is a diagram for explaining an instruction “stbp (Ra +), Rb: Rb + 1”. 命令「stbp (Ra),Rb:Rb+1」を説明する図である。FIG. 4 is a diagram for explaining an instruction “stbp (Ra), Rb: Rb + 1”. 命令「stbh (Ra+)I7,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “stbh (Ra +) I7, Rb”. 命令「stbh (Ra+),Rb」を説明する図である。FIG. 6 is a diagram illustrating an instruction “stbh (Ra +), Rb”. 命令「stbh (Ra),Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “stbh (Ra), Rb”. 命令「stbhp (Ra+)I7,Rb:Rb+1」を説明する図である。FIG. 6 is a diagram illustrating an instruction “stbhp (Ra +) I7, Rb: Rb + 1”. 命令「stbhp (Ra+),Rb:Rb+1」を説明する図である。FIG. 6 is a diagram for explaining an instruction “stbhp (Ra +), Rb: Rb + 1”. 命令「stbhp (Ra),Rb:Rb+1」を説明する図である。FIG. 5 is a diagram illustrating an instruction “stbhp (Ra), Rb: Rb + 1”. 命令「dpref (Ra,D8)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “dpref (Ra, D8)”. 命令「ldstb Rb,(Ra)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ldstb Rb, (Ra)”. 命令「rd C0:C1,Rb,(D11)」を説明する図である。FIG. 10 is a diagram for explaining an instruction “rd C0: C1, Rb, (D11)”. 命令「rd C0:C1,Rb,(Ra,D5)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “rd C0: C1, Rb, (Ra, D5)”. 命令「rd C0:C1,Rb,(Ra)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “rd C0: C1, Rb, (Ra)”. 命令「rd C0:C1,Rb2,(Ra2)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “rd C0: C1, Rb2, (Ra2)”. 命令「rd C2:C3,Rb,(Ra,D5)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “rd C2: C3, Rb, (Ra, D5)”. 命令「rd C2:C3,Rb,(Ra)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “rd C2: C3, Rb, (Ra)”. 命令「rde C0:C1,Rb,(Ra,D5)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “rde C0: C1, Rb, (Ra, D5)”. 命令「rde C0:C1,Rb,(Ra)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “rde C0: C1, Rb, (Ra)”. 命令「rde C2:C3,Rb,(Ra,D5)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “rde C2: C3, Rb, (Ra, D5)”. 命令「rde C2:C3,Rb,(Ra)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “rde C2: C3, Rb, (Ra)”. 命令「wt C0:C1,(D11),Rb」を説明する図である。FIG. 6 is a diagram illustrating an instruction “wt C0: C1, (D11), Rb”. 命令「wt C0:C1,(Ra,D5),Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “wt C0: C1, (Ra, D5), Rb”. 命令「wt C0:C1,(Ra),Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “wt C0: C1, (Ra), Rb”. 命令「wt C0:C1,(Ra2),Rb2」を説明する図である。FIG. 6 is a diagram illustrating an instruction “wt C0: C1, (Ra2), Rb2”. 命令「wt C2:C3,(Ra,D5),Rb」を説明する図である。FIG. 10 is a diagram for explaining an instruction “wt C2: C3, (Ra, D5), Rb”. 命令「wt C2:C3,(Ra),Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “wt C2: C3, (Ra), Rb”. 命令「wte C0:C1,(Ra,D5),Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “wte C0: C1, (Ra, D5), Rb”. 命令「wte C0:C1,(Ra),Rb」を説明する図である。FIG. 6 is a diagram illustrating an instruction “wte C0: C1, (Ra), Rb”. 命令「wte C2:C3,(Ra,D5),Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “wte C2: C3, (Ra, D5), Rb”. 命令「wte C2:C3,(Ra),Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “wte C2: C3, (Ra), Rb”. 命令「br D20」を説明する図である。It is a figure explaining instruction “br D20”. 命令「br D9」を説明する図である。FIG. 11 is a diagram for explaining an instruction “br D9”. 命令「brl D20」を説明する図である。FIG. 10 is a diagram for explaining an instruction “brl D20”. 命令「call D20」を説明する図である。FIG. 10 is a diagram for explaining an instruction “call D20”. 命令「brl D9」を説明する図である。FIG. 11 is a diagram for explaining an instruction “brl D9”. 命令「call D9」を説明する図である。FIG. 10 is a diagram for explaining an instruction “call D9”. 命令「jmp LR」を説明する図である。FIG. 6 is a diagram for explaining an instruction “jmp LR”. 命令「jmp TAR」を説明する図である。FIG. 6 is a diagram for explaining an instruction “jmp TAR”. 命令「jmpl LR」を説明する図である。FIG. 6 is a diagram for explaining an instruction “jmpl LR”. 命令「call LR」を説明する図である。FIG. 6 is a diagram for explaining an instruction “call LR”. 命令「jmpl TAR」を説明する図である。FIG. 6 is a diagram illustrating an instruction “jmpl TAR”. 命令「call TAR」を説明する図である。FIG. 6 is a diagram for explaining an instruction “call TAR”. 命令「jmpr LR」を説明する図である。FIG. 6 is a diagram for explaining an instruction “jmpr LR”. 命令「ret」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ret”. 命令「jmpf LR」を説明する図である。FIG. 6 is a diagram for explaining an instruction “jmpf LR”. 命令「jmpf C6,C2:C4,TAR」を説明する図である。FIG. 6 is a diagram for explaining an instruction “jmpf C6, C2: C4, TAR”. 命令「jmpf Cm,TAR」を説明する図である。FIG. 6 is a diagram for explaining an instruction “jmpf Cm, TAR”. 命令「jmpf TAR」を説明する図である。FIG. 6 is a diagram for explaining an instruction “jmpf TAR”. 命令「jloop C6,TAR,Ra,I8」を説明する図である。FIG. 6 is a diagram illustrating an instruction “jloop C6, TAR, Ra, I8”. 命令「jloop C6,TAR,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “jloop C6, TAR, Ra”. 命令「jloop C6,TAR,Ra2」を説明する図である。FIG. 6 is a diagram for explaining an instruction “jloop C6, TAR, Ra2”. 命令「jloop C6,TAR,Ra2,-1」を説明する図である。FIG. 6 is a diagram for explaining an instruction “jloop C6, TAR, Ra2, −1”. 命令「jloop C6,Cm,TAR,Ra,I8」を説明する図である。FIG. 6 is a diagram for explaining an instruction “jloop C6, Cm, TAR, Ra, I8”. 命令「jloop C6,Cm,TAR,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “jloop C6, Cm, TAR, Ra”. 命令「jloop C6,Cm,TAR,Ra2」を説明する図である。FIG. 6 is a diagram illustrating an instruction “jloop C6, Cm, TAR, Ra2”. 命令「jloop C6,Cm,TAR,Ra2,-1」を説明する図である。FIG. 6 is a diagram for explaining an instruction “jloop C6, Cm, TAR, Ra2, −1”. 命令「jloop C6,C2:C4,TAR,Ra,I8」を説明する図である。FIG. 6 is a diagram for explaining an instruction “jloop C6, C2: C4, TAR, Ra, I8”. 命令「jloop C6,C2:C4,TAR,Ra」を説明する図である。It is a figure explaining the instruction “jloop C6, C2: C4, TAR, Ra”. 命令「jloop C6,C2:C4,TAR,Ra2」を説明する図である。FIG. 10 is a diagram for explaining an instruction “jloop C6, C2: C4, TAR, Ra2”. 命令「jloop C6,C2:C4,TAR,Ra2,-1」を説明する図である。FIG. 6 is a diagram for explaining an instruction “jloop C6, C2: C4, TAR, Ra2, −1”. 命令「jloop C5,LR,Ra,I8」を説明する図である。FIG. 6 is a diagram for explaining an instruction “jloop C5, LR, Ra, I8”. 命令「jloop C5,LR,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “jloop C5, LR, Ra”. 命令「settar D9」を説明する図である。It is a figure explaining instruction “settar D9”. 命令「settar C6,Cm,D9」を説明する図である。It is a figure explaining instruction “settar C6, Cm, D9”. 命令「settar C6,D9」を説明する図である。FIG. 7 is a diagram for explaining an instruction “settar C6, D9”. 命令「settar C6,C2:C4,D9」を説明する図である。FIG. 6 is a diagram illustrating an instruction “settar C6, C2: C4, D9”. 命令「settar C6,C4,D9」を説明する図である。FIG. 6 is a diagram for explaining an instruction “settar C6, C4, D9”. 命令「setlr D9」を説明する図である。FIG. 10 is a diagram for explaining an instruction “setlr D9”. 命令「setlr C5,D9」を説明する図である。FIG. 7 is a diagram for explaining an instruction “setlr C5, D9”. 命令「setbb TAR」を説明する図である。FIG. 10 is a diagram for explaining an instruction “setbb TAR”. 命令「setbb LR」を説明する図である。FIG. 10 is a diagram for explaining an instruction “setbb LR”. 命令「intd」を説明する図である。FIG. 10 is a diagram for explaining an instruction “intd”. 命令「inte」を説明する図である。FIG. 10 is a diagram for explaining an instruction “inte”. 命令「vmpswd」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vmpswd”. 命令「vmpswe」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vmpswe”. 命令「vmpsleep」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vmpsleep”. 命令「vmpwait」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vmpwait”. 命令「vmpsus」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vmpsus”. 命令「rti」を説明する図である。FIG. 10 is a diagram for explaining an instruction “rti”. 命令「piNl(pi0l,pi1l,pi2l,pi3l,pi4l,pi5l,pi6l,pi7l)」を説明する図である。FIG. 6 is a diagram illustrating an instruction “piNl (pi0l, pi1l, pi2l, pi3l, pi4l, pi5l, pi6l, pi7l)”. 命令「piN(pi0,pi1,pi2,pi3,pi4,pi5,pi6,pi7)」を説明する図である。FIG. 7 is a diagram for explaining an instruction “piN (pi0, pi1, pi2, pi3, pi4, pi5, pi6, pi7)”. 命令「scN(sc0,sc1,sc2,sc3,sc4,sc5,sc6,sc7)」を説明する図である。FIG. 6 is a diagram for explaining an instruction “scN (sc0, sc1, sc2, sc3, sc4, sc5, sc6, sc7)”. 命令「add Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “add Rc, Ra, Rb”. 命令「add Rc3,Ra3,Rb3」を説明する図である。FIG. 6 is a diagram for explaining an instruction “add Rc3, Ra3, Rb3”. 命令「add Ra2,Rb2」を説明する図である。FIG. 6 is a diagram for explaining an instruction “add Ra2, Rb2”. 命令「add Rb,Ra,I12」を説明する図である。FIG. 6 is a diagram illustrating an instruction “add Rb, Ra, I12”. 命令「add Ra2,I5」を説明する図である。FIG. 6 is a diagram for explaining an instruction “add Ra2, I5”. 命令「add SP,I19」を説明する図である。FIG. 11 is a diagram for explaining an instruction “add SP, I19”. 命令「add SP,I11」を説明する図である。FIG. 10 is a diagram for explaining an instruction “add SP, I11”. 命令「addu Rb,GP,I13」を説明する図である。FIG. 6 is a diagram illustrating an instruction “addu Rb, GP, I13”. 命令「addu Rb,SP,I13」を説明する図である。FIG. 10 is a diagram for explaining an instruction “addu Rb, SP, I13”. 命令「addu Ra3,SP,I6」を説明する図である。FIG. 6 is a diagram for explaining an instruction “addu Ra3, SP, I6”. 命令「addvw Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “addvw Rc, Ra, Rb”. 命令「addvh Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “addvh Rc, Ra, Rb”. 命令「addc Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “addc Rc, Ra, Rb”. 命令「adds Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “adds Rc, Ra, Rb”. 命令「addsr Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “addsr Rc, Ra, Rb”. 命令「s1add Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “s1add Rc, Ra, Rb”. 命令「s1add Rc3,Ra3,Rb3」を説明する図である。FIG. 6 is a diagram for explaining an instruction “s1add Rc3, Ra3, Rb3”. 命令「s2add Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “s2add Rc, Ra, Rb”. 命令「s2add Rc3,Ra3,Rb3」を説明する図である。FIG. 6 is a diagram for explaining an instruction “s2add Rc3, Ra3, Rb3”. 命令「addmsk Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “addmsk Rc, Ra, Rb”. 命令「addarvw Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “addarvw Rc, Ra, Rb”. 命令「sub Rc,Rb,Ra」を説明する図である。FIG. 6 is a diagram illustrating an instruction “sub Rc, Rb, Ra”. 命令「sub Rc3,Rb3,Ra3」を説明する図である。FIG. 6 is a diagram illustrating an instruction “sub Rc3, Rb3, Ra3”. 命令「sub Rb2,Ra2」を説明する図である。FIG. 6 is a diagram for explaining an instruction “sub Rb2, Ra2”. 命令「sub Rb,Ra,I12」を説明する図である。FIG. 6 is a diagram for explaining an instruction “sub Rb, Ra, I12”. 命令「sub Ra2,I5」を説明する図である。FIG. 6 is a diagram for explaining an instruction “sub Ra2, I5”. 命令「sub SP,I19」を説明する図である。FIG. 10 is a diagram for explaining an instruction “sub SP, I19”. 命令「sub SP,I11」を説明する図である。FIG. 10 is a diagram for explaining an instruction “sub SP, I11”. 命令「subc Rc,Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “subc Rc, Rb, Ra”. 命令「subvw Rc,Rb,Ra」を説明する図である。FIG. 6 is a diagram illustrating an instruction “subvw Rc, Rb, Ra”. 命令「subvh Rc,Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “subvh Rc, Rb, Ra”. 命令「subs Rc,Rb,Ra」を説明する図である。FIG. 6 is a diagram illustrating an instruction “subs Rc, Rb, Ra”. 命令「submsk Rc,Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “submsk Rc, Rb, Ra”. 命令「rsub Rb,Ra,I8」を説明する図である。FIG. 6 is a diagram for explaining an instruction “rsub Rb, Ra, I8”. 命令「rsub Ra2,I4」を説明する図である。FIG. 6 is a diagram illustrating an instruction “rsub Ra2, I4”. 命令「rsub Ra2,Rb2」を説明する図である。FIG. 6 is a diagram for explaining an instruction “rsub Ra2, Rb2”. 命令「neg Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “neg Rb, Ra”. 命令「neg Ra2」を説明する図である。FIG. 6 is a diagram for explaining an instruction “neg Ra2”. 命令「negvh Rb,Ra」を説明する図である。FIG. 6 is a diagram illustrating an instruction “negvh Rb, Ra”. 命令「negvw Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “negvw Rb, Ra”. 命令「abs Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “abs Rb, Ra”. 命令「absvw Rb,Ra」を説明する図である。FIG. 6 is a diagram illustrating an instruction “absvw Rb, Ra”. 命令「absvh Rb,Ra」を説明する図である。FIG. 5 is a diagram for explaining an instruction “absvh Rb, Ra”. 命令「max Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “max Rc, Ra, Rb”. 命令「min Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “min Rc, Ra, Rb”. 命令「and Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “and Rc, Ra, Rb”. 命令「and Ra2,Rb2」を説明する図である。FIG. 6 is a diagram for explaining an instruction “and Ra2, Rb2”. 命令「and Rb,Ra,I8」を説明する図である。FIG. 6 is a diagram for explaining an instruction “and Rb, Ra, I8”. 命令「andn Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram illustrating an instruction “andn Rc, Ra, Rb”. 命令「andn Ra2,Rb2」を説明する図である。FIG. 6 is a diagram for explaining an instruction “andn Ra2, Rb2”. 命令「andn Rb,Ra,I8」を説明する図である。FIG. 6 is a diagram for explaining an instruction “andn Rb, Ra, I8”. 命令「or Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “or Rc, Ra, Rb”. 命令「or Ra2,Rb2」を説明する図である。FIG. 6 is a diagram for explaining an instruction “or Ra2, Rb2”. 命令「or Rb,Ra,I8」を説明する図である。FIG. 6 is a diagram for explaining an instruction “or Rb, Ra, I8”. 命令「xor Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “xor Rc, Ra, Rb”. 命令「xor Ra2,Rb2」を説明する図である。FIG. 6 is a diagram for explaining an instruction “xor Ra2, Rb2”. 命令「xor Rb,Ra,I8」を説明する図である。FIG. 6 is a diagram for explaining an instruction “xor Rb, Ra, I8”. 命令「not Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “not Rb, Ra”. 命令「not Ra2」を説明する図である。FIG. 6 is a diagram for explaining an instruction “not Ra2”. 命令「cmpCC Cm,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “cmpCC Cm, Ra, Rb”. 命令「cmpCC C6,Ra2,Rb2」を説明する図である。FIG. 6 is a diagram illustrating an instruction “cmpCC C6, Ra2, Rb2”. 命令「cmpCC」を説明する図である。FIG. 6 is a diagram for explaining an instruction “cmpCC”. 命令「cmpCC C6,Ra2,I4」を説明する図である。FIG. 6 is a diagram for explaining an instruction “cmpCC C6, Ra2, I4”. 命令「cmpCC」を説明する図である。FIG. 6 is a diagram for explaining an instruction “cmpCC”. 命令「cmpCC」を説明する図である。FIG. 6 is a diagram for explaining an instruction “cmpCC”. 命令「cmpCCn Cm,Ra,Rb,Cn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “cmpCCn Cm, Ra, Rb, Cn”. 命令「cmpCCn Cm,Ra,I5,Cn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “cmpCCn Cm, Ra, I5, Cn”. 命令「cmpCCn Cm:Cm+1,Ra,Rb,Cn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “cmpCCn Cm: Cm + 1, Ra, Rb, Cn”. 命令「cmpCCn Cm:Cm+1,Ra,I5,Cn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “cmpCCn Cm: Cm + 1, Ra, I5, Cn”. 命令「cmpCCa Cm:Cm+1,Ra,Rb,Cn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “cmpCCa Cm: Cm + 1, Ra, Rb, Cn”. 命令「cmpCCa Cm:Cm+1,Ra,I5,Cn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “cmpCCa Cm: Cm + 1, Ra, I5, Cn”. 命令「cmpCCo Cm:Cm+1,Ra,Rb,Cn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “cmpCCo Cm: Cm + 1, Ra, Rb, Cn”. 命令「cmpCCo Cm:Cm+1,Ra,I5,Cn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “cmpCCo Cm: Cm + 1, Ra, I5, Cn”. 命令「tstz Cm,Ra,Rb」を説明する図である。FIG. 5 is a diagram for explaining an instruction “tstz Cm, Ra, Rb”. 命令「tstz C6,Ra2,Rb2」を説明する図である。FIG. 6 is a diagram illustrating an instruction “tstz C6, Ra2, Rb2”. 命令「tstz Cm,Ra,I5」を説明する図である。It is a figure explaining instruction “tstz Cm, Ra, I5”. 命令「tstz C6,Ra2,I4」を説明する図である。FIG. 6 is a diagram illustrating an instruction “tstz C6, Ra2, I4”. 命令「tstz Cm:Cm+1,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “tstz Cm: Cm + 1, Ra, Rb”. 命令「tstz Cm:Cm+1,Ra,I5」を説明する図である。FIG. 10 is a diagram for explaining an instruction “tstz Cm: Cm + 1, Ra, I5”. 命令「tstzn Cm,Ra,Rb,Cn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “tstzn Cm, Ra, Rb, Cn”. 命令「tstzn Cm,Ra,I5,Cn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “tstzn Cm, Ra, I5, Cn”. 命令「tstzn Cm:Cm+1,Ra,Rb,Cn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “tstzn Cm: Cm + 1, Ra, Rb, Cn”. 命令「tstzn Cm:Cm+1,Ra,I5,Cn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “tstzn Cm: Cm + 1, Ra, I5, Cn”. 命令「tstza Cm:Cm+1,Ra,Rb,Cn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “tstza Cm: Cm + 1, Ra, Rb, Cn”. 命令「tstza Cm:Cm+1,Ra,I5,Cn」を説明する図である。FIG. 5 is a diagram for explaining an instruction “tstza Cm: Cm + 1, Ra, I5, Cn”. 命令「tstzo Cm:Cm+1,Ra,Rb,Cn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “tstzo Cm: Cm + 1, Ra, Rb, Cn”. 命令「tstzo Cm:Cm+1,Ra,I5,Cn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “tstzo Cm: Cm + 1, Ra, I5, Cn”. 命令「tstn Cm,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “tstn Cm, Ra, Rb”. 命令「tstn C6,Ra2,Rb2」を説明する図である。FIG. 6 is a diagram illustrating an instruction “tstn C6, Ra2, Rb2”. 命令「tstn Cm,Ra,I5」を説明する図である。FIG. 6 is a diagram for explaining an instruction “tstn Cm, Ra, I5”. 命令「tstn C6,Ra2,I4」を説明する図である。FIG. 6 is a diagram illustrating an instruction “tstn C6, Ra2, I4”. 命令「tstn Cm:Cm+1,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “tstn Cm: Cm + 1, Ra, Rb”. 命令「tstn Cm:Cm+1,Ra,I5」を説明する図である。FIG. 6 is a diagram for explaining an instruction “tstn Cm: Cm + 1, Ra, I5”. 命令「tstnn Cm,Ra,Rb,Cn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “tstnn Cm, Ra, Rb, Cn”. 命令「tstnn Cm,Ra,I5,Cn」を説明する図である。FIG. 10 is a diagram for explaining an instruction “tstnn Cm, Ra, I5, Cn”. 命令「tstnn Cm:Cm+1,Ra,Rb,Cn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “tstnn Cm: Cm + 1, Ra, Rb, Cn”. 命令「tstnn Cm:Cm+1,Ra,I5,Cn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “tstnn Cm: Cm + 1, Ra, I5, Cn”. 命令「tstna Cm:Cm+1,Ra,Rb,Cn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “tstna Cm: Cm + 1, Ra, Rb, Cn”. 命令「tstna Cm:Cm+1,Ra,I5,Cn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “tstna Cm: Cm + 1, Ra, I5, Cn”. 命令「tstno Cm:Cm+1,Ra,Rb,Cn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “tstno Cm: Cm + 1, Ra, Rb, Cn”. 命令「tstno Cm:Cm+1,Ra,I5,Cn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “tstno Cm: Cm + 1, Ra, I5, Cn”. 命令「mov Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov Rb, Ra”. 命令「mov Ra2,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov Ra2, Rb”. 命令「mov Ra,I16」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov Ra, I16”. 命令「mov Ra2,I8」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov Ra2, I8”. 命令「mov Rb,TAR」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov Rb, TAR”. 命令「mov Rb2,TAR」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov Rb2, TAR”. 命令「mov Rb,LR」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov Rb, LR”. 命令「mov Rb2,LR」を説明する図である。FIG. 10 is a diagram for explaining an instruction “mov Rb2, LR”. 命令「mov Rb,SVR」を説明する図である。It is a figure explaining instruction “mov Rb, SVR”. 命令「mov Rb,PSR」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov Rb, PSR”. 命令「mov Rb,CFR」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov Rb, CFR”. 命令「mov Rb,MH0」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov Rb, MH0”. 命令「mov Rb2,MH0」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov Rb2, MH0”. 命令「mov Rb,MH1」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov Rb, MH1”. 命令「mov Rb2,MH1」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov Rb2, MH1”. 命令「mov Rb,ML0」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov Rb, ML0”. 命令「mov Rb,ML1」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov Rb, ML1”. 命令「mov Rb,IPC」を説明する図である。FIG. 10 is a diagram for explaining an instruction “mov Rb, IPC”. 命令「mov Rb,IPSR」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov Rb, IPSR”. 命令「mov Rb,PC」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov Rb, PC”. 命令「mov Rb,EPC」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov Rb, EPC”. 命令「mov Rb,EPSR」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov Rb, EPSR”. 命令「mov Rb,PSR0」を説明する図である。FIG. 6 is a diagram illustrating an instruction “mov Rb, PSR0”. 命令「mov Rb,PSR1」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov Rb, PSR1”. 命令「mov Rb,PSR2」を説明する図である。FIG. 6 is a diagram illustrating an instruction “mov Rb, PSR2”. 命令「mov Rb,PSR3」を説明する図である。FIG. 10 is a diagram for explaining an instruction “mov Rb, PSR3”. 命令「mov Rb,CFR0」を説明する図である。FIG. 10 is a diagram illustrating an instruction “mov Rb, CFR0”. 命令「mov Rb,CFR1」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov Rb, CFR1”. 命令「mov Rb,CFR2」を説明する図である。FIG. 6 is a diagram illustrating an instruction “mov Rb, CFR2”. 命令「mov Rb,CFR3」を説明する図である。FIG. 10 is a diagram for explaining an instruction “mov Rb, CFR3”. 命令「mov LR,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov LR, Rb”. 命令「mov LR,Rb2」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov LR, Rb2”. 命令「mov TAR,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov TAR, Rb”. 命令「mov TAR,Rb2」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov TAR, Rb2”. 命令「mov SVR,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov SVR, Rb”. 命令「mov PSR,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov PSR, Rb”. 命令「mov CFR,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov CFR, Rb”. 命令「mov MH0,Rb」を説明する図である。FIG. 10 is a diagram for explaining an instruction “mov MH0, Rb”. 命令「mov MH0,Rb2」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov MH0, Rb2”. 命令「mov MH1,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov MH1, Rb”. 命令「mov MH1,Rb2」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov MH1, Rb2”. 命令「mov ML0,Rb」を説明する図である。FIG. 10 is a diagram for explaining an instruction “mov ML0, Rb”. 命令「mov ML1,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov ML1, Rb”. 命令「mov IPC,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov IPC, Rb”. 命令「mov IPSR,Rb」を説明する図である。FIG. 10 is a diagram for explaining an instruction “mov IPSR, Rb”. 命令「mov EPC,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov EPC, Rb”. 命令「mov EPSR,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov EPSR, Rb”. 命令「mov PSR0,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov PSR0, Rb”. 命令「mov PSR1,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov PSR1, Rb”. 命令「mov PSR2,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov PSR2, Rb”. 命令「mov PSR3,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov PSR3, Rb”. 命令「mov CFR0,Rb」を説明する図である。FIG. 10 is a diagram illustrating an instruction “mov CFR0, Rb”. 命令「mov CFR1,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov CFR1, Rb”. 命令「mov CFR2,Rb」を説明する図である。FIG. 10 is a diagram for explaining an instruction “mov CFR2, Rb”. 命令「mov CFR3,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mov CFR3, Rb”. 命令「mvclovs Cm:Cm+1」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mvclovs Cm: Cm + 1”. 命令「movcf Ci,Cj,Cm,Cn」を説明する図である。It is a figure explaining the instruction “movcf Ci, Cj, Cm, Cn”. 命令「mvclcas Cm:Cm+1」を説明する図である。FIG. 10 is a diagram for explaining an instruction “mvclcas Cm: Cm + 1”. 命令「sethi Ra,I16」を説明する図である。FIG. 11 is a diagram for explaining an instruction “sethi Ra, I16”. 命令「setlo Ra,I16」を説明する図である。FIG. 6 is a diagram for explaining an instruction “setlo Ra, I16”. 命令「vcchk」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vcchk”. 命令「nop」を説明する図である。FIG. 10 is a diagram for explaining an instruction “nop”. 命令「asl Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “asl Rc, Ra, Rb”. 命令「asl Rb,Ra,I5」を説明する図である。FIG. 6 is a diagram for explaining an instruction “asl Rb, Ra, I5”. 命令「asl Ra2,I4」を説明する図である。FIG. 6 is a diagram for explaining an instruction “asl Ra2, I4”. 命令「aslvw Rc,Ra,Rb」を説明する図である。FIG. 7 is a diagram for explaining an instruction “aslvw Rc, Ra, Rb”. 命令「aslvw Rb,Ra,I5」を説明する図である。FIG. 6 is a diagram for explaining an instruction “aslvw Rb, Ra, I5”. 命令「aslvh Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “aslvh Rc, Ra, Rb”. 命令「aslvh Rb,Ra,I5」を説明する図である。FIG. 6 is a diagram for explaining an instruction “aslvh Rb, Ra, I5”. 命令「asr Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “asr Rc, Ra, Rb”. 命令「asr Rb,Ra,I5」を説明する図である。FIG. 6 is a diagram for explaining an instruction “asr Rb, Ra, I5”. 命令「asr Ra2,I4」を説明する図である。FIG. 6 is a diagram for explaining an instruction “asr Ra2, I4”. 命令「asrvw Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram illustrating an instruction “asrvw Rc, Ra, Rb”. 命令「asrvh Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “asrvh Rc, Ra, Rb”. 命令「lsl Rc,Ra,Rb」を説明する図である。FIG. 10 is a diagram for explaining an instruction “lsl Rc, Ra, Rb”. 命令「lsl Rc,Ra,I5」を説明する図である。FIG. 6 is a diagram for explaining an instruction “lsl Rc, Ra, I5”. 命令「lsl Ra2,I4」を説明する図である。FIG. 6 is a diagram illustrating an instruction “lsl Ra2, I4”. 命令「lsr Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “lsr Rc, Ra, Rb”. 命令「lsr Rb,Ra,I5」を説明する図である。FIG. 6 is a diagram for explaining an instruction “lsr Rb, Ra, I5”. 命令「rol Rc,Ra,Rb」を説明する図である。It is a figure explaining command "rol Rc, Ra, Rb". 命令「rol Rb,Ra,I5」を説明する図である。It is a figure explaining command "rol Rb, Ra, I5". 命令「ror Rb,Ra,I5」を説明する図である。FIG. 6 is a diagram for explaining an instruction “ror Rb, Ra, I5”. 命令「aslp Mm,Ra,Mn,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “aslp Mm, Ra, Mn, Rb”. 命令「aslp Mm,Rb,Mn,I6」を説明する図である。FIG. 6 is a diagram for explaining an instruction “aslp Mm, Rb, Mn, I6”. 命令「aslp Mm,Rc,MHn,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “aslp Mm, Rc, MHn, Ra, Rb”. 命令「aslp Mm,Rb,MHn,Ra,I6」を説明する図である。FIG. 6 is a diagram for explaining an instruction “aslp Mm, Rb, MHn, Ra, I6”. 命令「aslpvw Mm,Ra,Mn,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “aslpvw Mm, Ra, Mn, Rb”. 命令「aslpvw Mm,Rb,Mn,I6」を説明する図である。FIG. 6 is a diagram for explaining an instruction “aslpvw Mm, Rb, Mn, I6”. 命令「asrp Mm,Ra,Mn,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “asrp Mm, Ra, Mn, Rb”. 命令「asrp Mm,Rb,Mn,I6」を説明する図である。FIG. 6 is a diagram illustrating an instruction “asrp Mm, Rb, Mn, I6”. 命令「asrp Mm,Rc,MHn,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “asrp Mm, Rc, MHn, Ra, Rb”. 命令「asrp Mm,Rb,MHn,Ra,I6」を説明する図である。FIG. 6 is a diagram for explaining an instruction “asrp Mm, Rb, MHn, Ra, I6”. 命令「asrpvw Mm,Ra,Mn,Rb」を説明する図である。FIG. 5 is a diagram for explaining an instruction “asrpvw Mm, Ra, Mn, Rb”. 命令「lslp Mm,Ra,Mn,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “lslp Mm, Ra, Mn, Rb”. 命令「lslp Mm,Rb,Mn,I6」を説明する図である。FIG. 6 is a diagram for explaining an instruction “lslp Mm, Rb, Mn, I6”. 命令「lslp Mm,Rc,MHn,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “lslp Mm, Rc, MHn, Ra, Rb”. 命令「lslp Mm,Rb,MHn,Ra,I6」を説明する図である。FIG. 6 is a diagram for explaining an instruction “lslp Mm, Rb, MHn, Ra, I6”. 命令「lsrp Mm,Ra,Mn,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “lsrp Mm, Ra, Mn, Rb”. 命令「lsrp Mm,Rb,Mn,I6」を説明する図である。FIG. 6 is a diagram for explaining an instruction “lsrp Mm, Rb, Mn, I6”. 命令「lsrp Mm,Rc,MHn,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “lsrp Mm, Rc, MHn, Ra, Rb”. 命令「lsrp Mm,Rb,MHn,Ra,I6」を説明する図である。FIG. 6 is a diagram for explaining an instruction “lsrp Mm, Rb, MHn, Ra, I6”. 命令「extr Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram illustrating an instruction “extr Rc, Ra, Rb”. 命令「extr Rb,Ra,Ib5,Ia5」を説明する図である。FIG. 6 is a diagram for explaining an instruction “extr Rb, Ra, Ib5, Ia5”. 命令「ext Rb,Ra,I5」を説明する図である。FIG. 10 is a diagram for explaining an instruction “ext Rb, Ra, I5”. 命令「exth Ra2」を説明する図である。FIG. 10 is a diagram for explaining an instruction “exth Ra2”. 命令「extb Ra2」を説明する図である。FIG. 6 is a diagram for explaining an instruction “extb Ra2”. 命令「extru Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “extru Rc, Ra, Rb”. 命令「extru Rb,Ra,Ib5,Ia5」を説明する図である。FIG. 6 is a diagram for explaining an instruction “extru Rb, Ra, Ib5, Ia5”. 命令「extu Rb,Ra,I5」を説明する図である。FIG. 6 is a diagram illustrating an instruction “extu Rb, Ra, I5”. 命令「exthu Ra2」を説明する図である。It is a figure explaining instruction "exthu Ra2". 命令「extbu Ra2」を説明する図である。It is a figure explaining instruction “extbu Ra2”. 命令「mskgen Rc,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mskgen Rc, Rb”. 命令「mskgen Rb,Ib5,Ia5」を説明する図である。FIG. 6 is a diagram illustrating an instruction “mskgen Rb, Ib5, Ia5”. 命令「msk Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “msk Rc, Ra, Rb”. 命令「msk Rb,Ra,Ib5,Ia5」を説明する図である。FIG. 6 is a diagram for explaining an instruction “msk Rb, Ra, Ib5, Ia5”. 命令「satw Mm,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “satw Mm, Rb, Mn”. 命令「sath Rb,Ra」を説明する図である。FIG. 10 is a diagram for explaining an instruction “sath Rb, Ra”. 命令「sat12 Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “sat12 Rb, Ra”. 命令「sat9 Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “sat9 Rb, Ra”. 命令「satb Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “satb Rb, Ra”. 命令「satbu Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “satbu Rb, Ra”. 命令「extw Mm,Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “extw Mm, Rb, Ra”. 命令「vintllh Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vintllh Rc, Ra, Rb”. 命令「vintlhh Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vintlhh Rc, Ra, Rb”. 命令「vintllb Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vintllb Rc, Ra, Rb”. 命令「vintlhb Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vintlhb Rc, Ra, Rb”. 命令「valn Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram illustrating an instruction “valn Rc, Ra, Rb”. 命令「valn1 Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “valn1 Rc, Ra, Rb”. 命令「valn2 Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “valn2 Rc, Ra, Rb”. 命令「valn3 Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “valn3 Rc, Ra, Rb”. 命令「valnvc1 Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “valnvc1 Rc, Ra, Rb”. 命令「valnvc2 Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “valnvc2 Rc, Ra, Rb”. 命令「valnvc3 Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “valnvc3 Rc, Ra, Rb”. 命令「valnvc4 Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “valnvc4 Rc, Ra, Rb”. 命令「vxchngh Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vxchngh Rb, Ra”. 命令「byterev Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “byterev Rb, Ra”. 命令「vstovb Rb,Ra」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vstovb Rb, Ra”. 命令「vstovh Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vstovh Rb, Ra”. 命令「vlunpkh Rb:Rb+1,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vlunpkh Rb: Rb + 1, Ra”. 命令「vlunpkhu Rb:Rb+1,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vlunpkhu Rb: Rb + 1, Ra”. 命令「vlunpkb Rb:Rb+1,Ra」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vlunpkb Rb: Rb + 1, Ra”. 命令「vlunpkbu Rb:Rb+1,Ra」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vlunpkbu Rb: Rb + 1, Ra”. 命令「vhunpkh Rb:Rb+1,Ra」を説明する図である。It is a figure explaining instruction “vhunpkh Rb: Rb + 1, Ra”. 命令「vhunpkb Rb:Rb+1,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vhunpkb Rb: Rb + 1, Ra”. 命令「vunpk1 Rb,Mn」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vunpk1 Rb, Mn”. 命令「vunpk2 Rb,Mn」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vunpk2 Rb, Mn”. 命令「vlpkh Rc,Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vlpkh Rc, Rb, Ra”. 命令「vlpkhu Rc,Rb,Ra」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vlpkhu Rc, Rb, Ra”. 命令「vlpkb Rc,Rb,Ra」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vlpkb Rc, Rb, Ra”. 命令「vlpkbu Rc,Rb,Ra」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vlpkbu Rc, Rb, Ra”. 命令「vhpkh Rc,Ra,Rb」を説明する図である。It is a figure explaining instruction “vhpkh Rc, Ra, Rb”. 命令「vhpkb Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vhpkb Rc, Ra, Rb”. 命令「vexth Mm,Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vexth Mm, Rb, Ra”. 命令「bseq0 Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “bseq0 Rb, Ra”. 命令「bseq1 Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “bseq1 Rb, Ra”. 命令「bseq Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “bseq Rb, Ra”. 命令「bcnt1 Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “bcnt1 Rb, Ra”. 命令「rndvh Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “rndvh Rb, Ra”. 命令「mskbrvb Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mskbrvb Rc, Ra, Rb”. 命令「mskbrvh Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mskbrvh Rc, Ra, Rb”. 命令「movp Rc:Rc+1,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “movp Rc: Rc + 1, Ra, Rb”. 命令「hmul Mm,Rc,Ra,Rb」を説明する図である。It is a figure explaining the instruction “hmul Mm, Rc, Ra, Rb”. 命令「lmul Mm,Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram illustrating an instruction “lmul Mm, Rc, Ra, Rb”. 命令「fmulhh Mm,Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “fmulhh Mm, Rc, Ra, Rb”. 命令「fmulhhr Mm,Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “fmulhhr Mm, Rc, Ra, Rb”. 命令「fmulhw Mm,Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram illustrating an instruction “fmulhw Mm, Rc, Ra, Rb”. 命令「fmulhww Mm,Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “fmulhww Mm, Rc, Ra, Rb”. 命令「mul Mm,Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mul Mm, Rc, Ra, Rb”. 命令「mul Mm,Rb,Ra,I8」を説明する図である。FIG. 7 is a diagram for explaining an instruction “mul Mm, Rb, Ra, I8”. 命令「mulu Mm,Rc,Ra,Rb」を説明する図である。It is a figure explaining instruction "mulu Mm, Rc, Ra, Rb". 命令「mulu Mm,Rb,Ra,I8」を説明する図である。FIG. 10 is a diagram for explaining an instruction “mulu Mm, Rb, Ra, I8”. 命令「fmulww Mm,Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “fmulww Mm, Rc, Ra, Rb”. 命令「hmac Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “hmac Mm, Rc, Ra, Rb, Mn”. 命令「hmac M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “hmac M0, Rc, Ra, Rb, Rx”. 命令「lmac Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “lmac Mm, Rc, Ra, Rb, Mn”. 命令「lmac M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “lmac M0, Rc, Ra, Rb, Rx”. 命令「fmachh Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “fmachh Mm, Rc, Ra, Rb, Mn”. 命令「fmachh M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “fmachh M0, Rc, Ra, Rb, Rx”. 命令「fmachhr Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “fmachhr Mm, Rc, Ra, Rb, Mn”. 命令「fmachhr M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “fmachhr M0, Rc, Ra, Rb, Rx”. 命令「fmachw Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “fmachw Mm, Rc, Ra, Rb, Mn”. 命令「fmachw M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “fmachw M0, Rc, Ra, Rb, Rx”. 命令「fmachww Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “fmachww Mm, Rc, Ra, Rb, Mn”. 命令「fmachww M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “fmachww M0, Rc, Ra, Rb, Rx”. 命令「mac Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mac Mm, Rc, Ra, Rb, Mn”. 命令「mac M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “mac M0, Rc, Ra, Rb, Rx”. 命令「fmacww Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “fmacww Mm, Rc, Ra, Rb, Mn”. 命令「fmacww M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “fmacww M0, Rc, Ra, Rb, Rx”. 命令「hmsu Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “hmsu Mm, Rc, Ra, Rb, Mn”. 命令「hmsu M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram illustrating an instruction “hmsu M0, Rc, Ra, Rb, Rx”. 命令「lmsu Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram illustrating an instruction “lmsu Mm, Rc, Ra, Rb, Mn”. 命令「lmsu M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 4 is a diagram for explaining an instruction “lmsu M0, Rc, Ra, Rb, Rx”. 命令「fmsuhh Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “fmsuhh Mm, Rc, Ra, Rb, Mn”. 命令「fmsuhh M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “fmsuhh M0, Rc, Ra, Rb, Rx”. 命令「fmsuhhr Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “fmsuhhr Mm, Rc, Ra, Rb, Mn”. 命令「fmsuhhr M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “fmsuhhr M0, Rc, Ra, Rb, Rx”. 命令「fmsuhw Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “fmsuhw Mm, Rc, Ra, Rb, Mn”. 命令「fmsuhw M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “fmsuhw M0, Rc, Ra, Rb, Rx”. 命令「fmsuhww Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “fmsuhww Mm, Rc, Ra, Rb, Mn”. 命令「fmsuhww M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “fmsuhww M0, Rc, Ra, Rb, Rx”. 命令「msu Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “msu Mm, Rc, Ra, Rb, Mn”. 命令「msu M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “msu M0, Rc, Ra, Rb, Rx”. 命令「fmsuww Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “fmsuww Mm, Rc, Ra, Rb, Mn”. 命令「fmsuww M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “fmsuww M0, Rc, Ra, Rb, Rx”. 命令「div MHm,Rc,MHn,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “div MHm, Rc, MHn, Ra, Rb”. 命令「divu MHm,Rc,MHn,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “divu MHm, Rc, MHn, Ra, Rb”. 命令「dbgm0」を説明する図である。FIG. 6 is a diagram for explaining an instruction “dbgm0”. 命令「dbgm1」を説明する図である。FIG. 6 is a diagram for explaining an instruction “dbgm1”. 命令「dbgm2 I15」を説明する図である。FIG. 10 is a diagram for explaining an instruction “dbgm2 I15”. 命令「dbgm3 I15」を説明する図である。It is a figure explaining instruction "dbgm3 I15". 命令「vaddh Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vaddh Rc, Ra, Rb”. 命令「vxaddh Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vxaddh Rc, Ra, Rb”. 命令「vhaddh Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vhaddh Rc, Ra, Rb”. 命令「vladdh Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vladdh Rc, Ra, Rb”. 命令「vaddhvh Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vaddhvh Rc, Ra, Rb”. 命令「vxaddhvh Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vxaddhvh Rc, Ra, Rb”. 命令「vhaddhvh Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vhaddhvh Rc, Ra, Rb”. 命令「vladdhvh Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vladdhvh Rc, Ra, Rb”. 命令「vsaddh Rb,Ra,I8」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vsaddh Rb, Ra, I8”. 命令「vaddsh Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vaddsh Rc, Ra, Rb”. 命令「vaddsrh Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vaddsrh Rc, Ra, Rb”. 命令「vaddhvc Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vaddhvc Rc, Ra, Rb”. 命令「vaddrhvc Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vaddrhvc Rc, Ra, Rb”. 命令「vaddb Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vaddb Rc, Ra, Rb”. 命令「vsaddb Rb,Ra,I8」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vsaddb Rb, Ra, I8”. 命令「vaddsb Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vaddsb Rc, Ra, Rb”. 命令「vaddsrb Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vaddsrb Rc, Ra, Rb”. 命令「vsubh Rc,Rb,Ra」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vsubh Rc, Rb, Ra”. 命令「vxsubh Rc,Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vxsubh Rc, Rb, Ra”. 命令「vhsubh Rc,Rb,Ra」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vhsubh Rc, Rb, Ra”. 命令「vlsubh Rc,Rb,Ra」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vlsubh Rc, Rb, Ra”. 命令「vsubhvh Rc,Rb,Ra」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vsubhvh Rc, Rb, Ra”. 命令「vxsubhvh Rc,Rb,Ra」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vxsubhvh Rc, Rb, Ra”. 命令「vhsubhvh Rc,Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vhsubhvh Rc, Rb, Ra”. 命令「vlsubhvh Rc,Rb,Ra」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vlsubhvh Rc, Rb, Ra”. 命令「vssubh Rb,Ra,I8」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vssubh Rb, Ra, I8”. 命令「vsubb Rc,Rb,Ra」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vsubb Rc, Rb, Ra”. 命令「vssubb Rb,Ra,I8」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vssubb Rb, Ra, I8”. 命令「vsubsh Rc,Rb,Ra」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vsubsh Rc, Rb, Ra”. 命令「vsrsubh Rb,Ra,I8」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vsrsubh Rb, Ra, I8”. 命令「vsrsubb Rb,Ra,I8」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vsrsubb Rb, Ra, I8”. 命令「vsumh Rb,Ra」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vsumh Rb, Ra”. 命令「vsumh2 Rb,Ra」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vsumh2 Rb, Ra”. 命令「vsumrh2 Rb,Ra」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vsumrh2 Rb, Ra”. 命令「vnegh Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vnegh Rb, Ra”. 命令「vneghvh Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vneghvh Rb, Ra”. 命令「vnegb Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vnegb Rb, Ra”. 命令「vabshvh Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vabshvh Rb, Ra”. 命令「vasubb Rc,Rb,Ra」を説明する図である。It is a figure explaining instruction "vasubb Rc, Rb, Ra". 命令「vsgnh Rb,Ra」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vsgnh Rb, Ra”. 命令「vmaxh Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vmaxh Rc, Ra, Rb”. 命令「vmaxb Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vmaxb Rc, Ra, Rb”. 命令「vminh Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vminh Rc, Ra, Rb”. 命令「vminb Rc,Ra,Rb」を説明する図である。FIG. 5 is a diagram for explaining an instruction “vminb Rc, Ra, Rb”. 命令「vsel Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vsel Rc, Ra, Rb”. 命令「vmovt Rb,Ra」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vmovt Rb, Ra”. 命令「vscmpeqb Ra,I5」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vscmpeqb Ra, I5”. 命令「vscmpneb Ra,I5」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vscmpneb Ra, I5”. 命令「vscmpgtb Ra,I5」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vscmpgtb Ra, I5”. 命令「vscmpleb Ra,I5」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vscmpleb Ra, I5”. 命令「vscmpgeb Ra,I5」を説明する図である。It is a figure explaining instruction “vscmpgeb Ra, I5”. 命令「vscmpltb Ra,I5」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vscmpltb Ra, I5”. 命令「vscmpeqh Ra,I5」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vscmpeqh Ra, I5”. 命令「vscmpneh Ra,I5」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vscmpneh Ra, I5”. 命令「vscmpgth Ra,I5」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vscmpgth Ra, I5”. 命令「vscmpleh Ra,I5」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vscmpleh Ra, I5”. 命令「vscmpgeh Ra,I5」を説明する図である。It is a figure explaining instruction “vscmpgeh Ra, I5”. 命令「vscmplth Ra,I5」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vscmplth Ra, I5”. 命令「vcmpeqh Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vcmpeqh Ra, Rb”. 命令「vcmpneh Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vcmpneh Ra, Rb”. 命令「vcmpgth Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vcmpgth Ra, Rb”. 命令「vcmpleh Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vcmpleh Ra, Rb”. 命令「vcmpgeh Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vcmpgeh Ra, Rb”. 命令「vcmplth Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vcmplth Ra, Rb”. 命令「vcmpeqb Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vcmpeqb Ra, Rb”. 命令「vcmpneb Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vcmpneb Ra, Rb”. 命令「vcmpgtb Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vcmpgtb Ra, Rb”. 命令「vcmpleb Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vcmpleb Ra, Rb”. 命令「vcmpgeb Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vcmpgeb Ra, Rb”. 命令「vcmpltb Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vcmpltb Ra, Rb”. 命令「vaslh Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vaslh Rc, Ra, Rb”. 命令「vaslh Rb,Ra,I4」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vaslh Rb, Ra, I4”. 命令「vaslvh Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vaslvh Rc, Ra, Rb”. 命令「vaslvh Rb,Ra,I4」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vaslvh Rb, Ra, I4”. 命令「vasrh Rb,Ra,I4」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vasrh Rb, Ra, I4”. 命令「vasrvh Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vasrvh Rc, Ra, Rb”. 命令「vlslh Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vlslh Rc, Ra, Rb”. 命令「vlslh Rb,Ra,I4」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vlslh Rb, Ra, I4”. 命令「vlsrh Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vlsrh Rc, Ra, Rb”. 命令「vlsrh Rb,Ra,I4」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vlsrh Rb, Ra, I4”. 命令「vrolh Rc,Ra,Rb」を説明する図である。FIG. 5 is a diagram for explaining an instruction “vrolh Rc, Ra, Rb”. 命令「vrolh Rb,Ra,I4」を説明する図である。It is a figure explaining command "vrolh Rb, Ra, I4". 命令「vrorh Rb,Ra,I4」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vrorh Rb, Ra, I4”. 命令「vasrh Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vasrh Rc, Ra, Rb”. 命令「vaslb Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vaslb Rc, Ra, Rb”. 命令「vaslb Rb,Ra,I3」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vaslb Rb, Ra, I3”. 命令「vasrb Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vasrb Rc, Ra, Rb”. 命令「vasrb Rb,Ra,I3」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vasrb Rb, Ra, I3”. 命令「vlslb Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vlslb Rc, Ra, Rb”. 命令「vlslb Rb,Ra,I3」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vlslb Rb, Ra, I3”. 命令「vlsrb Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vlsrb Rc, Ra, Rb”. 命令「vlsrb Rb,Ra,I3」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vlsrb Rb, Ra, I3”. 命令「vrolb Rc,Ra,Rb」を説明する図である。FIG. 5 is a diagram for explaining an instruction “vrolb Rc, Ra, Rb”. 命令「vrolb Rb,Ra,I3」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vrolb Rb, Ra, I3”. 命令「vrorb Rb,Ra,I3」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vrorb Rb, Ra, I3”. 命令「vasl Mm,Ra,Mn,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vasl Mm, Ra, Mn, Rb”. 命令「vasl Mm,Rb,Mn,I5」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vasl Mm, Rb, Mn, I5”. 命令「vaslvw Mm,Ra,Mn,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vaslvw Mm, Ra, Mn, Rb”. 命令「vaslvw Mm,Rb,Mn,I5」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vaslvw Mm, Rb, Mn, I5”. 命令「vasr Mm,Ra,Mn,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vasr Mm, Ra, Mn, Rb”. 命令「vasr Mm,Rb,Mn,I5」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vasr Mm, Rb, Mn, I5”. 命令「vasrvw Mm,Ra,Mn,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vasrvw Mm, Ra, Mn, Rb”. 命令「vlsl Mm,Ra,Mn,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vlsl Mm, Ra, Mn, Rb”. 命令「vlsl Mm,Rb,Mn,I5」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vlsl Mm, Rb, Mn, I5”. 命令「vlsr Mm,Ra,Mn,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vlsr Mm, Ra, Mn, Rb”. 命令「vlsr Mm,Rb,Mn,I5」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vlsr Mm, Rb, Mn, I5”. 命令「vsath Mm,Rb,Mn」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vsath Mm, Rb, Mn”. 命令「vsath12 Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vsath12 Rb, Ra”. 命令「vsath9 Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vsath9 Rb, Ra”. 命令「vsath8 Rb,Ra」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vsath8 Rb, Ra”. 命令「vsath8u Rb,Ra」を説明する図である。It is a figure explaining instruction “vsath8u Rb, Ra”. 命令「vrndvh Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vrndvh Rb, Mn”. 命令「vabssumb Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vabssumb Rc, Ra, Rb”. 命令「vmul Mm,Rc,Ra,Rb」を説明する図である。FIG. 10 is a diagram for explaining an instruction “vmul Mm, Rc, Ra, Rb”. 命令「vxmul Mm,Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vxmul Mm, Rc, Ra, Rb”. 命令「vhmul Mm,Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vhmul Mm, Rc, Ra, Rb”. 命令「vlmul Mm,Rc,Ra,Rb」を説明する図である。FIG. 10 is a diagram for explaining an instruction “vlmul Mm, Rc, Ra, Rb”. 命令「vfmulh Mm,Rc,Ra,Rb」を説明する図である。FIG. 10 is a diagram for explaining an instruction “vfmulh Mm, Rc, Ra, Rb”. 命令「vxfmulh Mm,Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vxfmulh Mm, Rc, Ra, Rb”. 命令「vhfmulh Mm,Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vhfmulh Mm, Rc, Ra, Rb”. 命令「vlfmulh Mm,Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vlfmulh Mm, Rc, Ra, Rb”. 命令「vfmulhr Mm,Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vfmulhr Mm, Rc, Ra, Rb”. 命令「vxfmulhr Mm,Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vxfmulhr Mm, Rc, Ra, Rb”. 命令「vhfmulhr Mm,Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vhfmulhr Mm, Rc, Ra, Rb”. 命令「vlfmulhr Mm,Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vlfmulhr Mm, Rc, Ra, Rb”. 命令「vfmulw Mm,Rc,Ra,Rb」を説明する図である。FIG. 10 is a diagram for explaining an instruction “vfmulw Mm, Rc, Ra, Rb”. 命令「vxfmulw Mm,Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vxfmulw Mm, Rc, Ra, Rb”. 命令「vhfmulw Mm,Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vhfmulw Mm, Rc, Ra, Rb”. 命令「vlfmulw Mm,Rc,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vlfmulw Mm, Rc, Ra, Rb”. 命令「vpfmulhww Mm,Rc:Rc+1,Ra,Rb」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vpfmulhww Mm, Rc: Rc + 1, Ra, Rb”. 命令「vmac Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vmac Mm, Rc, Ra, Rb, Mn”. 命令「vmac M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vmac M0, Rc, Ra, Rb, Rx”. 命令「vxmac Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vxmac Mm, Rc, Ra, Rb, Mn”. 命令「vxmac M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vxmac M0, Rc, Ra, Rb, Rx”. 命令「vhmac Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 5 is a diagram for explaining an instruction “vhmac Mm, Rc, Ra, Rb, Mn”. 命令「vhmac M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vhmac M0, Rc, Ra, Rb, Rx”. 命令「vlmac Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vlmac Mm, Rc, Ra, Rb, Mn”. 命令「vlmac M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vlmac M0, Rc, Ra, Rb, Rx”. 命令「vfmach Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vfmach Mm, Rc, Ra, Rb, Mn”. 命令「vfmach M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vfmach M0, Rc, Ra, Rb, Rx”. 命令「vxfmach Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vxfmach Mm, Rc, Ra, Rb, Mn”. 命令「vxfmach M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vxfmach M0, Rc, Ra, Rb, Rx”. 命令「vhfmach Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram illustrating an instruction “vhfmach Mm, Rc, Ra, Rb, Mn”. 命令「vhfmach M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vhfmach M0, Rc, Ra, Rb, Rx”. 命令「vlfmach Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vlfmach Mm, Rc, Ra, Rb, Mn”. 命令「vlfmach M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vlfmach M0, Rc, Ra, Rb, Rx”. 命令「vfmachr Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vfmachr Mm, Rc, Ra, Rb, Mn”. 命令「vfmachr M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vfmachr M0, Rc, Ra, Rb, Rx”. 命令「vxfmachr Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vxfmachr Mm, Rc, Ra, Rb, Mn”. 命令「vxfmachr M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vxfmachr M0, Rc, Ra, Rb, Rx”. 命令「vhfmachr Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vhfmachr Mm, Rc, Ra, Rb, Mn”. 命令「vhfmachr M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vhfmachr M0, Rc, Ra, Rb, Rx”. 命令「vlfmachr Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vlfmachr Mm, Rc, Ra, Rb, Mn”. 命令「vlfmachr M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vlfmachr M0, Rc, Ra, Rb, Rx”. 命令「vfmacw Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vfmacw Mm, Rc, Ra, Rb, Mn”. 命令「vxfmacw Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vxfmacw Mm, Rc, Ra, Rb, Mn”. 命令「vhfmacw Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vhfmacw Mm, Rc, Ra, Rb, Mn”. 命令「vlfmacw Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 5 is a diagram for explaining an instruction “vlfmacw Mm, Rc, Ra, Rb, Mn”. 命令「vpfmachww Mm,Rc:Rc+1,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vpfmachww Mm, Rc: Rc + 1, Ra, Rb, Mn”. 命令「vmsu Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vmsu Mm, Rc, Ra, Rb, Mn”. 命令「vmsu M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vmsu M0, Rc, Ra, Rb, Rx”. 命令「vxmsu Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vxmsu Mm, Rc, Ra, Rb, Mn”. 命令「vxmsu M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vxmsu M0, Rc, Ra, Rb, Rx”. 命令「vhmsu Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vhmsu Mm, Rc, Ra, Rb, Mn”. 命令「vhmsu M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vhmsu M0, Rc, Ra, Rb, Rx”. 命令「vlmsu Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vlmsu Mm, Rc, Ra, Rb, Mn”. 命令「vlmsu M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vlmsu M0, Rc, Ra, Rb, Rx”. 命令「vfmsuh Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vfmsuh Mm, Rc, Ra, Rb, Mn”. 命令「vfmsuh M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vfmsuh M0, Rc, Ra, Rb, Rx”. 命令「vxfmsuh Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 7 is a diagram for explaining an instruction “vxfmsuh Mm, Rc, Ra, Rb, Mn”. 命令「vxfmsuh M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vxfmsuh M0, Rc, Ra, Rb, Rx”. 命令「vhfmsuh Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vhfmsuh Mm, Rc, Ra, Rb, Mn”. 命令「vhfmsuh M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vhfmsuh M0, Rc, Ra, Rb, Rx”. 命令「vlfmsuh Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vlfmsuh Mm, Rc, Ra, Rb, Mn”. 命令「vlfmsuh M0,Rc,Ra,Rb,Rx」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vlfmsuh M0, Rc, Ra, Rb, Rx”. 命令「vfmsuw Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vfmsuw Mm, Rc, Ra, Rb, Mn”. 命令「vxfmsuw Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vxfmsuw Mm, Rc, Ra, Rb, Mn”. 命令「vhfmsuw Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vhfmsuw Mm, Rc, Ra, Rb, Mn”. 命令「vlfmsuw Mm,Rc,Ra,Rb,Mn」を説明する図である。FIG. 6 is a diagram for explaining an instruction “vlfmsuw Mm, Rc, Ra, Rb, Mn”.

Explanation of symbols

１プロセッサ
１０命令制御部
１０ａ命令キャッシュ
１０ｂアドレス管理部
１０ｃ〜１０ｅ命令バッファ
１０ｆジャンプバッファ
１０ｇローテーション部
２０デコード部
３０レジスタファイル
３０ａ汎用レジスタ（Ｒ０〜Ｒ３１）
３０ｂアキュームレータ（ＭＨ，ＭＬ）
３０ｃリンクレジスタ（ＬＲ）
３０ｄ分岐レジスタ（ＴＡＲ）
３１プログラム状態レジスタ（ＰＳＲ）
３２条件フラグレジスタ（ＣＦＲ）
３３プログラムカウンタ（ＰＣ）
３４ＰＣ退避用レジスタ（ＩＰＣ）
３５ＰＳＲ退避用レジスタ（ＩＰＳＲ）
４０演算部
４１〜４３算術論理・比較演算器
４１ａＡＬＵ部
４１ｂ飽和処理部
４１ｃフラグ部
４４積和演算器
４４ａ、４４ｂ乗算器
４４ｃ〜４４ｅ加算器
４４ｆセレクタ
４４ｇ飽和処理部
４５バレルシフタ
４５ａ、４５ｂセレクタ
４５ｃ上位バレルシフタ
４５ｄ下位バレルシフタ
４５ｅ飽和処理部
４６除算器
４７変換器
４７ａ SATブロック
４７ｂ BSEQブロック
４７ｃ MSKGENブロック
４７ｄ VSUMBブロック
４７ｅ BCNTブロック
４７ｆ ILブロック
５０Ｉ／Ｆ部
６０命令メモリ部
７０データメモリ部
８０拡張レジスタ部
９０Ｉ／Ｏインターフェース部 DESCRIPTION OF SYMBOLS 1 Processor 10 Instruction control part 10a Instruction cache 10b Address management part 10c-10e Instruction buffer 10f Jump buffer 10g Rotation part 20 Decoding part 30 Register file 30a General-purpose register (R0-R31)
30b Accumulator (MH, ML)
30c Link register (LR)
30d Branch register (TAR)
31 Program status register (PSR)
32 Condition flag register (CFR)
33 Program counter (PC)
34 PC save register (IPC)
35 PSR save register (IPSR)
40 arithmetic units 41 to 43 arithmetic logic / comparison arithmetic unit 41a ALU unit 41b saturation processing unit 41c flag unit 44 product-sum arithmetic unit 44a, 44b multiplier 44c-44e adder 44f selector 44g saturation processing unit 45 barrel shifter 45a, 45b selector 45c Upper barrel shifter 45d Lower barrel shifter 45e Saturation processing section 46 Divider 47 Converter 47a SAT block 47b BSEQ block 47c MSKGEN block 47d VSUMB block 47e BCNT block 47f IL block 50 I / F section 60 Instruction memory section 70 Data memory section 80 Extension register section 90 I / O interface

Claims

A processor that executes SIMD type instructions that operate on a plurality of data with one instruction,
Decoding means for decoding instructions;
An execution means for executing an instruction based on the result of decoding by the decoding means;
The execution means includes an instruction code designating an operation type, a first operand designating a first data group consisting of a sequence of n (≧ 2) data, and a second data group consisting of a sequence of n data. When a SIMD type instruction including a second operand to be specified is decoded by the decoding means, the i-th data and the second data group in the sequence of n data constituting the first data group are Performing an operation specified by the instruction code on each of the n sets of j-th data in an array of n data constituting the data;
The processor is characterized in that i = 1, 2,..., N, and j = constant.

N is 2;
The first data group includes first data and second data,
The second data group includes first data and second data,
The execution means performs the calculation on a set of the first data of the first data group and the first data of the second data group, and the second data and second of the first data group. The processor according to claim 1, wherein the calculation is performed on a set of the data group and the first data.

N is 2;
The first data group includes first data and second data,
The second data group includes first data and second data,
The execution means performs the calculation on a set of the first data of the first data group and the second data of the second data group, and the second data and the second data of the first data group. The processor according to claim 1, wherein the calculation is performed on a set of the data group and the second data.

N is 2;
The first data group includes first data and second data,
The second data group includes first data and second data,
The execution means includes
When the first instruction is decoded by the decoding means, the calculation is performed on a set of the first data of the first data group and the first data of the second data group, and the first data Performing the operation on the set of the second data of the group and the first data of the second data group,
When the second instruction is decoded by the decoding means, the calculation is performed on a set of the first data of the first data group and the second data of the second data group, and the first data The processor according to claim 1, wherein the calculation is performed on a set of second data of the group and second data of the second data group.

The type of operation specified by the instruction code is multiplication, sum of products or product difference,
The instruction includes a third operand designating third data for storing a result of the operation,
The said execution means stores the low-order part of each result obtained by performing the said calculation with respect to said two sets in said 3rd data. The one of Claims 2-4 characterized by the above-mentioned. Processor.

The type of operation specified by the instruction code is multiplication, sum of products or product difference,
The instruction includes a third operand designating third data for storing a result of the operation,
The said execution means stores the high-order part of each result obtained by performing the said calculation with respect to said two sets in said 3rd data. The one of Claims 2-4 characterized by the above-mentioned. Processor.

The type of operation specified by the instruction code is multiplication, sum of products or product difference,
The instruction includes a third operand designating third data for storing a result of the operation,
5. The method according to claim 2, wherein the execution unit stores one of two results obtained by performing the calculation on the two sets in the third data. The processor described.

N is 4;
The first data group includes a sequence of first to fourth data,
The second data group consists of a sequence of first to fourth data,
The execution means includes a set of the first data of the first data group and the first data of the second data group, the second data of the first data group and the first data of the second data group, A set of the third data of the first data group and the first data of the second data group, and the fourth data of the first data group and the first data of the second data group, The processor according to claim 1, wherein the calculation is performed on a set of

N is 4;
The first data group includes a sequence of first to fourth data,
The second data group consists of a sequence of first to fourth data,
The execution means includes a set of first data of the first data group and second data of the second data group, second data of the first data group, and second data of the second data group, A set of the third data of the first data group and the second data of the second data group, and the fourth data of the first data group and the second data of the second data group, The processor according to claim 1, wherein the calculation is performed on a set of

N is 4;
The first data group includes a sequence of first to fourth data,
The second data group consists of a sequence of first to fourth data,
The execution means includes a set of first data of the first data group and third data of the second data group, second data of the first data group, and third data of the second data group, A set of the third data of the first data group and the third data of the second data group, and the fourth data of the first data group and the third data of the second data group, The processor according to claim 1, wherein the calculation is performed on a set of

N is 4;
The first data group includes a sequence of first to fourth data,
The second data group consists of a sequence of first to fourth data,
The execution means includes a set of the first data of the first data group and the fourth data of the second data group, the second data of the first data group and the fourth data of the second data group, A set of the third data of the first data group and the fourth data of the second data group, and the fourth data of the first data group and the fourth data of the second data group, The processor according to claim 1, wherein the calculation is performed on a set of

The type of operation specified by the instruction code is multiplication, sum of products or product difference,
The instruction includes a third operand designating third data for storing a result of the operation,
The said execution means stores the low-order part of each result obtained by performing the said calculation with respect to the said 4 groups in said 3rd data. The one of Claims 8-11 characterized by the above-mentioned. Processor.

The type of operation specified by the instruction code is multiplication, sum of products or product difference,
The instruction includes a third operand designating third data for storing a result of the operation,
The said execution means stores the high-order part of each result obtained by performing the said calculation with respect to the said 4 groups in said 3rd data. The one of Claims 8-11 characterized by the above-mentioned. Processor.

The type of operation specified by the instruction code is multiplication, sum of products or product difference,
The instruction includes a third operand designating third data for storing a result of the operation,
The said execution means stores two of the four results obtained by performing the said calculation with respect to the said four groups in said 3rd data. Any one of Claims 8-11 characterized by the above-mentioned. The processor according to item.