JP2010033323A

JP2010033323A - Multiprocessor, debugging device for debugging the same, and debug method for debugging the multiprocessor

Info

Publication number: JP2010033323A
Application number: JP2008194830A
Authority: JP
Inventors: Kazuhiko Iwanaga; 和彦岩永
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2008-07-29
Filing date: 2008-07-29
Publication date: 2010-02-12

Abstract

<P>PROBLEM TO BE SOLVED: To provide a multiprocessor, a debugging device, and a debug method for executing steps in order of a program even when there are a plurality of CPU cores in one chip. <P>SOLUTION: In the multiprocessor 1 incorporating a CPU 2a, CPU 2b and CPU 2c, respective commands of the CPU 2a, CPU 2b and CPU 2c are decoded by a decoder 21 so as to transit them to a monitor state when operating steps, a monitor transitable signal is output when a command is transitable to a monitor state, and the respective CPU are transited to the monitor state when all the CPU output monitor transitable signals. <P>COPYRIGHT: (C)2010,JPO&INPIT

Description

複数のＣＰＵ（Central Processing Unit）をコアとして内蔵したマルチプロセッサ及びそれをデバッグするデバッグ装置並びに前記マルチプロセッサをデバッグするデバッグ方法に関する。 The present invention relates to a multiprocessor including a plurality of CPUs (Central Processing Units) as a core, a debugging apparatus for debugging the multiprocessor, and a debugging method for debugging the multiprocessor.

近年、ＬＳＩ（Large Scale Integration）の処理能力を向上させるために、１チップ内に複数のＣＰＵコアを内蔵するマルチプロセッサ構成を取るものが多くなってきた。複数のＣＰＵコアを含むプロセッサのデバッグにおいては、複数のＣＰＵコアが同期して動作しているため、それぞれのＣＰＵコアで実行しているプログラムの時系列に従ってデバッグを行うことが必要である。デバッグの際にはプログラマーの意図通り動作していないＣＰＵコアがある場合には、対象となるＣＰＵコアのプログラム上にブレークポイントを設定し、そこで各ＣＰＵコアをプログラム実行状態からモニター状態と呼ばれるデバッグモードに移行させ、その近傍で１命令ずつステップ実行動作を行わせながらレジスタやメモリに格納されている値をチェックしていくということが行われている。 In recent years, in order to improve the processing capability of LSI (Large Scale Integration), many have adopted a multiprocessor configuration in which a plurality of CPU cores are built in one chip. In debugging a processor including a plurality of CPU cores, since the plurality of CPU cores operate synchronously, it is necessary to perform debugging in accordance with a time series of programs executed in the respective CPU cores. When there is a CPU core that does not operate as intended by the programmer during debugging, a breakpoint is set on the program of the target CPU core, and each CPU core is debugged from the program execution state to the monitor state. The mode is shifted to the mode, and the value stored in the register or the memory is checked while performing the step execution operation one instruction at a time in the vicinity thereof.

複数のコアを同時にモニター状態へ移行させることに関しては、特許文献１ないし特許文献５などに開示されている。 Patent Documents 1 to 5 disclose the transition of a plurality of cores to the monitor state at the same time.

また、マルチプロセッサ構成において、命令を１命令ずつ実行していくステップ実行については特許文献６に開示されている。特許文献６には、複数のＣＰＵコアの動作周波数が異なる場合に、周波数に反比例させた命令数を実行させることで複数のＣＰＵコアを同時にステップ実行可能としていることが記載されている。
特開２００５−１２２３７５号公報特開２００７−１４１２００号公報特許第２７７４７７０号公報特開平１０−１９８５７８号公報特開平４−１１４２４１号公報特開２００６−５３８３５号公報 Further, step execution in which instructions are executed one by one in a multiprocessor configuration is disclosed in Patent Document 6. Patent Document 6 describes that when the operating frequencies of a plurality of CPU cores are different, a plurality of CPU cores can be stepped simultaneously by executing the number of instructions inversely proportional to the frequency.
JP 2005-122375 A JP 2007-141200 A Japanese Patent No. 2774770 Japanese Patent Laid-Open No. 10-198578 JP-A-4-114241 JP 2006-53835 A

しかしながら、ＲＩＳＣ（Reduced Instruction Set Computer）プロセッサにおいては遅延分岐や遅延ロードというパイプライン構成があり、この遅延分岐や遅延ロードはデスティネーションレジスタの更新が１サイクル以上遅延するために、ステップ実行している命令が遅延分岐もしくは遅延ロードを伴う命令である場合はプログラム通りにレジスタの値が更新しなくなるという問題がある。 However, a RISC (Reduced Instruction Set Computer) processor has a pipeline structure such as a delay branch and a delay load, and this delay branch and delay load are executed in steps because the destination register update is delayed by one cycle or more. If the instruction is an instruction with a delayed branch or delayed load, there is a problem that the value of the register is not updated according to the program.

上述した問題を図７ないし図９を参照して詳細に説明する。図７ないし図９は、最も代表的なプロセッサのパイプラインの流れを示している。つまり、命令フェッチ（Instruction Fetch）、命令デコード（Instruction Decode）およびレジスタ読み出し（Register Read）、演算（Execute）、メモリアクセス（Memory Access）、レジスタ書き込み（Write Back）の５段階で構成されている。 The above problem will be described in detail with reference to FIGS. 7 to 9 show the pipeline flow of the most typical processor. That is, it is composed of five stages of instruction fetch (Instruction Fetch), instruction decode (Instruction Decode) and register read (Register Read), operation (Execute), memory access (Memory Access), and register write (Write Back).

図７ないし図９のパイプラインを持つプロセッサにおいて、例えば、以下のような連続する３つの命令列がある場合を考える。
命令１：ＬＤ［ａｄｄ］，Ｒ０（メモリのアドレスａｄｄからデータをロードし、Ｒ０レジスタに格納する）
命令２：ＡＤＤＲ０，Ｒ１，Ｒ２（Ｒ０レジスタとＲ１レジスタの和をＲ２レジスタに格納する）
命令３：ＡＤＤＲ０，Ｒ３，Ｒ４（Ｒ０レジスタとＲ３レジスタの和をＲ４レジスタに格納する） In the processor having the pipeline of FIGS. 7 to 9, for example, consider a case where there are three consecutive instruction sequences as follows.
Instruction 1: LD [add], R0 (load data from memory address add and store in R0 register)
Instruction 2: ADD R0, R1, R2 (store the sum of the R0 and R1 registers in the R2 register)
Instruction 3: ADD R0, R3, R4 (the sum of the R0 and R3 registers is stored in the R4 register)

上述した命令列は、図７に示すように実行していくことになるが、命令１でＲ０レジスタに値が格納されるよりも前に命令２、命令３でＲ０レジスタの読み出しを行うため、データハザードと呼ばれるパイプライン障害が発生する。命令３ではメモリから読み出した値を直接ＡＬＵに供給するForwardingと呼ばれるバイパス経路を開いて対応するが、命令２は絶対に間に合わないため更新前の値をＲ０レジスタの値として演算を行うことになる。つまり命令１（ＬＤ命令）はデスティネーションレジスタの更新が１サイクルの遅延を持っている命令であると言える。ＲＩＳＣプロセッサでは動作速度の向上のために、この状況をプログラムのスケジューリングに委ね、ハードウェアでは解決しないということが行われている。これを遅延ロードと呼ぶ。つまりこの場合プログラマーは、命令２は命令１によって更新される前のＲ０レジスタの値を使って演算を行うことを期待してプログラムを作成しているということになる。 The above-described instruction sequence is executed as shown in FIG. 7, but since the value is stored in the R0 register by the instruction 1, the R2 register is read by the instruction 2 and the instruction 3. A pipeline failure called a data hazard occurs. The instruction 3 responds by opening a bypass path called Forwarding that directly supplies the value read from the memory to the ALU. However, since the instruction 2 is not in time, the value before the update is calculated as the value of the R0 register. . That is, it can be said that the instruction 1 (LD instruction) is an instruction in which the update of the destination register has a delay of one cycle. In the RISC processor, in order to improve the operation speed, this situation is left to the scheduling of a program, and it is not solved by hardware. This is called lazy loading. In other words, in this case, the programmer is creating a program in the expectation that the instruction 2 performs an operation using the value of the R0 register before being updated by the instruction 1.

ここで命令１をステップ実行する場合を説明する。ステップ実行は、対象となる命令（この場合命令１）の次の命令（命令２）をデバッグ装置のホストコンピュータ上に退避しておき、これをソフトウェアブレーク命令（ＥＢＲＫ命令）に差し替えることで実現されるが、ステップ実行の場合は図８に示すように命令１はパイプラインの最後まで実行されて停止するために、次の命令２をステップ実行する際にはＲ０レジスタの値が更新されてしまい、プログラム通りにデバッグ出来ないことになる。遅延分岐の場合も同様にステップ実行がプログラム通りに行われない。 Here, the case where the instruction 1 is executed in steps will be described. Step execution is realized by saving the instruction (instruction 2) next to the target instruction (in this case, instruction 1) on the host computer of the debugging device and replacing it with a software break instruction (EBRK instruction). However, in the case of step execution, as shown in FIG. 8, instruction 1 is executed until the end of the pipeline and stops. Therefore, when the next instruction 2 is executed stepwise, the value of the R0 register is updated. It will not be possible to debug according to the program. Similarly, in the case of delayed branching, step execution is not performed according to the program.

このような問題を回避するためにチップの中に１つしかＣＰＵコアがないプロセッサでは、ステップ実行の際に命令２をＥＢＲＫ命令に差し替えるのではなく、図９のように命令３をＥＢＲＫ命令に差し替えることでこの問題を回避している。 In order to avoid such a problem, a processor having only one CPU core in the chip does not replace instruction 2 with an EBRK instruction during step execution, but replaces instruction 3 with an EBRK instruction as shown in FIG. By replacing it, this problem is avoided.

しかし１つのチップの中に複数のＣＰＵコアがあるようなマルチプロセッサにおいて、各コアが独立してこの回避策を取ってしまうと、遅延ロード、遅延分岐を含む命令を実行しているＣＰＵコアと、そうでないＣＰＵコアとで同時にステップ実行を行うことが出来ないという問題があった。 However, in a multiprocessor having a plurality of CPU cores in one chip, if each core takes this workaround independently, a CPU core executing an instruction including a delay load and a delay branch There was a problem that step execution could not be performed simultaneously with other CPU cores.

本発明はかかる問題を解決することを目的としている。 The present invention aims to solve such problems.

すなわち、本発明は、１つのチップの中に複数のＣＰＵコアがある場合でも、各ＣＰＵコアが同期してプログラムどおりの動作となるステップ実行を行うことができるマルチプロセッサ及びそれをデバッグするデバッグ装置並びに前記マルチプロセッサをデバッグするデバッグ方法を提供することを目的としている。 That is, the present invention relates to a multiprocessor capable of performing step execution in which each CPU core operates in synchronization with a program even when there are a plurality of CPU cores in one chip, and a debugging device for debugging the same. It is another object of the present invention to provide a debugging method for debugging the multiprocessor.

請求項１に記載された発明は、ブレーク命令によって実行状態からモニター状態に遷移し、同一周波数で動作するＣＰＵを複数有するマルチプロセッサにおいて、前記複数のＣＰＵそれぞれで実行している命令を監視する監視手段と、前記ＣＰＵに前記ブレーク命令が入力されている際に、前記監視手段が全ての前記ＣＰＵがモニター状態に遷移可能な命令を実行していることを検出した場合は、前記複数のＣＰＵを前記モニター状態に遷移させる遷移手段と、を有していることを特徴とするマルチプロセッサである。 According to the first aspect of the present invention, in a multiprocessor having a plurality of CPUs operating at the same frequency by transitioning from an execution state to a monitor state by a break instruction, monitoring that monitors instructions executed by each of the plurality of CPUs And when the monitoring means detects that all of the CPUs are executing instructions that can transition to the monitor state when the break instruction is input to the CPU, the CPUs And a transition means for transitioning to the monitor state.

請求項２に記載された発明は、請求項１に記載された発明において、前記遷移手段が、前記ＣＰＵに含まれていることを特徴とするものである。 The invention described in claim 2 is the invention described in claim 1, wherein the transition means is included in the CPU.

請求項３に記載された発明は、請求項１または２に記載された発明において、前記複数のＣＰＵのうちから前記モニター状態に遷移するＣＰＵを選択する選択手段を有していることを特徴とするものである。 The invention described in claim 3 is characterized in that in the invention described in claim 1 or 2, it further comprises selection means for selecting a CPU that transitions to the monitor state from the plurality of CPUs. To do.

請求項４に記載された発明は、ブレーク命令によって実行状態からモニター状態に遷移し、同一周波数で動作するＣＰＵを複数有するマルチプロセッサと、前記マルチプロセッサをデバッグするデバッグ手段と、を有するデバッグ装置において、前記複数のＣＰＵそれぞれで実行している命令を監視する監視手段と、前記ＣＰＵに前記ブレーク命令が入力されている際に、前記監視手段が全ての前記ＣＰＵがモニター状態に遷移可能な命令を実行していることを検出した場合は、前記複数のＣＰＵを前記モニター状態に遷移させる遷移手段と、を有していることを特徴とするデバッグ装置である。 According to a fourth aspect of the present invention, there is provided a debugging apparatus comprising: a multiprocessor having a plurality of CPUs that transition from an execution state to a monitor state by a break instruction and operating at the same frequency; and a debugging unit that debugs the multiprocessor. Monitoring means for monitoring instructions executed by each of the plurality of CPUs; and when the break instruction is input to the CPU, the monitoring means provides instructions that allow all of the CPUs to transition to a monitor state. And a transition unit configured to transition the plurality of CPUs to the monitor state when the execution is detected.

請求項５に記載された発明は、ブレーク命令によって実行状態からモニター状態に遷移し、同一周波数で動作するＣＰＵを複数有するマルチプロセッサをデバッグするデバッグ方法において、前記複数のＣＰＵそれぞれで実行している命令を監視し、前記ＣＰＵに前記ブレーク命令が入力されている際に、全ての前記ＣＰＵがモニター状態に遷移可能な命令を実行していることを検出した場合は、前記複数のＣＰＵを前記モニター状態に遷移させることを特徴とするデバッグ方法である。 According to a fifth aspect of the present invention, in a debugging method for debugging a multiprocessor having a plurality of CPUs operating at the same frequency by transitioning from an execution state to a monitoring state by a break instruction, each of the plurality of CPUs executes the debugging method. The instruction is monitored, and when the break instruction is input to the CPU, if it is detected that all the CPUs are executing an instruction that can transition to the monitor state, the plurality of CPUs are monitored. This is a debugging method characterized by transitioning to a state.

請求項１に記載の発明によれば、監視手段で複数のＣＰＵそれぞれで実行している命令を監視し、ＣＰＵにブレーク命令が入力されている際に、監視手段が全てのＣＰＵがモニター状態に遷移可能な命令を実行していることを検出した場合は、複数のＣＰＵをモニター状態に遷移させているので、複数のＣＰＵの中でステップ実行中の命令が遅延分岐や遅延ロードなどモニター状態に移行することでプログラム通りの動きを再現できないような命令であるものが１つでもある場合には、モニター状態に移行せずに次の命令を実行するようにすることで、全てのＣＰＵが同期してモニター状態に遷移することができるので、プログラム通りの動きを再現できるステップ実行をすることができる。また、ハードウェアで制御しているので、ステップ実行時のデバッグ用ホストコンピュータの処理の負荷を軽減することができる。 According to the first aspect of the present invention, the monitoring unit monitors the instruction executed by each of the plurality of CPUs, and when the break instruction is input to the CPU, the monitoring unit sets all the CPUs to the monitor state. If it is detected that a transitionable instruction is being executed, multiple CPUs are transitioned to the monitor state, so that the instruction being stepped in the multiple CPUs is placed in the monitor state such as delayed branch or delayed load. If there is even one command that cannot be reproduced as programmed by shifting, all the CPUs are synchronized by executing the next command without shifting to the monitor state. Thus, it is possible to perform step execution that can reproduce the movement according to the program. In addition, since it is controlled by hardware, the processing load of the debugging host computer during step execution can be reduced.

請求項２に記載の発明によれば、遷移手段が、ＣＰＵに含まれているので、監視手段と遷移手段との間の配線が短くなり高速に動作させる際に有利となる。 According to the second aspect of the present invention, since the transition means is included in the CPU, the wiring between the monitoring means and the transition means is shortened, which is advantageous when operating at high speed.

請求項３に記載の発明によれば、複数のＣＰＵのうちからモニター状態に遷移するＣＰＵを選択する選択手段を有しているので、他のＣＰＵとの依存関係の無い物を除外することができ、それによってステップ実行を行う際の１ステップの命令数を少なくすることができる。 According to the third aspect of the present invention, since there is a selection means for selecting a CPU that transitions to a monitor state from among a plurality of CPUs, it is possible to exclude an object having no dependency relationship with other CPUs. Thus, the number of instructions in one step when performing step execution can be reduced.

請求項４に記載の発明によれば、監視手段で複数のＣＰＵそれぞれで実行している命令を監視し、ＣＰＵにブレーク命令が入力されている際に、監視手段が全てのＣＰＵがモニター状態に遷移可能な命令を実行していることを検出した場合は、複数のＣＰＵをモニター状態に遷移させているので、複数のＣＰＵの中でステップ実行中の命令が遅延分岐や遅延ロードなどモニター状態に移行することでプログラム通りの動きを再現できないような命令であるものが１つでもある場合には、モニター状態に移行せずに次の命令を実行するようにすることで、全てのＣＰＵが同期してモニター状態に遷移することができるので、プログラム通りの動きを再現できるステップ実行をすることができる。 According to the fourth aspect of the present invention, the monitoring unit monitors the instruction executed by each of the plurality of CPUs, and when the break instruction is input to the CPU, the monitoring unit sets all the CPUs to the monitor state. If it is detected that a transitionable instruction is being executed, multiple CPUs are transitioned to the monitor state, so that the instruction being stepped in the multiple CPUs is placed in the monitor state such as delayed branch or delayed load. If there is even one command that cannot be reproduced as programmed by shifting, all the CPUs are synchronized by executing the next command without shifting to the monitor state. Thus, it is possible to perform step execution that can reproduce the movement according to the program.

請求項５に記載の発明によれば、複数のＣＰＵそれぞれで実行している命令を監視し、ＣＰＵにブレーク命令が入力されている際に、全てのＣＰＵがモニター状態に遷移可能な命令を実行していることを検出した場合は、複数のＣＰＵをモニター状態に遷移させているので、複数のＣＰＵの中でステップ実行中の命令が遅延分岐や遅延ロードなどモニター状態に移行することでプログラム通りの動きを再現できないような命令であるものが１つでもある場合には、モニター状態に移行せずに次の命令を実行するようにすることで、全てのＣＰＵが同期してモニター状態に遷移することができるので、プログラム通りの動きを再現できるステップ実行をすることができる。 According to the fifth aspect of the present invention, the instruction executed by each of the plurality of CPUs is monitored, and when a break instruction is input to the CPU, all the CPUs execute an instruction that can transit to the monitor state. If it is detected that a plurality of CPUs are in the monitor state, the instructions being executed in steps in the plurality of CPUs are shifted to the monitor state such as delayed branching and delay load, and the program is executed. If there is at least one command that cannot reproduce the movement of the CPU, the next command is executed without shifting to the monitor state, so that all CPUs shift to the monitor state synchronously. Therefore, it is possible to perform step execution that can reproduce the motion as programmed.

［第１実施形態］
以下、本発明の第１の実施形態を、図１ないし図３を参照して説明する。図１は、本発明の第１の実施形態にかかるマルチプロセッサのブロック図である。図２は、図１に示したマルチプロセッサの変形例を示したブロック図である。図３は、図１に示したマルチプロセッサのチップレイアウトの例を示した説明図である。 [First Embodiment]
A first embodiment of the present invention will be described below with reference to FIGS. FIG. 1 is a block diagram of a multiprocessor according to the first embodiment of the present invention. FIG. 2 is a block diagram showing a modification of the multiprocessor shown in FIG. FIG. 3 is an explanatory diagram showing an example of the chip layout of the multiprocessor shown in FIG.

図１に示したマルチプロセッサ１は、ＣＰＵ２ａ、２ｂ、２ｃと、ブレーク生成装置３と、を備え、１チップのＬＳＩで構成されている。 The multiprocessor 1 shown in FIG. 1 includes CPUs 2a, 2b, and 2c and a break generation device 3, and is configured by a one-chip LSI.

ＣＰＵ（ＣＰＵコア、コアＣＰＵ或いは単にコアともいう）２ａは、デコーダ２１、ＡＮＤ回路２２、２３と、を備えている。また、ＣＰＵ２ａは、図示しないが、プログラム格納用のプログラムＲＡＭ、演算データ格納用のデータＲＡＭ、プログラムのアドレスを保持するプログラムカウンタ、プログラムＲＡＭから読み出した命令を格納する命令レジスタ、演算回路および各種レジスタも備えている。なお、ＣＰＵ２ｂ、２ｃも同様の構成である。また、ＣＰＵ２ａ、２ｂ、２ｃは同一周波数のクロックで動作する。 The CPU (also referred to as CPU core, core CPU, or simply core) 2 a includes a decoder 21 and AND circuits 22 and 23. Although not shown, the CPU 2a includes a program RAM for storing programs, a data RAM for storing operation data, a program counter for holding program addresses, an instruction register for storing instructions read from the program RAM, an arithmetic circuit, and various registers. It also has. The CPUs 2b and 2c have the same configuration. The CPUs 2a, 2b and 2c operate with clocks having the same frequency.

監視手段としてのデコーダ２１は、命令レジスタに格納された命令をデコードするデコーダであり、本実施形態では、モニター状態に遷移できる命令である場合はモニター遷移移行可能信号を出力する。モニター状態とは、プログラムを実行している状態である実行状態から遷移する状態であり、この状態でプログラムのステップ実行やデータメモリなどの読み出しなどのデバッグを行うことができる。また、モニター状態に遷移できる命令とは、ＣＰＵ２ａ、２ｂ、２ｃに実装されている命令のうち遅延ロード、遅延分岐を伴う命令以外の命令を示す。 The decoder 21 serving as a monitoring unit is a decoder that decodes an instruction stored in the instruction register. In this embodiment, the monitor 21 outputs a monitor transition enable signal when the instruction can be changed to the monitor state. The monitor state is a state in which a transition is made from an execution state that is a state in which a program is being executed. In this state, debugging such as step execution of a program and reading of a data memory or the like can be performed. In addition, the instruction that can transit to the monitor state indicates an instruction other than an instruction accompanied by a delay load and a delay branch among instructions installed in the CPUs 2a, 2b, and 2c.

遷移手段としてのＡＮＤ回路２２は、ＣＰＵ２ａ、２ｂ、２ｃのモニター遷移可能信号の論理積をとって、全てのモニター遷移可能信号がアクティブ（例えばハイレベル）になるとアクティブレベルの信号（例えばハイレベル）を出力する。 The AND circuit 22 as a transition means takes the logical product of the monitor transition enable signals of the CPUs 2a, 2b, and 2c, and when all the monitor transition enable signals become active (for example, high level), an active level signal (for example, high level) Is output.

遷移手段としてのＡＮＤ回路２３は、後述するブレーク生成装置３から出力されるハードウェアブレーク信号とＡＮＤ回路２２の出力との論理積をとって、双方ともアクティブ（例えばハイレベル）になると出力であるモニター状態移行信号をアクティブ（例えばハイレベル）とする。 The AND circuit 23 as a transition means takes a logical product of a hardware break signal output from a break generation device 3 to be described later and the output of the AND circuit 22, and outputs both when active (for example, high level). The monitor state transition signal is activated (for example, high level).

ブレーク生成装置３は、マルチプロセッサ１の外部などからの信号などによってハードウェアブレーク信号を生成し出力する。 The break generation device 3 generates and outputs a hardware break signal by a signal from the outside of the multiprocessor 1 or the like.

次に、図１に示したマルチプロセッサ１におけるステップ動作を説明する。説明を平易にするためにＣＰＵ２ａ、２ｂをステップ実行する場合について説明することとし、一例として以下のプログラムでそれぞれ命令１をステップ実行する場合を考える。 Next, the step operation in the multiprocessor 1 shown in FIG. 1 will be described. In order to simplify the explanation, a case where the CPUs 2a and 2b are step-executed will be described. As an example, a case where the instruction 1 is step-executed by the following program will be considered.

ＣＰＵ２ａのプログラム
命令１：ＬＤ［ａｄｄ］，Ｒ０（メモリのアドレスａｄｄからデータをロードし、Ｒ０レジスタに格納する）
命令２：ＡＤＤＲ０，Ｒ１，Ｒ２（Ｒ０レジスタとＲ１レジスタの和をＲ２レジスタに格納する）
命令３：ＡＤＤＲ０，Ｒ３，Ｒ４（Ｒ０レジスタとＲ３レジスタの和をＲ４レジスタに格納する）
命令４：ＮＯＰ（何もしない） CPU2a program instruction 1: LD [add], R0 (load data from memory address add and store in R0 register)
Instruction 2: ADD R0, R1, R2 (store the sum of the R0 and R1 registers in the R2 register)
Instruction 3: ADD R0, R3, R4 (the sum of the R0 and R3 registers is stored in the R4 register)
Command 4: NOP (do nothing)

ＣＰＵ２ｂのプログラム
命令１：ＬＤ［ａｄｄ］，Ｒ０（メモリのアドレスａｄｄからデータをロードし、Ｒ０レジスタに格納する）
命令２：ＬＤ［ａｄｄ］，Ｒ１（メモリのアドレスａｄｄからデータをロードし、Ｒ１レジスタに格納する）
命令３：ＡＤＤＲ１，Ｒ６，Ｒ７（Ｒ１レジスタとＲ６レジスタの和をＲ７レジスタに格納する）
命令４：ＡＤＤＲ１，Ｒ９，Ｒ１０（Ｒ１レジスタとＲ９レジスタの和をＲ１０レジスタに格納する） CPU2b program instruction 1: LD [add], R0 (load data from memory address add and store in R0 register)
Instruction 2: LD [add], R1 (load data from memory address add and store in R1 register)
Instruction 3: ADD R1, R6, R7 (the sum of the R1 register and R6 register is stored in the R7 register)
Instruction 4: ADD R1, R9, R10 (store the sum of the R1 and R9 registers in the R10 register)

まず命令１をそれぞれのＣＰＵのデコーダ２１でデコードすると、ＣＰＵ２ａ、ＣＰＵ２ｂともに遅延ロード命令であるためモニター遷移可能信号はアクティブとならない。次に命令２をそれぞれのＣＰＵのデコーダ２１でデコードするとＣＰＵ２ａは遅延ロードでも遅延分岐でもないのでＣＰＵ２ａのモニター遷移可能信号はアクティブとなるが、ＣＰＵ２ｂが遅延ロード命令であるためＣＰＵ２ｂのモニター遷移可能信号はアクティブとならない。従ってＡＮＤ回路２２の出力はアクティブとならない。命令１をステップ実行する場合この命令２のタイミングでハードウェアブレーク信号がアクティブとなる。次の命令３をそれぞれのＣＰＵのデコーダ２１でデコードすると、ＣＰＵ２ａ、ＣＰＵ２ｂともに遅延ロードでも遅延分岐でもないため、それぞれのモニター遷移可能信号がアクティブとなってＡＮＤ回路２２がアクティブとなり、ハードウェアブレーク信号がアクティブであるためＡＮＤ回路２３がアクティブとなって命令４のタイミングでＣＰＵ２ａ、ＣＰＵ２ｂともにブレーク（停止）する。 First, when the instruction 1 is decoded by the decoder 21 of each CPU, the monitor transition enable signal is not activated because both the CPUs 2a and 2b are delayed load instructions. Next, when the instruction 2 is decoded by the decoder 21 of each CPU, since the CPU 2a is neither delayed load nor delayed branch, the monitor transition enable signal of the CPU 2a becomes active, but since the CPU 2b is a delay load instruction, the monitor transition enable signal of the CPU 2b Is not active. Therefore, the output of the AND circuit 22 is not active. When stepping instruction 1, the hardware break signal becomes active at the timing of instruction 2. When the next instruction 3 is decoded by the decoder 21 of each CPU, neither the CPU 2a nor the CPU 2b is delayed loading nor delayed branching, so that each monitor transition enable signal becomes active, the AND circuit 22 becomes active, and the hardware break signal Is active, the AND circuit 23 becomes active, and both the CPU 2a and CPU 2b break (stop) at the timing of the instruction 4.

こうするとＣＰＵ２ａもＣＰＵ２ｂも命令１〜３を実行した後にモニター状態に遷移するため、プログラム通りにステップ実行を行うことができる。 In this way, since both the CPU 2a and the CPU 2b execute the instructions 1 to 3 and transition to the monitor state, step execution can be performed according to the program.

本実施形態のように、ＣＰＵをブレークさせるための信号がハードウェアとして装備されており、その信号をプロセッサに印加することによって実行状態からモニター状態へ移行することをハードウェアブレークといい、ハードウェアブレーク信号が特許請求の範囲のブレーク命令に相当する。 As in this embodiment, a signal for breaking the CPU is provided as hardware, and the transition from the execution state to the monitoring state by applying the signal to the processor is called a hardware break. The break signal corresponds to a break instruction in the claims.

本実施形態によれば、ＣＰＵ２ａ、ＣＰＵ２ｂ、ＣＰＵ２ｃを内蔵したマルチプロセッサ１において、ステップ動作時にモニター状態に遷移させるために、ＣＰＵ２ａ、ＣＰＵ２ｂ、ＣＰＵ２ｃそれぞれの命令をデコーダ２１でデコードして、モニター状態に遷移可能な命令の場合はモニター遷移可能信号を出力し、全てのＣＰＵがモニター遷移可能信号を出力した場合に、各ＣＰＵをモニター状態に遷移させているので、プログラム通りの動きを再現できるステップ実行をすることができる。 According to the present embodiment, in the multiprocessor 1 incorporating the CPU 2a, CPU 2b, and CPU 2c, in order to shift to the monitor state during the step operation, each instruction of the CPU 2a, CPU 2b, and CPU 2c is decoded by the decoder 21 to enter the monitor state. In the case of a transitionable instruction, a monitor transition enable signal is output, and when all CPUs output a monitor transition enable signal, each CPU is transitioned to the monitor state, so step execution that can reproduce the behavior as programmed Can do.

また、マルチプロセッサ１内部でモニター状態への遷移をハード的に制御しているので、ステップ実行時のデバッグ用ホストコンピュータの処理の負荷を軽減することができる。さらに、ハードウェアブレークを用いているので、各ＣＰＵのプログラムメモリからのデータを読み出して各ＣＰＵのプログラムメモリの書き換えといった作業が不要となるためステップ実行が高速化できる。 In addition, since the transition to the monitor state is controlled by hardware in the multiprocessor 1, it is possible to reduce the processing load of the debugging host computer at the time of step execution. Furthermore, since a hardware break is used, the step execution can be speeded up because the operation of reading data from the program memory of each CPU and rewriting the program memory of each CPU becomes unnecessary.

なお、ＡＮＤ回路２２およびＡＮＤ回路２３は、ＣＰＵ２ａ、２ｂ、２ｃがそれぞれ内蔵せずに、図２に示すようにブレーク生成装置３で一括して論理積をとってモニター遷移信号を生成してＣＰＵ２ａ、２ｂ、２ｃに出力する構成としてもよい。 Note that the AND circuit 22 and the AND circuit 23 are not built in the CPUs 2a, 2b, and 2c, respectively, and as shown in FIG. 2b and 2c may be output.

このように構成することで、ブレーク生成装置３で一括して論理積をとってモニター遷移信号を生成してＣＰＵ２ａ、２ｂ、２ｃに出力しているので、ＣＰＵ２ａ、２ｂ、２ｃに対する回路変更が最小限で済む。 With this configuration, the break generation device 3 collectively performs logical product and generates monitor transition signals and outputs them to the CPUs 2a, 2b, and 2c, so that circuit changes to the CPUs 2a, 2b, and 2c are minimized. It ’s all you need.

ただし、図２よりも図１の構成のほうが、各ＣＰＵとＡＮＤ回路との間の配線が短くなるために高速に動作させる際に有利である。一般的にＬＳＩのチップレイアウトの際には図３に示すようにＣＰＵ２ａ、２ｂ、２ｃは並べて配置されることが多い。一方、ブレーク生成装置３のようなデバッグ回路は、通常動作に関わらないことや外部との通信を行うことからＣＰＵ２ａ、２ｂ、２ｃとは離れた外周に配置される可能性が大きい。この場合、各ＣＰＵとブレーク生成装置３との間の配線が長くなり、図２のように各ＣＰＵから信号を取り出してからまた各ＣＰＵに信号を出力する構成では、図１と比較して回路の高速化が困難である。従って、ハードウェアで各ＣＰＵをブレークさせる際には図１の構成の方が好ましい。 However, the configuration shown in FIG. 1 is more advantageous than the configuration shown in FIG. 2 because the wiring between each CPU and the AND circuit is shortened. In general, in the LSI chip layout, the CPUs 2a, 2b, and 2c are often arranged side by side as shown in FIG. On the other hand, a debug circuit such as the break generating device 3 is not likely to be related to normal operation or communicates with the outside, so it is highly likely that the debug circuit is arranged on the outer periphery away from the CPUs 2a, 2b, and 2c. In this case, the wiring between each CPU and the break generating device 3 becomes long, and in the configuration in which a signal is taken out from each CPU as shown in FIG. It is difficult to increase the speed. Therefore, the configuration shown in FIG. 1 is preferable when breaking each CPU with hardware.

［第２実施形態］
次に、本発明の第２の実施形態を図４を参照して説明する。なお、前述した第１の実施形態と同一部分には、同一符号を付して説明を省略する。図４は、本発明の第２の実施形態にかかるマルチプロセッサのブロック図である。 [Second Embodiment]
Next, a second embodiment of the present invention will be described with reference to FIG. Note that the same parts as those in the first embodiment described above are denoted by the same reference numerals and description thereof is omitted. FIG. 4 is a block diagram of a multiprocessor according to the second embodiment of the present invention.

本実施形態では、選択手段としてのブレーク生成装置３が、ハードウェアブレーク信号に加えて、ＣＰＵ２ａマスク信号、ＣＰＵ２ｂマスク信号、ＣＰＵ２ｃマスク信号を出力している。 In the present embodiment, the break generation device 3 as selection means outputs a CPU 2a mask signal, a CPU 2b mask signal, and a CPU 2c mask signal in addition to the hardware break signal.

さらに、ＣＰＵ２ａでは、ＣＰＵ２ｂマスク信号とＣＰＵ２ｂが出力したモニター遷移可能信号とがＯＲ回路２４によって論理和をとってＡＮＤ回路２２に入力され、ＣＰＵ２ｃマスク信号とＣＰＵ２ｃが出力したモニター遷移可能信号とがＯＲ回路２５によって論理和をとってＡＮＤ回路２２に入力されている。ＣＰＵ２ｂでは、ＣＰＵ２ａマスク信号とＣＰＵ２ａが出力したモニター遷移可能信号とがＯＲ回路２４によって論理和をとってＡＮＤ回路２２に入力され、ＣＰＵ２ｃマスク信号とＣＰＵ２ｃが出力したモニター遷移可能信号とがＯＲ回路２５によって論理和をとってＡＮＤ回路２２に入力されている。ＣＰＵ２ｃでは、ＣＰＵ２ａマスク信号とＣＰＵ２ａが出力したモニター遷移可能信号とがＯＲ回路２４によって論理和をとってＡＮＤ回路２２に入力され、ＣＰＵ２ｂマスク信号とＣＰＵ２ｂが出力したモニター遷移可能信号とがＯＲ回路２５によって論理和をとってＡＮＤ回路２２に入力されている。 Further, in the CPU 2a, the CPU 2b mask signal and the monitor transition enable signal output from the CPU 2b are ORed by the OR circuit 24 and input to the AND circuit 22, and the CPU 2c mask signal and the monitor transition enable signal output from the CPU 2c are ORed. A logical sum is taken by the circuit 25 and inputted to the AND circuit 22. In the CPU 2b, the CPU 2a mask signal and the monitor transition enable signal output by the CPU 2a are ORed by the OR circuit 24 and input to the AND circuit 22, and the CPU 2c mask signal and the monitor transition enable signal output by the CPU 2c are OR circuit 25. Is logically summed and input to the AND circuit 22. In the CPU 2c, the CPU 2a mask signal and the monitor transition enable signal output from the CPU 2a are ORed by the OR circuit 24 and input to the AND circuit 22, and the CPU 2b mask signal and the monitor transition enable signal output from the CPU 2b are OR circuit 25. Is logically summed and input to the AND circuit 22.

このような構成にすることで同期して動かなくても構わないＣＰＵを除外してステップ実行が可能となる。例えば、ＣＰＵ２ａを除外する場合、ＣＰＵ２ａマスク信号をアクティブ（ハイレベル）にする。そうすると、ＣＰＵ２ｂとＣＰＵ２ｃではＯＲ回路２４によってＡＮＤ回路２２の入力が常にアクティブ（ハイレベル）となるので、ＣＰＵ２ａの状態に関わらずＣＰＵ２ｂと２ｃとでステップ実行を行うことができる。すなわち、ブレーク生成装置３が出力するＣＰＵ２ａマスク信号、ＣＰＵ２ｂマスク信号、ＣＰＵ２ｃマスク信号によってモニター状態に遷移するＣＰＵを選択している。 By adopting such a configuration, it is possible to perform step execution by excluding CPUs that do not have to operate synchronously. For example, when excluding the CPU 2a, the CPU 2a mask signal is made active (high level). Then, since the input of the AND circuit 22 is always active (high level) by the OR circuit 24 in the CPUs 2b and 2c, the CPUs 2b and 2c can perform step execution regardless of the state of the CPU 2a. In other words, the CPU that changes to the monitor state is selected by the CPU 2a mask signal, the CPU 2b mask signal, and the CPU 2c mask signal output from the break generation device 3.

本実施形態によれば、ブレーク生成装置３が各ＣＰＵに対してマスク信号（ＣＰＵ２ａマスク信号、ＣＰＵ２ｂマスク信号、ＣＰＵ２ｃマスク信号）を出力して、各ＣＰＵでは他のＣＰＵからのモニター遷移可能信号とマスク信号の論理和を取った上で、自身の信号と論理積を取るように構成しているので、他のＣＰＵとの依存関係の無い物を除外することができ、それによって除外したＣＰＵがモニター状態に遷移できない命令を実行中であってもモニター状態にさせることができ、全てのＣＰＵを同期してステップ実行する場合と比較してステップ実行を行う際の１ステップの命令数を少なくすることができる。 According to the present embodiment, the break generation device 3 outputs a mask signal (CPU 2a mask signal, CPU 2b mask signal, CPU 2c mask signal) to each CPU, and each CPU receives a monitor transition enable signal from another CPU. Since it is configured to take the logical product of its own signal after taking the logical sum of the mask signal, it is possible to exclude those that do not depend on other CPUs, Even if an instruction that cannot be shifted to the monitor state is being executed, the monitor state can be set, and the number of instructions in one step when performing step execution is reduced as compared with the case where all CPUs execute step by step synchronously. be able to.

［第３の実施形態］
次に、本発明の第３の実施形態を図５および図６を参照して説明する。なお、前述した第１、第２の実施形態と同一部分には、同一符号を付して説明を省略する。図５は、本発明の第３の実施形態にかかるデバッグ装置のブロック図である。図６は、図５に示したデバッグ装置のステップ実行動作を示すフローチャートである。 [Third Embodiment]
Next, a third embodiment of the present invention will be described with reference to FIGS. The same parts as those in the first and second embodiments described above are denoted by the same reference numerals and description thereof is omitted. FIG. 5 is a block diagram of a debugging device according to the third embodiment of the present invention. FIG. 6 is a flowchart showing the step execution operation of the debugging apparatus shown in FIG.

図５に示したデバッグ装置１０は、マルチプロセッサ１´と、ブレーク生成装置３´と、を備えている。 The debugging device 10 shown in FIG. 5 includes a multiprocessor 1 ′ and a break generation device 3 ′.

マルチプロセッサ１´は、ＣＰＵ２ａ´、２ｂ´、２ｃ´を備えている。ＣＰＵ２ａ´、２ｂ´、２ｃ´は、デコーダ２１や、図示しないプログラム格納用のプログラムＲＡＭ、演算データ格納用のデータＲＡＭ、プログラムのアドレスを保持するプログラムカウンタ、プログラムＲＡＭから読み出した命令を格納する命令レジスタ、演算回路および各種レジスタを備えている点は第１、第２の実施形態のＣＰＵ２ａ、２ｂ、２ｃと同様であるが、ＡＮＤ回路２２および２３が削除されている点が異なる。 The multiprocessor 1 ′ includes CPUs 2a ′, 2b ′, and 2c ′. The CPUs 2a ′, 2b ′, and 2c ′ are a decoder 21, a program RAM (not shown) for storing a program, a data RAM for storing operation data, a program counter that holds a program address, and an instruction that stores an instruction read from the program RAM. The point that a register, an arithmetic circuit, and various registers are provided is the same as the CPUs 2a, 2b, and 2c of the first and second embodiments, except that the AND circuits 22 and 23 are deleted.

さらに、マルチプロセッサ１´は第１、第２の実施形態において、内蔵されていたブレーク生成装置３が削除され、マルチプロセッサ１´外部にブレーク生成装置３´として設けられている。この監視手段、遷移手段としてのブレーク生成装置は、例えばデバッグ用のコンピュータやワークステーションなどで構成してもよい。 Furthermore, in the first and second embodiments, the built-in break generation device 3 is deleted from the multiprocessor 1 ′, and the multiprocessor 1 ′ is provided outside the multiprocessor 1 ′ as a break generation device 3 ′. The break generation device as the monitoring means and the transition means may be constituted by, for example, a debugging computer or a workstation.

次に、上述した構成のデバッグ装置１０において、ステップ実行を行う際の動作を図６のフローチャートを参照して説明する。図６に示したフローチャートはブレーク実行装置３´にて実行される。 Next, the operation when performing step execution in the debug device 10 having the above-described configuration will be described with reference to the flowchart of FIG. The flowchart shown in FIG. 6 is executed by the break execution device 3 ′.

まず、ステップＳ１において、検査対象となる命令を各ＣＰＵのプログラムカウンタ位置の命令に設定しステップＳ２に進む。本ステップでは、各ＣＰＵのプログラムをプログラムメモリからブレーク生成装置３´に読み出して、ステップ実行する命令をプログラムカウンタに設定された状態としている。 First, in step S1, an instruction to be inspected is set as an instruction at the program counter position of each CPU, and the process proceeds to step S2. In this step, the program of each CPU is read from the program memory to the break generation device 3 ', and the instruction to be executed in step is set in the program counter.

次に、ステップＳ２において、全ＣＰＵにおいて対象命令をチェックしてステップＳ３に進む。本ステップにおいては、モニター状態に遷移できる命令であるか否かをチェックしている。 Next, in step S2, the target instruction is checked in all the CPUs, and the process proceeds to step S3. In this step, it is checked whether or not the command is capable of transitioning to the monitor state.

次に、ステップＳ３において、全てのＣＰＵがモニター状態に遷移可能か否かを判断して遷移可能である場合（ＹＥＳの場合）はステップＳ５に進み、遷移可能でない場合（ＮＯの場合）はステップＳ４に進む。つまり、ステップＳ２でチェックした命令がモニター状態に遷移可能か否かを判断している。 Next, in step S3, it is determined whether or not all CPUs can transition to the monitor state, and if transition is possible (in the case of YES), the process proceeds to step S5. Proceed to S4. That is, it is determined whether or not the instruction checked in step S2 can transition to the monitor state.

次に、ステップＳ４において、全てのＣＰＵがモニター状態に遷移可能な命令でなかったため、対象命令を１つ次の命令に進めてステップＳ２に戻る。 Next, in step S4, since all the CPUs are not instructions that can make a transition to the monitor state, the target instruction is advanced to the next instruction and the process returns to step S2.

次に、ステップＳ５において、全てのＣＰＵがモニター状態に遷移可能な命令であったため、対象命令の１つ次の命令を退避してステップＳ６に進む。 Next, in step S5, since all the CPUs are instructions that can transition to the monitor state, the instruction next to the target instruction is saved and the process proceeds to step S6.

次に、ステップＳ６において、対象命令の１つ次の命令をソフトウェアブレーク命令（ＥＢＲＫ命令）に修正しステップＳ７に進む。本ステップにおいて、ステップＳ５で退避した命令に代えてソフトウェアブレーク命令に差し替える。 Next, in step S6, the instruction next to the target instruction is corrected to a software break instruction (EBRK instruction), and the process proceeds to step S7. In this step, a software break instruction is substituted for the instruction saved in step S5.

次に、ステップＳ７において、全てのＣＰＵをプログラム実行状態にする。本ステップにおいて、修正したプログラムを各ＣＰＵのプログラムメモリに書き込んでプログラムを実行させている。そして、実行されたプログラムはＥＢＲＫ命令をデコーダ２１がデコードすることでモニター遷移可能信号を出力してＣＰＵを停止させ、以後モニター状態に遷移してステップ実行を行うことができる。 Next, in step S7, all the CPUs are set to the program execution state. In this step, the modified program is written in the program memory of each CPU and the program is executed. The executed program causes the decoder 21 to decode the EBRK instruction to output a monitor transition enable signal to stop the CPU, and then transitions to the monitor state to perform step execution.

上述したフローチャートの動作例を第１の実施形態で例示したプログラムでそれぞれ命令１をステップ実行する場合で説明する。まず命令１をそれぞれ検査すると、ＣＰＵ２ａ´、ＣＰＵ２ｂ´ともに遅延ロード命令であるためスキップする。次に命令２をそれぞれ検査するとＣＰＵ２ａ´は遅延ロードでも遅延分岐でもないが、ＣＰＵ２ｂ´が遅延ロード命令であるためスキップする。次の命令３をそれぞれ検査すると、ＣＰＵ２ａ´、ＣＰＵ２ｂ´ともに遅延ロードでも遅延分岐でもないため、それぞれ次の命令４をソフトウェアブレーク命令（ＥＢＲＫ命令）に差し替える。 An operation example of the above-described flowchart will be described in the case where the instruction 1 is stepped by the program exemplified in the first embodiment. First, when each of the instructions 1 is inspected, both CPU 2a 'and CPU 2b' are skipped because they are delayed load instructions. Next, when each of the instructions 2 is inspected, the CPU 2a 'is neither a delay load nor a delay branch, but skips because the CPU 2b' is a delay load instruction. When each of the next instructions 3 is inspected, neither the CPU 2a 'nor the CPU 2b' is a delayed load or delayed branch, so the next instruction 4 is replaced with a software break instruction (EBRK instruction).

本実施形態のように、ブレークしたい箇所の命令を、ＥＢＲＫ命令に置き換えて、モニター状態への移行をプロセッサの命令デコーダに委ねて、モニターモードへ移行することをソフトウェアブレークといい、このＥＢＲＫ命令が特許請求の範囲のブレーク命令に相当する。 As in this embodiment, replacing an instruction at a place where a break is desired with an EBRK instruction, leaving the transition to the monitor state to the instruction decoder of the processor and shifting to the monitor mode is called a software break, and this EBRK instruction This corresponds to the break instruction in the claims.

本実施例によれば、デバッグ装置１０において、ステップ動作時にモニター状態に遷移させるために、ブレーク生成装置３´でマルチプロセッサ１´内のＣＰＵ２ａ´、２ｂ´、２ｃ´のプログラムをそれぞれ読み込んで、実行するべき命令を検査し、全てのＣＰＵでその命令が遅延ロード、遅延分岐命令ではなければ次の命令を退避し、ソフトウェアブレーク命令に差し替えた後、全てのＣＰＵを同時にプログラム実行状態にする。もしも１つでも遅延ロードや遅延分岐命令がある場合には全てのＣＰＵでその命令をスキップして次の命令を検査するということを遅延ロード、遅延分岐命令が出なくなるまで繰り返しているので、プログラム通りの動きを再現できるステップ実行をすることができる。 According to the present embodiment, the debug device 10 reads the programs of the CPUs 2 a ′, 2 b ′, and 2 c ′ in the multiprocessor 1 ′ with the break generation device 3 ′ in order to transit to the monitor state during the step operation. The instruction to be executed is inspected, and if the instruction is not a delayed load or delayed branch instruction in all the CPUs, the next instruction is saved and replaced with a software break instruction. If there is at least one delayed load or delayed branch instruction, all CPUs skip the instruction and check the next instruction until the delay load and delayed branch instructions are not issued. Step execution that can reproduce the movement of the street can be performed.

また、ブレーク生成装置３´でソフトウェアブレーク命令に差し替えることでステップ実行を行わせているので、マルチプロセッサ１´に追加回路等が不要となる。 In addition, since step execution is performed by replacing the software break instruction with the break generation device 3 ', an additional circuit or the like is not required for the multiprocessor 1'.

また、上述した第３の実施形態では、遅延ロードである場合には直後の命令をＥＢＲＫ命令に差し替えることが出来ないとして説明を行ったが、遅延ロードであってもデスティネーションが直後の命令のソースと異なる場合には、直後の命令をＥＢＲＫ命令に差し替えても問題がないため、検査対象の命令だけではなく、直後の命令も考慮して判断を行うようにすればステップ実行の際の命令実行数を少なくすることが出来る。 Further, in the third embodiment described above, it has been described that the instruction immediately after cannot be replaced with the EBRK instruction in the case of a delay load. If it is different from the source, there is no problem even if the immediately following instruction is replaced with the EBRK instruction. Therefore, if the determination is made considering not only the instruction to be inspected but also the immediately following instruction, the instruction at the time of step execution The number of executions can be reduced.

なお、本発明は上記実施形態に限定されるものではない。即ち、本発明の骨子を逸脱しない範囲で種々変形して実施することができる。 The present invention is not limited to the above embodiment. That is, various modifications can be made without departing from the scope of the present invention.

本発明の第１の実施形態にかかるマルチプロセッサのブロック図である。1 is a block diagram of a multiprocessor according to a first embodiment of the present invention. FIG. 図１に示したマルチプロセッサの変形例を示したブロック図である。FIG. 6 is a block diagram showing a modification of the multiprocessor shown in FIG. 1. 図１に示したマルチプロセッサのチップレイアウトの例を示した説明図である。FIG. 2 is an explanatory diagram showing an example of a chip layout of the multiprocessor shown in FIG. 1. 本発明の第２の実施形態にかかるマルチプロセッサのブロック図である。It is a block diagram of the multiprocessor concerning the 2nd Embodiment of this invention. 本発明の第３の実施形態にかかるデバッグ装置のブロック図である。It is a block diagram of the debugging apparatus concerning the 3rd Embodiment of this invention. 図５に示したデバッグ装置のステップ実行動作を示すフローチャートである。6 is a flowchart showing a step execution operation of the debugging apparatus shown in FIG. 5. 遅延ロード命令のパイプラインの例を示す説明図である。It is explanatory drawing which shows the example of the pipeline of a delay load instruction. 遅延ロード命令をステップ実行させた場合のパイプラインの例を示す説明図である。It is explanatory drawing which shows the example of the pipeline at the time of making the delay load instruction execute step. 遅延ロード命令時の問題点を除いたステップ実行のパイプラインの例を示す説明図である。It is explanatory drawing which shows the example of the pipeline of step execution except the problem at the time of a delay load instruction.

Explanation of symbols

１マルチプロセッサ
２ａ、２ｂ、２ｃＣＰＵ
２１デコーダ（監視手段）
２２、２３ＡＮＤ回路（遷移手段）
２４、２５ＯＲ回路（選択手段）
３ブレーク生成装置（選択手段）
１０デバッグ装置（選択手段）
１´ マルチプロセッサ
２ａ´、２ｂ´、２ｃ´ ＣＰＵ
３´ ブレーク生成装置（監視手段、遷移手段） 1 Multiprocessor 2a, 2b, 2c CPU
21 Decoder (monitoring means)
22, 23 AND circuit (transition means)
24, 25 OR circuit (selection means)
3 Break generation device (selection means)
10 Debugging device (selection means)
1 'multiprocessor 2a', 2b ', 2c' CPU
3 'Break generation device (monitoring means, transition means)

Claims

In a multiprocessor having a plurality of CPUs that transition from an execution state to a monitor state by a break instruction and operate at the same frequency,
Monitoring means for monitoring instructions executed by each of the plurality of CPUs;
When the break instruction is input to the CPU, if the monitoring unit detects that all the CPUs are executing an instruction that can transition to the monitor state, the plurality of CPUs are placed in the monitor state. Transition means for transitioning to
A multiprocessor characterized by comprising:

The multiprocessor according to claim 1, wherein the transition unit is included in the CPU.

The multiprocessor according to claim 1, further comprising a selection unit that selects a CPU that transitions to the monitor state from the plurality of CPUs.

In a debugging device having a multiprocessor having a plurality of CPUs that transition from an execution state to a monitor state by a break instruction and operate at the same frequency, and debugging means for debugging the multiprocessor,
Monitoring means for monitoring instructions executed by each of the plurality of CPUs;
When the break instruction is input to the CPU, if the monitoring unit detects that all the CPUs are executing an instruction that can transition to the monitor state, the plurality of CPUs are placed in the monitor state. Transition means for transitioning to
A debugging device characterized by comprising:

In a debugging method for debugging a multiprocessor having a plurality of CPUs operating at the same frequency by transitioning from an execution state to a monitor state by a break instruction,
An instruction executed by each of the plurality of CPUs is monitored, and when the break instruction is input to the CPU, it is detected that all the CPUs are executing an instruction that can transition to a monitor state. In this case, the debugging method is characterized by causing the plurality of CPUs to transition to the monitor state.