WO2013145529A1 - 演算処理装置、その演算処理方法、及び演算処理プログラムが格納された記憶媒体 - Google Patents
演算処理装置、その演算処理方法、及び演算処理プログラムが格納された記憶媒体 Download PDFInfo
- Publication number
- WO2013145529A1 WO2013145529A1 PCT/JP2013/000762 JP2013000762W WO2013145529A1 WO 2013145529 A1 WO2013145529 A1 WO 2013145529A1 JP 2013000762 W JP2013000762 W JP 2013000762W WO 2013145529 A1 WO2013145529 A1 WO 2013145529A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- address
- arithmetic processing
- storage means
- processing
- stored
- Prior art date
Links
- 238000003672 processing method Methods 0.000 title claims description 20
- 238000000034 method Methods 0.000 claims abstract description 22
- 238000004364 calculation method Methods 0.000 claims description 19
- 230000006870 function Effects 0.000 claims description 10
- 230000004044 response Effects 0.000 claims description 6
- 230000000717 retained effect Effects 0.000 abstract description 4
- 238000010586 diagram Methods 0.000 description 6
- 230000009291 secondary effect Effects 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 239000013307 optical fiber Substances 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L61/00—Network arrangements, protocols or services for addressing or naming
- H04L61/58—Caching of addresses or names
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/3005—Arrangements for executing specific machine instructions to perform operations for flow control
- G06F9/30054—Unconditional branch instructions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/30076—Arrangements for executing specific machine instructions to perform miscellaneous control operations, e.g. NOP
- G06F9/3009—Thread control instructions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/32—Address formation of the next instruction, e.g. by incrementing the instruction counter
- G06F9/322—Address formation of the next instruction, e.g. by incrementing the instruction counter for non-sequential address
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/32—Address formation of the next instruction, e.g. by incrementing the instruction counter
- G06F9/322—Address formation of the next instruction, e.g. by incrementing the instruction counter for non-sequential address
- G06F9/323—Address formation of the next instruction, e.g. by incrementing the instruction counter for non-sequential address for indirect branch instructions
Definitions
- the present invention relates to an arithmetic processing device, an arithmetic processing method, and an arithmetic processing program for switching the type of processing for a plurality of users or tasks using parameters existing for each user or task, for example.
- Servers and base stations generally process data transmitted from a large number of users at once, and various processing methods for efficiently processing such a plurality of data have been proposed. (For example, refer to Patent Document 1).
- the processing content may be changed according to the parameters of the user data. In this case, switching control between users is necessary, and the instruction execution time increases.
- the present invention has been made to solve such problems, and an arithmetic processing device, an arithmetic processing method, and an arithmetic processing device that suppress the switching overhead between the processes and improve the overall arithmetic performance.
- the main purpose is to provide a processing program.
- a first storage means for storing a plurality of processing contents in association with addresses, and addresses of the plurality of processing contents stored in the first storage means, respectively.
- a second storage means for storing; a holding means for temporarily holding the address; a reading means for sequentially reading the addresses stored in the second storage means and causing the holding means to output; and an output from the holding means
- Execution means for reading out and executing the processing content corresponding to the address that has been read from the first storage means, and the holding means is read by the reading means when the address is not held Temporarily holds and outputs the address, and when the address is held, waits for the execution of the processing content of the execution means to end, and outputs the held address It is an arithmetic processing unit according to claim.
- a first storage unit that stores a plurality of processing contents in association with addresses, and addresses of the plurality of processing contents stored in the first storage unit Second storage means for storing the address respectively, holding means for temporarily holding the address
- An arithmetic processing method for an arithmetic processing apparatus comprising: a step of sequentially reading out the addresses stored in the second storage means and outputting the addresses to the holding means; and a process corresponding to the addresses output from the holding means Reading the contents from the first storage means and executing the contents; and when the holding means does not hold the address, temporarily holds and outputs the read address; and And a step of waiting for completion of execution of the processing contents and outputting the held address when the processing content is held.
- a first storage unit that stores a plurality of processing contents in association with addresses, and addresses of the plurality of processing contents stored in the first storage unit Second storage means for storing the address respectively, holding means for temporarily holding the address
- An arithmetic processing program for an arithmetic processing device comprising: processing for sequentially reading the addresses stored in the second storage means and outputting the addresses to the holding means; and processing corresponding to the addresses output from the holding means Processing to read and execute the contents from the first storage means, and when the holding means does not hold the address, temporarily holds and outputs the read address, and And an arithmetic processing program for an arithmetic processing device, wherein when the processing is held, the computer is caused to execute a process of waiting for completion of execution of the processing content and outputting the stored address.
- Good when the processing is held, the computer is caused to execute a process of waiting for completion of execution of the processing content and outputting the stored address.
- an arithmetic processing device it is possible to provide an arithmetic processing device, an arithmetic processing method thereof, and an arithmetic processing program that suppress the overhead of switching between processes and improve the overall arithmetic performance.
- FIG. 5 It is a figure which shows an example of a structure of the user table which concerns on Embodiment 5 of this invention, a jump table, FF, a COMB part, and an instruction memory. It is a functional block diagram of the arithmetic processing unit which concerns on embodiment of this invention.
- FIG. 11 is a functional block diagram of the arithmetic processing unit according to the present embodiment.
- the arithmetic processing unit 60 stores a first storage unit 61 that stores a plurality of processing contents in association with addresses, and a plurality of processing content addresses stored in the first storage unit 61, respectively.
- And executing means 65 for causing the processing contents corresponding to the address to be read from the first storage means 61 and executed.
- the holding unit 63 temporarily holds and outputs the address read by the reading unit 64 when the address is not held, and waits for the completion of execution of the processing content of the execution unit 65 when the address is held. , Output the stored address. Thereby, the overhead of switching between processes can be suppressed, and the overall calculation performance can be improved.
- FIG. 1 is a block diagram showing a schematic system configuration of an arithmetic processing apparatus according to Embodiment 1 of the present invention.
- the arithmetic processing apparatus 1 includes an instruction memory 10, a decode unit 11, a register file 12, an ALU 13, a program counter 14, a user table 15, a user table control unit 16, and a jump table 17. And a FIFO 18.
- the instruction memory 10 is a specific example of the first storage means, and stores a plurality of instructions (processing contents) executed by the user in association with addresses.
- the instruction memory 10 is connected to the program counter 14 and the decode unit 11 respectively.
- the decode unit 11 is a specific example of execution means, and reads an instruction corresponding to an address output from the FIFO 18 via the program counter 14 to the instruction memory 10 and causes the ALU 13 to execute the instruction.
- the decode unit 11 analyzes the instruction output from the instruction memory 10, and generates and outputs control signals for controlling the register file 12, the ALU 13, the user table control unit 15, and the FIFO 18 connected to the subsequent stage. .
- the decode unit 11 is connected to a register file 12, an ALU 13, a user table control unit 15, a FIFO 18, and a program counter 14, respectively.
- the register file 12 stores data necessary for operations executed by the ALU 13.
- the register file 12 is connected to the decode unit 11 and the ALU 13 respectively.
- the ALU 13 is an arithmetic unit that performs actual arithmetic processing using data from the register file 12 in accordance with a control signal from the decode unit 11.
- the ALU 13 is connected to the decode unit 11, the register file 12, and the program counter 13, respectively.
- the register file 12 and the ALU 13 are specific examples of calculation execution means. Note that the register file 12 and the ALU 13 are not the essence of the present invention, and thus any configuration can be applied without being limited to the above configuration.
- the program counter 14 is a specific example of the program counter means, and generates an address for reading an instruction stored in the instruction memory 10. Further, the program counter 14 increments and outputs the address held internally in the next cycle, outputs the same address in the next cycle, outputs the output value from the ALU 13 as an address, A function of updating and outputting an address generated by itself based on an output from the FIFO 18.
- the program counter 14 is connected to the instruction memory 10, the ALU 13, the FIFO 18, and the decode unit 11, respectively.
- the user table 15 is a specific example of user table means, and stores a processing index designated for each user, that is, index information for designating processing contents to be executed by the user.
- the user table 15 is connected to the user table control unit 16 and the jump table 17 respectively.
- the user table control unit 16 is a specific example of the user table control means, and causes the user's processing index stored in the user table 15 to be output to the jump table 17 in accordance with the parameter of the control signal output from the decoding unit 11.
- the user table control unit 16 is connected to the decoding unit 11 and the user table 15, respectively.
- the jump table 17 is a specific example of the second storage means, and the head address (hereinafter referred to as the head address) for the area of the instruction memory 10 in which the processing contents for each user are stored is stored for each user processing (processing Stored in each index).
- the jump table 17 outputs the head address to the FIFO 18 based on the processing index output from the user table.
- the jump table 17 is connected to the user table 15 and the FIFO 18 respectively.
- the FIFO 18 is a specific example of holding means, and temporarily holds the head address of each user process read from the jump table 17.
- the FIFO 18 is connected to the user table 17, the program counter 14, and the decoding unit 11, respectively.
- the FIFO 18 When the first address from the jump table 17 is not held (Empty state), the FIFO 18 temporarily holds the first address read from the jump table 17 and outputs it in the next cycle. On the other hand, when the FIFO 18 holds the head address from the jump table 17 (when it is not in the Empty state), the FIFO 18 waits for the end of the processing of the subprogram and outputs the held head address.
- the instruction memory 10 includes at least two types of operations necessary for executing the arithmetic processing specified for each user as shown in the table on the right side of FIG. Assume that a subprogram (processing contents) and a special instruction USER_JUMP (instruction information) shown on the left side are stored in association with the subprogram. Further, it is assumed that a NEXT_USER instruction indicating the end of the subprogram always exists at the end of the subprogram.
- the special instruction USER_JUMP includes, for example, an operation code (USER_JUMP) 21 as shown in FIG. 2, GO TO22 indicating information on a location (address) where an instruction to be read after processing for the desired number of users is completed, NUM_USER23 indicating the number of users to be processed.
- an operation code (USER_JUMP) 21 as shown in FIG. 2
- GO TO22 indicating information on a location (address) where an instruction to be read after processing for the desired number of users is completed
- NUM_USER23 indicating the number of users to be processed.
- the jump table 17 stores a plurality of subprogram start addresses, as shown in the substantially central table of FIG. Finally, as shown in the table on the left side of FIG. 3, the user table 15 stores processing indexes that specify which subprograms each user executes as many as the number of users.
- the instruction memory 10 reads the stored special instruction USER_JUMP to the decode unit 11.
- the decode unit 11 interprets the special instruction USER_JUMP read from the instruction memory 10, generates a control signal based on the interpreted special instruction USER_JUMP, and outputs the generated control signal to the user table control unit 16.
- the control signal includes, for example, a flag for starting user data processing and a value indicating the number of users.
- the user table control unit 16 When the user table control unit 16 receives the control signal from the decoding unit 11, for example, the user table control unit 16 instructs the user table 15 to read the processing index for the first user from the top.
- the processing index read from the user table 15 is output to the jump table 17 as an address for the jump table 17. Then, the jump table 17 outputs the first address of the stored subprogram to be executed as the first user process to the FIFO 18.
- the FIFO 18 outputs the head address output from the jump table 17 to the program counter 14 in the next cycle.
- the program counter 14 updates the address to the instruction memory 10 to the head address received from the FIFO 18 and outputs it to the instruction memory 10.
- the instruction memory 10 receives the head address from the program counter 14, reads the instruction of the subprogram specified by the head address (for example, the head instruction of the 32-point FFT operation in FIG. 3), and outputs it to the decode unit 11. To do.
- the decode unit 11 interprets a subprogram instruction from the instruction memory 10, generates a control signal for controlling the register file 12 and the ALU 13, and outputs the control signal to the register file 12 and the ALU 13.
- the register file 12 and the ALU 13 execute a subprogram in response to a control signal from the decode unit 11.
- the FIFO 18 temporarily holds the head address from the jump table 17.
- the decode unit 11 detects the NEXT_USER instruction indicating the end of the subprogram during the processing of the subprogram, the decode unit 11 programs the start address of the subprogram corresponding to the next user temporarily held in the FIFO 18. Control to output to the counter 14.
- the instruction memory 10 decodes the head instruction of the subprogram corresponding to the next user to the decoding unit 11. Read to.
- the decode unit 11 interprets the instruction read by the instruction memory 10, generates a control signal for controlling the register file 12 and the ALU 13, and outputs the control signal to the register file 12 and the ALU 13.
- the register file 12 and the ALU 13 execute the next user subprogram (for example, the first instruction of the 64-point FFT operation in FIG. 3). The above-described processing is repeated for the specified number of users or tasks.
- the arithmetic processing apparatus 1 it is possible to generate the start address of the subprogram of each process using the user table 15 and the jump table 17 continuously for the number of users.
- the FIFO 18 temporarily stores the generated start address, and outputs the start address of the next instruction to be executed to the instruction memory 10 via the program counter 14 at the timing when the NEXT_USER instruction at which the execution of the subprogram ends is detected. To do. Thereby, the overhead of switching between users can be suppressed, and the overall calculation performance can be improved.
- FIG. 5 The arithmetic processing device 2 according to the second embodiment of the present invention will be described in detail with reference to FIGS.
- the argument of the special instruction USER_JUMP includes the GO TO value indicating the destination of the program counter value when the processing for all users is completed, and the NUM_USER indicating the number of users.
- FIG. 2 There are two types of values (FIG. 2).
- the argument of the special instruction USER_JUMP is processed in a larger range in addition to the GO TO value and the NUM_USER value. It differs from the arithmetic processing apparatus 1 according to the first embodiment in that it has an index (INDEX) value that can be distinguished.
- INDEX index
- the difference in the configuration of the arithmetic processing apparatus according to the second embodiment will be described in detail.
- the user table 15 is directly connected to the jump table 17, whereas the arithmetic processing apparatus 2 according to the second embodiment further stores an index value.
- a COMB unit 4A that generates a new index value and outputs it to the jump table 47.
- the FF 49 is a specific example of the third storage unit, and stores an index value together with a control signal for controlling the user table control unit 46 output from the decoding unit 41.
- the COMB unit 4A is a specific example of the index generation unit, and generates a new index by combining the index value stored in the FF 49 and the processing index for each user read from the user table 45, Output to the jump table 47.
- the special command USER_JUMP includes an operation code (USER_JUMP) 51, GO TO 52 indicating a location (address) where a command to be read after processing for the desired number of users is stored, and a user to be processed NUM_USER 53 indicating the number, and index (INDEX) information 54 that can specify processing in a unit in which all of the number of users are collected.
- USER_JUMP operation code
- GO TO 52 indicating a location (address) where a command to be read after processing for the desired number of users is stored
- NUM_USER 53 indicating the number
- index (INDEX) information 54 that can specify processing in a unit in which all of the number of users are collected.
- the jump table 47 stores a plurality of subprogram head addresses, as shown in the substantially central table of FIG. As shown in the table on the left side of FIG. 6, the user table 45 stores processing indexes for specifying the number of users for which subprograms each user executes.
- the instruction memory 40 reads the stored special instruction USER_JUMP to the decode unit 41.
- the decode unit 41 interprets the instruction read from the instruction memory 40, and outputs a control signal for controlling the user table control unit 46 and an index value to the user table control unit 46 and the FF 49, respectively.
- the control signal includes a flag for starting user processing and a value indicating the number of users.
- the user table control unit 46 when receiving the control signal from the decoding unit 41, causes the user table 45 to read the processing index for the first user from the top.
- the COMB 4A generates a new index value by combining the processing index read from the user table 45 and the index value stored in the FF 49, and outputs the new index value to the jump table 47. Then, the jump table 47 outputs the first address of the subprogram to be executed as the stored first user process to the FIFO 48.
- the head address output from the jump table 47 to the FIFO 48 is output to the program counter 44 in the next cycle.
- the program counter 44 updates the address for the instruction memory 40 to the head address from the FIFO 48 and outputs it to the instruction memory 40.
- the instruction memory 40 receives an address from the program counter 44, the instruction memory 40 reads the instruction of the subprogram specified by the address.
- the decode unit 41 interprets the instruction, generates a control signal for controlling the register file 42 and the ALU 43, and outputs the control signal to the register file 42 and the ALU 43. To do.
- the register file 42 and the ALU 43 receive the control signal from the decode unit 41, the register file 42 and the ALU 43 execute the subprogram.
- the start address output from the jump table 47 to the FIFO 48 is temporarily stored in the FIFO 48. Retained.
- the decode unit 41 detects a NEXT_USER instruction indicating the end of the subprogram during the execution of the subprogram, the decode unit 41 sets the start address of the subprogram corresponding to the next user temporarily held by the FIFO 48 to the program counter 44. Control to output to.
- the first address of the subprogram corresponding to the next user is output from the FIFO 48 to the program counter 44, and the first instruction of the subprogram corresponding to the next user is read from the instruction memory 40.
- the decode unit 41 interprets the instruction read from the instruction memory 40, generates a control signal for controlling the register file 42 and the ALU 43, and outputs the control signal to the register file 42 and the ALU 43.
- the register file 42 and the ALU 43 execute the next user subprogram. The above process is repeated for the number of specified users.
- the register file 42 and the ALU 43 are not the essence of the present invention. Therefore, the present invention is not limited to the above configuration, and any configuration can be applied.
- the start address of each user's subprogram can be generated continuously by the number of users using the user table 45 and the jump table 47.
- the FIFO 48 temporarily stores the generated start address, and outputs the start address of the next instruction to be executed to the instruction memory 40 via the program counter 44 at the timing when the NEXT_USER instruction at which the execution of the subprogram ends is detected. To do. Thereby, the overhead of switching between processes can be suppressed, and the overall calculation performance can be improved.
- Embodiment 3 The arithmetic processing apparatus according to the third embodiment of the present invention will be described in detail with reference to FIGS.
- the user table 15 is directly connected to the jump table 17.
- the arithmetic processing unit 3 according to the third embodiment as shown in FIG. 7, the arithmetic processing according to the first embodiment is further provided with a tALU 79 between the user table 75 and the jump table 77. Different from the device 1.
- the tALU 79 is a specific example of an arithmetic unit, and is an arithmetic unit that can perform arithmetic processing such as constant division, constant multiplication, and constant addition.
- the argument of the special instruction USER_JUMP indicates the GO ⁇ ⁇ TO value indicating the destination of the program counter value after the processing for all users is completed and the number of users in the arithmetic processing unit 1 according to the first embodiment.
- NUM_USER values there are two types of NUM_USER values, in the arithmetic processing unit 3 according to the third embodiment, as shown in FIG. 8, in addition to the GO_TO value and the NUM_USER value, tALU_mode that specifies the operation mode of tALU79. (One specific example of calculation mode information) 84 is added.
- the special instruction USER_JUMP includes an operation code (USER_JUMP) 81, a GO TO 82 indicating a location (address) where an instruction to be read after processing for the desired number of users is stored, NUM_USER 83 indicating the number of users to perform, and tALU_mode 84 designating the calculation mode of tALU 79.
- an operation code (USER_JUMP) 81
- a GO TO 82 indicating a location (address) where an instruction to be read after processing for the desired number of users is stored
- NUM_USER 83 indicating the number of users to perform
- tALU_mode 84 designating the calculation mode of tALU 79.
- the jump table 77 stores a plurality of subprogram head addresses, as in the first embodiment.
- the user table 75 stores as many processing indexes as the number of users that specify which subprograms each user executes.
- the arithmetic processing unit 3 operates as follows. First, the instruction memory 70 reads the stored special instruction USER_JUMP to the decode unit 71.
- the decode unit 71 interprets the instruction read from the instruction memory 70, and controls the user table control with a control signal for controlling the user table control unit 76 and a tALU_mode value specifying the tALU operation mode. Output to the unit 76 and the tALU 79, respectively.
- the control signal includes a flag for starting user processing and a value indicating the number of users.
- the user table control unit 76 When the user table control unit 76 receives the control signal from the decode unit 71, the user table control unit 76 causes the user table 75 to read the processing index for the first user from the top and output it to the tALU 79.
- TALU 79 performs the arithmetic processing specified by the previously input tALU_mode on the processing index.
- the tALU 79 outputs the calculation processing result to the jump table 77.
- the jump table 77 outputs the head address of the subprogram to be performed as the first stored user process to the FIFO 78.
- the FIFO 78 when the content of the FIFO 78 is empty (Empty state), the FIFO 78 outputs the head address output from the jump table 77 to the program counter 74 in the next cycle.
- the program counter 74 updates the address for the instruction memory 70 to the head address received from the FIFO 78, and outputs the updated head address to the instruction memory 70.
- the instruction memory 70 receives the head address from the program counter 74, the instruction memory 70 reads the instruction in the subprogram specified by the head address to the decode unit 71.
- the decode unit 71 interprets the instruction of the subprogram read from the instruction memory 70, generates a control signal for controlling the register file 72 and the ALU 73, and outputs the control signal to the register file 72 and the ALU 73.
- the register file 72 and the ALU 73 execute a subprogram in response to a control signal from the decode unit 71.
- the FIFO 78 temporarily holds the head address output from the jump table 77. To do.
- the decode unit 71 When the decode unit 71 detects the NEXT_USER instruction indicating the end of the subprogram during the execution of the subprogram processing, the decode unit 71 outputs the start address of the subprogram corresponding to the next user temporarily held in the FIFO 78 to the program counter 74. Control to do.
- the FIFO 78 outputs the top address of the subprogram corresponding to the next user to the program counter 74.
- the program counter 74 outputs the head address from the FIFO 78 to the instruction memory 70.
- the instruction memory 70 reads the first instruction of the subprogram corresponding to the next user to the decode unit 71.
- the decode unit 71 interprets the instruction read by the instruction memory 70, generates a control signal for controlling the register file 72 and the ALU 73, and outputs the control signal to the register file 72 and the ALU 73.
- the register file 72 and the ALU 73 execute the next user subprogram in response to a control signal from the decode unit 71. The above process is repeated for the number of specified users.
- the register file 72 and the ALU 73 are not the essence of the present invention. Therefore, the present invention is not limited to the above configuration, and any configuration can be applied.
- the start address of each user's subprogram can be generated continuously by the number of users using the user table 75 and the jump table 77.
- the FIFO 78 temporarily stores the generated start address, and outputs the start address of the next instruction to be executed to the instruction memory 70 via the program counter 74 at the timing when the NEXT_USER instruction at which the execution of the subprogram ends is detected. To do. Thereby, the overhead of switching between processes can be suppressed, and the overall calculation performance can be improved.
- a tALU 79 capable of performing arithmetic processing on the index value is provided.
- the index value can be obtained from the number of data given to the user without converting the parameter for each user stored in the user table 75 into the index value in advance. It is done.
- Embodiment 4 The arithmetic processing device 1 according to the fourth embodiment of the present invention will be described in detail with reference to FIG.
- the NEXT_USER instruction indicating the end of the program is inserted at the end of the subprogram in the instruction memory 10, and the switching control of the subprogram for each user is performed based on the instruction. Is going.
- the NEXT_USER indicating the end of the program is not inserted at the end of the subprogram, and the start of each subprogram is entered in the jump table 17. It differs from the arithmetic processing apparatus 1 according to the first embodiment in that the address and the end address are stored.
- the hardware configuration of the arithmetic processing apparatus 1 according to the fourth embodiment and the arithmetic processing apparatus 1 according to the first embodiment is substantially the same, and therefore, the same parts are denoted by the same reference numerals and are described in detail. Is omitted.
- the decoder unit 11 interprets the NEXT_USER instruction and controls the FIFO 18 and the program counter 14.
- the end address read from the jump table 17 is compared with the value of the program counter 14 (hereinafter referred to as the program counter value). If the program counter value is less than the end address, the program counter value is updated as it is. If the end address matches the program counter value, the next program counter value is updated with the output address from the FIFO 18. Note that, in the arithmetic processing apparatus 1 according to the fourth embodiment, other operations are substantially the same as those of the arithmetic processing apparatus 1 according to the first embodiment, and thus the description thereof is omitted.
- the arithmetic processing device 1 according to the fourth embodiment similarly to the arithmetic processing device 1 according to the first embodiment, the overhead of switching between the processes is suppressed, and the overall arithmetic performance is improved. be able to.
- FIG. 10 The arithmetic processing apparatus according to the fifth embodiment of the present invention will be described in detail with reference to FIG.
- a NEXT_USER instruction indicating the end of the program is inserted at the end of the subprogram in the instruction memory 40, and subprogram switching control for each user is performed based on the instruction. It is carried out.
- the NEXT_USER indicating the end of the program is not inserted at the end of the subprogram, and the start of each subprogram is inserted in the jump table. It differs from the arithmetic processing unit 2 according to the second embodiment in that the address and the end address are stored.
- the hardware configuration of the arithmetic processing device 2 according to the fifth embodiment and the arithmetic processing device 2 according to the second embodiment is substantially the same, and therefore, the same parts are denoted by the same reference numerals and detailed description is given. Is omitted.
- the decoder unit 41 interprets the NEXT_USER instruction and controls the FIFO 48 and the program counter 44.
- the end address read from the jump table 47 is compared with the program counter value. If the program counter value is less than the end address, the program counter value is updated as it is. If the end address matches the program counter value, the next program counter value is updated with the output address from the FIFO 48. Note that, in the arithmetic processing device 2 according to the fifth embodiment, other operations are substantially the same as those of the arithmetic processing device 2 according to the second embodiment, and thus the description thereof is omitted.
- the arithmetic processing device 2 according to the fifth embodiment similarly to the arithmetic processing device 2 according to the second embodiment, the overhead of switching between the processes is suppressed, and the overall arithmetic performance is improved. be able to.
- the present invention has been described as a hardware configuration, but the present invention is not limited to this.
- the present invention can be realized, for example, by causing the CPU to execute a computer program for the processing executed by the decoding units 11, 41, 71 and the user table control units 16, 46, 76.
- Non-transitory computer readable media include various types of tangible storage media (tangible storage medium). Examples of non-transitory computer readable media are magnetic recording media (eg flexible disks, magnetic tapes, hard disk drives), magneto-optical recording media (eg magneto-optical disks), CD-ROM, CD-R, CD-R / W Semiconductor memory (for example, mask ROM, PROM (Programmable ROM), EPROM (Erasable ROM), flash ROM, RAM).
- the program may also be supplied to the computer by various types of temporary computer-readable media. Examples of transitory computer readable media include electrical signals, optical signals, and electromagnetic waves.
- the temporary computer-readable medium can supply the program to the computer via a wired communication path such as an electric wire and an optical fiber, or a wireless communication path.
- First storage means for storing a plurality of processing contents in association with addresses
- Second storage means for storing addresses of the plurality of processing contents stored in the first storage means
- Holding means for temporarily holding the address
- Reading means for sequentially reading the addresses stored in the second storage means and outputting the addresses to the holding means
- Execution means for causing the processing content corresponding to the address output from the holding means to be read from the first storage means and executed;
- the holding means temporarily holds and outputs the address read by the reading means when the address is not held, and when the address is held, the holding means Wait for the end of execution and output the stored address.
- the first storage means includes information on an address in which a command to be read after processing for a desired number of users is completed and information indicating the number of users to be processed are associated with the processing contents. Instruction information is stored, An arithmetic processing apparatus characterized by that.
- (Appendix 3) An arithmetic processing device according to (Appendix 1) or (Appendix 2),
- the reading means includes user table means for storing index information for designating the processing content to be executed by the user, and user table control means for outputting the index information stored in the user table means to the second storage means.
- the second storage means outputs an address corresponding to the index information output from the user table means to the holding means; An arithmetic processing apparatus characterized by that.
- the first storage means stores address information storing instructions to be read after processing for the desired number of users, information indicating the number of users to be processed, and the number of users Instruction information is stored, including index information that can be processed in units that are all collected.
- a third storage means for storing a control signal and the index information for controlling the user table control means output from the execution means; Index generating means for generating new index information based on the index information stored in the user table section and the index information stored in the third storage means; The new index information generated by the index generation means is output to the second storage means.
- (Appendix 5) (Appendix 3) A calculation means for performing calculation processing on the index information stored in the user table means; The arithmetic processing apparatus, wherein the arithmetic means outputs the index information subjected to arithmetic processing to the second storage means.
- Appendix 6) (Appendix 5)
- the first storage means stores address information in which an instruction to be read after processing for a desired number of users is completed, information indicating the number of users to be processed, and the computing means Instruction information including at least operation mode information for determining the operation content of An arithmetic processing apparatus characterized by that.
- (Appendix 7) The arithmetic processing device according to any one of (Appendix 1) to (Appendix 7), Program counter means connected to the holding means; Calculation execution means for executing calculation processing in response to an instruction from the execution means; Further comprising The program counter means increments the address held internally in the next cycle and outputs it to the first memory means, the function to output the same address to the first memory means in the next cycle, and the calculation A function of outputting an output value from the execution unit as an address, and a function of updating and outputting an address generated based on the output value from the holding unit, An arithmetic processing apparatus characterized by that.
- First storage means for storing a plurality of processing contents in association with addresses
- Second storage means for storing addresses of the plurality of processing contents stored in the first storage means
- Holding means for temporarily holding the address
- An arithmetic processing method of an arithmetic processing device comprising: Sequentially reading the addresses stored in the second storage means and outputting the addresses to the holding means; Reading out the processing content corresponding to the address output from the holding unit from the first storage unit and executing the processing content; When the holding means does not hold the address, the read address is temporarily held and output, and when the address is held, the execution waits for the execution of the processing contents to be held.
- An arithmetic processing method for an arithmetic processing device comprising: (Appendix 9) (Appendix 8) An arithmetic processing method of the arithmetic processing device according to claim
- the first storage means includes information on an address in which a command to be read after processing for a desired number of users is completed and information indicating the number of users to be processed are associated with the processing contents. Instruction information is stored, An arithmetic processing method of an arithmetic processing device characterized by the above.
- the arithmetic processing method of the arithmetic processing device further comprising: (Appendix 11) (Additional remark 10) It is the arithmetic processing method of the arithmetic processing unit of description, Comprising: Corresponding to the processing contents, the first storage means stores address information storing instructions to be read after processing for the desired number of users, information indicating the number of users to be processed, and the number of users Instruction information is stored, including index information that can be processed in units that are all collected.
- the arithmetic processing unit further includes third storage means for storing the control signal and the index information output from the execution means, Generating new index information based on the stored index information and the index information stored in the third storage means; And a step of outputting the generated new index information to the second storage means.
- third storage means for storing the control signal and the index information output from the execution means, Generating new index information based on the stored index information and the index information stored in the third storage means; And a step of outputting the generated new index information to the second storage means.
- the first storage means stores address information in which an instruction to be read after processing for a desired number of users is completed, information indicating the number of users to be processed, and the computing means Instruction information including at least operation mode information for determining the operation content of An arithmetic processing method of an arithmetic processing device characterized by the above.
- the arithmetic processing apparatus further includes a program counter means connected to the holding means, and an arithmetic execution means for executing arithmetic processing in response to an instruction from the execution means,
- the program counter means increments the address held internally in the next cycle and outputs it to the first memory means, the function to output the same address to the first memory means in the next cycle, and the calculation A function of outputting an output value from the execution unit as an address, and a function of updating and outputting an address generated based on the output value from the holding unit,
- An arithmetic processing method of an arithmetic processing device characterized by the above.
- First storage means for storing a plurality of processing contents in association with addresses
- Second storage means for storing addresses of the plurality of processing contents stored in the first storage means
- Holding means for temporarily holding the address
- An arithmetic processing program of an arithmetic processing device comprising: A process of sequentially reading the addresses stored in the second storage means and outputting the addresses to the holding means; Processing to read out and execute the processing content corresponding to the address output from the holding unit from the first storage unit; When the holding means does not hold the address, the read address is temporarily held and output, and when the address is held, the execution waits for the execution of the processing contents to be held. Processing to output the address being An arithmetic processing program for an arithmetic processing device, characterized in that a computer is executed.
- the present invention can be used, for example, for an arithmetic processing device that switches a type of processing for a plurality of users or tasks using parameters that exist for each user or task.
- Arithmetic processing unit 10 40, 70 Instruction memory 11, 41, 71 Decoding unit 12, 42, 72 Register file 13, 43, 73 ALU 14, 44, 74 Program counter 15, 45, 75 User table 16, 46, 76 User table control unit 17, 47, 77 Jump table 18, 48, 78 FIFO 49 FF 4A COMB part 79 tALU
Landscapes
- Engineering & Computer Science (AREA)
- Software Systems (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Executing Machine-Instructions (AREA)
- Advance Control (AREA)
Abstract
Description
他方、上記目的を達成するための本発明の一態様は、複数の処理内容をアドレスと対応させて記憶する第1記憶手段と、前記第1記憶手段に記憶された前記複数の処理内容のアドレスを夫々記憶する第2記憶手段と、前記アドレスを一時的に保持する保持手段と、
を備える演算処理装置の演算処理方法であって、前記第2記憶手段に記憶された前記アドレスを順次読出し、前記保持手段に出力させるステップと、前記保持手段から出力された前記アドレスに対応する処理内容を前記第1記憶手段から読み出させ、実行させるステップと、前記保持手段が、前記アドレスを保持していないとき、前記読み出されたアドレスを一時的に保持して出力し、前記アドレスを保持しているとき、前記処理内容の実行終了を待って、保持している前記アドレスを出力するステップと、を含む、ことを特徴とする演算処理装置の演算処理方法であってもよい。
さらに、上記目的を達成するための本発明の一態様は、複数の処理内容をアドレスと対応させて記憶する第1記憶手段と、前記第1記憶手段に記憶された前記複数の処理内容のアドレスを夫々記憶する第2記憶手段と、前記アドレスを一時的に保持する保持手段と、
を備える演算処理装置の演算処理プログラムであって、前記第2記憶手段に記憶された前記アドレスを順次読出し、前記保持手段に出力させる処理と、前記保持手段から出力された前記アドレスに対応する処理内容を前記第1記憶手段から読み出させ、実行させる処理と、前記保持手段が、前記アドレスを保持していないとき、前記読み出されたアドレスを一時的に保持して出力し、前記アドレスを保持しているとき、前記処理内容の実行終了を待って、保持している前記アドレスを出力する処理と、をコンピュータに実行させる、ことを特徴とする演算処理装置の演算処理プログラムであってもよい。
図1は、本発明の実施の形態1に係る演算処理装置の概略的なシステム構成を示すブロック図である。本実施の形態に係る演算処理装置1は、命令メモリ10と、デコードユニット11と、レジスタファイル12と、ALU13と、プログラムカウンタ14と、ユーザテーブル15と、ユーザテーブル制御部16と、ジャンプテーブル17と、FIFO18と、を備えている。
まず、本動作の前提条件として、例えば、命令メモリ10には、図3の右側に位置するテーブルに示すような、ユーザ毎に指定された演算処理を実行するために必要な少なくとも2種類以上のサブプログラム(処理内容)と、そのサブプログラムに対応させて、その左側に示す特殊命令USER_JUMP(命令情報)と、が格納されているものとする。また、サブプログラムの最後には必ずサブプログラムの終端であることを示すNEXT_USER命令が存在しているものとする。
まず、命令メモリ10は、格納した特殊命令USER_JUMPをデコードユニット11に読み出す。
本発明の実施の形態2に係る演算処理装置2について、図4乃至6を参照して詳細に説明する。上記実施の形態1に係る演算処理装置1においては、特殊命令USER_JUMPの引数は、全ユーザ分の処理が終了したときのプログラムカウンタ値の移動先を示すGO TO値と、ユーザ数を示したNUM_USER値と、の2種類となっている(図2)。これに対し、本実施の形態2に係る演算処理装置2においては、図5に示すように、特殊命令USER_JUMPの引数が、GO TO値とNUM_USER値とに加えて、さらに大きな範囲の処理を大別できるインデックス(INDEX)値を有する点で、上記実施の形態1に係る演算処理装置1と相違する。
まず、命令メモリ40は、格納した特殊命令USER_JUMPをデコードユニット41に読み出す。デコードユニット41は、命令メモリ40から読み出された命令を解釈し、ユーザテーブル制御部46を制御するための制御信号と、インデックス値とを、ユーザテーブル制御部46及びFF49に夫々出力する。なお、上記制御信号には、ユーザ処理を開始するフラグと、ユーザ数を示す値が含まれる。
本発明の実施の形態3に係る演算処理装置について、図7及び図8を参照して詳細に説明する。上記実施の形態1に係る演算処理装置1においては、ユーザテーブル15が直接ジャンプテーブル17に接続している。これに対し、本実施の形態3に係る演算処理装置3においては、図7に示すように、ユーザテーブル75とジャンプテーブル77の間にtALU79を更に備える点で上記実施の形態1に係る演算処理装置1と相違する。tALU79は、演算手段の一具体例であり、例えば、定数割算、定数乗算、定数加算などの演算処理が可能な演算器である。
まず、本動作の前提条件として、命令メモリ70には、上記実施の形態1と同様に、ユーザ毎に指定された演算処理を実行するために必要な少なくとも2種類以上のサブプログラムと、特殊命令USER_JUMPと、が格納されているものとする。また、サブプログラムの最後には必ずサブプログラムの終端であることを示すNEXT_USER命令が存在しているものとする。さらに、特殊命令USER_JUMPは、図8に示すように、オペコード(USER_JUMP)81と、所望のユーザ数分の処理が終了した後に読み出す命令が格納されている場所(アドレス)を示すGO TO82と、処理するユーザ数を示すNUM_USER83と、tALU79の演算モードを指定するtALU_mode84と、で構成されている。
本発明の実施の形態4に係る演算処理装置1について、図9を参照して詳細に説明する。
本発明の実施の形態5に係る演算処理装置について、図10を参照して詳細に説明する。
上記実施の形態2に係る演算処理装置2においては、命令メモリ40内のサブプログラムの最後にプログラムの終端を示すNEXT_USER命令が挿入されており、その命令に基づいてユーザ毎のサブプログラムの切り替え制御を行っている。これに対し、本実施の形態5に係る演算処理装置2においては、図10に示すように、サブプログラムの最後にプログラムの終端を示すNEXT_USERを挿入せずに、ジャンプテーブルに各サブプログラムの先頭アドレスと終端アドレスを格納する点で、上記実施の形態2に係る演算処理装置2と相違する。
なお、本実施の形態5に係る演算処理装置2において、その他の動作については、上記実施の形態2に係る演算処理装置2と略同一であるため、説明を省略する。
複数の処理内容をアドレスと対応させて記憶する第1記憶手段と、
前記第1記憶手段に記憶された前記複数の処理内容のアドレスを夫々記憶する第2記憶手段と、
前記アドレスを一時的に保持する保持手段と、
前記第2記憶手段に記憶された前記アドレスを順次読出し、前記保持手段に出力させる読出手段と、
前記保持手段から出力された前記アドレスに対応する処理内容を前記第1記憶手段から読み出させ、実行させる実行手段と、
を備え、
前記保持手段は、前記アドレスを保持していないとき、前記読出手段により読み出されたアドレスを一時的に保持して出力し、前記アドレスを保持しているとき、前記実行手段の前記処理内容の実行終了を待って、保持している前記アドレスを出力する、
ことを特徴とする演算処理装置。
(付記2)
(付記1)記載の演算処理装置であって、
前記第1記憶手段には、前記処理内容と対応させて、所望のユーザ数分の処理が終了した後に読み出す命令が格納されているアドレスの情報と、処理するユーザ数を示す情報と、を含む命令情報が記憶されている、
ことを特徴とする演算処理装置。
(付記3)
(付記1)又は(付記2)記載の演算処理装置であって、
前記読出手段は、ユーザが実行する前記処理内容を指定するためのインデックス情報を記憶するユーザテーブル手段と、前記ユーザテーブル手段に記憶されたインデックス情報を前記第2記憶手段に出力させるユーザテーブル制御手段と、を有しており、
前記第2記憶手段は、前記ユーザテーブル手段から出力されたインデック情報に対応するアドレスを前記保持手段に出力する、
ことを特徴とする演算処理装置。
(付記4)
(付記3)記載の演算処理装置であって、
前記第1記憶手段には、前記処理内容と対応させて、所望のユーザ数分の処理が終了した後に読み出す命令が格納されているアドレスの情報と、処理するユーザ数を示す情報と、ユーザ数分全てまとめた単位で処理指定が可能なインデックス情報と、を含む命令情報が記憶されており、
前記実行手段から出力される前記ユーザテーブル制御手段を制御するための制御信号及び前記インデックス情報を記憶する第3記憶手段と、
前記ユーザテーブル部に記憶されたインデックス情報と、前記第3記憶手段に記憶されたインデックス情報と、に基づいて、新たなインデックス情報を生成するインデックス生成手段と、を更に備え、
前記インデックス生成手段により生成された新たなインデックス情報は前記第2記憶手段に出力される、ことを特徴とする演算処理装置。
(付記5)
(付記3)記載の演算処理装置であって、
前記ユーザテーブル手段に記憶されたインデックス情報に対して演算処理を行う演算手段を更に備え、
前記演算手段は演算処理したインデックス情報を前記第2記憶手段に出力する、ことを特徴とする演算処理装置。
(付記6)
(付記5)記載の演算処理装置であって、
前記第1記憶手段には、前記処理内容と対応させて、所望のユーザ数分の処理が終了した後に読み出す命令が格納されているアドレス情報と、処理するユーザ数を示す情報と、前記演算手段の演算内容を決定する演算モード情報と、を少なくとも含む命令情報が記憶されている、
ことを特徴とする演算処理装置。
(付記7)
(付記1)乃至(付記7)のうちいずれか記載の演算処理装置であって、
前記保持手段に接続されたプログラムカウンタ手段と、
前記実行手段からの指示に応じて演算処理を実行する演算実行手段と、
を更に備え、
前記プログラムカウンタ手段は、次のサイクルで内部に保持したアドレスをインクリメントして前記第1記憶手段に出力する機能と、次のサイクルでも同じアドレスを前記第1記憶手段に出力する機能と、前記演算実行手段からの出力値をアドレスとして出力する機能と、前記保持手段からの出力値に基づいて生成するアドレスを更新し出力する機能と、を有している、
ことを特徴とする演算処理装置。
(付記8)
複数の処理内容をアドレスと対応させて記憶する第1記憶手段と、
前記第1記憶手段に記憶された前記複数の処理内容のアドレスを夫々記憶する第2記憶手段と、
前記アドレスを一時的に保持する保持手段と、
を備える演算処理装置の演算処理方法であって、
前記第2記憶手段に記憶された前記アドレスを順次読出し、前記保持手段に出力させるステップと、
前記保持手段から出力された前記アドレスに対応する処理内容を前記第1記憶手段から読み出させ、実行させるステップと、
前記保持手段が、前記アドレスを保持していないとき、前記読み出されたアドレスを一時的に保持して出力し、前記アドレスを保持しているとき、前記処理内容の実行終了を待って、保持している前記アドレスを出力するステップと、
を含む、ことを特徴とする演算処理装置の演算処理方法。
(付記9)
(付記8)記載の演算処理装置の演算処理方法であって、
前記第1記憶手段には、前記処理内容と対応させて、所望のユーザ数分の処理が終了した後に読み出す命令が格納されているアドレスの情報と、処理するユーザ数を示す情報と、を含む命令情報が記憶されている、
ことを特徴とする演算処理装置の演算処理方法。
(付記10)
(付記8)又は(付記9)記載の演算処理装置の演算処理方法であって、
ユーザが実行する前記処理内容を指定するためのインデックス情報を記憶するステップと、
前記記憶されたインデックス情報を前記第2記憶手段に出力させるステップと、
前記出力されたインデック情報に対応するアドレスを前記保持手段に出力するステップと、を更に含む
ことを特徴とする演算処理装置の演算処理方法。
(付記11)
(付記10)記載の演算処理装置の演算処理方法であって、
前記第1記憶手段には、前記処理内容と対応させて、所望のユーザ数分の処理が終了した後に読み出す命令が格納されているアドレスの情報と、処理するユーザ数を示す情報と、ユーザ数分全てまとめた単位で処理指定が可能なインデックス情報と、を含む命令情報が記憶されており、
前記演算処理装置は、前記実行手段から出力される制御信号及び前記インデックス情報を記憶する第3記憶手段を更に備えており、
前記記憶されたインデックス情報と、前記第3記憶手段に記憶されたインデックス情報と、に基づいて、新たなインデックス情報を生成するステップと、
前記生成された新たなインデックス情報は前記第2記憶手段に出力されるステップと、を更に含む、ことを特徴とする演算処理装置の演算処理方法。
(付記12)
(付記10)記載の演算処理装置の演算処理方法であって、
前記記憶されたインデックス情報に対して演算処理を行うステップと、
前記演算処理したインデックス情報を前記第2記憶手段に出力するステップと、を更に含む、ことを特徴とする演算処理装置の演算処理方法。
(付記13)
(付記12)記載の演算処理装置の演算処理方法であって、
前記第1記憶手段には、前記処理内容と対応させて、所望のユーザ数分の処理が終了した後に読み出す命令が格納されているアドレス情報と、処理するユーザ数を示す情報と、前記演算手段の演算内容を決定する演算モード情報と、を少なくとも含む命令情報が記憶されている、
ことを特徴とする演算処理装置の演算処理方法。
(付記14)
(付記9)乃至(付記13)のうちいずれか記載の演算処理装置の演算処理方法であって、
前記演算処理装置は、前記保持手段に接続されたプログラムカウンタ手段と、前記実行手段からの指示に応じて演算処理を実行する演算実行手段と、を更に備え、
前記プログラムカウンタ手段は、次のサイクルで内部に保持したアドレスをインクリメントして前記第1記憶手段に出力する機能と、次のサイクルでも同じアドレスを前記第1記憶手段に出力する機能と、前記演算実行手段からの出力値をアドレスとして出力する機能と、前記保持手段からの出力値に基づいて生成するアドレスを更新し出力する機能と、を有している、
ことを特徴とする演算処理装置の演算処理方法。
(付記15)
複数の処理内容をアドレスと対応させて記憶する第1記憶手段と、
前記第1記憶手段に記憶された前記複数の処理内容のアドレスを夫々記憶する第2記憶手段と、
前記アドレスを一時的に保持する保持手段と、
を備える演算処理装置の演算処理プログラムであって、
前記第2記憶手段に記憶された前記アドレスを順次読出し、前記保持手段に出力させる処理と、
前記保持手段から出力された前記アドレスに対応する処理内容を前記第1記憶手段から読み出させ、実行させる処理と、
前記保持手段が、前記アドレスを保持していないとき、前記読み出されたアドレスを一時的に保持して出力し、前記アドレスを保持しているとき、前記処理内容の実行終了を待って、保持している前記アドレスを出力する処理と、
をコンピュータに実行させる、ことを特徴とする演算処理装置の演算処理プログラム。
10、40、70 命令メモリ
11、41、71 デコードユニット
12、42、72 レジスタファイル
13、43、73 ALU
14、44、74 プログラムカウンタ
15、45、75 ユーザテーブル
16、46、76 ユーザテーブル制御部
17、47、77 ジャンプテーブル
18、48、78 FIFO
49 FF
4A COMB部
79 tALU
Claims (9)
- 複数の処理内容をアドレスと対応させて記憶する第1記憶手段と、
前記第1記憶手段に記憶された前記複数の処理内容のアドレスを夫々記憶する第2記憶手段と、
前記アドレスを一時的に保持する保持手段と、
前記第2記憶手段に記憶された前記アドレスを順次読出し、前記保持手段に出力させる読出手段と、
前記保持手段から出力された前記アドレスに対応する処理内容を前記第1記憶手段から読み出させ、実行させる実行手段と、
を備え、
前記保持手段は、前記アドレスを保持していないとき、前記読出手段により読み出されたアドレスを一時的に保持して出力し、前記アドレスを保持しているとき、前記実行手段の前記処理内容の実行終了を待って、保持している前記アドレスを出力する、
ことを特徴とする演算処理装置。 - 前記第1記憶手段には、前記処理内容と対応させて、所望のユーザ数分の処理が終了した後に読み出す命令が格納されているアドレスの情報と、処理するユーザ数を示す情報と、を含む命令情報が記憶されている、
ことを特徴とする請求項1記載の演算処理装置。 - 前記読出手段は、ユーザが実行する前記処理内容を指定するためのインデックス情報を記憶するユーザテーブル手段と、前記ユーザテーブル手段に記憶されたインデックス情報を前記第2記憶手段に出力させるユーザテーブル制御手段と、を有しており、
前記第2記憶手段は、前記ユーザテーブル手段から出力されたインデック情報に対応するアドレスを前記保持手段に出力する、
ことを特徴とする請求項1又は2記載の演算処理装置。 - 前記第1記憶手段には、前記処理内容と対応させて、所望のユーザ数分の処理が終了した後に読み出す命令が格納されているアドレスの情報と、処理するユーザ数を示す情報と、ユーザ数分全てまとめた単位で処理指定が可能なインデックス情報と、を含む命令情報が記憶されており、
前記実行手段から出力される前記ユーザテーブル制御手段を制御するための制御信号及び前記インデックス情報を記憶する第3記憶手段と、
前記ユーザテーブル部に記憶されたインデックス情報と、前記第3記憶手段に記憶されたインデックス情報と、に基づいて、新たなインデックス情報を生成するインデックス生成手段と、を更に備え、
前記インデックス生成手段により生成された新たなインデックス情報は前記第2記憶手段に出力される、ことを特徴とする請求項3記載の演算処理装置。 - 前記ユーザテーブル手段に記憶されたインデックス情報に対して演算処理を行う演算手段を更に備え、
前記演算手段は演算処理したインデックス情報を前記第2記憶手段に出力する、ことを特徴とする請求項3記載の演算処理装置。 - 前記第1記憶手段には、前記処理内容と対応させて、所望のユーザ数分の処理が終了した後に読み出す命令が格納されているアドレス情報と、処理するユーザ数を示す情報と、前記演算手段の演算内容を決定する演算モード情報と、を少なくとも含む命令情報が記憶されている、
ことを特徴とする請求項5記載の演算処理装置。 - 前記保持手段に接続されたプログラムカウンタ手段と、
前記実行手段からの指示に応じて演算処理を実行する演算実行手段と、
を更に備え、
前記プログラムカウンタ手段は、次のサイクルで内部に保持したアドレスをインクリメントして前記第1記憶手段に出力する機能と、次のサイクルでも同じアドレスを前記第1記憶手段に出力する機能と、前記演算実行手段からの出力値をアドレスとして出力する機能と、前記保持手段からの出力値に基づいて生成するアドレスを更新し出力する機能と、を有している、
ことを特徴とする請求項1乃至6のうちいずれか1項記載の演算処理装置。 - 複数の処理内容をアドレスと対応させて記憶する第1記憶手段と、
前記第1記憶手段に記憶された前記複数の処理内容のアドレスを夫々記憶する第2記憶手段と、
前記アドレスを一時的に保持する保持手段と、
を備える演算処理装置の演算処理方法であって、
前記第2記憶手段に記憶された前記アドレスを順次読出し、前記保持手段に出力させるステップと、
前記保持手段から出力された前記アドレスに対応する処理内容を前記第1記憶手段から読み出させ、実行させるステップと、
前記保持手段が、前記アドレスを保持していないとき、前記読み出されたアドレスを一時的に保持して出力し、前記アドレスを保持しているとき、前記処理内容の実行終了を待って、保持している前記アドレスを出力するステップと、
を含む、ことを特徴とする演算処理装置の演算処理方法。 - 複数の処理内容をアドレスと対応させて記憶する第1記憶手段と、
前記第1記憶手段に記憶された前記複数の処理内容のアドレスを夫々記憶する第2記憶手段と、
前記アドレスを一時的に保持する保持手段と、
を備える演算処理装置の演算処理プログラムが格納された記憶媒体であって、
前記第2記憶手段に記憶された前記アドレスを順次読出し、前記保持手段に出力させる処理と、
前記保持手段から出力された前記アドレスに対応する処理内容を前記第1記憶手段から読み出させ、実行させる処理と、
前記保持手段が、前記アドレスを保持していないとき、前記読み出されたアドレスを一時的に保持して出力し、前記アドレスを保持しているとき、前記処理内容の実行終了を待って、保持している前記アドレスを出力する処理と、
をコンピュータに実行させる、ことを特徴とする演算処理装置の演算処理プログラムが格納された記憶媒体。
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US14/386,248 US20150046563A1 (en) | 2012-03-30 | 2013-02-13 | Arithmetic processing device, its arithmetic processing method, and storage medium storing arithmetic processing program |
JP2014507360A JP6064993B2 (ja) | 2012-03-30 | 2013-02-13 | 演算処理装置、その演算処理方法、及び演算処理プログラム |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2012-079359 | 2012-03-30 | ||
JP2012079359 | 2012-03-30 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2013145529A1 true WO2013145529A1 (ja) | 2013-10-03 |
Family
ID=49258856
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2013/000762 WO2013145529A1 (ja) | 2012-03-30 | 2013-02-13 | 演算処理装置、その演算処理方法、及び演算処理プログラムが格納された記憶媒体 |
Country Status (3)
Country | Link |
---|---|
US (1) | US20150046563A1 (ja) |
JP (1) | JP6064993B2 (ja) |
WO (1) | WO2013145529A1 (ja) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10853074B2 (en) * | 2014-05-01 | 2020-12-01 | Netronome Systems, Inc. | Table fetch processor instruction using table number to base address translation |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS57123454A (en) * | 1981-01-26 | 1982-07-31 | Fujitsu Ltd | Microprogram computer |
JPS6298427A (ja) * | 1985-10-25 | 1987-05-07 | Hitachi Ltd | 連想型マイクロ制御方式 |
JPS62271022A (ja) * | 1985-11-08 | 1987-11-25 | Matsushita Electric Ind Co Ltd | マイクロプログラム制御装置 |
JPH02183830A (ja) * | 1988-12-21 | 1990-07-18 | Internatl Business Mach Corp <Ibm> | マイクロプログラム変換機構を有するコンピュータ |
JPH02190930A (ja) * | 1988-12-29 | 1990-07-26 | Internatl Business Mach Corp <Ibm> | ソフトウエア命令実行装置 |
JPH05173768A (ja) * | 1991-12-20 | 1993-07-13 | Fuji Electric Co Ltd | テーブルの作成方法 |
JPH05241821A (ja) * | 1992-02-28 | 1993-09-21 | Nec Corp | データ処理装置 |
JPH11126161A (ja) * | 1997-10-23 | 1999-05-11 | Hitachi Ltd | 制御記憶の障害回避方法 |
JP2004538556A (ja) * | 2001-05-31 | 2004-12-24 | エイアールエム リミテッド | 複数命令セットシステムにおける非処理操作処理 |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7509671B1 (en) * | 2001-06-20 | 2009-03-24 | Microstrategy Incorporated | Systems and methods for assigning priority to jobs in a reporting system |
US20040172631A1 (en) * | 2001-06-20 | 2004-09-02 | Howard James E | Concurrent-multitasking processor |
CA2443347A1 (en) * | 2003-09-29 | 2005-03-29 | Pleora Technologies Inc. | Massively reduced instruction set processor |
JP4425177B2 (ja) * | 2005-05-20 | 2010-03-03 | 株式会社ソニー・コンピュータエンタテインメント | グラフィックプロセッサ、情報処理装置 |
US8185899B2 (en) * | 2007-03-07 | 2012-05-22 | International Business Machines Corporation | Prediction based priority scheduling |
WO2010004245A1 (en) * | 2008-07-10 | 2010-01-14 | Cambridge Consultants Limited | Processor with push instruction |
US8966228B2 (en) * | 2009-03-20 | 2015-02-24 | Arm Limited | Instruction fetching following changes in program flow |
-
2013
- 2013-02-13 JP JP2014507360A patent/JP6064993B2/ja active Active
- 2013-02-13 US US14/386,248 patent/US20150046563A1/en not_active Abandoned
- 2013-02-13 WO PCT/JP2013/000762 patent/WO2013145529A1/ja active Application Filing
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS57123454A (en) * | 1981-01-26 | 1982-07-31 | Fujitsu Ltd | Microprogram computer |
JPS6298427A (ja) * | 1985-10-25 | 1987-05-07 | Hitachi Ltd | 連想型マイクロ制御方式 |
JPS62271022A (ja) * | 1985-11-08 | 1987-11-25 | Matsushita Electric Ind Co Ltd | マイクロプログラム制御装置 |
JPH02183830A (ja) * | 1988-12-21 | 1990-07-18 | Internatl Business Mach Corp <Ibm> | マイクロプログラム変換機構を有するコンピュータ |
JPH02190930A (ja) * | 1988-12-29 | 1990-07-26 | Internatl Business Mach Corp <Ibm> | ソフトウエア命令実行装置 |
JPH05173768A (ja) * | 1991-12-20 | 1993-07-13 | Fuji Electric Co Ltd | テーブルの作成方法 |
JPH05241821A (ja) * | 1992-02-28 | 1993-09-21 | Nec Corp | データ処理装置 |
JPH11126161A (ja) * | 1997-10-23 | 1999-05-11 | Hitachi Ltd | 制御記憶の障害回避方法 |
JP2004538556A (ja) * | 2001-05-31 | 2004-12-24 | エイアールエム リミテッド | 複数命令セットシステムにおける非処理操作処理 |
Also Published As
Publication number | Publication date |
---|---|
JP6064993B2 (ja) | 2017-01-25 |
US20150046563A1 (en) | 2015-02-12 |
JPWO2013145529A1 (ja) | 2015-12-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9021241B2 (en) | Combined branch target and predicate prediction for instruction blocks | |
US20140304493A1 (en) | Methods and systems for performing a binary translation | |
US9436469B2 (en) | Methods to optimize a program loop via vector instructions using a shuffle table and a mask store table | |
US20130318540A1 (en) | Data flow graph processing device, data flow graph processing method, and data flow graph processing program | |
JP2015133129A (ja) | 命令としてデータ値を評価するシステムおよび方法 | |
JP6064993B2 (ja) | 演算処理装置、その演算処理方法、及び演算処理プログラム | |
US20140052960A1 (en) | Apparatus and method for generating vliw, and processor and method for processing vliw | |
JP2001125792A (ja) | 最適化促進装置 | |
US20140013312A1 (en) | Source level debugging apparatus and method for a reconfigurable processor | |
JP2016062388A (ja) | 情報処理装置、情報処理方法及びそのプログラム | |
JP2008250838A (ja) | ソフトウェア生成装置、方法、およびプログラム | |
US10949209B2 (en) | Techniques for scheduling instructions in compiling source code | |
CN113031952A (zh) | 深度学习模型的执行代码的确定方法、装置及存储介质 | |
KR101265019B1 (ko) | 명령 실행 회로 | |
US20100153688A1 (en) | Apparatus and method for data process | |
US9928045B2 (en) | Information processing apparatus, compile method and non-transitory recording medium storing compile program | |
WO2020066375A1 (ja) | 情報処理装置、情報処理方法、プログラム | |
CN112214244A (zh) | 一种运算装置及其操作方法 | |
JP6191457B2 (ja) | 演算処理装置、演算処理方法 | |
JP3381253B2 (ja) | シミュレーション装置 | |
WO2018082344A1 (zh) | 一种软硬件协同分支指令预测方法及装置 | |
JP5304307B2 (ja) | ソートキー比較コード生成装置、ソート処理装置及びソートキー比較コード生成方法 | |
JPS5829051A (ja) | 演算処理装置 | |
JP2014081846A (ja) | Vliwプロセッサ | |
JPS63186333A (ja) | 局所的分岐命令に対する命令のスケジユ−リング処理方式 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 13770015 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2014507360 Country of ref document: JP Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: 14386248 Country of ref document: US |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 13770015 Country of ref document: EP Kind code of ref document: A1 |