EP0227811A1 - Selective operation of processing elements in a single instruction, multiple data stream (simd) computer system - Google Patents

Selective operation of processing elements in a single instruction, multiple data stream (simd) computer system

Info

Publication number
EP0227811A1
EP0227811A1 EP19860904568 EP86904568A EP0227811A1 EP 0227811 A1 EP0227811 A1 EP 0227811A1 EP 19860904568 EP19860904568 EP 19860904568 EP 86904568 A EP86904568 A EP 86904568A EP 0227811 A1 EP0227811 A1 EP 0227811A1
Authority
EP
European Patent Office
Prior art keywords
instruction
test
processors
instructions
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
EP19860904568
Other languages
German (de)
English (en)
French (fr)
Inventor
Adam E. Levinthal
Thomas Porter
Thomas D. S. Duff
Loren C. Carpenter
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Pixar
Original Assignee
Pixar
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Pixar filed Critical Pixar
Publication of EP0227811A1 publication Critical patent/EP0227811A1/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/76Architectures of general purpose stored program computers
    • G06F15/80Architectures of general purpose stored program computers comprising an array of processing units with common control, e.g. single instruction multiple data processors
    • G06F15/8007Architectures of general purpose stored program computers comprising an array of processing units with common control, e.g. single instruction multiple data processors single instruction multiple data [SIMD] multiprocessors

Definitions

  • This invention relates generally to parallel data processing techniques and computer systems, and specifically to those of a type where each of a plurality of parallel processors simultaneously executes the same instruction on different data.
  • a computer is commonly termed a single instruction, multiple data stream (SIMD) processor.
  • SIMD single instruction, multiple data stream
  • program being executed on the parallel streams of data is an invariant series of statements. It is more common, however, that the controlling program includes condi ⁇ tional statements that depend for execution upon the data in each of the parallel processors. Since the data being processed in each stream will be different, provision must be made in this case for those processors whose data does not meet the condition of the program statement to be rendered non-operative during the time that the remaining processors are executing the particular statement. It is known that a WHILE-DO construct is the minimum needed to implement all possible flow control structures.
  • a common example of such a conditional program instruction is an "IF-THEN" statement: that is, the individual processors are all instructed to perform a certain manipulation of their individual data streams, but only "if" their data meets a certain condition expressed in the program instruction. Those processors whose data at that instant do not meet the condition do not execute that instruction.
  • An "IF-THEN" instruction is often augmented by an "ELSE” modifier; that is, those processors not executing the "IF-THEN" statement are subsequently instructed to execute a different operation on their data at the next instant while those processors who did execute the "IF-THEN" instruction are rendered inoperative.
  • each of the parallel processors has a separate control element, such as one bit of a control register, that enables the processor to execute a common, instruction given all processors when the element is in one state and disables the processor from executing that instruction when in its other state.
  • the state of each control element is set to control execution , of a particular statement dependent upon whether the data for that processor met the test of a previous instruction, such as an "IF-THEN" instruction.
  • subsequent complementary execution such as occurs in an "ELSE” instruction, the states of the control elements are reversed so that those processors who did not execute the first statement will execute the subsequent state ⁇ ment, and vice versa.
  • a memory device (a stack memory in a preferred embodiment) is provided to store the states of the individual control elements when the nested condi- tional statement occurs.
  • a memory device a stack memory in a preferred embodiment
  • Figure 1 illustrates in general block diagram form a SIMD processor
  • Figure 2 illustrates a first circuit embodiment
  • Figures 3 and 4 are tables which illustrate the operation of the system of Figure 1 when implemented with the control circuit of Figure 2;
  • Figure 5 illustrates a second circuit embodi ⁇ ment of the control circuits of the system of Figure 1;
  • Figures 6 and 7 are tables which illustrate the operation of the system of Figure 1 whem implemented with the control circuit of Figure 5; and Figure 8 provides logic details of another portion of the circuit of Figure 1.
  • Parallel processing is particularly adapted for a gra- phics application since high speed processing is a requirement and the same sequence of program instructions is executed simultaneously on all four data paths.
  • a control circuit is provided in association with each of them, such as a circuit 37 which controls operation of the processing element 11.
  • a line 39 carries a signal to the processing element 11 which controls whether it is enabled to execute an instruction on the bus 35. For example, a voltage in line 39 representative of a logical "1" will cause the processing element to execute the instruction, while a voltage representative of a logical "0" will disable the processing element during execution of that particular instruction by other of the processing elements.
  • One is an initial condition which is presented external of the circuits of Figure 1 in a set line 41.
  • Another piece of information is a status instruction in a bus 43 which specifies, for those processor instructions on bus 35 that may require less than all of the processing elements to execute the instruction, additional instructions for determining the state of the enable signal in the line 39.
  • a final piece of information is a true "1" or false "0" signal in a line 45 which gives the result of a test performed by the processing element 11 on its data in response to a current or immediately preceding instruc-
  • test result input received from its associated processing element can be different and thus result in some processors being enabled and others being disabled at a given instant in time.
  • the function of the control circuits in the system of Figure 1 is explained more fully with respect to its two preferred embodiments, one embodiment illustrated in Figures 2-4 and another in Figures 5-7. But before proceeding to those embodiments, some general items of the system of Figure 1 are first explained.
  • the pro ⁇ cessor instructions in the bus 35 and the status instructions 43 originate from a micro-programmed control unit such as micro-sequencer 47.
  • a micro-programmed control unit consists of the micro-program memory and the structure required to determine the address of the next microinstruction, specific implementations being well known.
  • a logic circuit 49 has as inputs the invidiual test result lines of each of the processing elements.
  • the logic circuit 49 generates a condition code in an output line 51 when the signals in the input test result lines are a particular one or more combinations.
  • the signal in the line 51 is connected to the condition code input of the micro-sequencer 47, thus enabling a change in the sequence of instructions in response to a particular combination of test result outputs.
  • Another input to the logic circuits 49 is by way of a line 53, an instruction field of the micro-sequencer 47.
  • each of the processing elements contains as primary components a 16-bit multiplier and a 16-bit arithmetic and logic unit (ALU).
  • ALU arithmetic and logic unit
  • a flip-flop circuit 61 has its output connected to the enable line 39.
  • An input line 63 is connected to an output of a four-position multiplexer 65.
  • the multiplexer 65 has four separate inputs 0-3.
  • the status instruction in the bus 43 selects which of the inputs 0-3 is connected to the output 63.
  • the 0 input of the multiplexer is connected directly to the output of the flip-flop 61, thereby allowing the current state of the flip-flop 61 to be held when the multiplexer 65 is switched to its 0 input.
  • the next status instruction selecting the 1 input of the multi ⁇ plexer 65, causes the test result of its associated processing element to be stored, as previously described, an operation that accompanies an IF instruction in the bus 35.
  • the status instruction 2 causes the flip-flop 61 to be set, a status instruction on bus 43 that accompanies an END IF instruction in the processing element instruction bus 35.
  • a status instruction 3 causes the flip- flop element 61 to change state in order to enable those processors previously disabled, and conversely to disable those processors previously enabled.
  • the status in ⁇ struction 3 is presented in the bus 43 simultaneously with the ELSE instruction in the bus 35.
  • Micro-code in the micro-sequencer 47 assures that the instructions in the buses 35 and 43 correspond according to the table of Figure 3 in accordance with other particular requirements of any application.
  • the table of Figure 4 better explains the operation of the circuit of Figure 1, when using a control circuit of Figure 2, by a specific example.
  • IF statement asking whether the data input to each processing element .(D ) is greater than 1.
  • line 2 of the table of Figure 4 it is assumed in the "test result" column that the first and third processing elements have passed the test, thus showing the logical "1" in their test result output lines 45, while the second and fourth processors have failed the test, and thus show a test result logical signal of "0".
  • each processor is executing the same . IF instruction, the results of the test performed by each can be different because the data being processed by each is generally different.
  • the status instruction on the bus 43 causes the multiplexer 65 of each of the control circuits of the system of Figure 1 to switch to its position 1 to receive the test results from their corresponding processors. These test results, whether a test pass "1" or fail "0", are then stored in the individual flip-flop elements.
  • the enable signal outputs of the four flip-flops are given as the enable signals in the table of Figure 4, referred to interchangably in this example as "run flags".
  • the runflags are causing those processing elements who pass the test to be enabled and those who did not to be disabled. Those which are enabled are then caused, as shown in the line 3 of the table of Figure 4, to execute a statement, in this example chosen to be to set the data output (D ) equal to 1 of the enabled processing elements.
  • the disabled processing elements do nothing at this time.
  • An ELSE instruction is next presented to all the processing elements for execution, which is to say that those processors who failed the IF test are now going to be called upon to do something different, as illus ⁇ trated in lines 4 and 5 of the. table of Figure 4.
  • the ELSE processor instruction is accompanied by the status instruction 3 which causes the control circuits, illus- trated in Figure 2, to all invert the states of their flip-flops. That can be seen by comparing the run flags of lines 3 and 4 of Figure 4, one being the complement of the other.
  • the logic circuits 49 of Figure 1 are useful for detecting conditions where, because of a particular combination of input data, certain instructions need not be executed. In such a case, the micro-sequencer 47 is then caused to skip the unexecutable instructions. Logic circuits 49 may be omitted in implementations where unexecuted instruction sequences may be allowed to occur. In the example of Figure 4, if the test results shown in line 2 had all been 0, then there is no need to execute the statement of line 3 since all processors would be disabled. For this particular example, therefore, the logic circuits 49 are designed to detect when all processor test results are false (0) and causes the condition code in the line 51 to change, with the resultant change of the instruction sequence issued by the micro-sequencer 47.
  • test re ⁇ sults are all true (1)
  • the instructions at lines 4 and 5 of Figure 4 do not need to be executed, so the condition code in the line 51 can cause that instruction sequence to be bypassed, as well.
  • a signal in line 53 functions to allow testing for any false (0) condition or any true (1) condition.
  • the ability is provided (in conjunction with the status instruction on the bus 43) for testing for any or all conditions true or false.
  • An OR gate 52 has as its inputs the test result lines from all of the processing elements.
  • the gate's output is one input of an exclusive OR gate 54, the select line 53 being the second input.
  • the output of the gate 54 is the condition code line 51.
  • the gate 54 operates to pass through the output of the gate 52 when the select line 53 is false (0), and to pass a complement of that output when the line 53 is true (1).
  • Certain applications will require the ability of the individual processing element control circuits to handle a set of instructions that is nested within an IF- THEN-ELSE series of instructions.
  • the run flags determined as the result of executing the IF instruction are stored while the nested set of instruc ⁇ tions is being executed. Once the nested instructions have been executed, the stored run flags are called out of memory so that the remainder of the IF-THEN-ELSE set of instructions can be executed.
  • the control circuit of Figure 5 allows such • nested program instruction operation.
  • a stacked memory 81 Added to the system circuit of Figure 1 is a stacked memory 81, and associated controlling decoder circuits 83.
  • the cir ⁇ cuits within the dotted outline of Figure 5 are not repeated within each of the four control circuits of Figure 1, but rather are shared by them.
  • the decoding circuits 83 respond to status instructions in the bus 43 to cause the current enable signals (run flags) of each of the control circuits to be stored in the stack memory 81 (a "push") through lines 85 or to be read from memory (a "pop") through lines 87.
  • stack memories read ("pop") the last written ("pushed") data.
  • a flip-flop 91 of the same type used in the embodiment of Figure 2 is employed, with this output being the enable signal, one bit of the four-bit run flag. Its input in a line 93 is also connected to an output of a multiplexer 95.
  • the multiplexer has five positions 0-4, one more than used in the embodiment of Figure 2. One of these inputs is selected at a time for connection to the input of the flip-flop 91 by the status instruction in the bus 43. The 0 input is connected directly to the flip-flop output, thus serving to hold the flip-flop in whatever state it is found when switched to that position.
  • Input 1 of the multiplexer receives the output of AND gate 97, having as one input the output of the flip-flop 91 and as the other input test result line 45 of its associated processor.
  • the status instruction 1 is also decoded by circuits 83 to store ("push") at the top of the stack memory 81 the output (run flags) of the flip-flops within the control circuits of Figure 1.
  • Multiplexer input 2 is connected to the set line 41.
  • Input number 3 is connected to the stack memory 81 for setting the flip-flops in accordance with what has previously been recorded at the top of the stack.
  • the decoding circuits 83 cause the top stack data of the memory 81 to pop when the status instruction 3 is received.
  • the last input of the multiplexer 95 receives the output of another AND gate 99 whose two inputs are connected to the stack memory output and the output of the flip-flop 91 through an inverter 101.
  • the result is to AND together the data stored at the top of the stack and a complement of the current run flags.
  • the control circuit of Figure 5 whose logical operation is shown in the table of Figure 6, is especially adapted for carrying out the sequence of operations given in Figure 7.
  • an IF-THEN-ELSE sequence of program instructions is executed at lines 1, 2, 3, 9, 10, 11, 17, 18, and 19. Nested inside the IF or ELSE portions of that set of instructions is yet another IF- THEN-ELSE series of instructions, at lines 4-8. Similarly, a second set of such statements is nested at lines 12-16 within the basic sequence of instructions.
  • a different test result is assumed, as shown in the "test result" column of Figure 7. These different test results cause different run flags for each of the three IF-THEN- ELSE series of instructions.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Hardware Design (AREA)
  • Theoretical Computer Science (AREA)
  • Computing Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Advance Control (AREA)
  • Multi Processors (AREA)
  • Image Processing (AREA)
EP19860904568 1985-06-24 1986-06-23 Selective operation of processing elements in a single instruction, multiple data stream (simd) computer system Pending EP0227811A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US74840985A 1985-06-24 1985-06-24
US748409 1985-06-24

Publications (1)

Publication Number Publication Date
EP0227811A1 true EP0227811A1 (en) 1987-07-08

Family

ID=25009327

Family Applications (1)

Application Number Title Priority Date Filing Date
EP19860904568 Pending EP0227811A1 (en) 1985-06-24 1986-06-23 Selective operation of processing elements in a single instruction, multiple data stream (simd) computer system

Country Status (8)

Country Link
EP (1) EP0227811A1 (ja)
JP (1) JPS6254359A (ja)
AU (1) AU6128486A (ja)
CA (1) CA1267230A (ja)
DE (1) DE3620982A1 (ja)
FR (1) FR2583904A1 (ja)
GB (1) GB2177526B (ja)
WO (1) WO1987000318A1 (ja)

Families Citing this family (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2211638A (en) * 1987-10-27 1989-07-05 Ibm Simd array processor
GB2201015B (en) * 1987-02-10 1990-10-10 Univ Southampton Parallel processor array and array element
US5262965A (en) * 1988-10-31 1993-11-16 Bts-Broadcast Television Systems, Inc. System and method for high speed computer graphics image computation using a parallel connected, asynchronous multiprocessor ring coupled to a synchronous special purpose video processing ring
JP2578286Y2 (ja) * 1992-03-18 1998-08-06 日本精工株式会社 等速ジョイント
FR2693576B1 (fr) * 1992-07-13 1994-09-30 Texas Instruments France Système multiprocesseur à contrôle local.
FR2735253B1 (fr) * 1995-06-08 1999-10-22 Hewlett Packard Co Synchronisation de donnees entre plusieurs dispositifs de restitution asynchrones de donnees
AU717336B2 (en) * 1997-04-30 2000-03-23 Canon Kabushiki Kaisha Graphics processor architecture
US6366999B1 (en) * 1998-01-28 2002-04-02 Bops, Inc. Methods and apparatus to support conditional execution in a VLIW-based array processor with subword execution
JPH11246581A (ja) * 1998-02-28 1999-09-14 Tonen Corp 亜鉛−モリブデン系ジチオカルバミン酸塩誘導体、その製造方法およびそれを含有する潤滑油組成物
US6732253B1 (en) 2000-11-13 2004-05-04 Chipwrights Design, Inc. Loop handling for single instruction multiple datapath processor architectures
US6931518B1 (en) 2000-11-28 2005-08-16 Chipwrights Design, Inc. Branching around conditional processing if states of all single instruction multiple datapaths are disabled and the computer program is non-deterministic
US7493607B2 (en) 2002-07-09 2009-02-17 Bluerisc Inc. Statically speculative compilation and execution
US20050114850A1 (en) 2003-10-29 2005-05-26 Saurabh Chheda Energy-focused re-compilation of executables and hardware mechanisms based on compiler-architecture interaction and compiler-inserted control
US7996671B2 (en) 2003-11-17 2011-08-09 Bluerisc Inc. Security of program executables and microprocessors based on compiler-architecture interaction
US8607209B2 (en) 2004-02-04 2013-12-10 Bluerisc Inc. Energy-focused compiler-assisted branch prediction
US7434028B2 (en) * 2004-12-15 2008-10-07 Intel Corporation Hardware stack having entries with a data portion and associated counter
US20080126766A1 (en) 2006-11-03 2008-05-29 Saurabh Chheda Securing microprocessors against information leakage and physical tampering
JP5452066B2 (ja) * 2009-04-24 2014-03-26 本田技研工業株式会社 並列計算装置
JP5358287B2 (ja) * 2009-05-19 2013-12-04 本田技研工業株式会社 並列計算装置

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4101960A (en) * 1977-03-29 1978-07-18 Burroughs Corporation Scientific processor
JPS56127266A (en) * 1980-03-10 1981-10-05 Ibm Method of executing and controlling command stream
US4435758A (en) * 1980-03-10 1984-03-06 International Business Machines Corporation Method for conditional branch execution in SIMD vector processors
US4574348A (en) * 1983-06-01 1986-03-04 The Boeing Company High speed digital signal processor architecture

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO8700318A1 *

Also Published As

Publication number Publication date
AU6128486A (en) 1987-01-30
WO1987000318A1 (en) 1987-01-15
DE3620982A1 (de) 1987-01-29
JPH031699B2 (ja) 1991-01-11
GB2177526B (en) 1990-02-14
FR2583904A1 (fr) 1986-12-26
GB2177526A (en) 1987-01-21
JPS6254359A (ja) 1987-03-10
GB8614907D0 (en) 1986-07-23
CA1267230A (en) 1990-03-27

Similar Documents

Publication Publication Date Title
US5045995A (en) Selective operation of processing elements in a single instruction multiple data stream (SIMD) computer system
EP0227811A1 (en) Selective operation of processing elements in a single instruction, multiple data stream (simd) computer system
US20080072011A1 (en) SIMD type microprocessor
JPH03147021A (ja) 命令指定方法及び命令実行方式
US20050081021A1 (en) Automatic register backup/restore system and method
JP3237858B2 (ja) 演算装置
US6026486A (en) General purpose processor having a variable bitwidth
US3997771A (en) Apparatus and method for performing an arithmetic operation and multibit shift
GB1529581A (en) Data processing apparatus
US5019969A (en) Computer system for directly transferring vactor elements from register to register using a single instruction
US6904510B1 (en) Data processor having a respective multiplexer for each particular field
US5034879A (en) Programmable data path width in a programmable unit having plural levels of subinstruction sets
US20030088826A1 (en) Method and apparatus for performing computations and operations on data using data steering
US6175912B1 (en) Accumulator read port arbitration logic
US3794979A (en) Microprogrammed control unit with means for reversing and complementing microinstructions
US5751999A (en) Processor and data memory for outputting and receiving data on different buses for storage in the same location
EP0542318B1 (en) Digital processor and method for controlling the same
US4300208A (en) Controlling which of two addresses is used by a microcode memory
US9250898B2 (en) VLIW processor, instruction structure, and instruction execution method
JP2812610B2 (ja) パイプライン制御方式
JPH09305401A (ja) コンピュータ及びコンパイラ
JPH0310129B2 (ja)
US20060271610A1 (en) Digital signal processor having reconfigurable data paths
SU758126A1 (ru) Устройство для сопряжения 1
JPH0628198A (ja) Riscマイクロプロセッサ優先ベクトル割り込みシステム

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE CH DE FR GB IT LI LU

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

RIN1 Information on inventor provided before grant (corrected)

Inventor name: CARPENTER, LOREN, C.

Inventor name: DUFF, THOMAS, D., S.

Inventor name: LEVINTHAL, ADAM, E.

Inventor name: PORTER, THOMAS