US20140173256A1 - Processor configured for operation with multiple operation codes per instruction - Google Patents
Processor configured for operation with multiple operation codes per instruction Download PDFInfo
- Publication number
- US20140173256A1 US20140173256A1 US14/186,707 US201414186707A US2014173256A1 US 20140173256 A1 US20140173256 A1 US 20140173256A1 US 201414186707 A US201414186707 A US 201414186707A US 2014173256 A1 US2014173256 A1 US 2014173256A1
- Authority
- US
- United States
- Prior art keywords
- instructions
- processor
- operation codes
- code
- instruction
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30145—Instruction analysis, e.g. decoding, instruction word fields
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30145—Instruction analysis, e.g. decoding, instruction word fields
- G06F9/30156—Special purpose encoding of instructions, e.g. Gray coding
Definitions
- the present invention relates generally to microprocessors and other types of digital data processors, and more particularly to the manner in which operation codes are assigned to instructions executed by such processors.
- Digital data processors execute instructions that are represented by respective operation codes, also referred to herein as “opcodes.” Selection of an instruction set and assignment of opcodes to the respective instructions of that set are important issues in the design of a given processor.
- opcodes operation codes
- Selection of an instruction set and assignment of opcodes to the respective instructions of that set are important issues in the design of a given processor.
- the processor will not need to support that many instructions, and thus a designer has an ability to select which of the available opcodes will be assigned to the respective instructions.
- a problem with this conventional approach is that it is unduly rigid in its application, and accordingly does not achieve sufficient reduction of power consumption, particularly for processors that operate with a variety of different types of programs. More particularly, once the assignment of operation codes to respective instructions is determined, and the corresponding processor is designed and manufactured, there is insufficient opportunity to modify the initial assignment if the programs to be executed by the processor should happen to change.
- the present invention meets the above-identified need in one or more illustrative embodiments by providing a processor that is capable of handling instructions that each have multiple assigned operation codes.
- the processing circuitry is operative to retrieve operation codes from the memory circuitry and to decode the operation codes to produce corresponding instructions for execution by the processing circuitry. More particularly, the processing circuitry is configured to decode a first operation code to produce a given one of the instructions and to decode a second operation code different than the first operation code to also produce the given instruction.
- the same instruction is produced for execution by the processing circuitry regardless of whether the first operation code or the second operation code is decoded. Similar processing may be applied for an instruction having more than two assigned operation codes.
- the processing circuitry may, for example, generate any control and select signals, or other types of signals, that are required for execution of that instruction.
- Another aspect of the invention relates to a method of associating operation codes with instructions for execution in a processor.
- the method includes the steps of assigning the operation codes to the instructions in a manner that allows a given instruction to have multiple assigned operation codes, and selecting a particular one of the multiple assigned operation codes for use in executing a program containing the given instruction.
- the assigning step may be implemented in conjunction with design of the processor, and may further comprise the steps of determining frequency of occurrence of adjacent pairs of instructions in one or more programs likely to be run on the processor, and assigning the operation codes to the instructions based at least in part on the determined frequency of occurrence of the adjacent pairs of instructions.
- the selecting step may be implemented in conjunction with code generation for the program containing the given instruction, for example, in a code assembler.
- FIG. 1 is a block diagram of a system comprising components for designing a processor and assembling code for execution by that processor in accordance with an illustrative embodiment of the invention.
- FIG. 2 is a flow diagram of a processor design and code generation process that is implemented in the system of FIG. 1 .
- FIG. 3 is a block diagram of a processor configured for operation with multiple operation codes per instruction.
- FIG. 4 is a schematic diagram showing an example of an instruction decoder in the case of assignment of a single operation code to each of a plurality of instructions.
- FIG. 5 is a schematic diagram showing an example of an instruction decoder in the case of assignment of multiple operation codes to each of a plurality of instructions.
- the invention will be illustrated below in conjunction with an exemplary implementation of a processor design and code assembly system and an associated processor. It should be understood, however, that the invention is more generally applicable to any digital data processor in which it is desirable to provide reductions in power consumption through opcode assignment and corresponding dynamic opcode selection.
- processor as used herein is intended to include any device in which instructions retrieved from a memory or other storage element are executed using one or more execution units.
- Exemplary processors in accordance with the invention may therefore include, for example, microprocessors, digital signal processors (DSPs), microcontrollers, application-specific integrated circuits (ASICs), and computers, as well as other types of data processing devices.
- DSPs digital signal processors
- ASICs application-specific integrated circuits
- computers as well as other types of data processing devices.
- a given processor, or a particular portion thereof, may be implemented in the form of one or more integrated circuits.
- FIG. 1 shows a system 100 for designing a processor and assembling code for execution by that processor in an illustrative embodiment of the invention.
- the system 100 comprises a design terminal 102 and a code assembler 104 .
- the design terminal may be a computer or other data processing device, such as a computer-aided design (CAD) workstation used to design processors that execute instructions.
- the code assembler 104 may comprise a compiler or other element or device suitable for generating assembled code from a given software program, and may be configured, for example, to run on a common computer or other data processing platform with design terminal 102 . Alternatively, the code assembler may be implemented on a processing platform separate from the computer.
- the design terminal 102 is used to design a processor that executes programs comprising instructions of an instruction set.
- An example of such a processor will be described in greater detail below in conjunction with FIGS. 3 , 4 and 5 .
- a designer interacts with a graphical user interface of the design terminal in designing the processor, using well-known techniques.
- the output of the design process is a processor design 110 .
- This design will generally include a designation of an instruction set for the processor, as well as operation codes associated with the instructions of the instruction set.
- the design terminal in the present embodiment is also used to generate a software program 112 that includes instructions from the instruction set and is configured to cause the processor to perform certain operations.
- the software program may be produced using a different computer or other data processing device than the design terminal 102 used to produce processor design 110 .
- the process of converting a given program to a sequence of executable instructions is performed using a compiler, although other types of code assemblers may be used.
- the code assembler 104 receives the software program 112 and generates from that program a corresponding set of assembled code 114 .
- the instructions of the program are represented by opcodes, such that the instructions can be readily executed by processing circuitry of a processor defined by the processor design 110 .
- FIG. 2 illustrates the operation of the system 100 in the illustrative embodiment.
- the process as shown in the flow diagram of FIG. 2 includes an opcode assignment step 202 that is implemented in the design terminal 102 of system 100 , and a dynamic opcode selection step 204 that is implemented in the code assembler 104 of system 100 .
- opcodes are assigned to instructions, in conjunction with design of the processor, in a manner that allows a given single instruction to have multiple assigned opcodes.
- the processor may be designed to include processing circuitry configured to decode a first operation code to produce the given instruction and to decode a second operation code that is different than the first operation code to also produce the given instruction, such that the same instruction is produced for execution by the processing circuitry regardless of whether the first operation code or the second operation code is decoded.
- Processing circuitry of this type may be implemented in the form of an instruction decoder, a simplified example of which is shown in FIG. 5 , or using other types of conventional processor circuitry modified in a straightforward manner to implement one or more of the techniques disclosed herein.
- the dynamic opcode selection step 204 is performed in conjunction with code generation for a particular program to be executed by the processor.
- one of the multiple assigned opcodes for a given single instruction is selected for use when executing that program.
- the assembled code 114 will include the selected opcode, and that opcode will be executed when the assembled code is executed by a processor configured in accordance with the processor design 110 .
- This selection step is repeated for other instructions of the program that may have multiple assigned opcodes.
- the selection of a particular opcode for a given single instruction may be based upon one or more adjacent instructions in the program, as will be described in greater detail below.
- the illustrative embodiment assigns the opcodes in step 202 and dynamically selects from the assigned opcodes in step 204 in a manner which tends to reduce the power consumption of the processor. Generally, this is achieved by reducing the number of signal transitions that occur over sequences of executed instructions, as will be described in greater detail below.
- the power dissipated during such signal transitions is referred to herein as the dynamic power of the processor, and it is this dynamic power that may be significantly reduced using the techniques of the invention.
- the opcode assignment step 202 of FIG. 2 will now be described in greater detail.
- the opcode assignment may be implemented using the following process comprising Steps 1 through 5:
- the adjacent instruction pair that has the highest frequency of occurrence is determined and the opcode assignment is performed for that pair in such a way that the Hamming distance between the opcodes assigned to the respective instructions of the pair is as small as possible, where Hamming distance is defined as the number of bits that change from one opcode to the other. For example, the Hamming distance between 1100 and 0001 is three. Thus, the smallest possible Hamming distance between different opcodes will be one, also referred to herein as unit Hamming distance.
- Step 4 Repeat Step 4 until one or more opcodes have been assigned to each of the instructions under consideration.
- this pair is assigned opcodes that are at unit Hamming distance. For example, I2 may be assigned the opcode 0000 0000 and I5 may be assigned the opcode 0000 0001.
- the pair with the second highest frequency of occurrence is ⁇ I0,I1> and this pair gets the next best opcode assignment, which is also unit Hamming distance if possible.
- TO may be assigned the opcode 0000 0010 and I1 may be assigned the opcode 0000 0011.
- the third pair ⁇ I1,I6> in the order includes instruction I1 that has already been assigned an opcode. Therefore, a determination is made as to whether or not an opcode can be assigned to I6 that is unit Hamming distance from the opcode already assigned to I1. In the present case, such an opcode can be assigned, for example, 0000 0111, and so that opcode is assigned to I6. If this were not possible, a determination would be made as to whether the ⁇ I1,I6> pair occurs mostly with the ⁇ I0,I1> pair or is more distributed.
- the best possible code for I6 is chosen in conjunction with the I1 code that is assigned to the pair ⁇ I0,I1>. Otherwise, all the possible combinations may be examined and two or more opcodes that result in the best Hamming distance are assigned.
- the dynamic opcode selection step 204 of FIG. 2 will now be described in greater detail.
- one or more instructions may each have multiple assigned opcodes.
- the code assembler 104 needs to determine which particular opcode should be selected for use from the multiple opcodes assigned to a given instruction. It is this selected opcode that is used in the assembled code 114 that is executed by the processor.
- the code assembler in making this selection may, for example, look at the opcode assigned to the previous instruction and then select the opcode for the current instruction using a criterion such as minimum Hamming distance.
- the previous instructions examined in this selection process may comprise loops, branches, jumps or other similar instructions.
- the code assembler 104 has information regarding any multiple assigned opcodes. For a given one of these instructions having multiple assigned opcodes, the code assembler selects a particular one of the assigned opcodes so as to minimize Hamming distance relative to the opcode assigned to the previous instruction.
- a multiple-pass approach may be used to optimize the Hamming distance, particularly in the presence of loops. Thus, multiple iterations of a given selection process may be used in order to arrive at an optimal selection.
- FIG. 3 shows an example of a processor 300 that may be implemented in accordance with the processor design 110 so as to operate with instructions that each may have multiple assigned opcodes.
- the processor 300 in this example is configured as a typical CPU, and may be coupled via a bus interface unit 302 to an external instruction memory and an external data memory. The latter two elements are omitted from the figure for simplicity and clarity of illustration.
- the CPU may be viewed as an example of what is more generally referred to herein as “processing circuitry.” This term is also intended to include, by way of example, a portion of a CPU or other digital data processor element, or combinations of multiple such digital data processor elements.
- memory circuitry may refer to an instruction memory, a data memory, or both, as well as portions or combinations of these and other storage elements.
- Memory circuitry may be internal to the processing circuitry, external to the processing circuitry, or partially internal and partially external, as required to satisfy the needs of a given embodiment.
- the processor 300 further includes an instruction register 304 coupled to an instruction decoder 306 .
- the instruction decoder 306 is configured to decode multiple operation codes per instruction to produce instructions 320 .
- the produced instructions 320 may be in the form of, for example, any control and select signals, or other types of signals, that are required for execution of the instructions in processor 300 .
- the term “instruction” as used herein is therefore intended to be construed broadly, so as to encompass, without limitation, any control and select signals, or other types of signals, associated with initiating or otherwise controlling one or more corresponding execution operations in the processor.
- the instructions 320 are directed to an arithmetic and logical unit 308 and a control unit 310 .
- an address generator 312 is included in the processor 300 .
- Storage elements such as instruction register 304 , register file 314 and on-chip memory 316 are considered to be examples of “memory circuitry” as that term is generally used herein.
- Other elements of the processor 300 such as instruction decoder 306 , arithmetic and logical unit 308 , and control unit 310 , may be viewed as examples of “processing circuitry” as that term is generally used herein.
- Elements 302 , 304 , 308 , 310 , 312 , 314 and 316 in the present embodiment may be configured to operate in a well-known conventional manner, and are therefore not described in further detail herein.
- a given directional connection may be part of a bidirectional connection, with only one direction shown in the figure for simplicity of illustration.
- the directional connections shown may be replaced with bidirectional connections.
- the processor 300 as shown in the figure may be viewed as comprising a single integrated circuit or multiple integrated circuits.
- the processor 300 in the illustrative embodiment may be a pipelined processor in which execution of each instruction is divided into several stages.
- a pipelined processor generally includes a plurality of pipeline stages, such as a fetch stage, a decode stage, an execute stage and a writeback stage.
- the execute stage may comprise, for example, multiple specialized execution units, such as multipliers, adders or dividers, for performing integer or floating point arithmetic operations.
- the invention can be implemented using a wide variety of other types and configurations of processing circuitry and memory circuitry.
- the processor 300 may include additional or alternative elements not explicitly shown, but of a type typically found in a conventional implementation of such a processor. Numerous such additional or alternative elements will be very familiar to those skilled in the art, and therefore will not be described herein.
- FIG. 4 shows a simple example of one possible implementation of instruction decoder 306 for a conventional case in which only a single opcode is assigned to each instruction.
- there are four instructions denoted Inst 0 , Inst 1 , Inst 2 and Inst 3 , which have been assigned respective opcodes 0000, 0001, 0010 and 1011.
- Processing circuitry 400 of the instruction decoder comprises an inverter bank 402 having inputs that receive respective bits of an input opcode to be decoded.
- the bits of the input opcode are denoted Op 0 , Op 1 , Op 2 and Op 3 .
- the processing circuitry 400 further comprises output logic gates 404 - 0 , 404 - 1 , 404 - 2 and 404 - 3 associated with the respective instructions Inst 0 , Inst 1 , Inst 2 and Inst 3 .
- the logic gates 404 in this example are four-input AND gates, which receive inputs as shown in the figure. Depending on which of the four opcodes 0000, 0001, 0010 and 1011 is applied to the input of the inverter bank, a particular one of the logic gates 404 will have its output transition from a logic low level to a logic high level.
- logic gate 404 - 0 receives as its four inputs the outputs Op 0 -bar, Op 1 -bar, Op 2 -bar and Op 3 -bar of the inverter bank 400 .
- the Inst 0 output of AND gate 404 - 0 will transition from a logic low level to a logic high level, resulting of execution of the corresponding instruction in the processor 300 .
- FIG. 5 shows processing circuitry 500 of the instruction decoder 306 in the case of assignment of multiple operation codes to each of a plurality of instructions. More specifically, in this simple example, the instructions Inst 0 , Inst 1 , Inst 2 and Inst 3 of the previous example are assigned the respective opcodes 0000, 0001, 0010 and 1011, while instructions Inst 0 , Inst 1 and Inst 2 are also assigned respective second opcodes 0100, 1001 and 0110. As in FIG. 4 , only four instructions are shown in this particular example for simplicity of illustration, but other embodiments may of course involve an instruction set that includes many more than four instructions.
- the processing circuitry 500 of the instruction decoder comprises an inverter bank 502 having inputs that receive respective bits of an input opcode to be decoded.
- the bits of the input opcode are again denoted Op 0 , Op 1 , Op 2 and Op 3 .
- the processing circuitry 500 further comprises sets of logic circuitry 504 - 0 , 504 - 1 , 504 - 2 and 504 - 3 associated with the respective instructions Inst 0 , Inst 1 , Inst 2 and Inst 3 .
- the sets of logic circuitry 504 - 0 , 504 - 1 and 504 - 2 each comprise a pair of input NAND gates and an output NAND gate.
- Each of the input NAND gates in a given one of the sets corresponds to one of the two opcodes assigned to the instruction associated with that set.
- the final set of logic circuitry comprises only a single AND gate, since only a single opcode is assigned to the corresponding instruction.
- a particular one of the output gates will have its output transition from a logic low level to a logic high level.
- the output NAND gate of the set of logic circuitry 504 - 0 will transition from a logic low level to a logic high level if either of the two opcodes 0000 or 0100 assigned to instruction Inst 0 is presented at the input of the inverter bank 502 , resulting in execution of the corresponding instruction in the processor 300 . Accordingly, regardless of which of the two assigned opcodes is selected for use by the code assembler 104 , the resulting decoded instruction will be the same.
- FIGS. 1 , 2 , 3 and 5 provide a number of significant advantages over conventional arrangements. For example, dynamic selection of the particular opcode used for a given instruction allows the dynamic power of the processor to be substantially reduced. Also, assignment of multiple opcodes per instruction provides an opportunity to optimize the decoder logic circuitry in a way that further reduces power consumption. For example, if both of the opcodes 1110 and 1111 are assigned for a given single instruction, the corresponding portion of the decoder may only include one 3-input AND gate rather than a 4-input AND gate with an inverter.
- the particular number of instructions for which multiple opcodes are assigned can vary depending on the needs of a given embodiment. For example, use may be made of the well-known 80-20 principle, which states that only 20% of the instructions in a given instruction set account for 80% of the run time instruction execution. Thus, one may identify this 20% of the instructions, or other suitable subset of the available instructions, as those for which multiple opcodes will be assigned. It should also be noted that a given instruction set may be divided into different classes of instructions and for certain classes there may be insufficient options to allow multiple opcode assignment. In such cases, one can provide degeneracy in opcode assignment for only those classes with sufficient assignment options.
- a given embodiment of the present invention can be implemented as one or more integrated circuits.
- a plurality of identical die is typically formed in a repeated pattern on a surface of a wafer.
- Each die may include a device as described herein, and may include other structures or circuits.
- the individual die are cut or diced from the wafer, then packaged as an integrated circuit.
- One skilled in the art would know how to dice wafers and package die to produce integrated circuits. Integrated circuits so manufactured are considered part of this invention.
Landscapes
- Engineering & Computer Science (AREA)
- Software Systems (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Executing Machine-Instructions (AREA)
Abstract
Description
- This application is a Divisional of U.S. patent application Ser. No. 11/755,511, filed on May 30, 2007, the disclosure of which is fully incorporated herein by reference.
- The present invention relates generally to microprocessors and other types of digital data processors, and more particularly to the manner in which operation codes are assigned to instructions executed by such processors.
- Digital data processors execute instructions that are represented by respective operation codes, also referred to herein as “opcodes.” Selection of an instruction set and assignment of opcodes to the respective instructions of that set are important issues in the design of a given processor. By way of example, a 32-bit processor that utilizes 16 bits for the opcode can support up to 216=64K instructions. Typically, the processor will not need to support that many instructions, and thus a designer has an ability to select which of the available opcodes will be assigned to the respective instructions.
- In conventional practice, a single opcode is generally assigned to each instruction. An example of this approach is described in U.S. Patent Application Publication No. 2006/0101297, entitled “Reducing Power Consumption of a Microprocessor.” This approach determines the probability of occurrence of each instruction in an instruction set, and assigns an opcode to each instruction based on the probability of occurrence of that instruction in programs likely to be executed by the processor. The instructions having the highest probability of occurrence are assigned operation codes that require fewer signal transitions, such that the power consumption of the processor is reduced.
- A problem with this conventional approach is that it is unduly rigid in its application, and accordingly does not achieve sufficient reduction of power consumption, particularly for processors that operate with a variety of different types of programs. More particularly, once the assignment of operation codes to respective instructions is determined, and the corresponding processor is designed and manufactured, there is insufficient opportunity to modify the initial assignment if the programs to be executed by the processor should happen to change.
- Accordingly, a need exists for an improved approach to operation code assignment which provides enhanced flexibility and thereby additional opportunities for substantial reductions in processor power consumption.
- The present invention meets the above-identified need in one or more illustrative embodiments by providing a processor that is capable of handling instructions that each have multiple assigned operation codes.
- In accordance with one aspect of the invention, a processor configured to operate with multiple operation codes per instruction comprises memory circuitry and processing circuitry coupled to the memory circuitry. The processing circuitry is operative to retrieve operation codes from the memory circuitry and to decode the operation codes to produce corresponding instructions for execution by the processing circuitry. More particularly, the processing circuitry is configured to decode a first operation code to produce a given one of the instructions and to decode a second operation code different than the first operation code to also produce the given instruction. Thus, the same instruction is produced for execution by the processing circuitry regardless of whether the first operation code or the second operation code is decoded. Similar processing may be applied for an instruction having more than two assigned operation codes. In producing a given instruction, the processing circuitry may, for example, generate any control and select signals, or other types of signals, that are required for execution of that instruction.
- Another aspect of the invention relates to a method of associating operation codes with instructions for execution in a processor. The method includes the steps of assigning the operation codes to the instructions in a manner that allows a given instruction to have multiple assigned operation codes, and selecting a particular one of the multiple assigned operation codes for use in executing a program containing the given instruction. The assigning step may be implemented in conjunction with design of the processor, and may further comprise the steps of determining frequency of occurrence of adjacent pairs of instructions in one or more programs likely to be run on the processor, and assigning the operation codes to the instructions based at least in part on the determined frequency of occurrence of the adjacent pairs of instructions. The selecting step may be implemented in conjunction with code generation for the program containing the given instruction, for example, in a code assembler.
- The above-noted illustrative embodiments provide a number of significant advantages over conventional arrangements. For example, dynamic selection of the particular opcode used when executing a given instruction having multiple assigned opcodes allows the dynamic power of the processor to be substantially reduced. Also, assignment of multiple opcodes per instruction provides an opportunity to optimize the decoder logic circuitry in a way that further reduces power consumption.
-
FIG. 1 is a block diagram of a system comprising components for designing a processor and assembling code for execution by that processor in accordance with an illustrative embodiment of the invention. -
FIG. 2 is a flow diagram of a processor design and code generation process that is implemented in the system ofFIG. 1 . -
FIG. 3 is a block diagram of a processor configured for operation with multiple operation codes per instruction. -
FIG. 4 is a schematic diagram showing an example of an instruction decoder in the case of assignment of a single operation code to each of a plurality of instructions. -
FIG. 5 is a schematic diagram showing an example of an instruction decoder in the case of assignment of multiple operation codes to each of a plurality of instructions. - The invention will be illustrated below in conjunction with an exemplary implementation of a processor design and code assembly system and an associated processor. It should be understood, however, that the invention is more generally applicable to any digital data processor in which it is desirable to provide reductions in power consumption through opcode assignment and corresponding dynamic opcode selection.
- The term “processor” as used herein is intended to include any device in which instructions retrieved from a memory or other storage element are executed using one or more execution units. Exemplary processors in accordance with the invention may therefore include, for example, microprocessors, digital signal processors (DSPs), microcontrollers, application-specific integrated circuits (ASICs), and computers, as well as other types of data processing devices. A given processor, or a particular portion thereof, may be implemented in the form of one or more integrated circuits.
-
FIG. 1 shows asystem 100 for designing a processor and assembling code for execution by that processor in an illustrative embodiment of the invention. Thesystem 100 comprises adesign terminal 102 and acode assembler 104. The design terminal may be a computer or other data processing device, such as a computer-aided design (CAD) workstation used to design processors that execute instructions. Thecode assembler 104 may comprise a compiler or other element or device suitable for generating assembled code from a given software program, and may be configured, for example, to run on a common computer or other data processing platform withdesign terminal 102. Alternatively, the code assembler may be implemented on a processing platform separate from the computer. - The
design terminal 102 is used to design a processor that executes programs comprising instructions of an instruction set. An example of such a processor will be described in greater detail below in conjunction withFIGS. 3 , 4 and 5. Generally, a designer interacts with a graphical user interface of the design terminal in designing the processor, using well-known techniques. The output of the design process is aprocessor design 110. This design will generally include a designation of an instruction set for the processor, as well as operation codes associated with the instructions of the instruction set. The design terminal in the present embodiment is also used to generate asoftware program 112 that includes instructions from the instruction set and is configured to cause the processor to perform certain operations. In other embodiments, the software program may be produced using a different computer or other data processing device than thedesign terminal 102 used to produceprocessor design 110. Typically the process of converting a given program to a sequence of executable instructions is performed using a compiler, although other types of code assemblers may be used. - The
code assembler 104 receives thesoftware program 112 and generates from that program a corresponding set of assembledcode 114. In the assembled code, the instructions of the program are represented by opcodes, such that the instructions can be readily executed by processing circuitry of a processor defined by theprocessor design 110. -
FIG. 2 illustrates the operation of thesystem 100 in the illustrative embodiment. The process as shown in the flow diagram ofFIG. 2 includes anopcode assignment step 202 that is implemented in thedesign terminal 102 ofsystem 100, and a dynamicopcode selection step 204 that is implemented in thecode assembler 104 ofsystem 100. - In the
opcode assignment step 202, opcodes are assigned to instructions, in conjunction with design of the processor, in a manner that allows a given single instruction to have multiple assigned opcodes. By way of example, in generating theprocessor design 110 using thedesign terminal 102, the processor may be designed to include processing circuitry configured to decode a first operation code to produce the given instruction and to decode a second operation code that is different than the first operation code to also produce the given instruction, such that the same instruction is produced for execution by the processing circuitry regardless of whether the first operation code or the second operation code is decoded. Processing circuitry of this type may be implemented in the form of an instruction decoder, a simplified example of which is shown inFIG. 5 , or using other types of conventional processor circuitry modified in a straightforward manner to implement one or more of the techniques disclosed herein. - The dynamic
opcode selection step 204 is performed in conjunction with code generation for a particular program to be executed by the processor. In this step, one of the multiple assigned opcodes for a given single instruction is selected for use when executing that program. Thus, the assembledcode 114 will include the selected opcode, and that opcode will be executed when the assembled code is executed by a processor configured in accordance with theprocessor design 110. This selection step is repeated for other instructions of the program that may have multiple assigned opcodes. The selection of a particular opcode for a given single instruction may be based upon one or more adjacent instructions in the program, as will be described in greater detail below. - The illustrative embodiment assigns the opcodes in
step 202 and dynamically selects from the assigned opcodes instep 204 in a manner which tends to reduce the power consumption of the processor. Generally, this is achieved by reducing the number of signal transitions that occur over sequences of executed instructions, as will be described in greater detail below. The power dissipated during such signal transitions is referred to herein as the dynamic power of the processor, and it is this dynamic power that may be significantly reduced using the techniques of the invention. - The
opcode assignment step 202 ofFIG. 2 will now be described in greater detail. In the illustrative embodiment, the opcode assignment may be implemented using the following process comprising Steps 1 through 5: - 1. Identify a set of typical software programs that are likely to be run most often on the processor being designed.
- 2. Analyze these programs and extract frequency of occurrence of different adjacent instruction pairs.
- 3. The adjacent instruction pair that has the highest frequency of occurrence is determined and the opcode assignment is performed for that pair in such a way that the Hamming distance between the opcodes assigned to the respective instructions of the pair is as small as possible, where Hamming distance is defined as the number of bits that change from one opcode to the other. For example, the Hamming distance between 1100 and 0001 is three. Thus, the smallest possible Hamming distance between different opcodes will be one, also referred to herein as unit Hamming distance.
- 4. The adjacent instruction pair that has the next highest frequency of occurrence is then determined and the opcode assignment is attempted for that pair in the manner described in Step 3 above. It is possible that one of the instructions in this pair has already been assigned an opcode. If this is the case, do the following:
-
- A. If it is possible to assign an opcode to the other instruction in the current pair which is at the smallest Hamming distance with respect to the instruction that has already been assigned, then assign that opcode.
- B. If all the nearest opcodes are already assigned to other instructions, then determine if the current pair is distributed over all the programs or mostly occurring in conjunction with a particular pair that has already been assigned opcodes. If the current pair occurs mostly in conjunction with another pair, then use the opcode that is closest to the assigned opcode of the other pair. Otherwise, use the two best opcodes. This will lead to a situation where one instruction has multiple assigned opcodes, as described previously. Such a situation is also referred to herein as “degeneracy” of opcode assignment.
- 5. Repeat Step 4 until one or more opcodes have been assigned to each of the instructions under consideration.
- A particular example of the above-described opcode assignment process will now be described. Let the instruction sequence be I0,I1,I2,I3,I4,I5, etc. Further assume that based on the adjacency analysis, the adjacent pairs in order of decreasing frequency of occurrence are given by (<I2,I5>, <I0,I1>, <I1,I6>, . . . ).
- Since the pair <I2,I5> has the highest frequency of occurrence, this pair is assigned opcodes that are at unit Hamming distance. For example, I2 may be assigned the
opcode 0000 0000 and I5 may be assigned theopcode 0000 0001. - The pair with the second highest frequency of occurrence is <I0,I1> and this pair gets the next best opcode assignment, which is also unit Hamming distance if possible. For example, TO may be assigned the
opcode 0000 0010 and I1 may be assigned theopcode 0000 0011. - The third pair <I1,I6> in the order includes instruction I1 that has already been assigned an opcode. Therefore, a determination is made as to whether or not an opcode can be assigned to I6 that is unit Hamming distance from the opcode already assigned to I1. In the present case, such an opcode can be assigned, for example, 0000 0111, and so that opcode is assigned to I6. If this were not possible, a determination would be made as to whether the <I1,I6> pair occurs mostly with the <I0,I1> pair or is more distributed. If it occurs mostly with the <I0,I1> pair then the best possible code for I6 is chosen in conjunction with the I1 code that is assigned to the pair <I0,I1>. Otherwise, all the possible combinations may be examined and two or more opcodes that result in the best Hamming distance are assigned.
- It should be noted that the process described above for assignment of opcodes to instructions is presented by way of illustration only, and numerous alternative techniques may be used to assign opcodes to instructions in other embodiments.
- The dynamic
opcode selection step 204 ofFIG. 2 will now be described in greater detail. As described above, based on the opcode assignment process, one or more instructions may each have multiple assigned opcodes. Thus, at the time of code generation for a given program, thecode assembler 104 needs to determine which particular opcode should be selected for use from the multiple opcodes assigned to a given instruction. It is this selected opcode that is used in the assembledcode 114 that is executed by the processor. The code assembler in making this selection may, for example, look at the opcode assigned to the previous instruction and then select the opcode for the current instruction using a criterion such as minimum Hamming distance. The previous instructions examined in this selection process may comprise loops, branches, jumps or other similar instructions. - A particular example of the above-described dynamic opcode selection process will now be described. Let the instruction sequence again be I0,I1,I2,I3,I4,I5, etc. Further assume that the
code assembler 104 has information regarding any multiple assigned opcodes. For a given one of these instructions having multiple assigned opcodes, the code assembler selects a particular one of the assigned opcodes so as to minimize Hamming distance relative to the opcode assigned to the previous instruction. A multiple-pass approach may be used to optimize the Hamming distance, particularly in the presence of loops. Thus, multiple iterations of a given selection process may be used in order to arrive at an optimal selection. -
FIG. 3 shows an example of aprocessor 300 that may be implemented in accordance with theprocessor design 110 so as to operate with instructions that each may have multiple assigned opcodes. Theprocessor 300 in this example is configured as a typical CPU, and may be coupled via abus interface unit 302 to an external instruction memory and an external data memory. The latter two elements are omitted from the figure for simplicity and clarity of illustration. - The CPU may be viewed as an example of what is more generally referred to herein as “processing circuitry.” This term is also intended to include, by way of example, a portion of a CPU or other digital data processor element, or combinations of multiple such digital data processor elements.
- The term “memory circuitry” as used herein may refer to an instruction memory, a data memory, or both, as well as portions or combinations of these and other storage elements. Memory circuitry may be internal to the processing circuitry, external to the processing circuitry, or partially internal and partially external, as required to satisfy the needs of a given embodiment.
- The
processor 300 further includes aninstruction register 304 coupled to aninstruction decoder 306. As will be described in greater detail below in conjunction withFIG. 5 , theinstruction decoder 306 is configured to decode multiple operation codes per instruction to produceinstructions 320. The producedinstructions 320 may be in the form of, for example, any control and select signals, or other types of signals, that are required for execution of the instructions inprocessor 300. The term “instruction” as used herein is therefore intended to be construed broadly, so as to encompass, without limitation, any control and select signals, or other types of signals, associated with initiating or otherwise controlling one or more corresponding execution operations in the processor. - The
instructions 320 are directed to an arithmetic andlogical unit 308 and acontrol unit 310. Also included in theprocessor 300 is anaddress generator 312, aregister file 314 and on-chip memory 316. Storage elements such asinstruction register 304,register file 314 and on-chip memory 316 are considered to be examples of “memory circuitry” as that term is generally used herein. Other elements of theprocessor 300, such asinstruction decoder 306, arithmetic andlogical unit 308, andcontrol unit 310, may be viewed as examples of “processing circuitry” as that term is generally used herein.Elements - It should be noted that the particular interconnects among the elements shown in this figure are not intended to be limiting, and those skilled in the art will recognize that additional or alternative interconnects may be used. For example, a given directional connection may be part of a bidirectional connection, with only one direction shown in the figure for simplicity of illustration. In general, the directional connections shown may be replaced with bidirectional connections.
- The
processor 300 as shown in the figure may be viewed as comprising a single integrated circuit or multiple integrated circuits. - The
processor 300 in the illustrative embodiment may be a pipelined processor in which execution of each instruction is divided into several stages. A pipelined processor generally includes a plurality of pipeline stages, such as a fetch stage, a decode stage, an execute stage and a writeback stage. The execute stage may comprise, for example, multiple specialized execution units, such as multipliers, adders or dividers, for performing integer or floating point arithmetic operations. - It is to be appreciated that the invention can be implemented using a wide variety of other types and configurations of processing circuitry and memory circuitry. For example, the
processor 300 may include additional or alternative elements not explicitly shown, but of a type typically found in a conventional implementation of such a processor. Numerous such additional or alternative elements will be very familiar to those skilled in the art, and therefore will not be described herein. -
FIG. 4 shows a simple example of one possible implementation ofinstruction decoder 306 for a conventional case in which only a single opcode is assigned to each instruction. In this simple example, there are four instructions, denoted Inst0, Inst1, Inst2 and Inst3, which have been assignedrespective opcodes Processing circuitry 400 of the instruction decoder comprises aninverter bank 402 having inputs that receive respective bits of an input opcode to be decoded. The bits of the input opcode are denoted Op0, Op1, Op2 and Op3. Theprocessing circuitry 400 further comprises output logic gates 404-0, 404-1, 404-2 and 404-3 associated with the respective instructions Inst0, Inst1, Inst2 and Inst3. The logic gates 404 in this example are four-input AND gates, which receive inputs as shown in the figure. Depending on which of the fouropcodes inverter bank 400. Thus, if theopcode 0000 is applied to the inverter bank input, the Inst0 output of AND gate 404-0 will transition from a logic low level to a logic high level, resulting of execution of the corresponding instruction in theprocessor 300. - As noted above, in the
FIG. 4 arrangement, only a single opcode is assigned to a particular instruction. The illustrative embodiments of the present invention assign multiple opcodes to each of one or more instructions, as was previously described. -
FIG. 5 shows processing circuitry 500 of theinstruction decoder 306 in the case of assignment of multiple operation codes to each of a plurality of instructions. More specifically, in this simple example, the instructions Inst0, Inst1, Inst2 and Inst3 of the previous example are assigned therespective opcodes FIG. 4 , only four instructions are shown in this particular example for simplicity of illustration, but other embodiments may of course involve an instruction set that includes many more than four instructions. - The
processing circuitry 500 of the instruction decoder comprises aninverter bank 502 having inputs that receive respective bits of an input opcode to be decoded. The bits of the input opcode are again denoted Op0, Op1, Op2 and Op3. Theprocessing circuitry 500 further comprises sets of logic circuitry 504-0, 504-1, 504-2 and 504-3 associated with the respective instructions Inst0, Inst1, Inst2 and Inst3. The sets of logic circuitry 504-0, 504-1 and 504-2 each comprise a pair of input NAND gates and an output NAND gate. Each of the input NAND gates in a given one of the sets corresponds to one of the two opcodes assigned to the instruction associated with that set. The final set of logic circuitry comprises only a single AND gate, since only a single opcode is assigned to the corresponding instruction. - Again, depending on which of the assigned opcodes is applied to the input of the inverter bank, a particular one of the output gates will have its output transition from a logic low level to a logic high level. For example, the output NAND gate of the set of logic circuitry 504-0 will transition from a logic low level to a logic high level if either of the two
opcodes 0000 or 0100 assigned to instruction Inst0 is presented at the input of theinverter bank 502, resulting in execution of the corresponding instruction in theprocessor 300. Accordingly, regardless of which of the two assigned opcodes is selected for use by thecode assembler 104, the resulting decoded instruction will be the same. - Of course, numerous other arrangements of logic gates or other logic circuitry may be utilized to implement a given embodiment of an instruction decoder of the type described above.
- The illustrative embodiments of the invention as described above in conjunction with
FIGS. 1 , 2, 3 and 5 provide a number of significant advantages over conventional arrangements. For example, dynamic selection of the particular opcode used for a given instruction allows the dynamic power of the processor to be substantially reduced. Also, assignment of multiple opcodes per instruction provides an opportunity to optimize the decoder logic circuitry in a way that further reduces power consumption. For example, if both of the opcodes 1110 and 1111 are assigned for a given single instruction, the corresponding portion of the decoder may only include one 3-input AND gate rather than a 4-input AND gate with an inverter. In some embodiments, one may assign multiple opcodes to each of a number of instructions in order to both reduce dynamic power and optimize the design of the decoder logic circuitry. That is, both of these goals can be addressed in the assigning of multiple opcodes to each of a number of instructions. - The particular number of instructions for which multiple opcodes are assigned can vary depending on the needs of a given embodiment. For example, use may be made of the well-known 80-20 principle, which states that only 20% of the instructions in a given instruction set account for 80% of the run time instruction execution. Thus, one may identify this 20% of the instructions, or other suitable subset of the available instructions, as those for which multiple opcodes will be assigned. It should also be noted that a given instruction set may be divided into different classes of instructions and for certain classes there may be insufficient options to allow multiple opcode assignment. In such cases, one can provide degeneracy in opcode assignment for only those classes with sufficient assignment options.
- As indicated above, a given embodiment of the present invention can be implemented as one or more integrated circuits. In such an arrangement, a plurality of identical die is typically formed in a repeated pattern on a surface of a wafer. Each die may include a device as described herein, and may include other structures or circuits. The individual die are cut or diced from the wafer, then packaged as an integrated circuit. One skilled in the art would know how to dice wafers and package die to produce integrated circuits. Integrated circuits so manufactured are considered part of this invention.
- Again, it should be emphasized that the above-described embodiments of the invention are intended to be illustrative only. For example, although the illustrative embodiments disclosed herein assign two opcodes for each of a plurality of instructions, other embodiments can assign more than two opcodes to a given instruction. Also, the particular configuration of the processor and its instruction decoder or other associated elements can be varied in alternative embodiments. Other alternative embodiments of the invention may be configured utilizing different arrangements of memory circuitry and processing circuitry than those explicitly described above. These and numerous other alternative embodiments within the scope of the following claims will be apparent to those skilled in the art.
Claims (15)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US14/186,707 US20140173256A1 (en) | 2007-05-30 | 2014-02-21 | Processor configured for operation with multiple operation codes per instruction |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US11/755,511 US8700886B2 (en) | 2007-05-30 | 2007-05-30 | Processor configured for operation with multiple operation codes per instruction |
US14/186,707 US20140173256A1 (en) | 2007-05-30 | 2014-02-21 | Processor configured for operation with multiple operation codes per instruction |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US11/755,511 Division US8700886B2 (en) | 2007-05-30 | 2007-05-30 | Processor configured for operation with multiple operation codes per instruction |
Publications (1)
Publication Number | Publication Date |
---|---|
US20140173256A1 true US20140173256A1 (en) | 2014-06-19 |
Family
ID=40089592
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US11/755,511 Active 2032-06-13 US8700886B2 (en) | 2007-05-30 | 2007-05-30 | Processor configured for operation with multiple operation codes per instruction |
US14/186,707 Abandoned US20140173256A1 (en) | 2007-05-30 | 2014-02-21 | Processor configured for operation with multiple operation codes per instruction |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US11/755,511 Active 2032-06-13 US8700886B2 (en) | 2007-05-30 | 2007-05-30 | Processor configured for operation with multiple operation codes per instruction |
Country Status (1)
Country | Link |
---|---|
US (2) | US8700886B2 (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9148129B2 (en) * | 2013-06-07 | 2015-09-29 | Finisar Corporation | Driver circuit with asymmetric boost |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4774688A (en) * | 1984-11-14 | 1988-09-27 | International Business Machines Corporation | Data processing system for determining min/max in a single operation cycle as a result of a single instruction |
US5101483A (en) * | 1988-01-27 | 1992-03-31 | Oki Electric Industry Co., Ltd. | Instruction decoder simplification by reuse of bits to produce the same control states for different instructions |
US5721945A (en) * | 1996-05-06 | 1998-02-24 | Advanced Micro Devices | Microprocessor configured to detect a DSP call instruction and to direct a DSP to execute a routine corresponding to the DSP call instruction |
US5784639A (en) * | 1995-12-29 | 1998-07-21 | Intel Corporation | Load buffer integrated dynamic decoding logic |
US6463339B1 (en) * | 1999-09-27 | 2002-10-08 | Rockwell Automation Technologies, Inc. | High reliability industrial controller using tandem independent programmable gate-arrays |
US6731559B2 (en) * | 2001-09-18 | 2004-05-04 | Kabushiki Kaisha Toshiba | Synchronous semiconductor memory device |
US20050010830A1 (en) * | 2001-08-10 | 2005-01-13 | Paul Webster | Power reduction in microprocessor systems |
US20060101297A1 (en) * | 2004-10-15 | 2006-05-11 | Genesis Microchip Inc. | Reducing power consumption of a microprocessor |
US20070300044A1 (en) * | 2006-06-27 | 2007-12-27 | Moyer William C | Method and apparatus for interfacing a processor and coprocessor |
-
2007
- 2007-05-30 US US11/755,511 patent/US8700886B2/en active Active
-
2014
- 2014-02-21 US US14/186,707 patent/US20140173256A1/en not_active Abandoned
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4774688A (en) * | 1984-11-14 | 1988-09-27 | International Business Machines Corporation | Data processing system for determining min/max in a single operation cycle as a result of a single instruction |
US5101483A (en) * | 1988-01-27 | 1992-03-31 | Oki Electric Industry Co., Ltd. | Instruction decoder simplification by reuse of bits to produce the same control states for different instructions |
US5784639A (en) * | 1995-12-29 | 1998-07-21 | Intel Corporation | Load buffer integrated dynamic decoding logic |
US5721945A (en) * | 1996-05-06 | 1998-02-24 | Advanced Micro Devices | Microprocessor configured to detect a DSP call instruction and to direct a DSP to execute a routine corresponding to the DSP call instruction |
US6463339B1 (en) * | 1999-09-27 | 2002-10-08 | Rockwell Automation Technologies, Inc. | High reliability industrial controller using tandem independent programmable gate-arrays |
US20050010830A1 (en) * | 2001-08-10 | 2005-01-13 | Paul Webster | Power reduction in microprocessor systems |
US6731559B2 (en) * | 2001-09-18 | 2004-05-04 | Kabushiki Kaisha Toshiba | Synchronous semiconductor memory device |
US20060101297A1 (en) * | 2004-10-15 | 2006-05-11 | Genesis Microchip Inc. | Reducing power consumption of a microprocessor |
US20070300044A1 (en) * | 2006-06-27 | 2007-12-27 | Moyer William C | Method and apparatus for interfacing a processor and coprocessor |
Also Published As
Publication number | Publication date |
---|---|
US8700886B2 (en) | 2014-04-15 |
US20080301410A1 (en) | 2008-12-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11347509B2 (en) | Encoding and decoding variable length instructions | |
US7774768B2 (en) | Method and apparatus for processor code optimization using code compression | |
Gordon-Ross et al. | Exploiting fixed programs in embedded systems: A loop cache example | |
JP2009535750A (en) | Method and apparatus for implementing a polymorphic branch predictor | |
US20190102197A1 (en) | System and method for merging divide and multiply-subtract operations | |
US20200278865A1 (en) | Hazard Mitigation for Lightweight Processor Cores | |
US20060015855A1 (en) | Systems and methods for replacing NOP instructions in a first program with instructions of a second program | |
US8700886B2 (en) | Processor configured for operation with multiple operation codes per instruction | |
US20020066082A1 (en) | Bus performance evaluation method for algorithm description | |
US20040236929A1 (en) | Logic circuit and program for executing thereon | |
US20220365755A1 (en) | Performing constant modulo arithmetic | |
CN111930504B (en) | Method and device for determining clock frequency of processor and computer readable storage medium | |
EP3705991A1 (en) | Geometric synthesis | |
US10803216B1 (en) | Combinational logic circuit optimization | |
US7457738B2 (en) | Method for decoding instruction in architectural simulator | |
US7191432B2 (en) | High frequency compound instruction mechanism and method for a compare operation in an arithmetic logic unit | |
US7058678B2 (en) | Fast forwarding ALU | |
Singh et al. | Performance Improvement in MIPS Pipeline Processor based on FPGA | |
US9880810B1 (en) | Universal shifter and rotator and methods for implementing a universal shifter and rotator | |
US8136061B2 (en) | Method of logic circuit synthesis and design using a dynamic circuit library | |
US11269651B2 (en) | Reusing adjacent SIMD unit for fast wide result generation | |
CN102681822A (en) | Realization device and method for buffer management of processor address | |
US8566772B2 (en) | Method, computer program and computing system for optimizing an architectural model of a microprocessor | |
US20040153485A1 (en) | Arithmetic apparatus and arithmetic method | |
Hao et al. | A high-performance framework for instruction-set simulator |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: DEUTSCHE BANK AG NEW YORK BRANCH, AS COLLATERAL AG Free format text: PATENT SECURITY AGREEMENT;ASSIGNORS:LSI CORPORATION;AGERE SYSTEMS LLC;REEL/FRAME:032856/0031 Effective date: 20140506 |
|
AS | Assignment |
Owner name: AVAGO TECHNOLOGIES GENERAL IP (SINGAPORE) PTE. LTD Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:AGERE SYSTEMS LLC;REEL/FRAME:035365/0634 Effective date: 20140804 |
|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |
|
AS | Assignment |
Owner name: LSI CORPORATION, CALIFORNIA Free format text: TERMINATION AND RELEASE OF SECURITY INTEREST IN PATENT RIGHTS (RELEASES RF 032856-0031);ASSIGNOR:DEUTSCHE BANK AG NEW YORK BRANCH, AS COLLATERAL AGENT;REEL/FRAME:037684/0039 Effective date: 20160201 Owner name: AGERE SYSTEMS LLC, PENNSYLVANIA Free format text: TERMINATION AND RELEASE OF SECURITY INTEREST IN PATENT RIGHTS (RELEASES RF 032856-0031);ASSIGNOR:DEUTSCHE BANK AG NEW YORK BRANCH, AS COLLATERAL AGENT;REEL/FRAME:037684/0039 Effective date: 20160201 |