EP1504342A2 - Method and arrangement for power efficient control of processors - Google Patents
Method and arrangement for power efficient control of processorsInfo
- Publication number
- EP1504342A2 EP1504342A2 EP03729889A EP03729889A EP1504342A2 EP 1504342 A2 EP1504342 A2 EP 1504342A2 EP 03729889 A EP03729889 A EP 03729889A EP 03729889 A EP03729889 A EP 03729889A EP 1504342 A2 EP1504342 A2 EP 1504342A2
- Authority
- EP
- European Patent Office
- Prior art keywords
- slice
- processors
- vliw
- register
- pcu
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 238000000034 method Methods 0.000 title claims abstract description 12
- 230000001276 controlling effect Effects 0.000 claims description 3
- 230000006870 function Effects 0.000 claims description 3
- 230000006978 adaptation Effects 0.000 abstract description 3
- 230000001419 dependent effect Effects 0.000 abstract 1
- 238000004364 calculation method Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline or look ahead
- G06F9/3836—Instruction issuing, e.g. dynamic instruction scheduling or out of order instruction execution
- G06F9/3853—Instruction issuing, e.g. dynamic instruction scheduling or out of order instruction execution of compound instructions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/30076—Arrangements for executing specific machine instructions to perform miscellaneous control operations, e.g. NOP
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline or look ahead
- G06F9/3836—Instruction issuing, e.g. dynamic instruction scheduling or out of order instruction execution
- G06F9/3842—Speculative instruction execution
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline or look ahead
- G06F9/3867—Concurrent instruction execution, e.g. pipeline or look ahead using instruction pipelines
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline or look ahead
- G06F9/3885—Concurrent instruction execution, e.g. pipeline or look ahead using a plurality of independent parallel functional units
Definitions
- the invention relates to a method for the functional control of the program and / or data flow in digital signal processors and processors, each with separate and separate modules for program and data flow control, which operate in parallel arithmetic units.
- DSP digital signal processors
- the results obtained can usually only be provided at different times or after a different number of processor cycles in the respective slice.
- the regime of command processing compliant with the other SIMD slices can either not be implemented at all or only with great effort.
- this high expenditure in hardware occurs as a strong processor and memory utilization, which reduces the processor performance.
- This reduction can e.g. be averted by a memory expansion, which means an increase in hardware expenditure.
- the task according to the invention thus consists in realizing a power-efficient individual adaptation of the signal processing for the SIMD instruction type used in the individual data paths and in particular in minimizing the occurrence of NOP instructions with which the VLIW architecture of the processor are supplied.
- the task according to the invention is achieved in that the SIMD implemented by the PCU Commands parallel signal processing of the processor in a respective data path (DP) of a first and second slice is individually controlled by a "single-slice half" state output by an SSM register bank for each slice.
- DP data path
- the controlling effect of the output "single slice half" state is achieved in that the bits of the SSM register bank assigned for the first and second slice switch the register clock supply via the respectively associated first and second gated clock cell ,
- the register file unit (RFU) and the memory access register of the processor remain functional.
- the SSM register bank of the PCU can be written to by the PCU at any time.
- the aim of this solution is to start the calculations in parallel in the slices of the data paths of the processor in accordance with the SIMD command type.
- the intermediate and / or final results are provided in the slices at different times in the pipeline control registers, accumulators or result registers of the associated data paths.
- a supplementary embodiment of the solution of the task according to the invention is that the clock supply for the VLIW unit is controlled by a software-related status output from the program flow of the processor in such a way that partial instruction words that are currently available in the VLIW unit are subsequently used in the latter Multiple use can be provided on the functional units.
- This solution according to the invention is advantageously effective if a necessary adaptation of the algorithm to the SIMD command type during signal processing requires that the data paths or the associated VLIW architecture of the processor with no-operation commands (NOP) or the like Instructions with a high repetition rate must be supplied.
- NOP no-operation commands
- the avoided generation of the same VLIW reduces the storage space consumption and keeps the processor's computing load low, so that the computing power is efficiently available for the important calculations.
- An advantageous variant of the additional embodiment of the solution according to the invention is that the generation of further VILWs in the VLIW unit is interrupted by the PCU being announced a VLIW-WAIT command via a distant signal line and this command to the PCU in the next cycle is applied, the PCU subsequently switching the clock supply for the VLIW unit by means of a “VLIW-WAIT” signal line and a third gated clock cell.
- debug routines can be implemented in software tests by setting and starting software break points in the program code.
- results are provided in the associated accumulator 8 at different times. In this case, one for the first and second slice 18; 19 assigned bit of the SSM register bank 13 set.
- the signal assignment of this bit is at the first and second slice 18; 19 respectively associated data path 14 via the first and / or second gated clock cell 3; 4 fed and controls the signal processing in the first and second slice 18; 19 individually, in that if there is a result in this slice, the clock supply at the associated input register and thus also the signal processing is prevented.
- the signal processing in the individual slices of the data paths 14 is thus advantageously adapted to the requirements of parallel processing of the SIMD commands.
- VLIW unit very long instruction word
- first gated clock cell second gated clock cell AGU address generating unit
- PCU processing controlling unit
- clock supply line accumulator further processing unit (with gated Clock cell)
- Register of the further processing unit RFU (register file unit)
- SIMD control bus SSM register bank (single slice mode)
- Data path SIMD data path control line Distortion signal line VLIW-WAIT signal line first slice second slice third gated Clock-cell
Landscapes
- Engineering & Computer Science (AREA)
- Software Systems (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Advance Control (AREA)
- Power Sources (AREA)
Abstract
Description
Claims
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE10221530 | 2002-05-14 | ||
DE10221530A DE10221530A1 (en) | 2002-05-14 | 2002-05-14 | Method and arrangement for the efficient control of processors |
PCT/DE2003/001540 WO2003096184A2 (en) | 2002-05-14 | 2003-05-13 | Method and arrangement for power efficient control of processors |
Publications (1)
Publication Number | Publication Date |
---|---|
EP1504342A2 true EP1504342A2 (en) | 2005-02-09 |
Family
ID=29413830
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP03729889A Withdrawn EP1504342A2 (en) | 2002-05-14 | 2003-05-13 | Method and arrangement for power efficient control of processors |
Country Status (6)
Country | Link |
---|---|
US (2) | US20070150701A1 (en) |
EP (1) | EP1504342A2 (en) |
JP (1) | JP4208149B2 (en) |
AU (1) | AU2003240421A1 (en) |
DE (1) | DE10221530A1 (en) |
WO (1) | WO2003096184A2 (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7493475B2 (en) | 2006-11-15 | 2009-02-17 | Stmicroelectronics, Inc. | Instruction vector-mode processing in multi-lane processor by multiplex switch replicating instruction in one lane to select others along with updated operand address |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6044448A (en) * | 1997-12-16 | 2000-03-28 | S3 Incorporated | Processor having multiple datapath instances |
US6845445B2 (en) | 2000-05-12 | 2005-01-18 | Pts Corporation | Methods and apparatus for power control in a scalable array of processor elements |
US6839828B2 (en) * | 2001-08-14 | 2005-01-04 | International Business Machines Corporation | SIMD datapath coupled to scalar/vector/address/conditional data register file with selective subpath scalar processing mode |
-
2002
- 2002-05-14 DE DE10221530A patent/DE10221530A1/en not_active Ceased
-
2003
- 2003-05-13 AU AU2003240421A patent/AU2003240421A1/en not_active Abandoned
- 2003-05-13 US US10/511,575 patent/US20070150701A1/en not_active Abandoned
- 2003-05-13 WO PCT/DE2003/001540 patent/WO2003096184A2/en active Application Filing
- 2003-05-13 EP EP03729889A patent/EP1504342A2/en not_active Withdrawn
- 2003-05-13 JP JP2004504110A patent/JP4208149B2/en not_active Expired - Fee Related
-
2008
- 2008-05-05 US US12/151,202 patent/US20080215851A1/en not_active Abandoned
Non-Patent Citations (1)
Title |
---|
See references of WO03096184A2 * |
Also Published As
Publication number | Publication date |
---|---|
AU2003240421A1 (en) | 2003-11-11 |
DE10221530A1 (en) | 2003-12-04 |
JP2005525637A (en) | 2005-08-25 |
AU2003240421A8 (en) | 2003-11-11 |
US20080215851A1 (en) | 2008-09-04 |
WO2003096184A2 (en) | 2003-11-20 |
US20070150701A1 (en) | 2007-06-28 |
WO2003096184A3 (en) | 2004-02-19 |
JP4208149B2 (en) | 2009-01-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE2251876C3 (en) | Electronic data processing system | |
DE1200581B (en) | Program interruption system for an electronic calculating machine | |
WO2003096586A2 (en) | Method for effecting the controlled shutdown of data processing units | |
EP1680737A1 (en) | Method and device for operand processing in a processor unit | |
EP2574997A1 (en) | Method for adjusting an operating status | |
DE102015114220A1 (en) | Machine tool with function for the safe control of peripheral devices | |
DE3852434T2 (en) | Virtual computer system. | |
EP1504342A2 (en) | Method and arrangement for power efficient control of processors | |
DE102017117144A1 (en) | simulation device | |
DE3101270C2 (en) | Computer arrangement for word processing with a device for expanding functions | |
DE60302103T2 (en) | Control device for a machine | |
DE10138918A1 (en) | Program controlled unit | |
DE19637369C2 (en) | Digital signal processor with multiplier and method | |
EP1179428B1 (en) | Method and device for working through process steps | |
DE2649932A1 (en) | CENTRAL CONTROL UNIT FOR PERIPHERAL UNITS OF A DATA PROCESSING SYSTEM | |
DE3603319C2 (en) | ||
DE102014210529A1 (en) | Method for operating a control unit | |
WO2013026483A1 (en) | Method for computer‑aided synchronization of a plurality of computation processes | |
DE102019110606A1 (en) | Method and arrangement for controlling clock signals | |
EP1915674B1 (en) | Method and device for controlling a computer system comprising at least two execution units and at least two groups of internal states | |
DE10132809A1 (en) | Automation system with a work drive unit | |
DE10311026A1 (en) | Process for changing the function parameters used in an industrial controller uses sampled values subjected to a weighting process | |
DE102017216454A1 (en) | A method of operating a processor unit having multiple processor cores and shared buffer memory area | |
DE3516880A1 (en) | Arithmetic processing unit | |
EP0629944A1 (en) | Data processing machine |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20041008 |
|
AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LI LU MC NL PT RO SE SI SK TR |
|
AX | Request for extension of the european patent |
Extension state: AL LT LV MK |
|
DAX | Request for extension of the european patent (deleted) | ||
RBV | Designated contracting states (corrected) |
Designated state(s): DE FR GB |
|
17Q | First examination report despatched |
Effective date: 20070619 |
|
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: NXP SEMICONDUCTORS GERMANY GMBH |
|
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: NXP B.V. |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
18D | Application deemed to be withdrawn |
Effective date: 20111201 |