CN112740173A - 使用循环退出预测来加速或抑制处理器的循环模式 - Google Patents
使用循环退出预测来加速或抑制处理器的循环模式 Download PDFInfo
- Publication number
- CN112740173A CN112740173A CN201980061096.3A CN201980061096A CN112740173A CN 112740173 A CN112740173 A CN 112740173A CN 201980061096 A CN201980061096 A CN 201980061096A CN 112740173 A CN112740173 A CN 112740173A
- Authority
- CN
- China
- Prior art keywords
- loop
- instruction
- processor
- instructions
- mode
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F1/00—Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
- G06F1/26—Power supply means, e.g. regulation thereof
- G06F1/32—Means for saving power
- G06F1/3203—Power management, i.e. event-based initiation of a power-saving mode
- G06F1/3234—Power saving characterised by the action undertaken
- G06F1/3287—Power saving characterised by the action undertaken by switching off individual functional units in the computer system
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/3005—Arrangements for executing specific machine instructions to perform operations for flow control
- G06F9/30065—Loop control instructions; iterative instructions, e.g. LOOP, REPEAT
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F1/00—Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
- G06F1/26—Power supply means, e.g. regulation thereof
- G06F1/32—Means for saving power
- G06F1/3203—Power management, i.e. event-based initiation of a power-saving mode
- G06F1/3206—Monitoring of events, devices or parameters that trigger a change in power modality
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F1/00—Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
- G06F1/26—Power supply means, e.g. regulation thereof
- G06F1/32—Means for saving power
- G06F1/3203—Power management, i.e. event-based initiation of a power-saving mode
- G06F1/3234—Power saving characterised by the action undertaken
- G06F1/3243—Power saving in microcontroller unit
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F1/00—Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
- G06F1/26—Power supply means, e.g. regulation thereof
- G06F1/32—Means for saving power
- G06F1/3203—Power management, i.e. event-based initiation of a power-saving mode
- G06F1/3234—Power saving characterised by the action undertaken
- G06F1/3293—Power saving characterised by the action undertaken by switching to a less power-consuming processor, e.g. sub-CPU
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F1/00—Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
- G06F1/26—Power supply means, e.g. regulation thereof
- G06F1/32—Means for saving power
- G06F1/3203—Power management, i.e. event-based initiation of a power-saving mode
- G06F1/3234—Power saving characterised by the action undertaken
- G06F1/3296—Power saving characterised by the action undertaken by lowering the supply or operating voltage
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/30076—Arrangements for executing specific machine instructions to perform miscellaneous control operations, e.g. NOP
- G06F9/30083—Power or thermal control instructions
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30181—Instruction operation extension or modification
- G06F9/30189—Instruction operation extension or modification according to execution mode, e.g. mode flag
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/32—Address formation of the next instruction, e.g. by incrementing the instruction counter
- G06F9/322—Address formation of the next instruction, e.g. by incrementing the instruction counter for non-sequential address
- G06F9/325—Address formation of the next instruction, e.g. by incrementing the instruction counter for non-sequential address for loops, e.g. loop detection or loop counter
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline or look ahead
- G06F9/3802—Instruction prefetching
- G06F9/3808—Instruction prefetching for instruction reuse, e.g. trace cache, branch target cache
- G06F9/381—Loop buffering
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline or look ahead
- G06F9/3836—Instruction issuing, e.g. dynamic instruction scheduling or out of order instruction execution
- G06F9/3842—Speculative instruction execution
- G06F9/3844—Speculative instruction execution using dynamic branch prediction, e.g. using branch history tables
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline or look ahead
- G06F9/3867—Concurrent instruction execution, e.g. pipeline or look ahead using instruction pipelines
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D30/00—Reducing energy consumption in communication networks
- Y02D30/50—Reducing energy consumption in communication networks in wire-line communication networks, e.g. low power modes or reduced link rate
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Software Systems (AREA)
- General Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Computer Hardware Design (AREA)
- Computing Systems (AREA)
- Advance Control (AREA)
- Executing Machine-Instructions (AREA)
- Microcomputers (AREA)
- Power Sources (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US16/134,440 US10915322B2 (en) | 2018-09-18 | 2018-09-18 | Using loop exit prediction to accelerate or suppress loop mode of a processor |
| US16/134,440 | 2018-09-18 | ||
| PCT/US2019/048487 WO2020060734A1 (en) | 2018-09-18 | 2019-08-28 | Using loop exit prediction to accelerate or suppress loop mode of a processor |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN112740173A true CN112740173A (zh) | 2021-04-30 |
Family
ID=69772505
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201980061096.3A Pending CN112740173A (zh) | 2018-09-18 | 2019-08-28 | 使用循环退出预测来加速或抑制处理器的循环模式 |
Country Status (6)
| Country | Link |
|---|---|
| US (2) | US10915322B2 (https=) |
| EP (1) | EP3853716A4 (https=) |
| JP (1) | JP7301955B2 (https=) |
| KR (1) | KR102556897B1 (https=) |
| CN (1) | CN112740173A (https=) |
| WO (1) | WO2020060734A1 (https=) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN117170747A (zh) * | 2023-08-28 | 2023-12-05 | 海光信息技术股份有限公司 | 程序与指令处理、训练与预测方法与装置、处理器 |
Families Citing this family (13)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US10884751B2 (en) | 2018-07-13 | 2021-01-05 | Advanced Micro Devices, Inc. | Method and apparatus for virtualizing the micro-op cache |
| US11294681B2 (en) * | 2019-05-31 | 2022-04-05 | Texas Instruments Incorporated | Processing device with a microbranch target buffer for branch prediction using loop iteration count |
| US11256318B2 (en) * | 2019-08-09 | 2022-02-22 | Intel Corporation | Techniques for memory access in a reduced power state |
| US20210200550A1 (en) * | 2019-12-28 | 2021-07-01 | Intel Corporation | Loop exit predictor |
| US11520590B2 (en) * | 2020-09-02 | 2022-12-06 | Microsoft Technology Licensing, Llc | Detecting a repetitive pattern in an instruction pipeline of a processor to reduce repeated fetching |
| US20220283811A1 (en) * | 2021-03-03 | 2022-09-08 | Microsoft Technology Licensing, Llc | Loop buffering employing loop characteristic prediction in a processor for optimizing loop buffer performance |
| US12288067B2 (en) * | 2022-06-23 | 2025-04-29 | Arm Limited | Prediction of number of iterations of a fetching process |
| US12373215B2 (en) * | 2022-07-25 | 2025-07-29 | Apple Inc. | Using a next fetch predictor circuit with short branches and return fetch groups |
| US20240112050A1 (en) * | 2022-09-29 | 2024-04-04 | Nvidia Corporation | Identifying idle-cores in data centers using machine-learning (ml) |
| US12541371B2 (en) | 2023-08-23 | 2026-02-03 | Arm Limited | Predicting behaviour of control flow instructions using prediction entry types |
| US12411692B2 (en) * | 2023-09-07 | 2025-09-09 | Arm Limited | Storage of prediction-related data |
| US12517732B2 (en) | 2024-03-22 | 2026-01-06 | Tenstorrent USA, Inc. | Processor with one or more progressive conservative execution modes |
| US12450060B1 (en) * | 2024-08-28 | 2025-10-21 | Qualcomm Incorporated | Sharing loop cache instances among multiple threads in processor devices |
Citations (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20110107071A1 (en) * | 2009-11-04 | 2011-05-05 | Jacob Yaakov Jeffrey Allan Alon | System and method for using a branch mis-prediction buffer |
| US20120117362A1 (en) * | 2010-11-10 | 2012-05-10 | Bhargava Ravindra N | Replay of detected patterns in predicted instructions |
| US20130339700A1 (en) * | 2012-06-15 | 2013-12-19 | Conrado Blasco-Allue | Loop buffer learning |
| US20150293577A1 (en) * | 2014-04-11 | 2015-10-15 | Apple Inc. | Instruction loop buffer with tiered power savings |
| US20160179549A1 (en) * | 2014-12-23 | 2016-06-23 | Intel Corporation | Instruction and Logic for Loop Stream Detection |
| US9459871B2 (en) * | 2012-12-31 | 2016-10-04 | Intel Corporation | System of improved loop detection and execution |
| US9710276B2 (en) * | 2012-11-09 | 2017-07-18 | Advanced Micro Devices, Inc. | Execution of instruction loops using an instruction buffer |
Family Cites Families (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6578138B1 (en) | 1999-12-30 | 2003-06-10 | Intel Corporation | System and method for unrolling loops in a trace cache |
| JP5043560B2 (ja) | 2007-08-24 | 2012-10-10 | パナソニック株式会社 | プログラム実行制御装置 |
| US9116686B2 (en) * | 2012-04-02 | 2015-08-25 | Apple Inc. | Selective suppression of branch prediction in vector partitioning loops until dependency vector is available for predicate generating instruction |
| US9753733B2 (en) * | 2012-06-15 | 2017-09-05 | Apple Inc. | Methods, apparatus, and processors for packing multiple iterations of loop in a loop buffer |
| US9471322B2 (en) | 2014-02-12 | 2016-10-18 | Apple Inc. | Early loop buffer mode entry upon number of mispredictions of exit condition exceeding threshold |
| CN104298488B (zh) * | 2014-09-29 | 2018-02-23 | 上海兆芯集成电路有限公司 | 循环预测器指导的循环缓冲器 |
| US9875106B2 (en) | 2014-11-12 | 2018-01-23 | Mill Computing, Inc. | Computer processor employing instruction block exit prediction |
| JP2018005488A (ja) * | 2016-06-30 | 2018-01-11 | 富士通株式会社 | 演算処理装置及び演算処理装置の制御方法 |
-
2018
- 2018-09-18 US US16/134,440 patent/US10915322B2/en active Active
-
2019
- 2019-08-28 CN CN201980061096.3A patent/CN112740173A/zh active Pending
- 2019-08-28 WO PCT/US2019/048487 patent/WO2020060734A1/en not_active Ceased
- 2019-08-28 JP JP2021514963A patent/JP7301955B2/ja active Active
- 2019-08-28 EP EP19862627.7A patent/EP3853716A4/en active Pending
- 2019-08-28 KR KR1020217010368A patent/KR102556897B1/ko active Active
-
2021
- 2021-02-05 US US17/169,053 patent/US11256505B2/en active Active
Patent Citations (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20110107071A1 (en) * | 2009-11-04 | 2011-05-05 | Jacob Yaakov Jeffrey Allan Alon | System and method for using a branch mis-prediction buffer |
| US20120117362A1 (en) * | 2010-11-10 | 2012-05-10 | Bhargava Ravindra N | Replay of detected patterns in predicted instructions |
| US20130339700A1 (en) * | 2012-06-15 | 2013-12-19 | Conrado Blasco-Allue | Loop buffer learning |
| US9557999B2 (en) * | 2012-06-15 | 2017-01-31 | Apple Inc. | Loop buffer learning |
| US9710276B2 (en) * | 2012-11-09 | 2017-07-18 | Advanced Micro Devices, Inc. | Execution of instruction loops using an instruction buffer |
| US9459871B2 (en) * | 2012-12-31 | 2016-10-04 | Intel Corporation | System of improved loop detection and execution |
| US20150293577A1 (en) * | 2014-04-11 | 2015-10-15 | Apple Inc. | Instruction loop buffer with tiered power savings |
| US20160179549A1 (en) * | 2014-12-23 | 2016-06-23 | Intel Corporation | Instruction and Logic for Loop Stream Detection |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN117170747A (zh) * | 2023-08-28 | 2023-12-05 | 海光信息技术股份有限公司 | 程序与指令处理、训练与预测方法与装置、处理器 |
Also Published As
| Publication number | Publication date |
|---|---|
| JP7301955B2 (ja) | 2023-07-03 |
| US11256505B2 (en) | 2022-02-22 |
| US20200089498A1 (en) | 2020-03-19 |
| EP3853716A4 (en) | 2022-06-15 |
| US20210191722A1 (en) | 2021-06-24 |
| JP2022500777A (ja) | 2022-01-04 |
| KR20210046806A (ko) | 2021-04-28 |
| WO2020060734A1 (en) | 2020-03-26 |
| KR102556897B1 (ko) | 2023-07-18 |
| US10915322B2 (en) | 2021-02-09 |
| EP3853716A1 (en) | 2021-07-28 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11256505B2 (en) | Using loop exit prediction to accelerate or suppress loop mode of a processor | |
| US9891923B2 (en) | Loop predictor-directed loop buffer | |
| KR100973951B1 (ko) | 오정렬 메모리 액세스 예측 | |
| US20100325395A1 (en) | Dependence prediction in a memory system | |
| US11861365B2 (en) | Macro-op fusion | |
| US9361111B2 (en) | Tracking speculative execution of instructions for a register renaming data store | |
| JP2022500777A5 (https=) | ||
| TW200939116A (en) | Method and apparatus for inhibiting fetch throttling when a processor encounters a low confidence branch instruction in an information handling system | |
| CN113168329A (zh) | 循环退出预测器 | |
| US20140195790A1 (en) | Processor with second jump execution unit for branch misprediction | |
| US10705851B2 (en) | Scheduling that determines whether to remove a dependent micro-instruction from a reservation station queue based on determining cache hit/miss status of one ore more load micro-instructions once a count reaches a predetermined value | |
| US10860327B2 (en) | Methods for scheduling that determine whether to remove a dependent micro-instruction from a reservation station queue based on determining a cache hit/miss status of a load micro-instruction once a count reaches a predetermined value and an apparatus using the same | |
| US10303482B2 (en) | Dynamic processor frequency selection | |
| US20040003215A1 (en) | Method and apparatus for executing low power validations for high confidence speculations | |
| US20130173885A1 (en) | Processor and Methods of Adjusting a Branch Misprediction Recovery Mode | |
| TWI401564B (zh) | 預測處理器組件懸置用之方法與處理器 | |
| US10613866B2 (en) | Method of detecting repetition of an out-of-order execution schedule, apparatus and computer-readable medium | |
| US11663007B2 (en) | Control of branch prediction for zero-overhead loop | |
| US11526360B2 (en) | Adaptive utilization mechanism for a first-line defense branch predictor | |
| JP2010152843A (ja) | 分岐予測の信頼度見積もり回路及びその方法 | |
| CN105094750B (zh) | 一种多线程处理器的返回地址预测方法和装置 | |
| JP2014059665A (ja) | マイクロコンピュータ及びマイクロコンピュータにおける命令処理方法 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination |