JP5186488B2 - 多形的分岐予測子を実装するための方法および装置 - Google Patents
多形的分岐予測子を実装するための方法および装置 Download PDFInfo
- Publication number
- JP5186488B2 JP5186488B2 JP2009510049A JP2009510049A JP5186488B2 JP 5186488 B2 JP5186488 B2 JP 5186488B2 JP 2009510049 A JP2009510049 A JP 2009510049A JP 2009510049 A JP2009510049 A JP 2009510049A JP 5186488 B2 JP5186488 B2 JP 5186488B2
- Authority
- JP
- Japan
- Prior art keywords
- predictor
- bit
- branch
- prediction
- polymorphic
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000000034 method Methods 0.000 title description 59
- 238000003860 storage Methods 0.000 claims description 33
- 230000007246 mechanism Effects 0.000 claims description 29
- 238000005192 partition Methods 0.000 claims description 7
- 238000004458 analytical method Methods 0.000 claims description 5
- 230000002902 bimodal effect Effects 0.000 description 18
- 230000006870 function Effects 0.000 description 15
- 238000003491 array Methods 0.000 description 11
- 230000000875 corresponding effect Effects 0.000 description 11
- 238000012545 processing Methods 0.000 description 9
- 238000013461 design Methods 0.000 description 8
- 238000010586 diagram Methods 0.000 description 8
- 230000006399 behavior Effects 0.000 description 5
- 230000008901 benefit Effects 0.000 description 4
- 238000004422 calculation algorithm Methods 0.000 description 4
- 230000007704 transition Effects 0.000 description 4
- 230000003287 optical effect Effects 0.000 description 3
- 238000005457 optimization Methods 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 230000003068 static effect Effects 0.000 description 3
- 238000004364 calculation method Methods 0.000 description 2
- 239000013256 coordination polymer Substances 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012544 monitoring process Methods 0.000 description 2
- 239000000047 product Substances 0.000 description 2
- 229920006395 saturated elastomer Polymers 0.000 description 2
- 239000004065 semiconductor Substances 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 230000009471 action Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 239000000919 ceramic Substances 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000001066 destructive effect Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 239000012467 final product Substances 0.000 description 1
- 239000013067 intermediate product Substances 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000003989 repetitive behavior Effects 0.000 description 1
- 208000013406 repetitive behavior Diseases 0.000 description 1
- 238000009738 saturating Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 230000032258 transport Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline, look ahead
- G06F9/3802—Instruction prefetching
- G06F9/3804—Instruction prefetching for branches, e.g. hedging, branch folding
- G06F9/3806—Instruction prefetching for branches, e.g. hedging, branch folding using address prediction, e.g. return stack, branch history buffer
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline, look ahead
- G06F9/3836—Instruction issuing, e.g. dynamic instruction scheduling or out of order instruction execution
- G06F9/3842—Speculative instruction execution
- G06F9/3844—Speculative instruction execution using dynamic branch prediction, e.g. using branch history tables
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline, look ahead
- G06F9/3836—Instruction issuing, e.g. dynamic instruction scheduling or out of order instruction execution
- G06F9/3842—Speculative instruction execution
- G06F9/3848—Speculative instruction execution using hybrid branch prediction, e.g. selection between prediction techniques
Description
1993年の「Combining Branch Predictors」、WRL Technical Note 36
b0≦direction;
b1≦confidence when(mode=2bit)else
direction;we_b0≦2_bit_update when(mode=2bit)else
1_bit_update and address_bit=‘0’;we_b1
≦2_bit_update when(mode−2bit)else
1_bit_update and address_bit=‘1’;
b0≦direction when(mode=2bit OR address_bit=‘0’)else
biq_b0;b1≦confidence when(mode=2bit)else
direction when(address_bit=‘1’)else
biq_b1;we≦2_bit_update when(mode−2bit)else
1_bit_update;
b0≦biq_b0 when(mode=2bit OR address_bit=‘0’)else
biq_b1;b1≦biq_b1 when(mode=2bit)else
‘0’;1_bit_update≦b0/=direction;2bit_update≦b0/−direction OR b1/=confidence
Claims (8)
- 1ビット予測子を分岐予測に用いる第1の予測モードと、2ビット予測子を分岐予測に用いる第2の予測モードのいずれにも利用することができる多形的分岐予測子であって、 ストレージ・アレイと、第1の選択要素と、第2の選択要素を備え、
第1の予測モードの際には、
ストレージ・アレイは、ストレージ・アレイ内の1ビット予測子の位置を特定するのに要するアドレス・ビット群のうちの所定の1アドレス・ビット以外のアドレス・ビット群が入力され、入力されたアドレス・ビット群に応じてビットb0とビットb1を出力するものであり、
第1の選択要素は、ストレージ・アレイ内の1ビット予測子の位置を特定するのに要するアドレス・ビット群のうちの所定の1アドレス・ビットの値に応じて、ビットb0とビットb1のいずれか一方を1ビット予測子として出力するものであり、
第2の予測モードの際には、
ストレージ・アレイは、ストレージ・アレイ内の2ビット予測子の位置を特定するのに要するアドレス・ビット群が入力され、入力されたアドレス・ビット群に応じてビットb0とビットb1を出力するものであり、
第1の選択要素は、ビットb0を2ビット予測子のうちの一方のビットとして出力するものであり、
第2の選択要素は、ビットb1を2ビット予測子のうちの他方のビットとして出力するものである、
多形的分岐予測子。 - 動作時に分岐予測子の性能を最適化するための動的設定に応じて、予測モードを動的に選択するためのメカニズムをさらに備える、
請求項1に記載の多形的分岐予測子。 - メカニズムは、あらかじめ指定されたように、特定アプリケーションに対して、予測モードを選択するものである、
請求項2に記載の多形的分岐予測子。 - メカニズムは、アプリケーションまたはパーティションについて、以前にスケジューリングされた際に格納されていた構成情報に基づいて、予測モードを選択するものである、
請求項2に記載の多形的分岐予測子。 - メカニズムは、ランタイムの振る舞いの分析に基づいて、予測モードを選択するものである、
請求項2に記載の多形的分岐予測子。 - メカニズムは、コンパイラまたはランタイム・コンポーネントからの情報に基づいて、予測モードを選択するものである、
請求項2に記載の多形的分岐予測子。 - メカニズムは、アプリケーションのランタイム中に、予測モードの選択を更新するものである、
請求項2に記載の多形的分岐予測子。 - ストレージ・アレイから出力されたビットb0とビットb1を格納する分岐情報キューをさらに備え、
分岐情報キューから出力される情報は、ストレージ・アレイから前もって読み取られた情報を書き直すために使用されるものであり、
書き直される情報は、ストレージ・アレイからビットb0及びビットb1として分岐情報キューに入力されてから、ストレージ・アレイに書き直されるまで、分岐情報キューにおいて変更されないように維持されるものである、
請求項1乃至7のいずれかに記載の多形的分岐予測子。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US11/417,847 US7523298B2 (en) | 2006-05-04 | 2006-05-04 | Polymorphic branch predictor and method with selectable mode of prediction |
US11/417,847 | 2006-05-04 | ||
PCT/US2007/068027 WO2007131032A2 (en) | 2006-05-04 | 2007-05-02 | Methods and apparatus for implementing polymorphic branch predictors |
Publications (2)
Publication Number | Publication Date |
---|---|
JP2009535750A JP2009535750A (ja) | 2009-10-01 |
JP5186488B2 true JP5186488B2 (ja) | 2013-04-17 |
Family
ID=38668509
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2009510049A Expired - Fee Related JP5186488B2 (ja) | 2006-05-04 | 2007-05-02 | 多形的分岐予測子を実装するための方法および装置 |
Country Status (5)
Country | Link |
---|---|
US (2) | US7523298B2 (ja) |
EP (1) | EP2024815B1 (ja) |
JP (1) | JP5186488B2 (ja) |
CN (1) | CN101427213B (ja) |
WO (1) | WO2007131032A2 (ja) |
Families Citing this family (29)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080209190A1 (en) * | 2007-02-28 | 2008-08-28 | Advanced Micro Devices, Inc. | Parallel prediction of multiple branches |
US7962724B1 (en) * | 2007-09-28 | 2011-06-14 | Oracle America, Inc. | Branch loop performance enhancement |
US20100332812A1 (en) * | 2009-06-24 | 2010-12-30 | Doug Burger | Method, system and computer-accessible medium for low-power branch prediction |
US8612731B2 (en) | 2009-11-06 | 2013-12-17 | International Business Machines Corporation | Branch target buffer for emulation environments |
US9122486B2 (en) * | 2010-11-08 | 2015-09-01 | Qualcomm Incorporated | Bimodal branch predictor encoded in a branch instruction |
US9778934B2 (en) * | 2010-11-16 | 2017-10-03 | Advanced Micro Devices, Inc. | Power efficient pattern history table fetch in branch predictor |
US9495136B2 (en) * | 2011-01-28 | 2016-11-15 | International Business Machines Corporation | Using aliasing information for dynamic binary optimization |
WO2012127589A1 (ja) * | 2011-03-18 | 2012-09-27 | 富士通株式会社 | マルチコアプロセッサシステム、および分岐予測方法 |
WO2012127666A1 (ja) * | 2011-03-23 | 2012-09-27 | 富士通株式会社 | 演算処理装置、情報処理装置及び演算処理方法 |
US8959320B2 (en) * | 2011-12-07 | 2015-02-17 | Apple Inc. | Preventing update training of first predictor with mismatching second predictor for branch instructions with alternating pattern hysteresis |
US9032191B2 (en) * | 2012-01-23 | 2015-05-12 | International Business Machines Corporation | Virtualization support for branch prediction logic enable / disable at hypervisor and guest operating system levels |
CN102608518A (zh) * | 2012-02-29 | 2012-07-25 | 华为技术有限公司 | 一种芯片测试方法及装置 |
US20130283023A1 (en) * | 2012-04-18 | 2013-10-24 | Qualcomm Incorporated | Bimodal Compare Predictor Encoded In Each Compare Instruction |
US9229723B2 (en) * | 2012-06-11 | 2016-01-05 | International Business Machines Corporation | Global weak pattern history table filtering |
US9395984B2 (en) * | 2012-09-12 | 2016-07-19 | Qualcomm Incorporated | Swapping branch direction history(ies) in response to a branch prediction table swap instruction(s), and related systems and methods |
US9389868B2 (en) | 2012-11-01 | 2016-07-12 | International Business Machines Corporation | Confidence-driven selective predication of processor instructions |
US10795683B2 (en) | 2014-06-11 | 2020-10-06 | International Business Machines Corporation | Predicting indirect branches using problem branch filtering and pattern cache |
CN108062236A (zh) * | 2016-11-07 | 2018-05-22 | 杭州华为数字技术有限公司 | 一种软硬件协同分支指令预测方法及装置 |
US10747539B1 (en) | 2016-11-14 | 2020-08-18 | Apple Inc. | Scan-on-fill next fetch target prediction |
CN107688468B (zh) * | 2016-12-23 | 2020-05-15 | 北京国睿中数科技股份有限公司 | 推测执行处理器中分支指令与分支预测功能的验证方法 |
US10642621B2 (en) * | 2017-12-29 | 2020-05-05 | Intel Corporation | System, apparatus and method for controlling allocations into a branch prediction circuit of a processor |
US10740104B2 (en) * | 2018-08-16 | 2020-08-11 | International Business Machines Corporation | Tagging target branch predictors with context with index modification and late stop fetch on tag mismatch |
US11086629B2 (en) * | 2018-11-09 | 2021-08-10 | Arm Limited | Misprediction of predicted taken branches in a data processing apparatus |
US11163577B2 (en) | 2018-11-26 | 2021-11-02 | International Business Machines Corporation | Selectively supporting static branch prediction settings only in association with processor-designated types of instructions |
US10963260B2 (en) * | 2019-01-18 | 2021-03-30 | Arm Limited | Branch predictor |
CN110348250B (zh) * | 2019-06-26 | 2020-12-29 | 中国科学院信息工程研究所 | 多链式哈希栈的硬件开销优化方法及系统 |
CN111459549B (zh) * | 2020-04-07 | 2022-11-01 | 上海兆芯集成电路有限公司 | 具有高度领先分支预测器的微处理器 |
US11803386B2 (en) * | 2021-09-16 | 2023-10-31 | International Business Machines Corporation | Neuron cache-based hardware branch prediction |
US20230315469A1 (en) * | 2022-03-30 | 2023-10-05 | Advanced Micro Devices, Inc. | Hybrid parallelized tagged geometric (tage) branch prediction |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5163140A (en) * | 1990-02-26 | 1992-11-10 | Nexgen Microsystems | Two-level branch prediction cache |
JPH04245334A (ja) * | 1991-01-31 | 1992-09-01 | Hitachi Ltd | 情報処理装置の命令先読み制御方式 |
US5758142A (en) * | 1994-05-31 | 1998-05-26 | Digital Equipment Corporation | Trainable apparatus for predicting instruction outcomes in pipelined processors |
JPH10240526A (ja) * | 1997-02-27 | 1998-09-11 | Fujitsu Ltd | 分岐予測装置 |
US6550004B1 (en) * | 1999-11-05 | 2003-04-15 | Ip-First, Llc | Hybrid branch predictor with improved selector table update mechanism |
US7203825B2 (en) * | 2001-10-03 | 2007-04-10 | Intel Corporation | Sharing information to reduce redundancy in hybrid branch prediction |
US7120784B2 (en) * | 2003-04-28 | 2006-10-10 | International Business Machines Corporation | Thread-specific branch prediction by logically splitting branch history tables and predicted target address cache in a simultaneous multithreading processing environment |
KR100785723B1 (ko) * | 2004-04-21 | 2007-12-18 | 후지쯔 가부시끼가이샤 | 분기 예측 장치, 그 방법 및 프로세서 |
KR100630702B1 (ko) * | 2004-10-05 | 2006-10-02 | 삼성전자주식회사 | 명령어 캐쉬와 명령어 변환 참조 버퍼의 제어기, 및 그제어방법 |
-
2006
- 2006-05-04 US US11/417,847 patent/US7523298B2/en not_active Expired - Fee Related
-
2007
- 2007-05-02 EP EP07797318A patent/EP2024815B1/en not_active Not-in-force
- 2007-05-02 CN CN2007800138518A patent/CN101427213B/zh not_active Expired - Fee Related
- 2007-05-02 JP JP2009510049A patent/JP5186488B2/ja not_active Expired - Fee Related
- 2007-05-02 WO PCT/US2007/068027 patent/WO2007131032A2/en active Application Filing
-
2008
- 2008-08-19 US US12/194,145 patent/US20080307209A1/en not_active Abandoned
Also Published As
Publication number | Publication date |
---|---|
CN101427213B (zh) | 2010-08-25 |
EP2024815B1 (en) | 2012-11-28 |
EP2024815A4 (en) | 2009-07-22 |
WO2007131032A2 (en) | 2007-11-15 |
CN101427213A (zh) | 2009-05-06 |
US20080005542A1 (en) | 2008-01-03 |
US7523298B2 (en) | 2009-04-21 |
EP2024815A2 (en) | 2009-02-18 |
US20080307209A1 (en) | 2008-12-11 |
WO2007131032B1 (en) | 2008-10-30 |
JP2009535750A (ja) | 2009-10-01 |
WO2007131032A3 (en) | 2008-09-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP5186488B2 (ja) | 多形的分岐予測子を実装するための方法および装置 | |
US8166281B2 (en) | Implementing instruction set architectures with non-contiguous register file specifiers | |
US8918623B2 (en) | Implementing instruction set architectures with non-contiguous register file specifiers | |
US8572358B2 (en) | Meta predictor restoration upon detecting misprediction | |
US10664280B2 (en) | Fetch ahead branch target buffer | |
EP2864868A1 (en) | Methods and apparatus to extend software branch target hints | |
JP2004326752A (ja) | 同時マルチスレッド化プロセッサ | |
US11074080B2 (en) | Apparatus and branch prediction circuitry having first and second branch prediction schemes, and method | |
KR100986375B1 (ko) | 피연산자의 빠른 조건부 선택 | |
US11294684B2 (en) | Indirect branch predictor for dynamic indirect branches | |
US7130991B1 (en) | Method and apparatus for loop detection utilizing multiple loop counters and a branch promotion scheme | |
TWI788912B (zh) | 可調整分支預測方法和微處理器 | |
US11379240B2 (en) | Indirect branch predictor based on register operands | |
CN113515311A (zh) | 微处理器和预取指调整方法 | |
US20180267803A1 (en) | Computer Processor Employing Phases of Operations Contained in Wide Instructions | |
TWI786691B (zh) | 微處理器和分支處理方法 | |
US10990403B1 (en) | Predicting an outcome of an instruction following a flush | |
US11157284B1 (en) | Predicting an outcome of an instruction following a flush | |
TW202344988A (zh) | 用於最佳化迴路重放性能的處理器中捕獲迴路的最佳化 | |
US20160239312A1 (en) | Computer Processor Employing Phases of Operations Contained in Wide Instructions |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20100222 |
|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20120918 |
|
A521 | Written amendment |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20121017 |
|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20121120 |
|
A521 | Written amendment |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20121213 |
|
TRDD | Decision of grant or rejection written | ||
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20130108 |
|
A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20130121 |
|
R150 | Certificate of patent or registration of utility model |
Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20160125 Year of fee payment: 3 |
|
LAPS | Cancellation because of no payment of annual fees |