JP2014142969A - 分散型プレディケート予測を実現するための方法、システム、およびコンピュータによってアクセス可能な媒体 - Google Patents
分散型プレディケート予測を実現するための方法、システム、およびコンピュータによってアクセス可能な媒体 Download PDFInfo
- Publication number
- JP2014142969A JP2014142969A JP2014095690A JP2014095690A JP2014142969A JP 2014142969 A JP2014142969 A JP 2014142969A JP 2014095690 A JP2014095690 A JP 2014095690A JP 2014095690 A JP2014095690 A JP 2014095690A JP 2014142969 A JP2014142969 A JP 2014142969A
- Authority
- JP
- Japan
- Prior art keywords
- predicate
- predictor
- prediction
- computing system
- cores
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 37
- 238000012545 processing Methods 0.000 claims description 8
- 238000004891 communication Methods 0.000 abstract description 8
- 238000010586 diagram Methods 0.000 description 13
- 230000015654 memory Effects 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 4
- 230000006870 function Effects 0.000 description 4
- 230000001419 dependent effect Effects 0.000 description 3
- 230000003190 augmentative effect Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 241001442055 Vipera berus Species 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 230000014509 gene expression Effects 0.000 description 1
- 230000009249 intrinsic sympathomimetic activity Effects 0.000 description 1
- 239000003607 modifier Substances 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 230000007480 spreading Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/76—Architectures of general purpose stored program computers
- G06F15/80—Architectures of general purpose stored program computers comprising an array of processing units with common control, e.g. single instruction multiple data processors
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/40—Transformation of program code
- G06F8/41—Compilation
- G06F8/44—Encoding
- G06F8/445—Exploiting fine grain parallelism, i.e. parallelism at instruction level
- G06F8/4451—Avoiding pipeline stalls
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline or look ahead
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline or look ahead
- G06F9/3836—Instruction issuing, e.g. dynamic instruction scheduling or out of order instruction execution
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline or look ahead
- G06F9/3836—Instruction issuing, e.g. dynamic instruction scheduling or out of order instruction execution
- G06F9/3842—Speculative instruction execution
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline or look ahead
- G06F9/3836—Instruction issuing, e.g. dynamic instruction scheduling or out of order instruction execution
- G06F9/3842—Speculative instruction execution
- G06F9/3844—Speculative instruction execution using dynamic branch prediction, e.g. using branch history tables
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline or look ahead
- G06F9/3854—Instruction completion, e.g. retiring, committing or graduating
- G06F9/3858—Result writeback, i.e. updating the architectural state or memory
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Software Systems (AREA)
- General Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Computer Hardware Design (AREA)
- Computing Systems (AREA)
- Advance Control (AREA)
- Executing Machine-Instructions (AREA)
- Devices For Executing Special Programs (AREA)
Abstract
【解決手段】分散型のマルチコアアーキテクチャにおいてプレディケート予測を生成するための、システム、方法、およびコンピュータによってアクセス可能な媒体の実施例が提供される。そのようなシステム、方法、およびコンピュータによってアクセス可能な媒体を用いると、分岐命令についての概略プレディケート経路情報をインテリジェントに符号化することが可能になる。この静的に生成された情報を用いて、分散型のプレディケート予測器は、信頼性の高いプレディケートの正確な予測を容易にし得る動的なプレディケート履歴を生成することができ、同時に、コア間の通信を最小化する。
【選択図】図4
Description
本発明は、米国空軍によって与えられたF33615−03−C−4106の下での政府支援によりなされた。米国政府は、本発明の一定の権利を有する。
Claims (20)
- 複数のプロセッサコアを含むマルチコアプロセッサを備えるコンピューティングシステムであって、前記コアの各々がプレディケート予測器を備え、前記プレディケート予測器が、プレディケート予測を生成するように構成される、コンピューティングシステム。
- 前記コアの少なくとも1つが、符号化されたプレディケート経路情報を有する分岐命令を備えるアプリケーションプログラムを実行し、前記プレディケート予測器が、前記プレディケート経路情報に基づいて前記プレディケート予測を生成するように構成される、請求項1に記載のコンピューティングシステム。
- 前記コアの少なくとも1つが、前記分岐命令に基づいて前記プレディケート経路情報を符号化するように構成されるコンパイラである、請求項2に記載のコンピューティングシステム。
- 前記マルチコアプロセッサのうちのどの前記コアが前記分岐命令を実行するのに割り当てられるかを決定するブロックアドレスを、前記分岐命令が有する、請求項2に記載のコンピューティングシステム。
- 前記マルチコアプロセッサが、エクスプリジット・データ・グラフ・エグゼキューション(explicit data graph execution)マイクロアーキテクチャを備える、請求項1に記載のコンピューティングシステム。
- 前記プレディケート予測器が、基本予測器およびグローバル履歴レジスタを備える、請求項1に記載のコンピューティングシステム。
- 前記基本予測器が、幾何履歴長予測器を備える、請求項6に記載のコンピューティングシステム。
- 前記グローバル履歴レジスタが、コアローカルプレディケート履歴レジスタを備える、請求項6に記載のコンピューティングシステム。
- 前記グローバル履歴レジスタが、グローバルブロック履歴レジスタを備える、請求項6に記載のコンピューティングシステム。
- 前記グローバル履歴レジスタが、コアローカルプレディケート履歴レジスタおよびグローバルブロック履歴レジスタを備える、請求項6に記載のコンピューティングシステム。
- 前記プレディケート予測器が、複数のプレディケート予測を生成するように構成され、前記コアの少なくとも1つが、前記プレディケート予測の正確さを示す信頼性予測を得て、前記信頼性予測に基づいて、どのプレディケートが次に予測されるべきかを決定するように構成される、請求項1に記載のコンピューティングシステム。
- マルチコアプロセッサ内の複数のプロセッサコアの各々に対してプレディケート予測器を提供すること、
前記プレディケート予測器を用いて、複数の分岐命令からプレディケート予測を生成すること含む、マルチコアプロセッサ内でプレディケート予測を提供する方法。 - 前記コアの少なくとも1つによってアプリケーションプログラムを実行することをさらに含み、前記プログラムが、符号化されたプレディケート経路情報を有する複数の分岐命令の1つを含む、請求項12に記載の方法。
- コンパイラを用いて、前記分岐命令についての前記プレディケート経路情報を符号化することをさらに含む、請求項13に記載の方法。
- 前記プログラムが複数の分岐命令を備える請求項13に記載の方法であって、前記分岐命令の各々のブロックアドレスを用いて、どのプロセッサコアが前記分岐命令を実行するかを決定することをさらに含む、方法。
- 前記少なくとも1つのプレディケート予測器が、基本予測器およびグローバル履歴レジスタを含む、請求項12に記載の方法。
- 前記基本予測器が幾何履歴長予測器である、請求項16に記載の方法。
- 前記グローバル履歴レジスタがコアローカルプレディケート履歴レジスタである、請求項16に記載の方法。
- 前記グローバル履歴レジスタがグローバルブロック履歴レジスタである、請求項16に記載の方法。
- マルチコアプロセッサコンピューティングシステム内でプレディケート予測を提供するためのコンピュータ実行可能命令が記憶された、コンピュータによってアクセス可能な媒体であって、処理構成が前記命令を実行するときに処理手順を実行するように構成され、前記処理手順が、
前記マルチコアプロセッサの複数のプロセッサコアの各々に対してプレディケート予測器を提供することであって、前記プロセッサコアの各々が少なくとも1つのプレディケート予測器を備える、提供することと、
前記プレディケート予測器を用いて、前記プレディケート予測を生成することとを含む、コンピュータによってアクセス可能な媒体。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/556,440 US8433885B2 (en) | 2009-09-09 | 2009-09-09 | Method, system and computer-accessible medium for providing a distributed predicate prediction |
US12/556,440 | 2009-09-09 |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2012522834A Division JP2013500539A (ja) | 2009-09-09 | 2010-06-11 | 分散型プレディケート予測を実現するための方法、システム、およびコンピュータによってアクセス可能な媒体 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2015096238A Division JP2015164068A (ja) | 2009-09-09 | 2015-05-11 | 分散型プレディケート予測を実現するための方法、システム、およびコンピュータによってアクセス可能な媒体 |
Publications (3)
Publication Number | Publication Date |
---|---|
JP2014142969A true JP2014142969A (ja) | 2014-08-07 |
JP2014142969A5 JP2014142969A5 (ja) | 2015-05-14 |
JP5747104B2 JP5747104B2 (ja) | 2015-07-08 |
Family
ID=43648555
Family Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2012522834A Pending JP2013500539A (ja) | 2009-09-09 | 2010-06-11 | 分散型プレディケート予測を実現するための方法、システム、およびコンピュータによってアクセス可能な媒体 |
JP2014095690A Active JP5747104B2 (ja) | 2009-09-09 | 2014-05-07 | 分散型プレディケート予測を実現するための方法、システム、およびコンピュータによってアクセス可能な媒体 |
JP2015096238A Pending JP2015164068A (ja) | 2009-09-09 | 2015-05-11 | 分散型プレディケート予測を実現するための方法、システム、およびコンピュータによってアクセス可能な媒体 |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2012522834A Pending JP2013500539A (ja) | 2009-09-09 | 2010-06-11 | 分散型プレディケート予測を実現するための方法、システム、およびコンピュータによってアクセス可能な媒体 |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2015096238A Pending JP2015164068A (ja) | 2009-09-09 | 2015-05-11 | 分散型プレディケート予測を実現するための方法、システム、およびコンピュータによってアクセス可能な媒体 |
Country Status (6)
Country | Link |
---|---|
US (1) | US8433885B2 (ja) |
JP (3) | JP2013500539A (ja) |
KR (1) | KR101364314B1 (ja) |
CN (2) | CN102473086B (ja) |
DE (1) | DE112010003595B4 (ja) |
WO (1) | WO2011031361A1 (ja) |
Families Citing this family (46)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10698859B2 (en) | 2009-09-18 | 2020-06-30 | The Board Of Regents Of The University Of Texas System | Data multicasting with router replication and target instruction identification in a distributed multi-core processing architecture |
WO2011159309A1 (en) * | 2010-06-18 | 2011-12-22 | The Board Of Regents Of The University Of Texas System | Combined branch target and predicate prediction |
EP2645254A4 (en) * | 2010-11-25 | 2014-01-15 | Toyota Motor Co Ltd | PROCESSOR, ELECTRONIC CONTROL DEVICE, CREATION PROGRAM |
WO2012127589A1 (ja) * | 2011-03-18 | 2012-09-27 | 富士通株式会社 | マルチコアプロセッサシステム、および分岐予測方法 |
US9182991B2 (en) | 2012-02-06 | 2015-11-10 | International Business Machines Corporation | Multi-threaded processor instruction balancing through instruction uncertainty |
US9268569B2 (en) | 2012-02-24 | 2016-02-23 | Apple Inc. | Branch misprediction behavior suppression on zero predicate branch mispredict |
US9792252B2 (en) | 2013-05-31 | 2017-10-17 | Microsoft Technology Licensing, Llc | Incorporating a spatial array into one or more programmable processor cores |
US9507594B2 (en) * | 2013-07-02 | 2016-11-29 | Intel Corporation | Method and system of compiling program code into predicated instructions for execution on a processor without a program counter |
US20160232346A1 (en) * | 2015-02-05 | 2016-08-11 | Qualcomm Incorporated | Mechanism for tracking tainted data |
US9946549B2 (en) | 2015-03-04 | 2018-04-17 | Qualcomm Incorporated | Register renaming in block-based instruction set architecture |
US9916164B2 (en) * | 2015-06-11 | 2018-03-13 | Intel Corporation | Methods and apparatus to optimize instructions for execution by a processor |
US9940136B2 (en) | 2015-06-26 | 2018-04-10 | Microsoft Technology Licensing, Llc | Reuse of decoded instructions |
US10409599B2 (en) | 2015-06-26 | 2019-09-10 | Microsoft Technology Licensing, Llc | Decoding information about a group of instructions including a size of the group of instructions |
US10346168B2 (en) | 2015-06-26 | 2019-07-09 | Microsoft Technology Licensing, Llc | Decoupled processor instruction window and operand buffer |
US10409606B2 (en) | 2015-06-26 | 2019-09-10 | Microsoft Technology Licensing, Llc | Verifying branch targets |
US10175988B2 (en) | 2015-06-26 | 2019-01-08 | Microsoft Technology Licensing, Llc | Explicit instruction scheduler state information for a processor |
US11755484B2 (en) | 2015-06-26 | 2023-09-12 | Microsoft Technology Licensing, Llc | Instruction block allocation |
US9720693B2 (en) | 2015-06-26 | 2017-08-01 | Microsoft Technology Licensing, Llc | Bulk allocation of instruction blocks to a processor instruction window |
US10191747B2 (en) | 2015-06-26 | 2019-01-29 | Microsoft Technology Licensing, Llc | Locking operand values for groups of instructions executed atomically |
US20160378491A1 (en) * | 2015-06-26 | 2016-12-29 | Microsoft Technology Licensing, Llc | Determination of target location for transfer of processor control |
US10169044B2 (en) | 2015-06-26 | 2019-01-01 | Microsoft Technology Licensing, Llc | Processing an encoding format field to interpret header information regarding a group of instructions |
US9946548B2 (en) | 2015-06-26 | 2018-04-17 | Microsoft Technology Licensing, Llc | Age-based management of instruction blocks in a processor instruction window |
US9952867B2 (en) | 2015-06-26 | 2018-04-24 | Microsoft Technology Licensing, Llc | Mapping instruction blocks based on block size |
US20170083319A1 (en) * | 2015-09-19 | 2017-03-23 | Microsoft Technology Licensing, Llc | Generation and use of block branch metadata |
US10936316B2 (en) | 2015-09-19 | 2021-03-02 | Microsoft Technology Licensing, Llc | Dense read encoding for dataflow ISA |
US11681531B2 (en) | 2015-09-19 | 2023-06-20 | Microsoft Technology Licensing, Llc | Generation and use of memory access instruction order encodings |
US10776115B2 (en) | 2015-09-19 | 2020-09-15 | Microsoft Technology Licensing, Llc | Debug support for block-based processor |
US10198263B2 (en) | 2015-09-19 | 2019-02-05 | Microsoft Technology Licensing, Llc | Write nullification |
US10095519B2 (en) | 2015-09-19 | 2018-10-09 | Microsoft Technology Licensing, Llc | Instruction block address register |
US10061584B2 (en) | 2015-09-19 | 2018-08-28 | Microsoft Technology Licensing, Llc | Store nullification in the target field |
US10678544B2 (en) | 2015-09-19 | 2020-06-09 | Microsoft Technology Licensing, Llc | Initiating instruction block execution using a register access instruction |
US10719321B2 (en) | 2015-09-19 | 2020-07-21 | Microsoft Technology Licensing, Llc | Prefetching instruction blocks |
US10180840B2 (en) | 2015-09-19 | 2019-01-15 | Microsoft Technology Licensing, Llc | Dynamic generation of null instructions |
US10871967B2 (en) | 2015-09-19 | 2020-12-22 | Microsoft Technology Licensing, Llc | Register read/write ordering |
US10768936B2 (en) | 2015-09-19 | 2020-09-08 | Microsoft Technology Licensing, Llc | Block-based processor including topology and control registers to indicate resource sharing and size of logical processor |
US11126433B2 (en) | 2015-09-19 | 2021-09-21 | Microsoft Technology Licensing, Llc | Block-based processor core composition register |
US11016770B2 (en) | 2015-09-19 | 2021-05-25 | Microsoft Technology Licensing, Llc | Distinct system registers for logical processors |
US20170083341A1 (en) * | 2015-09-19 | 2017-03-23 | Microsoft Technology Licensing, Llc | Segmented instruction block |
US10452399B2 (en) | 2015-09-19 | 2019-10-22 | Microsoft Technology Licensing, Llc | Broadcast channel architectures for block-based processors |
US10031756B2 (en) | 2015-09-19 | 2018-07-24 | Microsoft Technology Licensing, Llc | Multi-nullification |
US11977891B2 (en) | 2015-09-19 | 2024-05-07 | Microsoft Technology Licensing, Llc | Implicit program order |
US11106467B2 (en) | 2016-04-28 | 2021-08-31 | Microsoft Technology Licensing, Llc | Incremental scheduler for out-of-order block ISA processors |
US20180081690A1 (en) * | 2016-09-21 | 2018-03-22 | Qualcomm Incorporated | Performing distributed branch prediction using fused processor cores in processor-based systems |
US11531552B2 (en) | 2017-02-06 | 2022-12-20 | Microsoft Technology Licensing, Llc | Executing multiple programs simultaneously on a processor core |
US10963379B2 (en) | 2018-01-30 | 2021-03-30 | Microsoft Technology Licensing, Llc | Coupling wide memory interface to wide write back paths |
US10824429B2 (en) | 2018-09-19 | 2020-11-03 | Microsoft Technology Licensing, Llc | Commit logic and precise exceptions in explicit dataflow graph execution architectures |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20010032308A1 (en) * | 1998-08-04 | 2001-10-18 | Grochowski Edward T. | Method and apparatus for performing predicate prediction |
US20020174326A1 (en) * | 1998-08-04 | 2002-11-21 | Kling Ralph M. | Method and apparatus for performing predicate prediction |
US20050216714A1 (en) * | 2004-03-25 | 2005-09-29 | Intel Corporation | Method and apparatus for predicting confidence and value |
Family Cites Families (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3499252B2 (ja) * | 1993-03-19 | 2004-02-23 | 株式会社ルネサステクノロジ | コンパイル装置及びデータ処理装置 |
US6178498B1 (en) * | 1997-12-18 | 2001-01-23 | Idea Corporation | Storing predicted branch target address in different storage according to importance hint in branch prediction instruction |
US6367004B1 (en) * | 1998-12-31 | 2002-04-02 | Intel Corporation | Method and apparatus for predicting a predicate based on historical information and the least significant bits of operands to be compared |
US6513109B1 (en) * | 1999-08-31 | 2003-01-28 | International Business Machines Corporation | Method and apparatus for implementing execution predicates in a computer processing system |
US6662294B1 (en) * | 2000-09-28 | 2003-12-09 | International Business Machines Corporation | Converting short branches to predicated instructions |
US20030023959A1 (en) * | 2001-02-07 | 2003-01-30 | Park Joseph C.H. | General and efficient method for transforming predicated execution to static speculation |
US7114059B2 (en) * | 2001-11-05 | 2006-09-26 | Intel Corporation | System and method to bypass execution of instructions involving unreliable data during speculative execution |
KR100528479B1 (ko) * | 2003-09-24 | 2005-11-15 | 삼성전자주식회사 | 전력 소모를 감소시키기 위한 분기 예측기 및 구현방법 |
US8607209B2 (en) * | 2004-02-04 | 2013-12-10 | Bluerisc Inc. | Energy-focused compiler-assisted branch prediction |
CN100552622C (zh) * | 2005-03-31 | 2009-10-21 | 松下电器产业株式会社 | 运算处理装置 |
US8904155B2 (en) * | 2006-03-17 | 2014-12-02 | Qualcomm Incorporated | Representing loop branches in a branch history register with multiple bits |
US7487340B2 (en) * | 2006-06-08 | 2009-02-03 | International Business Machines Corporation | Local and global branch prediction information storage |
US20070288733A1 (en) * | 2006-06-08 | 2007-12-13 | Luick David A | Early Conditional Branch Resolution |
US9946550B2 (en) * | 2007-09-17 | 2018-04-17 | International Business Machines Corporation | Techniques for predicated execution in an out-of-order processor |
US7870371B2 (en) * | 2007-12-17 | 2011-01-11 | Microsoft Corporation | Target-frequency based indirect jump prediction for high-performance processors |
US7818551B2 (en) * | 2007-12-31 | 2010-10-19 | Microsoft Corporation | Feedback mechanism for dynamic predication of indirect jumps |
-
2009
- 2009-09-09 US US12/556,440 patent/US8433885B2/en active Active
-
2010
- 2010-06-11 WO PCT/US2010/038350 patent/WO2011031361A1/en active Application Filing
- 2010-06-11 CN CN201080035509.XA patent/CN102473086B/zh active Active
- 2010-06-11 DE DE112010003595.4T patent/DE112010003595B4/de active Active
- 2010-06-11 JP JP2012522834A patent/JP2013500539A/ja active Pending
- 2010-06-11 KR KR1020127005879A patent/KR101364314B1/ko active IP Right Grant
- 2010-06-11 CN CN201510449244.2A patent/CN105183449B/zh active Active
-
2014
- 2014-05-07 JP JP2014095690A patent/JP5747104B2/ja active Active
-
2015
- 2015-05-11 JP JP2015096238A patent/JP2015164068A/ja active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20010032308A1 (en) * | 1998-08-04 | 2001-10-18 | Grochowski Edward T. | Method and apparatus for performing predicate prediction |
US20020174326A1 (en) * | 1998-08-04 | 2002-11-21 | Kling Ralph M. | Method and apparatus for performing predicate prediction |
US20050216714A1 (en) * | 2004-03-25 | 2005-09-29 | Intel Corporation | Method and apparatus for predicting confidence and value |
Non-Patent Citations (7)
Title |
---|
CSNG200000966016; 古関 聰 他: '「拡張VLIWプロセッサGIFTにおけるブランチハンドリング機構」' 情報処理学会論文誌 第38巻 第12号, 19971215, 2576頁〜2587頁, 社団法人情報処理学会 * |
JPN6015005367; Nitya RANGANATHAN et al.: '"Analysis of the TRIPS Prototype Block Predictor"' ISPASS 2009. IEEE International Symposium on Performance Analysis of Systems and Software, 2009. , 20090428, Pages:195-206, IEEE * |
JPN6015005370; Doug BURGER et al.: '"Scaling to the End ofSilicon with EDGE Architectures"' Computer Volume:37,Issue:7, 200407, Pages:44-55, IEEE * |
JPN6015005374; Karthkeyan SANKARALINGAM et al.: '"Distributed Microarchitectural Protocols in the TRIPS Prototype Processor"' MICRO-39. 39th Annual IEEE/ACM International Symposium on Microarchitecture, 2006. , 20061213, Pages:480-491, IEEE * |
JPN6015005378; 古関 聰 他: '「拡張VLIWプロセッサGIFTにおけるブランチハンドリング機構」' 情報処理学会論文誌 第38巻 第12号, 19971215, 2576頁〜2587頁, 社団法人情報処理学会 * |
JPN6015005380; Aaron SMITH et al.: '"Compiling for EDGE Architectures"' CGO 2006. International Symposium on Code Generation and Optimization, 2006. , 20060329, pages:1-11, IEEE * |
JPN6015005382; Aaron SMITH et al.: '"Dataflow Predication"' MICRO-39. 39th Annual IEEE/ACM International Symposium on Microarchitecture, 2006. , 200612, Pages:89-102, IEEE * |
Also Published As
Publication number | Publication date |
---|---|
JP5747104B2 (ja) | 2015-07-08 |
KR101364314B1 (ko) | 2014-02-18 |
JP2015164068A (ja) | 2015-09-10 |
CN105183449A (zh) | 2015-12-23 |
DE112010003595T5 (de) | 2012-11-22 |
CN102473086B (zh) | 2015-08-19 |
DE112010003595B4 (de) | 2024-06-06 |
US20110060889A1 (en) | 2011-03-10 |
CN105183449B (zh) | 2018-12-18 |
CN102473086A (zh) | 2012-05-23 |
US8433885B2 (en) | 2013-04-30 |
JP2013500539A (ja) | 2013-01-07 |
WO2011031361A1 (en) | 2011-03-17 |
KR20120068855A (ko) | 2012-06-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP5747104B2 (ja) | 分散型プレディケート予測を実現するための方法、システム、およびコンピュータによってアクセス可能な媒体 | |
US9965274B2 (en) | Computer processor employing bypass network using result tags for routing result operands | |
JP5894120B2 (ja) | ゼロサイクルロード | |
CN107111550B (zh) | 用于隐藏程序提取的页遗漏转换时延的方法和装置 | |
JP2018060491A (ja) | ローカル分岐デコーダを有するマルチ発行マイクロコードユニットを備えたパイプラインプロセッサ | |
JP2014142969A5 (ja) | ||
Calder et al. | A comparative survey of load speculation architectures | |
RU2663362C1 (ru) | Команда и логическая схема для сортировки и выгрузки команд сохранения | |
US10831505B2 (en) | Architecture and method for data parallel single program multiple data (SPMD) execution | |
US10915328B2 (en) | Apparatus and method for a high throughput parallel co-processor and interconnect with low offload latency | |
JP6457200B2 (ja) | プロセッシング装置 | |
US11537397B2 (en) | Compiler-assisted inter-SIMD-group register sharing | |
KR20140131472A (ko) | 상수 저장 레지스터를 구비하는 재구성 가능 프로세서 | |
KR20190033084A (ko) | 로드 스토어 유닛들을 바이패싱하여 스토어 및 로드 추적 | |
CN111752616A (zh) | 用于符号存储地址生成的系统、装置和方法 | |
Wang et al. | Decoupled affine computation for SIMT GPUs | |
US20190079771A1 (en) | Lookahead out-of-order instruction fetch apparatus for microprocessors | |
Mittal | A survey of value prediction techniques for leveraging value locality | |
JP7046087B2 (ja) | キャッシュ・ミス・スレッド・バランシング | |
US11567771B2 (en) | Method and apparatus for back end gather/scatter memory coalescing | |
US20150007153A1 (en) | Partial commits in dynamic binary translation based systems | |
Sazeides | Modeling value speculation | |
US9342303B2 (en) | Modified execution using context sensitive auxiliary code | |
US11567767B2 (en) | Method and apparatus for front end gather/scatter memory coalescing | |
CN114691597A (zh) | 自适应远程原子操作 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20140507 |
|
A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20140507 |
|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20150212 |
|
A524 | Written submission of copy of amendment under article 19 pct |
Free format text: JAPANESE INTERMEDIATE CODE: A524 Effective date: 20150326 |
|
TRDD | Decision of grant or rejection written | ||
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20150413 |
|
A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20150511 |
|
R150 | Certificate of patent or registration of utility model |
Ref document number: 5747104 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |