CN100422979C - 用于单指令、多数据执行引擎标志寄存器的评估单元 - Google Patents
用于单指令、多数据执行引擎标志寄存器的评估单元 Download PDFInfo
- Publication number
- CN100422979C CN100422979C CNB2005101347265A CN200510134726A CN100422979C CN 100422979 C CN100422979 C CN 100422979C CN B2005101347265 A CNB2005101347265 A CN B2005101347265A CN 200510134726 A CN200510134726 A CN 200510134726A CN 100422979 C CN100422979 C CN 100422979C
- Authority
- CN
- China
- Prior art keywords
- passage
- flag register
- simd
- group
- engine
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline or look ahead
- G06F9/3885—Concurrent instruction execution, e.g. pipeline or look ahead using a plurality of independent parallel functional units
- G06F9/3887—Concurrent instruction execution, e.g. pipeline or look ahead using a plurality of independent parallel functional units controlled by a single instruction for multiple data lanes [SIMD]
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30094—Condition code generation, e.g. Carry, Zero flag
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/30007—Arrangements for executing specific machine instructions to perform operations on data operands
- G06F9/30036—Instructions to perform operations on packed data, e.g. vector, tile or matrix operations
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30098—Register arrangements
- G06F9/30101—Special purpose registers
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline or look ahead
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline or look ahead
- G06F9/3854—Instruction completion, e.g. retiring, committing or graduating
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline or look ahead
- G06F9/3885—Concurrent instruction execution, e.g. pipeline or look ahead using a plurality of independent parallel functional units
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline or look ahead
- G06F9/3885—Concurrent instruction execution, e.g. pipeline or look ahead using a plurality of independent parallel functional units
- G06F9/3888—Concurrent instruction execution, e.g. pipeline or look ahead using a plurality of independent parallel functional units controlled by a single instruction for multiple threads [SIMT] in parallel
Landscapes
- Engineering & Computer Science (AREA)
- Software Systems (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Executing Machine-Instructions (AREA)
- Advance Control (AREA)
- Image Processing (AREA)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US11/015,778 | 2004-12-17 | ||
| US11/015,778 US7219213B2 (en) | 2004-12-17 | 2004-12-17 | Flag bits evaluation for multiple vector SIMD channels execution |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN1790310A CN1790310A (zh) | 2006-06-21 |
| CN100422979C true CN100422979C (zh) | 2008-10-01 |
Family
ID=36123387
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CNB2005101347265A Expired - Fee Related CN100422979C (zh) | 2004-12-17 | 2005-12-19 | 用于单指令、多数据执行引擎标志寄存器的评估单元 |
Country Status (8)
| Country | Link |
|---|---|
| US (1) | US7219213B2 (enExample) |
| JP (1) | JP4901754B2 (enExample) |
| KR (1) | KR100958964B1 (enExample) |
| CN (1) | CN100422979C (enExample) |
| DE (1) | DE112005003130B4 (enExample) |
| GB (1) | GB2436499B (enExample) |
| TW (1) | TWI297853B (enExample) |
| WO (1) | WO2006066262A2 (enExample) |
Families Citing this family (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2008071130A (ja) * | 2006-09-14 | 2008-03-27 | Ricoh Co Ltd | Simd型マイクロプロセッサ |
| EP2478440A1 (en) * | 2009-09-17 | 2012-07-25 | Nokia Corp. | Multi-channel cache memory |
| US8996845B2 (en) * | 2009-12-22 | 2015-03-31 | Intel Corporation | Vector compare-and-exchange operation |
| US10318291B2 (en) | 2011-11-30 | 2019-06-11 | Intel Corporation | Providing vector horizontal compare functionality within a vector register |
| WO2013081588A1 (en) | 2011-11-30 | 2013-06-06 | Intel Corporation | Instruction and logic to provide vector horizontal compare functionality |
| US10255228B2 (en) * | 2011-12-06 | 2019-04-09 | Nvidia Corporation | System and method for performing shaped memory access operations |
| US10042813B2 (en) * | 2014-12-15 | 2018-08-07 | Intel Corporation | SIMD K-nearest-neighbors implementation |
| GB2536069B (en) * | 2015-03-25 | 2017-08-30 | Imagination Tech Ltd | SIMD processing module |
| US20200341772A1 (en) * | 2019-04-29 | 2020-10-29 | DeGirum Corporation | Efficient Architectures For Deep Learning Algorithms |
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP0682309A2 (en) * | 1994-04-28 | 1995-11-15 | International Business Machines Corporation | Data processing system adapted for single instruction branching on multiple condition codes from plural processing elements |
| US20020083311A1 (en) * | 2000-12-27 | 2002-06-27 | Paver Nigel C. | Method and computer program for single instruction multiple data management |
Family Cites Families (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2768803B2 (ja) * | 1990-04-26 | 1998-06-25 | 株式会社東芝 | 並列演算処理装置 |
| JP2793357B2 (ja) * | 1990-11-20 | 1998-09-03 | 株式会社東芝 | 並列演算装置 |
| JP2793342B2 (ja) * | 1990-08-09 | 1998-09-03 | 株式会社東芝 | 演算処理装置 |
| JP3442225B2 (ja) * | 1996-07-11 | 2003-09-02 | 株式会社日立製作所 | 演算処理装置 |
| US5805875A (en) * | 1996-09-13 | 1998-09-08 | International Computer Science Institute | Vector processing system with multi-operation, run-time configurable pipelines |
| JP3652518B2 (ja) | 1998-07-31 | 2005-05-25 | 株式会社リコー | Simd方式の演算器及び演算処理装置 |
| GB2382886B (en) * | 2001-10-31 | 2006-03-15 | Alphamosaic Ltd | Vector processing system |
| US6986023B2 (en) * | 2002-08-09 | 2006-01-10 | Intel Corporation | Conditional execution of coprocessor instruction based on main processor arithmetic flags |
| JP3958662B2 (ja) * | 2002-09-25 | 2007-08-15 | 松下電器産業株式会社 | プロセッサ |
-
2004
- 2004-12-17 US US11/015,778 patent/US7219213B2/en not_active Expired - Fee Related
-
2005
- 2005-12-15 GB GB0713878A patent/GB2436499B/en not_active Expired - Fee Related
- 2005-12-15 KR KR1020077014564A patent/KR100958964B1/ko not_active Expired - Fee Related
- 2005-12-15 DE DE112005003130T patent/DE112005003130B4/de not_active Expired - Fee Related
- 2005-12-15 JP JP2007547041A patent/JP4901754B2/ja not_active Expired - Fee Related
- 2005-12-15 WO PCT/US2005/046321 patent/WO2006066262A2/en not_active Ceased
- 2005-12-16 TW TW094144840A patent/TWI297853B/zh not_active IP Right Cessation
- 2005-12-19 CN CNB2005101347265A patent/CN100422979C/zh not_active Expired - Fee Related
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP0682309A2 (en) * | 1994-04-28 | 1995-11-15 | International Business Machines Corporation | Data processing system adapted for single instruction branching on multiple condition codes from plural processing elements |
| US20020083311A1 (en) * | 2000-12-27 | 2002-06-27 | Paver Nigel C. | Method and computer program for single instruction multiple data management |
Also Published As
| Publication number | Publication date |
|---|---|
| JP2008524723A (ja) | 2008-07-10 |
| JP4901754B2 (ja) | 2012-03-21 |
| DE112005003130B4 (de) | 2009-09-17 |
| GB2436499A (en) | 2007-09-26 |
| US20060149924A1 (en) | 2006-07-06 |
| WO2006066262A3 (en) | 2006-12-14 |
| US7219213B2 (en) | 2007-05-15 |
| CN1790310A (zh) | 2006-06-21 |
| GB0713878D0 (en) | 2007-08-29 |
| DE112005003130T5 (de) | 2007-11-22 |
| TW200636573A (en) | 2006-10-16 |
| WO2006066262A2 (en) | 2006-06-22 |
| KR20070089208A (ko) | 2007-08-30 |
| TWI297853B (en) | 2008-06-11 |
| KR100958964B1 (ko) | 2010-05-20 |
| GB2436499B (en) | 2009-07-22 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN112861073B (zh) | 用于执行16位浮点向量点积指令的系统和方法 | |
| US7908461B2 (en) | Cellular engine for a data processing system | |
| US7042466B1 (en) | Efficient clip-testing in graphics acceleration | |
| KR101900236B1 (ko) | 데이터 처리장치 및 방법 | |
| JP5607832B2 (ja) | 汎用論理演算の方法および装置 | |
| CN107273095A (zh) | 用于对齐寄存器的系统、装置和方法 | |
| CN104185837A (zh) | 在不同的粒度等级下广播数据值的指令执行单元 | |
| CN104011667A (zh) | 用于滑动窗口数据访问的设备和方法 | |
| CN104137061A (zh) | 向量频率扩展指令 | |
| CN104094221A (zh) | 基于零的高效解压缩 | |
| CN100422979C (zh) | 用于单指令、多数据执行引擎标志寄存器的评估单元 | |
| EP3326060B1 (en) | Mixed-width simd operations having even-element and odd-element operations using register pair for wide data elements | |
| CN109992243A (zh) | 用于矩阵操作的系统、方法和装置 | |
| CN117130578A (zh) | 位矩阵乘法 | |
| CN109992303A (zh) | 用于将片寄存器对存储到存储器的系统和方法 | |
| TW201721409A (zh) | 向量運算指令 | |
| CN108292228B (zh) | 用于基于通道的步进收集的系统、设备和方法 | |
| US9830150B2 (en) | Multi-functional execution lane for image processor | |
| CN114675888A (zh) | 用于加载矩阵操作加速器瓦片的指令的装置、方法和系统 | |
| JP2001005675A (ja) | プログラム変換装置及びプロセッサ | |
| CN109298886A (zh) | Simd指令执行方法、装置及处理器 | |
| CN109992301A (zh) | 用于向量广播的系统、方法和装置 | |
| CN108268277A (zh) | 用于广播比较加法的系统、设备和方法 | |
| CN104133660A (zh) | 用于数据元素中的位填零的系统、设备和方法 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| C14 | Grant of patent or utility model | ||
| GR01 | Patent grant | ||
| CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20081001 Termination date: 20181219 |
|
| CF01 | Termination of patent right due to non-payment of annual fee |