TWI739754B - 向量運算指令 - Google Patents
向量運算指令 Download PDFInfo
- Publication number
- TWI739754B TWI739754B TW105122689A TW105122689A TWI739754B TW I739754 B TWI739754 B TW I739754B TW 105122689 A TW105122689 A TW 105122689A TW 105122689 A TW105122689 A TW 105122689A TW I739754 B TWI739754 B TW I739754B
- Authority
- TW
- Taiwan
- Prior art keywords
- vector
- source operand
- elements
- operand
- bit size
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/30007—Arrangements for executing specific machine instructions to perform operations on data operands
- G06F9/3001—Arithmetic instructions
- G06F9/30014—Arithmetic instructions with variable precision
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/30007—Arrangements for executing specific machine instructions to perform operations on data operands
- G06F9/30036—Instructions to perform operations on packed data, e.g. vector, tile or matrix operations
- G06F9/30038—Instructions to perform operations on packed data, e.g. vector, tile or matrix operations using a mask
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/16—Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/30007—Arrangements for executing specific machine instructions to perform operations on data operands
- G06F9/3001—Arithmetic instructions
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/30007—Arrangements for executing specific machine instructions to perform operations on data operands
- G06F9/30036—Instructions to perform operations on packed data, e.g. vector, tile or matrix operations
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Software Systems (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Mathematical Physics (AREA)
- Computational Mathematics (AREA)
- Pure & Applied Mathematics (AREA)
- Mathematical Optimization (AREA)
- Mathematical Analysis (AREA)
- Data Mining & Analysis (AREA)
- Algebra (AREA)
- Databases & Information Systems (AREA)
- Computing Systems (AREA)
- Complex Calculations (AREA)
- Advance Control (AREA)
- Executing Machine-Instructions (AREA)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| GB1513511.4A GB2540943B (en) | 2015-07-31 | 2015-07-31 | Vector arithmetic instruction |
| GB1513511.4 | 2015-07-31 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| TW201721409A TW201721409A (zh) | 2017-06-16 |
| TWI739754B true TWI739754B (zh) | 2021-09-21 |
Family
ID=54062956
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| TW105122689A TWI739754B (zh) | 2015-07-31 | 2016-07-19 | 向量運算指令 |
Country Status (9)
| Country | Link |
|---|---|
| US (1) | US11003447B2 (enExample) |
| EP (1) | EP3329363B1 (enExample) |
| JP (1) | JP7071913B2 (enExample) |
| KR (1) | KR102584001B1 (enExample) |
| CN (1) | CN107851016B (enExample) |
| GB (1) | GB2540943B (enExample) |
| IL (1) | IL256663B (enExample) |
| TW (1) | TWI739754B (enExample) |
| WO (1) | WO2017021681A1 (enExample) |
Families Citing this family (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN111651203B (zh) * | 2016-04-26 | 2024-05-07 | 中科寒武纪科技股份有限公司 | 一种用于执行向量四则运算的装置和方法 |
| EP3428792B1 (en) * | 2017-07-10 | 2022-05-04 | Arm Ltd | Testing bit values inside vector elements |
| JP6604393B2 (ja) * | 2018-03-08 | 2019-11-13 | 日本電気株式会社 | ベクトルプロセッサ、演算実行方法、プログラム |
| US10528346B2 (en) * | 2018-03-29 | 2020-01-07 | Intel Corporation | Instructions for fused multiply-add operations with variable precision input operands |
| US20210389948A1 (en) * | 2020-06-10 | 2021-12-16 | Arm Limited | Mixed-element-size instruction |
| US12182570B2 (en) * | 2021-06-25 | 2024-12-31 | Intel Corporation | Apparatuses, methods, and systems for a packed data convolution instruction with shift control and width control |
| CN114296798B (zh) * | 2021-12-10 | 2024-08-13 | 龙芯中科技术股份有限公司 | 向量移位方法、处理器及电子设备 |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20050125631A1 (en) * | 2003-12-09 | 2005-06-09 | Arm Limited | Data element size control within parallel lanes of processing |
| US20050240870A1 (en) * | 2004-03-30 | 2005-10-27 | Aldrich Bradley C | Residual addition for video software techniques |
| TW201337748A (zh) * | 2011-12-23 | 2013-09-16 | Intel Corp | 用以響應於單一指令而執行橫向加法或減法之系統、裝置及方法 |
| US20150082010A1 (en) * | 2013-09-16 | 2015-03-19 | Oracle International Corporation | Shift instruction with per-element shift counts and full-width sources |
| TW201528131A (zh) * | 2011-12-23 | 2015-07-16 | Intel Corp | 用以偵測向量暫存器內相等元素之裝置及方法 |
Family Cites Families (15)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6408320B1 (en) * | 1998-01-27 | 2002-06-18 | Texas Instruments Incorporated | Instruction set architecture with versatile adder carry control |
| US6282634B1 (en) * | 1998-05-27 | 2001-08-28 | Arm Limited | Apparatus and method for processing data having a mixed vector/scalar register file |
| ATE493703T1 (de) * | 2004-11-03 | 2011-01-15 | Koninkl Philips Electronics Nv | Programmierbare datenverarbeitungsschaltung, die simd-befehle unterstützt |
| US20080091924A1 (en) * | 2006-10-13 | 2008-04-17 | Jouppi Norman P | Vector processor and system for vector processing |
| GB2464292A (en) * | 2008-10-08 | 2010-04-14 | Advanced Risc Mach Ltd | SIMD processor circuit for performing iterative SIMD multiply-accumulate operations |
| GB2474901B (en) * | 2009-10-30 | 2015-01-07 | Advanced Risc Mach Ltd | Apparatus and method for performing multiply-accumulate operations |
| JP5699554B2 (ja) * | 2010-11-11 | 2015-04-15 | 富士通株式会社 | ベクトル処理回路、命令発行制御方法、及びプロセッサシステム |
| GB2488985A (en) * | 2011-03-08 | 2012-09-19 | Advanced Risc Mach Ltd | Mixed size data processing operation with integrated operand conversion instructions |
| CN104185837B (zh) * | 2011-12-23 | 2017-10-13 | 英特尔公司 | 在不同的粒度等级下广播数据值的指令执行单元 |
| CN104137055B (zh) * | 2011-12-29 | 2018-06-05 | 英特尔公司 | 点积处理器、方法、系统和指令 |
| DE112012007058T5 (de) * | 2012-12-19 | 2015-08-06 | Intel Corporation | Vektormaskengesteuertes Clock-Gating für Leistungseffizenz eines Prozessors |
| US9292298B2 (en) * | 2013-07-08 | 2016-03-22 | Arm Limited | Data processing apparatus having SIMD processing circuitry |
| US9552205B2 (en) * | 2013-09-27 | 2017-01-24 | Intel Corporation | Vector indexed memory access plus arithmetic and/or logical operation processors, methods, systems, and instructions |
| US10489155B2 (en) * | 2015-07-21 | 2019-11-26 | Qualcomm Incorporated | Mixed-width SIMD operations using even/odd register pairs for wide data elements |
| US10146535B2 (en) * | 2016-10-20 | 2018-12-04 | Intel Corporatoin | Systems, apparatuses, and methods for chained fused multiply add |
-
2015
- 2015-07-31 GB GB1513511.4A patent/GB2540943B/en active Active
-
2016
- 2016-06-23 WO PCT/GB2016/051868 patent/WO2017021681A1/en not_active Ceased
- 2016-06-23 CN CN201680043340.XA patent/CN107851016B/zh active Active
- 2016-06-23 US US15/743,745 patent/US11003447B2/en active Active
- 2016-06-23 JP JP2018503593A patent/JP7071913B2/ja active Active
- 2016-06-23 EP EP16732707.1A patent/EP3329363B1/en active Active
- 2016-06-23 KR KR1020187003580A patent/KR102584001B1/ko active Active
- 2016-07-19 TW TW105122689A patent/TWI739754B/zh active
-
2017
- 2017-12-31 IL IL256663A patent/IL256663B/en active IP Right Grant
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20050125631A1 (en) * | 2003-12-09 | 2005-06-09 | Arm Limited | Data element size control within parallel lanes of processing |
| US20050240870A1 (en) * | 2004-03-30 | 2005-10-27 | Aldrich Bradley C | Residual addition for video software techniques |
| TW201337748A (zh) * | 2011-12-23 | 2013-09-16 | Intel Corp | 用以響應於單一指令而執行橫向加法或減法之系統、裝置及方法 |
| TW201528131A (zh) * | 2011-12-23 | 2015-07-16 | Intel Corp | 用以偵測向量暫存器內相等元素之裝置及方法 |
| US20150082010A1 (en) * | 2013-09-16 | 2015-03-19 | Oracle International Corporation | Shift instruction with per-element shift counts and full-width sources |
Also Published As
| Publication number | Publication date |
|---|---|
| GB201513511D0 (en) | 2015-09-16 |
| TW201721409A (zh) | 2017-06-16 |
| WO2017021681A1 (en) | 2017-02-09 |
| IL256663B (en) | 2020-02-27 |
| US20180203692A1 (en) | 2018-07-19 |
| EP3329363A1 (en) | 2018-06-06 |
| KR102584001B1 (ko) | 2023-10-04 |
| US11003447B2 (en) | 2021-05-11 |
| JP2018521423A (ja) | 2018-08-02 |
| EP3329363B1 (en) | 2020-10-14 |
| GB2540943A (en) | 2017-02-08 |
| CN107851016A (zh) | 2018-03-27 |
| GB2540943B (en) | 2018-04-11 |
| JP7071913B2 (ja) | 2022-05-19 |
| CN107851016B (zh) | 2022-05-17 |
| IL256663A (en) | 2018-02-28 |
| KR20180035211A (ko) | 2018-04-05 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| TWI739754B (zh) | 向量運算指令 | |
| JP5897696B2 (ja) | データ処理装置および方法 | |
| JP6807383B2 (ja) | 転送プレフィックス命令 | |
| US9507595B2 (en) | Execution of multi-byte memory access instruction specifying endian mode that overrides current global endian mode | |
| JP6051458B2 (ja) | 複数のハッシュ動作を効率的に実行する方法および装置 | |
| KR101787615B1 (ko) | 단일 명령어에 응답하여 회전 및 xor을 수행하기 위한 시스템들, 장치들, 및 방법들 | |
| TWI603263B (zh) | 執行置換運算的處理器及具有該處理器的電腦系統 | |
| KR20170033890A (ko) | 비트 셔플 프로세서, 방법, 시스템, 및 명령어 | |
| CN107851013B (zh) | 数据处理装置和方法 | |
| KR20130064797A (ko) | 범용 논리 연산 방법 및 장치 | |
| TWI544412B (zh) | 用於產生抑制的位址軌跡之設備和方法 | |
| TWI721999B (zh) | 向量長度查詢指令 | |
| US11093243B2 (en) | Vector interleaving in a data processing apparatus | |
| KR101635856B1 (ko) | 데이터 요소에 있는 비트들의 제로화를 위한 시스템, 장치, 및 방법 |