CN107851016B - 向量算术指令 - Google Patents
向量算术指令 Download PDFInfo
- Publication number
- CN107851016B CN107851016B CN201680043340.XA CN201680043340A CN107851016B CN 107851016 B CN107851016 B CN 107851016B CN 201680043340 A CN201680043340 A CN 201680043340A CN 107851016 B CN107851016 B CN 107851016B
- Authority
- CN
- China
- Prior art keywords
- vector
- source operand
- operand
- bit size
- elements
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/30007—Arrangements for executing specific machine instructions to perform operations on data operands
- G06F9/30036—Instructions to perform operations on packed data, e.g. vector, tile or matrix operations
- G06F9/30038—Instructions to perform operations on packed data, e.g. vector, tile or matrix operations using a mask
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/16—Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/30007—Arrangements for executing specific machine instructions to perform operations on data operands
- G06F9/3001—Arithmetic instructions
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/30007—Arrangements for executing specific machine instructions to perform operations on data operands
- G06F9/3001—Arithmetic instructions
- G06F9/30014—Arithmetic instructions with variable precision
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/30007—Arrangements for executing specific machine instructions to perform operations on data operands
- G06F9/30036—Instructions to perform operations on packed data, e.g. vector, tile or matrix operations
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Software Systems (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Mathematical Physics (AREA)
- Mathematical Optimization (AREA)
- Pure & Applied Mathematics (AREA)
- Mathematical Analysis (AREA)
- Computational Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Computing Systems (AREA)
- Algebra (AREA)
- Databases & Information Systems (AREA)
- Complex Calculations (AREA)
- Advance Control (AREA)
- Executing Machine-Instructions (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| GB1513511.4A GB2540943B (en) | 2015-07-31 | 2015-07-31 | Vector arithmetic instruction |
| GB1513511.4 | 2015-07-31 | ||
| PCT/GB2016/051868 WO2017021681A1 (en) | 2015-07-31 | 2016-06-23 | Vector arithmethic instruction |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN107851016A CN107851016A (zh) | 2018-03-27 |
| CN107851016B true CN107851016B (zh) | 2022-05-17 |
Family
ID=54062956
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201680043340.XA Active CN107851016B (zh) | 2015-07-31 | 2016-06-23 | 向量算术指令 |
Country Status (9)
| Country | Link |
|---|---|
| US (1) | US11003447B2 (enExample) |
| EP (1) | EP3329363B1 (enExample) |
| JP (1) | JP7071913B2 (enExample) |
| KR (1) | KR102584001B1 (enExample) |
| CN (1) | CN107851016B (enExample) |
| GB (1) | GB2540943B (enExample) |
| IL (1) | IL256663B (enExample) |
| TW (1) | TWI739754B (enExample) |
| WO (1) | WO2017021681A1 (enExample) |
Families Citing this family (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN111651203B (zh) * | 2016-04-26 | 2024-05-07 | 中科寒武纪科技股份有限公司 | 一种用于执行向量四则运算的装置和方法 |
| EP3428792B1 (en) * | 2017-07-10 | 2022-05-04 | Arm Ltd | Testing bit values inside vector elements |
| JP6604393B2 (ja) * | 2018-03-08 | 2019-11-13 | 日本電気株式会社 | ベクトルプロセッサ、演算実行方法、プログラム |
| US10528346B2 (en) * | 2018-03-29 | 2020-01-07 | Intel Corporation | Instructions for fused multiply-add operations with variable precision input operands |
| US20210389948A1 (en) * | 2020-06-10 | 2021-12-16 | Arm Limited | Mixed-element-size instruction |
| US12182570B2 (en) * | 2021-06-25 | 2024-12-31 | Intel Corporation | Apparatuses, methods, and systems for a packed data convolution instruction with shift control and width control |
| CN114296798B (zh) * | 2021-12-10 | 2024-08-13 | 龙芯中科技术股份有限公司 | 向量移位方法、处理器及电子设备 |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1303501A (zh) * | 1998-05-27 | 2001-07-11 | Arm有限公司 | 混合向量/标量寄存器文件 |
| CN104281437A (zh) * | 2013-07-08 | 2015-01-14 | Arm有限公司 | 具有单指令多数据处理电路的数据处理装置 |
| CN104813277A (zh) * | 2012-12-19 | 2015-07-29 | 英特尔公司 | 用于处理器的功率效率的向量掩码驱动时钟门控 |
Family Cites Families (17)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6408320B1 (en) * | 1998-01-27 | 2002-06-18 | Texas Instruments Incorporated | Instruction set architecture with versatile adder carry control |
| GB2409068A (en) * | 2003-12-09 | 2005-06-15 | Advanced Risc Mach Ltd | Data element size control within parallel lanes of processing |
| US8082419B2 (en) * | 2004-03-30 | 2011-12-20 | Intel Corporation | Residual addition for video software techniques |
| ATE493703T1 (de) * | 2004-11-03 | 2011-01-15 | Koninkl Philips Electronics Nv | Programmierbare datenverarbeitungsschaltung, die simd-befehle unterstützt |
| US20080091924A1 (en) * | 2006-10-13 | 2008-04-17 | Jouppi Norman P | Vector processor and system for vector processing |
| GB2464292A (en) * | 2008-10-08 | 2010-04-14 | Advanced Risc Mach Ltd | SIMD processor circuit for performing iterative SIMD multiply-accumulate operations |
| GB2474901B (en) * | 2009-10-30 | 2015-01-07 | Advanced Risc Mach Ltd | Apparatus and method for performing multiply-accumulate operations |
| JP5699554B2 (ja) * | 2010-11-11 | 2015-04-15 | 富士通株式会社 | ベクトル処理回路、命令発行制御方法、及びプロセッサシステム |
| GB2488985A (en) * | 2011-03-08 | 2012-09-19 | Advanced Risc Mach Ltd | Mixed size data processing operation with integrated operand conversion instructions |
| WO2013095658A1 (en) | 2011-12-23 | 2013-06-27 | Intel Corporation | Systems, apparatuses, and methods for performing a horizontal add or subtract in response to a single instruction |
| CN104081336B (zh) | 2011-12-23 | 2018-10-23 | 英特尔公司 | 用于检测向量寄存器内的相同元素的装置和方法 |
| CN104185837B (zh) * | 2011-12-23 | 2017-10-13 | 英特尔公司 | 在不同的粒度等级下广播数据值的指令执行单元 |
| CN104137055B (zh) * | 2011-12-29 | 2018-06-05 | 英特尔公司 | 点积处理器、方法、系统和指令 |
| US9323524B2 (en) * | 2013-09-16 | 2016-04-26 | Oracle International Corporation | Shift instruction with per-element shift counts and full-width sources |
| US9552205B2 (en) * | 2013-09-27 | 2017-01-24 | Intel Corporation | Vector indexed memory access plus arithmetic and/or logical operation processors, methods, systems, and instructions |
| US10489155B2 (en) * | 2015-07-21 | 2019-11-26 | Qualcomm Incorporated | Mixed-width SIMD operations using even/odd register pairs for wide data elements |
| US10146535B2 (en) * | 2016-10-20 | 2018-12-04 | Intel Corporatoin | Systems, apparatuses, and methods for chained fused multiply add |
-
2015
- 2015-07-31 GB GB1513511.4A patent/GB2540943B/en active Active
-
2016
- 2016-06-23 WO PCT/GB2016/051868 patent/WO2017021681A1/en not_active Ceased
- 2016-06-23 CN CN201680043340.XA patent/CN107851016B/zh active Active
- 2016-06-23 US US15/743,745 patent/US11003447B2/en active Active
- 2016-06-23 JP JP2018503593A patent/JP7071913B2/ja active Active
- 2016-06-23 EP EP16732707.1A patent/EP3329363B1/en active Active
- 2016-06-23 KR KR1020187003580A patent/KR102584001B1/ko active Active
- 2016-07-19 TW TW105122689A patent/TWI739754B/zh active
-
2017
- 2017-12-31 IL IL256663A patent/IL256663B/en active IP Right Grant
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1303501A (zh) * | 1998-05-27 | 2001-07-11 | Arm有限公司 | 混合向量/标量寄存器文件 |
| CN104813277A (zh) * | 2012-12-19 | 2015-07-29 | 英特尔公司 | 用于处理器的功率效率的向量掩码驱动时钟门控 |
| CN104281437A (zh) * | 2013-07-08 | 2015-01-14 | Arm有限公司 | 具有单指令多数据处理电路的数据处理装置 |
Non-Patent Citations (1)
| Title |
|---|
| 一种针对多媒体扩展指令集和实际多媒体程序的自动向量化方法;姜伟华等;《计算机学报》;20050831;第28卷(第8期);第1255-1266页 * |
Also Published As
| Publication number | Publication date |
|---|---|
| GB201513511D0 (en) | 2015-09-16 |
| TW201721409A (zh) | 2017-06-16 |
| WO2017021681A1 (en) | 2017-02-09 |
| IL256663B (en) | 2020-02-27 |
| US20180203692A1 (en) | 2018-07-19 |
| EP3329363A1 (en) | 2018-06-06 |
| KR102584001B1 (ko) | 2023-10-04 |
| US11003447B2 (en) | 2021-05-11 |
| JP2018521423A (ja) | 2018-08-02 |
| EP3329363B1 (en) | 2020-10-14 |
| GB2540943A (en) | 2017-02-08 |
| CN107851016A (zh) | 2018-03-27 |
| TWI739754B (zh) | 2021-09-21 |
| GB2540943B (en) | 2018-04-11 |
| JP7071913B2 (ja) | 2022-05-19 |
| IL256663A (en) | 2018-02-28 |
| KR20180035211A (ko) | 2018-04-05 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN107851016B (zh) | 向量算术指令 | |
| CN106575215B (zh) | 处理指令的系统、设备、方法、处理器、介质和电子设备 | |
| TWI512531B (zh) | 用以處理blake安全雜湊演算法的方法、設備、系統及製品 | |
| US9507595B2 (en) | Execution of multi-byte memory access instruction specifying endian mode that overrides current global endian mode | |
| CN107209661B (zh) | 用于提供simd sm4加密块密码功能性的指令和逻辑 | |
| CN107918546B (zh) | 利用经掩码的全寄存器访问实现部分寄存器访问的处理器、方法和系统 | |
| JP6466388B2 (ja) | 方法及び装置 | |
| CN114510272B (zh) | 用于散列指令的装置、方法和系统 | |
| US20160179523A1 (en) | Apparatus and method for vector broadcast and xorand logical instruction | |
| CN106030514B (zh) | 用于执行采用传播的被屏蔽源元素存储指令的处理器及其方法 | |
| TWI603262B (zh) | 緊縮有限脈衝響應(fir)濾波器處理器,方法,系統及指令 | |
| CN111433741A (zh) | 向量带进位加法指令 | |
| CN109582283B (zh) | 位矩阵乘法 | |
| CN108563465A (zh) | 用于响应于单个指令来执行循环和异或的系统、装置和方法 | |
| CN106796502A (zh) | 从3d坐标计算3d z曲线索引的机器级指令 | |
| CN107257955B (zh) | 快速向量动态存储器冲突检测 | |
| EP3394725B1 (en) | Adjoining data element pairwise swap processors, methods, systems, and instructions | |
| JP5732139B2 (ja) | データ要素の条件付き選択 | |
| JP5753603B2 (ja) | データ要素内のビットをゼロ化するためのシステム、装置、および方法 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |