GB2578972A8 - Instruction and logic to provide vector scatter-op and gather-op functionality - Google Patents
Instruction and logic to provide vector scatter-op and gather-op functionality Download PDFInfo
- Publication number
- GB2578972A8 GB2578972A8 GB1916688.3A GB201916688A GB2578972A8 GB 2578972 A8 GB2578972 A8 GB 2578972A8 GB 201916688 A GB201916688 A GB 201916688A GB 2578972 A8 GB2578972 A8 GB 2578972A8
- Authority
- GB
- United Kingdom
- Prior art keywords
- data elements
- source register
- instruction
- gather
- functionality
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/30007—Arrangements for executing specific machine instructions to perform operations on data operands
- G06F9/30018—Bit or string instructions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/3004—Arrangements for executing specific machine instructions to perform operations on memory
- G06F9/30043—LOAD or STORE instructions; Clear instruction
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/30007—Arrangements for executing specific machine instructions to perform operations on data operands
- G06F9/3001—Arithmetic instructions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/30007—Arrangements for executing specific machine instructions to perform operations on data operands
- G06F9/30036—Instructions to perform operations on packed data, e.g. vector, tile or matrix operations
Abstract
Decoding an SIMD instruction comprising a first operation and a scatter operation. The SIMD instruction is to indicate a first source register with a first plurality of data elements, indicate a second source register comprising a second plurality of data element different from the first, and indicate a third source register which has a plurality of indices, each corresponding to the first plurality of elements. Also included is or more execution units to perform the first operation on the first and second data elements to form corresponding result data elements, and then perform a scatter operation to store each result data element in memory. The first operation may be binary, addition, multiplication or ternary. The first source register may comprise 512 bits and wherein the data elements of the first source register are one of 32 bit data elements and 64 bit data elements.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB1916688.3A GB2578972B (en) | 2011-09-26 | 2011-09-26 | Instruction and logic to provide vector scatter-op and gather-op functionality |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB201402493A GB2508533B (en) | 2011-09-26 | 2011-09-26 | Instruction and logic to provide vector scatter-op and gather-op functionality |
GB1916688.3A GB2578972B (en) | 2011-09-26 | 2011-09-26 | Instruction and logic to provide vector scatter-op and gather-op functionality |
Publications (4)
Publication Number | Publication Date |
---|---|
GB201916688D0 GB201916688D0 (en) | 2020-01-01 |
GB2578972A GB2578972A (en) | 2020-06-03 |
GB2578972A8 true GB2578972A8 (en) | 2020-07-08 |
GB2578972B GB2578972B (en) | 2020-10-07 |
Family
ID=69063429
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
GB1916688.3A Active GB2578972B (en) | 2011-09-26 | 2011-09-26 | Instruction and logic to provide vector scatter-op and gather-op functionality |
Country Status (1)
Country | Link |
---|---|
GB (1) | GB2578972B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111930471B (en) * | 2020-08-14 | 2023-05-26 | 中国科学院上海高等研究院 | Parallel simulation evaluation selection method based on GPU |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7984273B2 (en) * | 2007-12-31 | 2011-07-19 | Intel Corporation | System and method for using a mask register to track progress of gathering elements from memory |
-
2011
- 2011-09-26 GB GB1916688.3A patent/GB2578972B/en active Active
Also Published As
Publication number | Publication date |
---|---|
GB2578972A (en) | 2020-06-03 |
GB2578972B (en) | 2020-10-07 |
GB201916688D0 (en) | 2020-01-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
GB2508533A (en) | Instruction and logic to provide vector scatter-op and gather-op functionality | |
MX346496B (en) | Instruction to compute the distance to a specified memory boundary. | |
GB2526485A (en) | Instruction for performing pseudorandom number seed operation | |
GB201317160D0 (en) | Systems,apparatuses and methods for blending two source operands into a single destination using a writemask | |
GB2511986A (en) | Performing arithmetic operations using both large and small floating point values | |
GB2503827A (en) | Systems, apparatuses, and methods for expanding a memory source into a destination register and compressing a source register into a destination memory locati | |
GB2520859A (en) | Instruction set for SHA1 round processing on 128-BIT data paths | |
WO2015187771A3 (en) | Apparatuses and methods for performing an exclusive or operation using sensing circuitry | |
WO2015187606A3 (en) | Apparatuses and methods for parity determination using sensing circuitry | |
GB2518104A (en) | Instruction for shifting bits left with pulling ones into less significant bits | |
GB2525356A (en) | Vector floating point test data class immediate instruction | |
GB2500337A (en) | Vector conflict instructions | |
GB2525357A (en) | Vector element rotate and insert under mask instruction | |
JP2015534169A5 (en) | ||
TW200746156A (en) | Method for generating soft bits in flash memories | |
IN2012DN02561A (en) | ||
WO2014167292A3 (en) | Memory with local-/global bit line architecture and additional capacitance for global bit line discharge in reading | |
BR112019012154A2 (en) | cd8a-binding fibronectin type iii domains | |
TW201612743A (en) | Bit group interleave processors, methods, systems, and instructions | |
WO2014133895A3 (en) | Vector register addressing and functions based on a scalar register data value | |
WO2017052811A3 (en) | Secure modular exponentiation processors, methods, systems, and instructions | |
WO2010148367A3 (en) | Searching regular expressions with virtualized massively parallel programmable hardware | |
WO2015044442A3 (en) | Method for generating a sequence of binary code words of a multi-bit code for a control signal for a consumer | |
GB2571218A (en) | Memory cell structure | |
CN106575219A8 (en) | Instruction and logic technology field for the vector format for handling computing |