BR112019000120A8 - Circuito de embaralhamento para embaralhar faixa em arquitetura simd - Google Patents
Circuito de embaralhamento para embaralhar faixa em arquitetura simdInfo
- Publication number
- BR112019000120A8 BR112019000120A8 BR112019000120A BR112019000120A BR112019000120A8 BR 112019000120 A8 BR112019000120 A8 BR 112019000120A8 BR 112019000120 A BR112019000120 A BR 112019000120A BR 112019000120 A BR112019000120 A BR 112019000120A BR 112019000120 A8 BR112019000120 A8 BR 112019000120A8
- Authority
- BR
- Brazil
- Prior art keywords
- circuit
- shuffleing
- shuffle
- belt
- simd architecture
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/76—Architectures of general purpose stored program computers
- G06F15/80—Architectures of general purpose stored program computers comprising an array of processing units with common control, e.g. single instruction multiple data processors
- G06F15/8007—Architectures of general purpose stored program computers comprising an array of processing units with common control, e.g. single instruction multiple data processors single instruction multiple data [SIMD] multiprocessors
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F13/00—Interconnection of, or transfer of information or other signals between, memories, input/output devices or central processing units
- G06F13/38—Information transfer, e.g. on bus
- G06F13/40—Bus structure
- G06F13/4004—Coupling between buses
- G06F13/4009—Coupling between buses with data restructuring
- G06F13/4013—Coupling between buses with data restructuring with data re-ordering, e.g. Endian conversion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/76—Architectures of general purpose stored program computers
- G06F15/80—Architectures of general purpose stored program computers comprising an array of processing units with common control, e.g. single instruction multiple data processors
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/76—Architectures of general purpose stored program computers
- G06F15/80—Architectures of general purpose stored program computers comprising an array of processing units with common control, e.g. single instruction multiple data processors
- G06F15/8053—Vector processors
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/30007—Arrangements for executing specific machine instructions to perform operations on data operands
- G06F9/30032—Movement instructions, e.g. MOVE, SHIFT, ROTATE, SHUFFLE
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/30007—Arrangements for executing specific machine instructions to perform operations on data operands
- G06F9/30036—Instructions to perform operations on packed data, e.g. vector, tile or matrix operations
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline or look ahead
- G06F9/3885—Concurrent instruction execution, e.g. pipeline or look ahead using a plurality of independent parallel functional units
- G06F9/3887—Concurrent instruction execution, e.g. pipeline or look ahead using a plurality of independent parallel functional units controlled by a single instruction for multiple data lanes [SIMD]
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Computer Hardware Design (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Advance Control (AREA)
- Executing Machine-Instructions (AREA)
- Image Processing (AREA)
- Time-Division Multiplex Systems (AREA)
- Synchronisation In Digital Transmission Systems (AREA)
Abstract
Técnicas são descritas para executar uma operação de embaralhamento. Ao invés de usar um barramento de cruzamento de todas as faixas para todas as faixas, um circuito de embaralhamento tendo uma barra de cruzamento menor é descrita. O circuito de embaralhamento executa a operação de embaralhar em pedaços por reordenar dados recebidos de faixas de processamento e transmitir os dados reordenados.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US15/209,057 US10592468B2 (en) | 2016-07-13 | 2016-07-13 | Shuffler circuit for lane shuffle in SIMD architecture |
US15/209,057 | 2016-07-13 | ||
PCT/US2017/033663 WO2018013219A1 (en) | 2016-07-13 | 2017-05-19 | Shuffler circuit for lane shuffle in simd architecture |
Publications (2)
Publication Number | Publication Date |
---|---|
BR112019000120A2 BR112019000120A2 (pt) | 2019-04-09 |
BR112019000120A8 true BR112019000120A8 (pt) | 2023-01-31 |
Family
ID=58779363
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
BR112019000120A BR112019000120A8 (pt) | 2016-07-13 | 2017-05-19 | Circuito de embaralhamento para embaralhar faixa em arquitetura simd |
Country Status (7)
Country | Link |
---|---|
US (1) | US10592468B2 (pt) |
EP (1) | EP3485385B1 (pt) |
JP (1) | JP2019521445A (pt) |
KR (1) | KR102118836B1 (pt) |
CN (1) | CN109478175B (pt) |
BR (1) | BR112019000120A8 (pt) |
WO (1) | WO2018013219A1 (pt) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10957095B2 (en) * | 2018-08-06 | 2021-03-23 | Intel Corporation | Programmable ray tracing with hardware acceleration on a graphics processor |
US10963300B2 (en) * | 2018-12-06 | 2021-03-30 | Raytheon Company | Accelerating dataflow signal processing applications across heterogeneous CPU/GPU systems |
US11397624B2 (en) * | 2019-01-22 | 2022-07-26 | Arm Limited | Execution of cross-lane operations in data processing systems |
US11294672B2 (en) * | 2019-08-22 | 2022-04-05 | Apple Inc. | Routing circuitry for permutation of single-instruction multiple-data operands |
US11256518B2 (en) | 2019-10-09 | 2022-02-22 | Apple Inc. | Datapath circuitry for math operations using SIMD pipelines |
US20210349717A1 (en) * | 2020-05-05 | 2021-11-11 | Intel Corporation | Compaction of diverged lanes for efficient use of alus |
US20220197649A1 (en) * | 2020-12-22 | 2022-06-23 | Advanced Micro Devices, Inc. | General purpose register hierarchy system and method |
CN115061731B (zh) * | 2022-06-23 | 2023-05-23 | 摩尔线程智能科技(北京)有限责任公司 | 混洗电路和方法、以及芯片和集成电路装置 |
Family Cites Families (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2078912A1 (en) * | 1992-01-07 | 1993-07-08 | Robert Edward Cypher | Hierarchical interconnection networks for parallel processing |
US20040054877A1 (en) * | 2001-10-29 | 2004-03-18 | Macy William W. | Method and apparatus for shuffling data |
US7343389B2 (en) | 2002-05-02 | 2008-03-11 | Intel Corporation | Apparatus and method for SIMD modular multiplication |
US9557994B2 (en) * | 2004-07-13 | 2017-01-31 | Arm Limited | Data processing apparatus and method for performing N-way interleaving and de-interleaving operations where N is an odd plural number |
US7761694B2 (en) * | 2006-06-30 | 2010-07-20 | Intel Corporation | Execution unit for performing shuffle and other operations |
GB2444744B (en) | 2006-12-12 | 2011-05-25 | Advanced Risc Mach Ltd | Apparatus and method for performing re-arrangement operations on data |
US8078836B2 (en) | 2007-12-30 | 2011-12-13 | Intel Corporation | Vector shuffle instructions operating on multiple lanes each having a plurality of data elements using a common set of per-lane control bits |
US9436469B2 (en) * | 2011-12-15 | 2016-09-06 | Intel Corporation | Methods to optimize a program loop via vector instructions using a shuffle table and a mask store table |
US9336000B2 (en) * | 2011-12-23 | 2016-05-10 | Intel Corporation | Instruction execution unit that broadcasts data values at different levels of granularity |
US9218182B2 (en) | 2012-06-29 | 2015-12-22 | Intel Corporation | Systems, apparatuses, and methods for performing a shuffle and operation (shuffle-op) |
US9342479B2 (en) | 2012-08-23 | 2016-05-17 | Qualcomm Incorporated | Systems and methods of data extraction in a vector processor |
US20140149480A1 (en) | 2012-11-28 | 2014-05-29 | Nvidia Corporation | System, method, and computer program product for transposing a matrix |
US9823924B2 (en) * | 2013-01-23 | 2017-11-21 | International Business Machines Corporation | Vector element rotate and insert under mask instruction |
US9405539B2 (en) | 2013-07-31 | 2016-08-02 | Intel Corporation | Providing vector sub-byte decompression functionality |
-
2016
- 2016-07-13 US US15/209,057 patent/US10592468B2/en active Active
-
2017
- 2017-05-19 WO PCT/US2017/033663 patent/WO2018013219A1/en unknown
- 2017-05-19 CN CN201780042845.9A patent/CN109478175B/zh active Active
- 2017-05-19 BR BR112019000120A patent/BR112019000120A8/pt unknown
- 2017-05-19 JP JP2019500593A patent/JP2019521445A/ja active Pending
- 2017-05-19 EP EP17726175.7A patent/EP3485385B1/en active Active
- 2017-05-19 KR KR1020197000601A patent/KR102118836B1/ko active IP Right Grant
Also Published As
Publication number | Publication date |
---|---|
CN109478175B (zh) | 2022-07-12 |
KR20190028426A (ko) | 2019-03-18 |
US20180018299A1 (en) | 2018-01-18 |
CN109478175A (zh) | 2019-03-15 |
EP3485385B1 (en) | 2020-04-22 |
KR102118836B1 (ko) | 2020-06-03 |
US10592468B2 (en) | 2020-03-17 |
BR112019000120A2 (pt) | 2019-04-09 |
JP2019521445A (ja) | 2019-07-25 |
EP3485385A1 (en) | 2019-05-22 |
WO2018013219A1 (en) | 2018-01-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
BR112019000120A8 (pt) | Circuito de embaralhamento para embaralhar faixa em arquitetura simd | |
PH12018501226A1 (en) | Methods for treating huntingtons disease | |
BR112016030424A2 (pt) | construções de anticorpo multiespecíficas | |
BR112018010430A2 (pt) | composições bacterianas projetadas | |
ES2871036T3 (es) | Método para el tratamiento de trastornos relacionados con ácidos biliares | |
EA201792651A1 (ru) | Аденовирусные полинуклеотиды и полипептиды | |
MX2016011922A (es) | Secuencia de arranque comun para utilidad de control capaz de ser iniciada en multiples arquitectura. | |
BR112016028066A2 (pt) | composição de vacina; idessuis; ridessuis; anticorpo; composição parenteral; uso de uma proteína idessuis ou ridessuis; e célula hospedeira | |
BR112018011975A2 (pt) | composições úteis no tratamento de atrofia muscular espinhal | |
EA201891399A1 (ru) | Бипиразолильные производные, пригодные для лечения аутоиммунных заболеваний | |
BR112018008819A2 (pt) | composições de cianoacrilato | |
CU24405B1 (es) | Proteínas de variantes de secuencias de aminoácidos de cry1da1 activas para lepidópteros | |
AR108683A1 (es) | Vectores aav recombinantes que expresan genes osteoprotectores que incluyen has2 y lubricina, de utilidad en el tratamiento de la osteoartritis y afecciones articulares relacionadas en mamíferos | |
MX2017000094A (es) | PRODUCTOS INMUNOGENICOS BASADOS EN SECUENCIAS DE AMINOACIDOS DE MUTEINA AMILOIDE ß (Aß) Y USOS DE LOS MISMOS. | |
EA201891440A1 (ru) | Составы/композиции, содержащие ингибитор btk | |
BR112014019976A2 (pt) | extração de solvente de produtos de algas | |
BR112017013664A2 (pt) | mirna de tipo natural, composição, métodos para inibir expressão de um gene alvo e para tratar uma doença, e, ácido nucleico de fita simples. | |
BR112018010255A2 (pt) | antibióticos macrocíclicos de amplo espectro | |
BR112017014107A2 (pt) | composição nutricional útil no tratamento de pacientes com ibd | |
PH12020551270A1 (en) | Botulinum neurotoxin biohybrid | |
EA201890861A1 (ru) | Композиции, содержащие лентивирусные векторы, экспрессирующие il-12, и способы их применения | |
BR112017008543A2 (pt) | polipeptídeo para desintoxicação enzimática de zearalenona, assim como polinucleotídeo isolado, assim como aditivo, uso e processo do mesmo | |
UY36196A (es) | Compuestos para usar en el tratamiento antihelmíntico | |
CY1124987T1 (el) | Μεθοδος και διαταξη για την επιταχυνση της εκτελεσης εφαρμογων | |
BR112018076287A2 (pt) | métodos e composições para o tratamento de doença celíaca, sensibilidade ao glúten não celíaca e doença celíaca refratária |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
B350 | Update of information on the portal [chapter 15.35 patent gazette] | ||
B06W | Patent application suspended after preliminary examination (for patents with searches from other patent authorities) chapter 6.23 patent gazette] | ||
B15K | Others concerning applications: alteration of classification |
Free format text: AS CLASSIFICACOES ANTERIORES ERAM: G06F 13/40 , G06F 15/80 , G06F 9/38 Ipc: G06F 9/30 (2018.01), G06F 9/38 (2018.01), G06F 13/ |