BR112019000120A8 - Circuito de embaralhamento para embaralhar faixa em arquitetura simd - Google Patents

Circuito de embaralhamento para embaralhar faixa em arquitetura simd

Info

Publication number
BR112019000120A8
BR112019000120A8 BR112019000120A BR112019000120A BR112019000120A8 BR 112019000120 A8 BR112019000120 A8 BR 112019000120A8 BR 112019000120 A BR112019000120 A BR 112019000120A BR 112019000120 A BR112019000120 A BR 112019000120A BR 112019000120 A8 BR112019000120 A8 BR 112019000120A8
Authority
BR
Brazil
Prior art keywords
circuit
shuffleing
shuffle
belt
simd architecture
Prior art date
Application number
BR112019000120A
Other languages
English (en)
Other versions
BR112019000120A2 (pt
Inventor
Han Liang
Jin Xiangdong
Chen Lin
Du Yun
Vladimirovich Bourd Alexei
Original Assignee
Qualcomm Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Inc filed Critical Qualcomm Inc
Publication of BR112019000120A2 publication Critical patent/BR112019000120A2/pt
Publication of BR112019000120A8 publication Critical patent/BR112019000120A8/pt

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/76Architectures of general purpose stored program computers
    • G06F15/80Architectures of general purpose stored program computers comprising an array of processing units with common control, e.g. single instruction multiple data processors
    • G06F15/8007Architectures of general purpose stored program computers comprising an array of processing units with common control, e.g. single instruction multiple data processors single instruction multiple data [SIMD] multiprocessors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F13/00Interconnection of, or transfer of information or other signals between, memories, input/output devices or central processing units
    • G06F13/38Information transfer, e.g. on bus
    • G06F13/40Bus structure
    • G06F13/4004Coupling between buses
    • G06F13/4009Coupling between buses with data restructuring
    • G06F13/4013Coupling between buses with data restructuring with data re-ordering, e.g. Endian conversion
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/76Architectures of general purpose stored program computers
    • G06F15/80Architectures of general purpose stored program computers comprising an array of processing units with common control, e.g. single instruction multiple data processors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/76Architectures of general purpose stored program computers
    • G06F15/80Architectures of general purpose stored program computers comprising an array of processing units with common control, e.g. single instruction multiple data processors
    • G06F15/8053Vector processors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/30Arrangements for executing machine instructions, e.g. instruction decode
    • G06F9/30003Arrangements for executing specific machine instructions
    • G06F9/30007Arrangements for executing specific machine instructions to perform operations on data operands
    • G06F9/30032Movement instructions, e.g. MOVE, SHIFT, ROTATE, SHUFFLE
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/30Arrangements for executing machine instructions, e.g. instruction decode
    • G06F9/30003Arrangements for executing specific machine instructions
    • G06F9/30007Arrangements for executing specific machine instructions to perform operations on data operands
    • G06F9/30036Instructions to perform operations on packed data, e.g. vector, tile or matrix operations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/30Arrangements for executing machine instructions, e.g. instruction decode
    • G06F9/38Concurrent instruction execution, e.g. pipeline or look ahead
    • G06F9/3885Concurrent instruction execution, e.g. pipeline or look ahead using a plurality of independent parallel functional units
    • G06F9/3887Concurrent instruction execution, e.g. pipeline or look ahead using a plurality of independent parallel functional units controlled by a single instruction for multiple data lanes [SIMD]

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Computer Hardware Design (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Advance Control (AREA)
  • Executing Machine-Instructions (AREA)
  • Image Processing (AREA)
  • Time-Division Multiplex Systems (AREA)
  • Synchronisation In Digital Transmission Systems (AREA)

Abstract

Técnicas são descritas para executar uma operação de embaralhamento. Ao invés de usar um barramento de cruzamento de todas as faixas para todas as faixas, um circuito de embaralhamento tendo uma barra de cruzamento menor é descrita. O circuito de embaralhamento executa a operação de embaralhar em pedaços por reordenar dados recebidos de faixas de processamento e transmitir os dados reordenados.
BR112019000120A 2016-07-13 2017-05-19 Circuito de embaralhamento para embaralhar faixa em arquitetura simd BR112019000120A8 (pt)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US15/209,057 US10592468B2 (en) 2016-07-13 2016-07-13 Shuffler circuit for lane shuffle in SIMD architecture
US15/209,057 2016-07-13
PCT/US2017/033663 WO2018013219A1 (en) 2016-07-13 2017-05-19 Shuffler circuit for lane shuffle in simd architecture

Publications (2)

Publication Number Publication Date
BR112019000120A2 BR112019000120A2 (pt) 2019-04-09
BR112019000120A8 true BR112019000120A8 (pt) 2023-01-31

Family

ID=58779363

Family Applications (1)

Application Number Title Priority Date Filing Date
BR112019000120A BR112019000120A8 (pt) 2016-07-13 2017-05-19 Circuito de embaralhamento para embaralhar faixa em arquitetura simd

Country Status (7)

Country Link
US (1) US10592468B2 (pt)
EP (1) EP3485385B1 (pt)
JP (1) JP2019521445A (pt)
KR (1) KR102118836B1 (pt)
CN (1) CN109478175B (pt)
BR (1) BR112019000120A8 (pt)
WO (1) WO2018013219A1 (pt)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10957095B2 (en) * 2018-08-06 2021-03-23 Intel Corporation Programmable ray tracing with hardware acceleration on a graphics processor
US10963300B2 (en) * 2018-12-06 2021-03-30 Raytheon Company Accelerating dataflow signal processing applications across heterogeneous CPU/GPU systems
US11397624B2 (en) * 2019-01-22 2022-07-26 Arm Limited Execution of cross-lane operations in data processing systems
US11294672B2 (en) * 2019-08-22 2022-04-05 Apple Inc. Routing circuitry for permutation of single-instruction multiple-data operands
US11256518B2 (en) 2019-10-09 2022-02-22 Apple Inc. Datapath circuitry for math operations using SIMD pipelines
US20210349717A1 (en) * 2020-05-05 2021-11-11 Intel Corporation Compaction of diverged lanes for efficient use of alus
US20220197649A1 (en) * 2020-12-22 2022-06-23 Advanced Micro Devices, Inc. General purpose register hierarchy system and method
CN115061731B (zh) * 2022-06-23 2023-05-23 摩尔线程智能科技(北京)有限责任公司 混洗电路和方法、以及芯片和集成电路装置

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2078912A1 (en) * 1992-01-07 1993-07-08 Robert Edward Cypher Hierarchical interconnection networks for parallel processing
US20040054877A1 (en) * 2001-10-29 2004-03-18 Macy William W. Method and apparatus for shuffling data
US7343389B2 (en) 2002-05-02 2008-03-11 Intel Corporation Apparatus and method for SIMD modular multiplication
US9557994B2 (en) * 2004-07-13 2017-01-31 Arm Limited Data processing apparatus and method for performing N-way interleaving and de-interleaving operations where N is an odd plural number
US7761694B2 (en) * 2006-06-30 2010-07-20 Intel Corporation Execution unit for performing shuffle and other operations
GB2444744B (en) 2006-12-12 2011-05-25 Advanced Risc Mach Ltd Apparatus and method for performing re-arrangement operations on data
US8078836B2 (en) 2007-12-30 2011-12-13 Intel Corporation Vector shuffle instructions operating on multiple lanes each having a plurality of data elements using a common set of per-lane control bits
US9436469B2 (en) * 2011-12-15 2016-09-06 Intel Corporation Methods to optimize a program loop via vector instructions using a shuffle table and a mask store table
US9336000B2 (en) * 2011-12-23 2016-05-10 Intel Corporation Instruction execution unit that broadcasts data values at different levels of granularity
US9218182B2 (en) 2012-06-29 2015-12-22 Intel Corporation Systems, apparatuses, and methods for performing a shuffle and operation (shuffle-op)
US9342479B2 (en) 2012-08-23 2016-05-17 Qualcomm Incorporated Systems and methods of data extraction in a vector processor
US20140149480A1 (en) 2012-11-28 2014-05-29 Nvidia Corporation System, method, and computer program product for transposing a matrix
US9823924B2 (en) * 2013-01-23 2017-11-21 International Business Machines Corporation Vector element rotate and insert under mask instruction
US9405539B2 (en) 2013-07-31 2016-08-02 Intel Corporation Providing vector sub-byte decompression functionality

Also Published As

Publication number Publication date
CN109478175B (zh) 2022-07-12
KR20190028426A (ko) 2019-03-18
US20180018299A1 (en) 2018-01-18
CN109478175A (zh) 2019-03-15
EP3485385B1 (en) 2020-04-22
KR102118836B1 (ko) 2020-06-03
US10592468B2 (en) 2020-03-17
BR112019000120A2 (pt) 2019-04-09
JP2019521445A (ja) 2019-07-25
EP3485385A1 (en) 2019-05-22
WO2018013219A1 (en) 2018-01-18

Similar Documents

Publication Publication Date Title
BR112019000120A8 (pt) Circuito de embaralhamento para embaralhar faixa em arquitetura simd
PH12018501226A1 (en) Methods for treating huntingtons disease
BR112016030424A2 (pt) construções de anticorpo multiespecíficas
BR112018010430A2 (pt) composições bacterianas projetadas
ES2871036T3 (es) Método para el tratamiento de trastornos relacionados con ácidos biliares
EA201792651A1 (ru) Аденовирусные полинуклеотиды и полипептиды
MX2016011922A (es) Secuencia de arranque comun para utilidad de control capaz de ser iniciada en multiples arquitectura.
BR112016028066A2 (pt) composição de vacina; idessuis; ridessuis; anticorpo; composição parenteral; uso de uma proteína idessuis ou ridessuis; e célula hospedeira
BR112018011975A2 (pt) composições úteis no tratamento de atrofia muscular espinhal
EA201891399A1 (ru) Бипиразолильные производные, пригодные для лечения аутоиммунных заболеваний
BR112018008819A2 (pt) composições de cianoacrilato
CU24405B1 (es) Proteínas de variantes de secuencias de aminoácidos de cry1da1 activas para lepidópteros
AR108683A1 (es) Vectores aav recombinantes que expresan genes osteoprotectores que incluyen has2 y lubricina, de utilidad en el tratamiento de la osteoartritis y afecciones articulares relacionadas en mamíferos
MX2017000094A (es) PRODUCTOS INMUNOGENICOS BASADOS EN SECUENCIAS DE AMINOACIDOS DE MUTEINA AMILOIDE ß (Aß) Y USOS DE LOS MISMOS.
EA201891440A1 (ru) Составы/композиции, содержащие ингибитор btk
BR112014019976A2 (pt) extração de solvente de produtos de algas
BR112017013664A2 (pt) mirna de tipo natural, composição, métodos para inibir expressão de um gene alvo e para tratar uma doença, e, ácido nucleico de fita simples.
BR112018010255A2 (pt) antibióticos macrocíclicos de amplo espectro
BR112017014107A2 (pt) composição nutricional útil no tratamento de pacientes com ibd
PH12020551270A1 (en) Botulinum neurotoxin biohybrid
EA201890861A1 (ru) Композиции, содержащие лентивирусные векторы, экспрессирующие il-12, и способы их применения
BR112017008543A2 (pt) polipeptídeo para desintoxicação enzimática de zearalenona, assim como polinucleotídeo isolado, assim como aditivo, uso e processo do mesmo
UY36196A (es) Compuestos para usar en el tratamiento antihelmíntico
CY1124987T1 (el) Μεθοδος και διαταξη για την επιταχυνση της εκτελεσης εφαρμογων
BR112018076287A2 (pt) métodos e composições para o tratamento de doença celíaca, sensibilidade ao glúten não celíaca e doença celíaca refratária

Legal Events

Date Code Title Description
B350 Update of information on the portal [chapter 15.35 patent gazette]
B06W Patent application suspended after preliminary examination (for patents with searches from other patent authorities) chapter 6.23 patent gazette]
B15K Others concerning applications: alteration of classification

Free format text: AS CLASSIFICACOES ANTERIORES ERAM: G06F 13/40 , G06F 15/80 , G06F 9/38

Ipc: G06F 9/30 (2018.01), G06F 9/38 (2018.01), G06F 13/