BR112016025511A2 - técnicas para execução serializada em sistema de processamento simd - Google Patents
técnicas para execução serializada em sistema de processamento simdInfo
- Publication number
- BR112016025511A2 BR112016025511A2 BR112016025511A BR112016025511A BR112016025511A2 BR 112016025511 A2 BR112016025511 A2 BR 112016025511A2 BR 112016025511 A BR112016025511 A BR 112016025511A BR 112016025511 A BR112016025511 A BR 112016025511A BR 112016025511 A2 BR112016025511 A2 BR 112016025511A2
- Authority
- BR
- Brazil
- Prior art keywords
- techniques
- processing system
- execution
- simd processing
- serialized execution
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/30076—Arrangements for executing specific machine instructions to perform miscellaneous control operations, e.g. NOP
- G06F9/3009—Thread control instructions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30145—Instruction analysis, e.g. decoding, instruction word fields
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline or look ahead
- G06F9/3836—Instruction issuing, e.g. dynamic instruction scheduling or out of order instruction execution
- G06F9/3851—Instruction issuing, e.g. dynamic instruction scheduling or out of order instruction execution from multiple instruction streams, e.g. multistreaming
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline or look ahead
- G06F9/3885—Concurrent instruction execution, e.g. pipeline or look ahead using a plurality of independent parallel functional units
- G06F9/3887—Concurrent instruction execution, e.g. pipeline or look ahead using a plurality of independent parallel functional units controlled by a single instruction for multiple data lanes [SIMD]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline or look ahead
- G06F9/3885—Concurrent instruction execution, e.g. pipeline or look ahead using a plurality of independent parallel functional units
- G06F9/3888—Concurrent instruction execution, e.g. pipeline or look ahead using a plurality of independent parallel functional units controlled by a single instruction for multiple threads [SIMT] in parallel
Landscapes
- Engineering & Computer Science (AREA)
- Software Systems (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Advance Control (AREA)
- Executing Machine-Instructions (AREA)
- Hardware Redundancy (AREA)
- Devices For Executing Special Programs (AREA)
Abstract
um processador simd pode ser configurado para determinar um ou mais fluxos de execução ativos de uma série de fluxos de execução, selecionar um fluxo de execução ativo do fluxo ou fluxos de execução ativos e executar uma operação divergente do fluxo de execução ativo selecionado. a operação divergente pode ser uma operação serial.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US14/268,215 US10133572B2 (en) | 2014-05-02 | 2014-05-02 | Techniques for serialized execution in a SIMD processing system |
PCT/US2015/025362 WO2015167777A1 (en) | 2014-05-02 | 2015-04-10 | Techniques for serialized execution in a simd processing system |
Publications (1)
Publication Number | Publication Date |
---|---|
BR112016025511A2 true BR112016025511A2 (pt) | 2017-08-15 |
Family
ID=53039617
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
BR112016025511A BR112016025511A2 (pt) | 2014-05-02 | 2015-04-10 | técnicas para execução serializada em sistema de processamento simd |
Country Status (8)
Country | Link |
---|---|
US (1) | US10133572B2 (pt) |
EP (1) | EP3137988B1 (pt) |
JP (1) | JP2017515228A (pt) |
KR (1) | KR20160148673A (pt) |
CN (1) | CN106233248B (pt) |
BR (1) | BR112016025511A2 (pt) |
ES (1) | ES2834573T3 (pt) |
WO (1) | WO2015167777A1 (pt) |
Families Citing this family (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9898348B2 (en) | 2014-10-22 | 2018-02-20 | International Business Machines Corporation | Resource mapping in multi-threaded central processor units |
US9921838B2 (en) * | 2015-10-02 | 2018-03-20 | Mediatek Inc. | System and method for managing static divergence in a SIMD computing architecture |
EP3726732B1 (en) | 2016-04-19 | 2024-07-31 | Huawei Technologies Co., Ltd. | Vector processing for segmentation hash values calculation |
US10091904B2 (en) | 2016-07-22 | 2018-10-02 | Intel Corporation | Storage sled for data center |
US10565017B2 (en) * | 2016-09-23 | 2020-02-18 | Samsung Electronics Co., Ltd. | Multi-thread processor and controlling method thereof |
US10990409B2 (en) * | 2017-04-21 | 2021-04-27 | Intel Corporation | Control flow mechanism for execution of graphics processor instructions using active channel packing |
CN108549583B (zh) * | 2018-04-17 | 2021-05-07 | 致云科技有限公司 | 大数据处理方法、装置、服务器及可读存储介质 |
US12004257B2 (en) * | 2018-10-08 | 2024-06-04 | Interdigital Patent Holdings, Inc. | Device discovery and connectivity in a cellular network |
US20230097115A1 (en) * | 2021-09-27 | 2023-03-30 | Advanced Micro Devices, Inc. | Garbage collecting wavefront |
Family Cites Families (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6947047B1 (en) | 2001-09-20 | 2005-09-20 | Nvidia Corporation | Method and system for programmable pipelined graphics processing with branching instructions |
US7895328B2 (en) | 2002-12-13 | 2011-02-22 | International Business Machines Corporation | System and method for context-based serialization of messages in a parallel execution environment |
WO2005072307A2 (en) | 2004-01-22 | 2005-08-11 | University Of Washington | Wavescalar architecture having a wave order memory |
US7590830B2 (en) * | 2004-05-28 | 2009-09-15 | Sun Microsystems, Inc. | Method and structure for concurrent branch prediction in a processor |
GB2437837A (en) | 2005-02-25 | 2007-11-07 | Clearspeed Technology Plc | Microprocessor architecture |
US7761697B1 (en) * | 2005-07-13 | 2010-07-20 | Nvidia Corporation | Processing an indirect branch instruction in a SIMD architecture |
US7634637B1 (en) | 2005-12-16 | 2009-12-15 | Nvidia Corporation | Execution of parallel groups of threads with per-instruction serialization |
US8176265B2 (en) | 2006-10-30 | 2012-05-08 | Nvidia Corporation | Shared single-access memory with management of multiple parallel requests |
US8312254B2 (en) * | 2008-03-24 | 2012-11-13 | Nvidia Corporation | Indirect function call instructions in a synchronous parallel thread processor |
US8850436B2 (en) | 2009-09-28 | 2014-09-30 | Nvidia Corporation | Opcode-specified predicatable warp post-synchronization |
US8782645B2 (en) * | 2011-05-11 | 2014-07-15 | Advanced Micro Devices, Inc. | Automatic load balancing for heterogeneous cores |
US8683468B2 (en) * | 2011-05-16 | 2014-03-25 | Advanced Micro Devices, Inc. | Automatic kernel migration for heterogeneous cores |
US10152329B2 (en) | 2012-02-09 | 2018-12-11 | Nvidia Corporation | Pre-scheduled replays of divergent operations |
US9256429B2 (en) | 2012-08-08 | 2016-02-09 | Qualcomm Incorporated | Selectively activating a resume check operation in a multi-threaded processing system |
US9229721B2 (en) | 2012-09-10 | 2016-01-05 | Qualcomm Incorporated | Executing subroutines in a multi-threaded processing system |
US10013290B2 (en) | 2012-09-10 | 2018-07-03 | Nvidia Corporation | System and method for synchronizing threads in a divergent region of code |
KR101603752B1 (ko) * | 2013-01-28 | 2016-03-28 | 삼성전자주식회사 | 멀티 모드 지원 프로세서 및 그 프로세서에서 멀티 모드를 지원하는 방법 |
KR20150019349A (ko) * | 2013-08-13 | 2015-02-25 | 삼성전자주식회사 | 다중 쓰레드 실행 프로세서 및 이의 동작 방법 |
US9652284B2 (en) * | 2013-10-01 | 2017-05-16 | Qualcomm Incorporated | GPU divergence barrier |
-
2014
- 2014-05-02 US US14/268,215 patent/US10133572B2/en active Active
-
2015
- 2015-04-10 KR KR1020167033480A patent/KR20160148673A/ko unknown
- 2015-04-10 BR BR112016025511A patent/BR112016025511A2/pt not_active IP Right Cessation
- 2015-04-10 EP EP15719929.0A patent/EP3137988B1/en active Active
- 2015-04-10 WO PCT/US2015/025362 patent/WO2015167777A1/en active Application Filing
- 2015-04-10 ES ES15719929T patent/ES2834573T3/es active Active
- 2015-04-10 CN CN201580021777.9A patent/CN106233248B/zh active Active
- 2015-04-10 JP JP2016563817A patent/JP2017515228A/ja active Pending
Also Published As
Publication number | Publication date |
---|---|
KR20160148673A (ko) | 2016-12-26 |
EP3137988A1 (en) | 2017-03-08 |
US20150317157A1 (en) | 2015-11-05 |
EP3137988B1 (en) | 2020-09-02 |
ES2834573T3 (es) | 2021-06-17 |
US10133572B2 (en) | 2018-11-20 |
JP2017515228A (ja) | 2017-06-08 |
CN106233248A (zh) | 2016-12-14 |
CN106233248B (zh) | 2018-11-13 |
WO2015167777A1 (en) | 2015-11-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
BR112016025511A2 (pt) | técnicas para execução serializada em sistema de processamento simd | |
MX2016009060A (es) | Metodos y sistemas para determinar el riesgo de falla cardiaca. | |
BR112016016831A8 (pt) | método implementado por computador, sistema incluindo memória e um ou mais processadores, e meio legível por computador não transitório | |
MX2016011921A (es) | Configuracion de modo arquitectonico en un sistema de computo. | |
GB2524617B (en) | Sort acceleration processors, methods, systems, and instructions | |
BR112017007160A2 (pt) | sistemas de catalisador de fosfinimida de titânio e de iminoimidazolidida de titânio com suportes de ativador. | |
KR20180084732A (ko) | 비신뢰 컴퓨터들 상에서 프라이빗 프로그램을 실행하기 위한 시스템 및 프로세스 | |
BR112016026264A2 (pt) | sistema e método dos mesmos para otimizar o tempo de inicialização de computadores com múltiplas cpus. | |
MX2016015214A (es) | Aparato y metodo. | |
MA40998A (fr) | Thérapies contre une envenimation, ainsi que compositions, systèmes et kits pharmaceutiques associés | |
BR112017007654A2 (pt) | sistemas e métodos para o imageamento de amostras de fluido | |
CL2016000021A1 (es) | Combinación herbicida que comprende ácido pelargónico e inhibidores específicos de als | |
BR112017009953B8 (pt) | Sistema de produção de energia e método para controle automatizado de um sistema de produção de energia | |
AU2015364405A8 (en) | Methods for simultaneous source separation | |
CL2016000185A1 (es) | Sistema robusto de recuperación de error de hardware/software | |
FI20155955L (fi) | Mikrobinen polttokenno, sen käyttö ja mikrobinen polttokennojärjestelmä | |
BR112017008674A2 (pt) | método de processamento de solicitação de gravação, processador, e computador | |
MA49633A (fr) | Agents, utilisations et procédés de traitement | |
FR3019557B1 (fr) | Dispositif d’incubation et de detection | |
CL2016003140A1 (es) | Sistema de expresión génica | |
SG11201703285WA (en) | Systems and computer implemented methods for monitoring an activity at one or more facilities | |
BR112017018847A2 (pt) | sistema e métodos de otimizador de projeto. | |
BR112018015084A2 (pt) | ?eletrônica de medidor para dois ou mais conjuntos de medidor, método de operar dois ou mais conjuntos de medidor, e, sistema com eletrônica de medidor? | |
Dunlap et al. | Medical Students, Residents, Social Work and Nursing Students Knowledge of Substance Use Screening and Interventions | |
PAUL | An In-depth Description and Detailed Analysis of Implementing Alternative Methods of Financing Software |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
B08F | Application dismissed because of non-payment of annual fees [chapter 8.6 patent gazette] |
Free format text: REFERENTE A 5A ANUIDADE. |
|
B08K | Patent lapsed as no evidence of payment of the annual fee has been furnished to inpi [chapter 8.11 patent gazette] |
Free format text: REFERENTE AO DESPACHO 8.6 PUBLICADO NA RPI 2561 DE 04/02/2020. |
|
B350 | Update of information on the portal [chapter 15.35 patent gazette] |