KR101871961B1 - 작업항목 동기화를 위한 방법 및 시스템 - Google Patents

작업항목 동기화를 위한 방법 및 시스템 Download PDF

Info

Publication number
KR101871961B1
KR101871961B1 KR1020147012038A KR20147012038A KR101871961B1 KR 101871961 B1 KR101871961 B1 KR 101871961B1 KR 1020147012038 A KR1020147012038 A KR 1020147012038A KR 20147012038 A KR20147012038 A KR 20147012038A KR 101871961 B1 KR101871961 B1 KR 101871961B1
Authority
KR
South Korea
Prior art keywords
barrier
work
group
work item
work items
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
KR1020147012038A
Other languages
English (en)
Korean (ko)
Other versions
KR20140088550A (ko
Inventor
리 더블유. 호웨스
베네딕트 알. 가스터
마이클 씨. 호우스톤
마이클 맨터
마크 리어서
노먼 루빈
브라이언 디. 엠버링
Original Assignee
어드밴스드 마이크로 디바이시즈, 인코포레이티드
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 어드밴스드 마이크로 디바이시즈, 인코포레이티드 filed Critical 어드밴스드 마이크로 디바이시즈, 인코포레이티드
Publication of KR20140088550A publication Critical patent/KR20140088550A/ko
Application granted granted Critical
Publication of KR101871961B1 publication Critical patent/KR101871961B1/ko
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/52Program synchronisation; Mutual exclusion, e.g. by means of semaphores
    • G06F9/524Deadlock detection or avoidance
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/30Arrangements for executing machine instructions, e.g. instruction decode
    • G06F9/30003Arrangements for executing specific machine instructions
    • G06F9/30076Arrangements for executing specific machine instructions to perform miscellaneous control operations, e.g. NOP
    • G06F9/30087Synchronisation or serialisation instructions
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/52Program synchronisation; Mutual exclusion, e.g. by means of semaphores
    • G06F9/522Barrier synchronisation

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Multi Processors (AREA)
KR1020147012038A 2011-11-03 2012-10-31 작업항목 동기화를 위한 방법 및 시스템 Active KR101871961B1 (ko)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US13/288,833 US8607247B2 (en) 2011-11-03 2011-11-03 Method and system for workitem synchronization
US13/288,833 2011-11-03
PCT/US2012/062768 WO2013066988A1 (en) 2011-11-03 2012-10-31 Method and system for workitem synchronization

Publications (2)

Publication Number Publication Date
KR20140088550A KR20140088550A (ko) 2014-07-10
KR101871961B1 true KR101871961B1 (ko) 2018-08-02

Family

ID=47172902

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020147012038A Active KR101871961B1 (ko) 2011-11-03 2012-10-31 작업항목 동기화를 위한 방법 및 시스템

Country Status (6)

Country Link
US (1) US8607247B2 (enExample)
EP (1) EP2774037B1 (enExample)
JP (1) JP5984952B2 (enExample)
KR (1) KR101871961B1 (enExample)
CN (1) CN103917959B (enExample)
WO (1) WO2013066988A1 (enExample)

Families Citing this family (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007002855A2 (en) * 2005-06-29 2007-01-04 Neopath Networks, Inc. Parallel filesystem traversal for transparent mirroring of directories and files
US9092272B2 (en) * 2011-12-08 2015-07-28 International Business Machines Corporation Preparing parallel tasks to use a synchronization register
US10585801B2 (en) 2012-11-26 2020-03-10 Advanced Micro Devices, Inc. Prefetch kernels on a graphics processing unit
JP5994601B2 (ja) * 2012-11-27 2016-09-21 富士通株式会社 並列計算機、並列計算機の制御プログラム及び並列計算機の制御方法
US9304940B2 (en) * 2013-03-15 2016-04-05 Intel Corporation Processors, methods, and systems to relax synchronization of accesses to shared memory
US9697003B2 (en) 2013-06-07 2017-07-04 Advanced Micro Devices, Inc. Method and system for yield operation supporting thread-like behavior
US10402235B2 (en) * 2016-04-15 2019-09-03 Nec Corporation Fine-grain synchronization in data-parallel jobs for distributed machine learning
US10402234B2 (en) * 2016-04-15 2019-09-03 Nec Corporation Fine-grain synchronization in data-parallel jobs
US10223436B2 (en) * 2016-04-27 2019-03-05 Qualcomm Incorporated Inter-subgroup data sharing
US10929944B2 (en) 2016-11-23 2021-02-23 Advanced Micro Devices, Inc. Low power and low latency GPU coprocessor for persistent computing
US20180239532A1 (en) * 2017-02-23 2018-08-23 Western Digital Technologies, Inc. Techniques for performing a non-blocking control sync operation
US11353868B2 (en) * 2017-04-24 2022-06-07 Intel Corporation Barriers and synchronization for machine learning at autonomous machines
US11436186B2 (en) * 2017-06-22 2022-09-06 Icat Llc High throughput processors
GB2569271B (en) * 2017-10-20 2020-05-13 Graphcore Ltd Synchronization with a host processor
GB2569273B (en) * 2017-10-20 2020-01-01 Graphcore Ltd Synchronization in a multi-tile processing arrangement
GB2569274B (en) * 2017-10-20 2020-07-15 Graphcore Ltd Synchronization amongst processor tiles
GB2569098B (en) * 2017-10-20 2020-01-08 Graphcore Ltd Combining states of multiple threads in a multi-threaded processor
DE102018205392A1 (de) * 2018-04-10 2019-10-10 Robert Bosch Gmbh Verfahren und Vorrichtung zur Fehlerbehandlung in einer Kommunikation zwischen verteilten Software Komponenten
DE102018205390A1 (de) * 2018-04-10 2019-10-10 Robert Bosch Gmbh Verfahren und Vorrichtung zur Fehlerbehandlung in einer Kommunikation zwischen verteilten Software Komponenten
US10824481B2 (en) * 2018-11-13 2020-11-03 International Business Machines Corporation Partial synchronization between compute tasks based on threshold specification in a computing system
US11449339B2 (en) * 2019-09-27 2022-09-20 Red Hat, Inc. Memory barrier elision for multi-threaded workloads
US11803380B2 (en) * 2019-10-29 2023-10-31 Nvidia Corporation High performance synchronization mechanisms for coordinating operations on a computer system
CN112749019B (zh) * 2019-10-29 2025-08-29 辉达公司 用于协调计算机系统上的操作的高性能同步机制
US11409579B2 (en) * 2020-02-24 2022-08-09 Intel Corporation Multiple independent synchonization named barrier within a thread group
US11231881B2 (en) * 2020-04-02 2022-01-25 Dell Products L.P. Raid data storage device multi-step command coordination system
US12314760B2 (en) * 2021-09-27 2025-05-27 Advanced Micro Devices, Inc. Garbage collecting wavefront
US11816349B2 (en) 2021-11-03 2023-11-14 Western Digital Technologies, Inc. Reduce command latency using block pre-erase
US20230289242A1 (en) * 2022-03-10 2023-09-14 Nvidia Corporation Hardware accelerated synchronization with asynchronous transaction support
CN117472600A (zh) * 2022-05-26 2024-01-30 上海壁仞科技股份有限公司 指令执行方法、处理器和电子装置

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060212868A1 (en) 2005-03-15 2006-09-21 Koichi Takayama Synchronization method and program for a parallel computer
US20090037707A1 (en) 2007-08-01 2009-02-05 Blocksome Michael A Determining When a Set of Compute Nodes Participating in a Barrier Operation on a Parallel Computer are Ready to Exit the Barrier Operation

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5930807A (en) * 1997-04-23 1999-07-27 Sun Microsystems Apparatus and method for fast filtering read and write barrier operations in garbage collection system
JP3810631B2 (ja) * 2000-11-28 2006-08-16 富士通株式会社 情報処理プログラムを記録した記録媒体
US7555607B2 (en) * 2005-11-10 2009-06-30 Hewlett-Packard Development Company, L.P. Program thread syncronization for instruction cachelines
US7587555B2 (en) * 2005-11-10 2009-09-08 Hewlett-Packard Development Company, L.P. Program thread synchronization
US7660961B2 (en) * 2007-04-03 2010-02-09 Sun Microsystems, Inc. Concurrent evacuation of the young generation
KR101458028B1 (ko) * 2007-05-30 2014-11-04 삼성전자 주식회사 병렬 처리 장치 및 방법
US8140773B2 (en) * 2007-06-27 2012-03-20 Bratin Saha Using ephemeral stores for fine-grained conflict detection in a hardware accelerated STM
US8719514B2 (en) * 2007-06-27 2014-05-06 Intel Corporation Software filtering in a transactional memory system
JP2009176116A (ja) * 2008-01-25 2009-08-06 Univ Waseda マルチプロセッサシステムおよびマルチプロセッサシステムの同期方法
US20100281082A1 (en) * 2009-04-30 2010-11-04 Tatu Ylonen Oy Ltd Subordinate Multiobjects
JP5304194B2 (ja) * 2008-11-19 2013-10-02 富士通株式会社 バリア同期装置、バリア同期システム及びバリア同期装置の制御方法
US8370577B2 (en) * 2009-06-26 2013-02-05 Microsoft Corporation Metaphysically addressed cache metadata
US8229907B2 (en) * 2009-06-30 2012-07-24 Microsoft Corporation Hardware accelerated transactional memory system with open nested transactions
US8402218B2 (en) * 2009-12-15 2013-03-19 Microsoft Corporation Efficient garbage collection and exception handling in a hardware accelerated transactional memory system
US8316194B2 (en) * 2009-12-15 2012-11-20 Intel Corporation Mechanisms to accelerate transactions using buffered stores
US8280866B2 (en) * 2010-04-12 2012-10-02 Clausal Computing Oy Monitoring writes using thread-local write barrier buffers and soft synchronization
US20110264880A1 (en) * 2010-04-23 2011-10-27 Tatu Ylonen Oy Ltd Object copying with re-copying concurrently written objects
US9069545B2 (en) * 2011-07-18 2015-06-30 International Business Machines Corporation Relaxation of synchronization for iterative convergent computations

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060212868A1 (en) 2005-03-15 2006-09-21 Koichi Takayama Synchronization method and program for a parallel computer
US20090037707A1 (en) 2007-08-01 2009-02-05 Blocksome Michael A Determining When a Set of Compute Nodes Participating in a Barrier Operation on a Parallel Computer are Ready to Exit the Barrier Operation

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Shivali 외 2명. 'Distributed Generalized Dynamic Barrier Synchronization', International Conference on Distributed Computing and Networking, 2011, pp.143-154.
Xiao 외 1명. 'Inter-block GPU communication via fast barrier synchronization', 2010 IEEE International Symposium on Parallel & Distributed Processing, 2010, pp.1-12.

Also Published As

Publication number Publication date
JP5984952B2 (ja) 2016-09-06
WO2013066988A1 (en) 2013-05-10
CN103917959A (zh) 2014-07-09
JP2014532937A (ja) 2014-12-08
EP2774037B1 (en) 2019-09-25
CN103917959B (zh) 2017-11-14
EP2774037A1 (en) 2014-09-10
US20130117750A1 (en) 2013-05-09
KR20140088550A (ko) 2014-07-10
US8607247B2 (en) 2013-12-10

Similar Documents

Publication Publication Date Title
KR101871961B1 (ko) 작업항목 동기화를 위한 방법 및 시스템
US10467013B2 (en) Method and system for yield operation supporting thread-like behavior
US9424099B2 (en) Method and system for synchronization of workitems with divergent control flow
US11847508B2 (en) Convergence among concurrently executing threads
US20140157287A1 (en) Optimized Context Switching for Long-Running Processes
US11803380B2 (en) High performance synchronization mechanisms for coordinating operations on a computer system
EP2989540B1 (en) Controlling tasks performed by a computing system
US20170083373A1 (en) Technique for Computational Nested Parallelism
KR20220036950A (ko) 순수 함수 신경망 가속기 시스템 및 아키텍처
US9612863B2 (en) Hardware device for accelerating the execution of a systemC simulation in a dynamic manner during the simulation
CN112749019B (zh) 用于协调计算机系统上的操作的高性能同步机制
Skrzypczak et al. Efficient parallel implementation of crowd simulation using a hybrid CPU+ GPU high performance computing system
US20150379172A1 (en) Device and method for accelerating the update phase of a simulation kernel
Zheng et al. Hiwaylib: A software framework for enabling high performance communications for heterogeneous pipeline computations
US10564948B2 (en) Method and device for processing an irregular application
Lisper Towards parallel programming models for predictability
US10996960B1 (en) Iterating single instruction, multiple-data (SIMD) instructions
Ramamurthy Towards scalar synchronization in SIMT architectures
Ding et al. Turnip: A" nondeterministic" gpu runtime with cpu ram offload
US20150033242A1 (en) Method for Automatic Parallel Computing
JP6697457B2 (ja) プロセッサ・コアをスレッド・モードからレーン・モードに遷移させ、2つのモードの間のデータ転送を可能にすること
Frank SUDS: Automatic parallelization for Raw processors
Brady et al. Introduction to MPI
Cheramangalath et al. GPU Architecture and Programming Challenges
Wang et al. A Multiprocessor RTOS Design of uC/OS

Legal Events

Date Code Title Description
PA0105 International application

Patent event date: 20140502

Patent event code: PA01051R01D

Comment text: International Patent Application

PG1501 Laying open of application
A201 Request for examination
PA0201 Request for examination

Patent event code: PA02012R01D

Patent event date: 20171025

Comment text: Request for Examination of Application

A302 Request for accelerated examination
PA0302 Request for accelerated examination

Patent event date: 20171212

Patent event code: PA03022R01D

Comment text: Request for Accelerated Examination

E701 Decision to grant or registration of patent right
PE0701 Decision of registration

Patent event code: PE07011S01D

Comment text: Decision to Grant Registration

Patent event date: 20180321

GRNT Written decision to grant
PR0701 Registration of establishment

Comment text: Registration of Establishment

Patent event date: 20180621

Patent event code: PR07011E01D

PR1002 Payment of registration fee

Payment date: 20180622

End annual number: 3

Start annual number: 1

PG1601 Publication of registration
PR1001 Payment of annual fee

Payment date: 20210517

Start annual number: 4

End annual number: 4

PR1001 Payment of annual fee

Payment date: 20220614

Start annual number: 5

End annual number: 5

PR1001 Payment of annual fee

Payment date: 20240613

Start annual number: 7

End annual number: 7

PR1001 Payment of annual fee

Payment date: 20250610

Start annual number: 8

End annual number: 8