KR101871961B1 - 작업항목 동기화를 위한 방법 및 시스템 - Google Patents
작업항목 동기화를 위한 방법 및 시스템 Download PDFInfo
- Publication number
- KR101871961B1 KR101871961B1 KR1020147012038A KR20147012038A KR101871961B1 KR 101871961 B1 KR101871961 B1 KR 101871961B1 KR 1020147012038 A KR1020147012038 A KR 1020147012038A KR 20147012038 A KR20147012038 A KR 20147012038A KR 101871961 B1 KR101871961 B1 KR 101871961B1
- Authority
- KR
- South Korea
- Prior art keywords
- barrier
- work
- group
- work item
- work items
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/52—Program synchronisation; Mutual exclusion, e.g. by means of semaphores
- G06F9/524—Deadlock detection or avoidance
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/30076—Arrangements for executing specific machine instructions to perform miscellaneous control operations, e.g. NOP
- G06F9/30087—Synchronisation or serialisation instructions
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/52—Program synchronisation; Mutual exclusion, e.g. by means of semaphores
- G06F9/522—Barrier synchronisation
Landscapes
- Engineering & Computer Science (AREA)
- Software Systems (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Multi Processors (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US13/288,833 US8607247B2 (en) | 2011-11-03 | 2011-11-03 | Method and system for workitem synchronization |
| US13/288,833 | 2011-11-03 | ||
| PCT/US2012/062768 WO2013066988A1 (en) | 2011-11-03 | 2012-10-31 | Method and system for workitem synchronization |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| KR20140088550A KR20140088550A (ko) | 2014-07-10 |
| KR101871961B1 true KR101871961B1 (ko) | 2018-08-02 |
Family
ID=47172902
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| KR1020147012038A Active KR101871961B1 (ko) | 2011-11-03 | 2012-10-31 | 작업항목 동기화를 위한 방법 및 시스템 |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US8607247B2 (enExample) |
| EP (1) | EP2774037B1 (enExample) |
| JP (1) | JP5984952B2 (enExample) |
| KR (1) | KR101871961B1 (enExample) |
| CN (1) | CN103917959B (enExample) |
| WO (1) | WO2013066988A1 (enExample) |
Families Citing this family (29)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2007002855A2 (en) * | 2005-06-29 | 2007-01-04 | Neopath Networks, Inc. | Parallel filesystem traversal for transparent mirroring of directories and files |
| US9092272B2 (en) * | 2011-12-08 | 2015-07-28 | International Business Machines Corporation | Preparing parallel tasks to use a synchronization register |
| US10585801B2 (en) | 2012-11-26 | 2020-03-10 | Advanced Micro Devices, Inc. | Prefetch kernels on a graphics processing unit |
| JP5994601B2 (ja) * | 2012-11-27 | 2016-09-21 | 富士通株式会社 | 並列計算機、並列計算機の制御プログラム及び並列計算機の制御方法 |
| US9304940B2 (en) * | 2013-03-15 | 2016-04-05 | Intel Corporation | Processors, methods, and systems to relax synchronization of accesses to shared memory |
| US9697003B2 (en) | 2013-06-07 | 2017-07-04 | Advanced Micro Devices, Inc. | Method and system for yield operation supporting thread-like behavior |
| US10402235B2 (en) * | 2016-04-15 | 2019-09-03 | Nec Corporation | Fine-grain synchronization in data-parallel jobs for distributed machine learning |
| US10402234B2 (en) * | 2016-04-15 | 2019-09-03 | Nec Corporation | Fine-grain synchronization in data-parallel jobs |
| US10223436B2 (en) * | 2016-04-27 | 2019-03-05 | Qualcomm Incorporated | Inter-subgroup data sharing |
| US10929944B2 (en) | 2016-11-23 | 2021-02-23 | Advanced Micro Devices, Inc. | Low power and low latency GPU coprocessor for persistent computing |
| US20180239532A1 (en) * | 2017-02-23 | 2018-08-23 | Western Digital Technologies, Inc. | Techniques for performing a non-blocking control sync operation |
| US11353868B2 (en) * | 2017-04-24 | 2022-06-07 | Intel Corporation | Barriers and synchronization for machine learning at autonomous machines |
| US11436186B2 (en) * | 2017-06-22 | 2022-09-06 | Icat Llc | High throughput processors |
| GB2569271B (en) * | 2017-10-20 | 2020-05-13 | Graphcore Ltd | Synchronization with a host processor |
| GB2569273B (en) * | 2017-10-20 | 2020-01-01 | Graphcore Ltd | Synchronization in a multi-tile processing arrangement |
| GB2569274B (en) * | 2017-10-20 | 2020-07-15 | Graphcore Ltd | Synchronization amongst processor tiles |
| GB2569098B (en) * | 2017-10-20 | 2020-01-08 | Graphcore Ltd | Combining states of multiple threads in a multi-threaded processor |
| DE102018205392A1 (de) * | 2018-04-10 | 2019-10-10 | Robert Bosch Gmbh | Verfahren und Vorrichtung zur Fehlerbehandlung in einer Kommunikation zwischen verteilten Software Komponenten |
| DE102018205390A1 (de) * | 2018-04-10 | 2019-10-10 | Robert Bosch Gmbh | Verfahren und Vorrichtung zur Fehlerbehandlung in einer Kommunikation zwischen verteilten Software Komponenten |
| US10824481B2 (en) * | 2018-11-13 | 2020-11-03 | International Business Machines Corporation | Partial synchronization between compute tasks based on threshold specification in a computing system |
| US11449339B2 (en) * | 2019-09-27 | 2022-09-20 | Red Hat, Inc. | Memory barrier elision for multi-threaded workloads |
| US11803380B2 (en) * | 2019-10-29 | 2023-10-31 | Nvidia Corporation | High performance synchronization mechanisms for coordinating operations on a computer system |
| CN112749019B (zh) * | 2019-10-29 | 2025-08-29 | 辉达公司 | 用于协调计算机系统上的操作的高性能同步机制 |
| US11409579B2 (en) * | 2020-02-24 | 2022-08-09 | Intel Corporation | Multiple independent synchonization named barrier within a thread group |
| US11231881B2 (en) * | 2020-04-02 | 2022-01-25 | Dell Products L.P. | Raid data storage device multi-step command coordination system |
| US12314760B2 (en) * | 2021-09-27 | 2025-05-27 | Advanced Micro Devices, Inc. | Garbage collecting wavefront |
| US11816349B2 (en) | 2021-11-03 | 2023-11-14 | Western Digital Technologies, Inc. | Reduce command latency using block pre-erase |
| US20230289242A1 (en) * | 2022-03-10 | 2023-09-14 | Nvidia Corporation | Hardware accelerated synchronization with asynchronous transaction support |
| CN117472600A (zh) * | 2022-05-26 | 2024-01-30 | 上海壁仞科技股份有限公司 | 指令执行方法、处理器和电子装置 |
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20060212868A1 (en) | 2005-03-15 | 2006-09-21 | Koichi Takayama | Synchronization method and program for a parallel computer |
| US20090037707A1 (en) | 2007-08-01 | 2009-02-05 | Blocksome Michael A | Determining When a Set of Compute Nodes Participating in a Barrier Operation on a Parallel Computer are Ready to Exit the Barrier Operation |
Family Cites Families (18)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5930807A (en) * | 1997-04-23 | 1999-07-27 | Sun Microsystems | Apparatus and method for fast filtering read and write barrier operations in garbage collection system |
| JP3810631B2 (ja) * | 2000-11-28 | 2006-08-16 | 富士通株式会社 | 情報処理プログラムを記録した記録媒体 |
| US7555607B2 (en) * | 2005-11-10 | 2009-06-30 | Hewlett-Packard Development Company, L.P. | Program thread syncronization for instruction cachelines |
| US7587555B2 (en) * | 2005-11-10 | 2009-09-08 | Hewlett-Packard Development Company, L.P. | Program thread synchronization |
| US7660961B2 (en) * | 2007-04-03 | 2010-02-09 | Sun Microsystems, Inc. | Concurrent evacuation of the young generation |
| KR101458028B1 (ko) * | 2007-05-30 | 2014-11-04 | 삼성전자 주식회사 | 병렬 처리 장치 및 방법 |
| US8140773B2 (en) * | 2007-06-27 | 2012-03-20 | Bratin Saha | Using ephemeral stores for fine-grained conflict detection in a hardware accelerated STM |
| US8719514B2 (en) * | 2007-06-27 | 2014-05-06 | Intel Corporation | Software filtering in a transactional memory system |
| JP2009176116A (ja) * | 2008-01-25 | 2009-08-06 | Univ Waseda | マルチプロセッサシステムおよびマルチプロセッサシステムの同期方法 |
| US20100281082A1 (en) * | 2009-04-30 | 2010-11-04 | Tatu Ylonen Oy Ltd | Subordinate Multiobjects |
| JP5304194B2 (ja) * | 2008-11-19 | 2013-10-02 | 富士通株式会社 | バリア同期装置、バリア同期システム及びバリア同期装置の制御方法 |
| US8370577B2 (en) * | 2009-06-26 | 2013-02-05 | Microsoft Corporation | Metaphysically addressed cache metadata |
| US8229907B2 (en) * | 2009-06-30 | 2012-07-24 | Microsoft Corporation | Hardware accelerated transactional memory system with open nested transactions |
| US8402218B2 (en) * | 2009-12-15 | 2013-03-19 | Microsoft Corporation | Efficient garbage collection and exception handling in a hardware accelerated transactional memory system |
| US8316194B2 (en) * | 2009-12-15 | 2012-11-20 | Intel Corporation | Mechanisms to accelerate transactions using buffered stores |
| US8280866B2 (en) * | 2010-04-12 | 2012-10-02 | Clausal Computing Oy | Monitoring writes using thread-local write barrier buffers and soft synchronization |
| US20110264880A1 (en) * | 2010-04-23 | 2011-10-27 | Tatu Ylonen Oy Ltd | Object copying with re-copying concurrently written objects |
| US9069545B2 (en) * | 2011-07-18 | 2015-06-30 | International Business Machines Corporation | Relaxation of synchronization for iterative convergent computations |
-
2011
- 2011-11-03 US US13/288,833 patent/US8607247B2/en active Active
-
2012
- 2012-10-31 WO PCT/US2012/062768 patent/WO2013066988A1/en not_active Ceased
- 2012-10-31 CN CN201280053875.7A patent/CN103917959B/zh active Active
- 2012-10-31 KR KR1020147012038A patent/KR101871961B1/ko active Active
- 2012-10-31 EP EP12784403.3A patent/EP2774037B1/en active Active
- 2012-10-31 JP JP2014540034A patent/JP5984952B2/ja active Active
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20060212868A1 (en) | 2005-03-15 | 2006-09-21 | Koichi Takayama | Synchronization method and program for a parallel computer |
| US20090037707A1 (en) | 2007-08-01 | 2009-02-05 | Blocksome Michael A | Determining When a Set of Compute Nodes Participating in a Barrier Operation on a Parallel Computer are Ready to Exit the Barrier Operation |
Non-Patent Citations (2)
| Title |
|---|
| Shivali 외 2명. 'Distributed Generalized Dynamic Barrier Synchronization', International Conference on Distributed Computing and Networking, 2011, pp.143-154. |
| Xiao 외 1명. 'Inter-block GPU communication via fast barrier synchronization', 2010 IEEE International Symposium on Parallel & Distributed Processing, 2010, pp.1-12. |
Also Published As
| Publication number | Publication date |
|---|---|
| JP5984952B2 (ja) | 2016-09-06 |
| WO2013066988A1 (en) | 2013-05-10 |
| CN103917959A (zh) | 2014-07-09 |
| JP2014532937A (ja) | 2014-12-08 |
| EP2774037B1 (en) | 2019-09-25 |
| CN103917959B (zh) | 2017-11-14 |
| EP2774037A1 (en) | 2014-09-10 |
| US20130117750A1 (en) | 2013-05-09 |
| KR20140088550A (ko) | 2014-07-10 |
| US8607247B2 (en) | 2013-12-10 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| KR101871961B1 (ko) | 작업항목 동기화를 위한 방법 및 시스템 | |
| US10467013B2 (en) | Method and system for yield operation supporting thread-like behavior | |
| US9424099B2 (en) | Method and system for synchronization of workitems with divergent control flow | |
| US11847508B2 (en) | Convergence among concurrently executing threads | |
| US20140157287A1 (en) | Optimized Context Switching for Long-Running Processes | |
| US11803380B2 (en) | High performance synchronization mechanisms for coordinating operations on a computer system | |
| EP2989540B1 (en) | Controlling tasks performed by a computing system | |
| US20170083373A1 (en) | Technique for Computational Nested Parallelism | |
| KR20220036950A (ko) | 순수 함수 신경망 가속기 시스템 및 아키텍처 | |
| US9612863B2 (en) | Hardware device for accelerating the execution of a systemC simulation in a dynamic manner during the simulation | |
| CN112749019B (zh) | 用于协调计算机系统上的操作的高性能同步机制 | |
| Skrzypczak et al. | Efficient parallel implementation of crowd simulation using a hybrid CPU+ GPU high performance computing system | |
| US20150379172A1 (en) | Device and method for accelerating the update phase of a simulation kernel | |
| Zheng et al. | Hiwaylib: A software framework for enabling high performance communications for heterogeneous pipeline computations | |
| US10564948B2 (en) | Method and device for processing an irregular application | |
| Lisper | Towards parallel programming models for predictability | |
| US10996960B1 (en) | Iterating single instruction, multiple-data (SIMD) instructions | |
| Ramamurthy | Towards scalar synchronization in SIMT architectures | |
| Ding et al. | Turnip: A" nondeterministic" gpu runtime with cpu ram offload | |
| US20150033242A1 (en) | Method for Automatic Parallel Computing | |
| JP6697457B2 (ja) | プロセッサ・コアをスレッド・モードからレーン・モードに遷移させ、2つのモードの間のデータ転送を可能にすること | |
| Frank | SUDS: Automatic parallelization for Raw processors | |
| Brady et al. | Introduction to MPI | |
| Cheramangalath et al. | GPU Architecture and Programming Challenges | |
| Wang et al. | A Multiprocessor RTOS Design of uC/OS |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PA0105 | International application |
Patent event date: 20140502 Patent event code: PA01051R01D Comment text: International Patent Application |
|
| PG1501 | Laying open of application | ||
| A201 | Request for examination | ||
| PA0201 | Request for examination |
Patent event code: PA02012R01D Patent event date: 20171025 Comment text: Request for Examination of Application |
|
| A302 | Request for accelerated examination | ||
| PA0302 | Request for accelerated examination |
Patent event date: 20171212 Patent event code: PA03022R01D Comment text: Request for Accelerated Examination |
|
| E701 | Decision to grant or registration of patent right | ||
| PE0701 | Decision of registration |
Patent event code: PE07011S01D Comment text: Decision to Grant Registration Patent event date: 20180321 |
|
| GRNT | Written decision to grant | ||
| PR0701 | Registration of establishment |
Comment text: Registration of Establishment Patent event date: 20180621 Patent event code: PR07011E01D |
|
| PR1002 | Payment of registration fee |
Payment date: 20180622 End annual number: 3 Start annual number: 1 |
|
| PG1601 | Publication of registration | ||
| PR1001 | Payment of annual fee |
Payment date: 20210517 Start annual number: 4 End annual number: 4 |
|
| PR1001 | Payment of annual fee |
Payment date: 20220614 Start annual number: 5 End annual number: 5 |
|
| PR1001 | Payment of annual fee |
Payment date: 20240613 Start annual number: 7 End annual number: 7 |
|
| PR1001 | Payment of annual fee |
Payment date: 20250610 Start annual number: 8 End annual number: 8 |