JP5984952B2 - 作業項目の同期のための方法及びシステム - Google Patents

作業項目の同期のための方法及びシステム Download PDF

Info

Publication number
JP5984952B2
JP5984952B2 JP2014540034A JP2014540034A JP5984952B2 JP 5984952 B2 JP5984952 B2 JP 5984952B2 JP 2014540034 A JP2014540034 A JP 2014540034A JP 2014540034 A JP2014540034 A JP 2014540034A JP 5984952 B2 JP5984952 B2 JP 5984952B2
Authority
JP
Japan
Prior art keywords
barrier
work
group
work items
work item
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2014540034A
Other languages
English (en)
Japanese (ja)
Other versions
JP2014532937A5 (https=
JP2014532937A (ja
Inventor
ダブリュ. ハウズ リー
ダブリュ. ハウズ リー
アール. ガスター ベネディクト
アール. ガスター ベネディクト
シー. ヒューストン マイケル
シー. ヒューストン マイケル
マントル マイケル
マントル マイケル
レザー マーク
レザー マーク
ルビン ノーマン
ルビン ノーマン
ディー. エンバーリング ブライアン
ディー. エンバーリング ブライアン
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Advanced Micro Devices Inc
Original Assignee
Advanced Micro Devices Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Advanced Micro Devices Inc filed Critical Advanced Micro Devices Inc
Publication of JP2014532937A publication Critical patent/JP2014532937A/ja
Publication of JP2014532937A5 publication Critical patent/JP2014532937A5/ja
Application granted granted Critical
Publication of JP5984952B2 publication Critical patent/JP5984952B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/52Program synchronisation; Mutual exclusion, e.g. by means of semaphores
    • G06F9/524Deadlock detection or avoidance
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/30Arrangements for executing machine instructions, e.g. instruction decode
    • G06F9/30003Arrangements for executing specific machine instructions
    • G06F9/30076Arrangements for executing specific machine instructions to perform miscellaneous control operations, e.g. NOP
    • G06F9/30087Synchronisation or serialisation instructions
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/52Program synchronisation; Mutual exclusion, e.g. by means of semaphores
    • G06F9/522Barrier synchronisation

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Multi Processors (AREA)
JP2014540034A 2011-11-03 2012-10-31 作業項目の同期のための方法及びシステム Active JP5984952B2 (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US13/288,833 US8607247B2 (en) 2011-11-03 2011-11-03 Method and system for workitem synchronization
US13/288,833 2011-11-03
PCT/US2012/062768 WO2013066988A1 (en) 2011-11-03 2012-10-31 Method and system for workitem synchronization

Publications (3)

Publication Number Publication Date
JP2014532937A JP2014532937A (ja) 2014-12-08
JP2014532937A5 JP2014532937A5 (https=) 2015-12-17
JP5984952B2 true JP5984952B2 (ja) 2016-09-06

Family

ID=47172902

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2014540034A Active JP5984952B2 (ja) 2011-11-03 2012-10-31 作業項目の同期のための方法及びシステム

Country Status (6)

Country Link
US (1) US8607247B2 (https=)
EP (1) EP2774037B1 (https=)
JP (1) JP5984952B2 (https=)
KR (1) KR101871961B1 (https=)
CN (1) CN103917959B (https=)
WO (1) WO2013066988A1 (https=)

Families Citing this family (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1900189B1 (en) * 2005-06-29 2018-04-18 Cisco Technology, Inc. Parallel filesystem traversal for transparent mirroring of directories and files
US9092272B2 (en) * 2011-12-08 2015-07-28 International Business Machines Corporation Preparing parallel tasks to use a synchronization register
US10585801B2 (en) 2012-11-26 2020-03-10 Advanced Micro Devices, Inc. Prefetch kernels on a graphics processing unit
JP5994601B2 (ja) * 2012-11-27 2016-09-21 富士通株式会社 並列計算機、並列計算機の制御プログラム及び並列計算機の制御方法
US9304940B2 (en) 2013-03-15 2016-04-05 Intel Corporation Processors, methods, and systems to relax synchronization of accesses to shared memory
US9697003B2 (en) 2013-06-07 2017-07-04 Advanced Micro Devices, Inc. Method and system for yield operation supporting thread-like behavior
US10402235B2 (en) * 2016-04-15 2019-09-03 Nec Corporation Fine-grain synchronization in data-parallel jobs for distributed machine learning
US10402234B2 (en) * 2016-04-15 2019-09-03 Nec Corporation Fine-grain synchronization in data-parallel jobs
US10223436B2 (en) * 2016-04-27 2019-03-05 Qualcomm Incorporated Inter-subgroup data sharing
US10929944B2 (en) * 2016-11-23 2021-02-23 Advanced Micro Devices, Inc. Low power and low latency GPU coprocessor for persistent computing
US20180239532A1 (en) * 2017-02-23 2018-08-23 Western Digital Technologies, Inc. Techniques for performing a non-blocking control sync operation
US11353868B2 (en) * 2017-04-24 2022-06-07 Intel Corporation Barriers and synchronization for machine learning at autonomous machines
KR20200031625A (ko) * 2017-06-22 2020-03-24 아이씨에이티 엘엘씨 고성능 프로세서
GB2569271B (en) * 2017-10-20 2020-05-13 Graphcore Ltd Synchronization with a host processor
GB2569098B (en) 2017-10-20 2020-01-08 Graphcore Ltd Combining states of multiple threads in a multi-threaded processor
GB2569274B (en) * 2017-10-20 2020-07-15 Graphcore Ltd Synchronization amongst processor tiles
GB2569273B (en) * 2017-10-20 2020-01-01 Graphcore Ltd Synchronization in a multi-tile processing arrangement
DE102018205392A1 (de) * 2018-04-10 2019-10-10 Robert Bosch Gmbh Verfahren und Vorrichtung zur Fehlerbehandlung in einer Kommunikation zwischen verteilten Software Komponenten
DE102018205390A1 (de) * 2018-04-10 2019-10-10 Robert Bosch Gmbh Verfahren und Vorrichtung zur Fehlerbehandlung in einer Kommunikation zwischen verteilten Software Komponenten
US10824481B2 (en) * 2018-11-13 2020-11-03 International Business Machines Corporation Partial synchronization between compute tasks based on threshold specification in a computing system
US11449339B2 (en) * 2019-09-27 2022-09-20 Red Hat, Inc. Memory barrier elision for multi-threaded workloads
CN112749019B (zh) * 2019-10-29 2025-08-29 辉达公司 用于协调计算机系统上的操作的高性能同步机制
US11080051B2 (en) * 2019-10-29 2021-08-03 Nvidia Corporation Techniques for efficiently transferring data to a processor
US11409579B2 (en) * 2020-02-24 2022-08-09 Intel Corporation Multiple independent synchonization named barrier within a thread group
US11231881B2 (en) * 2020-04-02 2022-01-25 Dell Products L.P. Raid data storage device multi-step command coordination system
US12314760B2 (en) * 2021-09-27 2025-05-27 Advanced Micro Devices, Inc. Garbage collecting wavefront
US11816349B2 (en) 2021-11-03 2023-11-14 Western Digital Technologies, Inc. Reduce command latency using block pre-erase
US12536056B2 (en) * 2022-03-10 2026-01-27 Nvidia Corporation Hardware accelerated synchronization with asynchronous transaction support
CN117472600A (zh) * 2022-05-26 2024-01-30 上海壁仞科技股份有限公司 指令执行方法、处理器和电子装置

Family Cites Families (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5930807A (en) * 1997-04-23 1999-07-27 Sun Microsystems Apparatus and method for fast filtering read and write barrier operations in garbage collection system
JP3810631B2 (ja) * 2000-11-28 2006-08-16 富士通株式会社 情報処理プログラムを記録した記録媒体
JP4448784B2 (ja) 2005-03-15 2010-04-14 株式会社日立製作所 並列計算機の同期方法及びプログラム
US7587555B2 (en) * 2005-11-10 2009-09-08 Hewlett-Packard Development Company, L.P. Program thread synchronization
US7555607B2 (en) * 2005-11-10 2009-06-30 Hewlett-Packard Development Company, L.P. Program thread syncronization for instruction cachelines
US7660961B2 (en) * 2007-04-03 2010-02-09 Sun Microsystems, Inc. Concurrent evacuation of the young generation
KR101458028B1 (ko) * 2007-05-30 2014-11-04 삼성전자 주식회사 병렬 처리 장치 및 방법
US8719514B2 (en) * 2007-06-27 2014-05-06 Intel Corporation Software filtering in a transactional memory system
US8140773B2 (en) * 2007-06-27 2012-03-20 Bratin Saha Using ephemeral stores for fine-grained conflict detection in a hardware accelerated STM
US8082424B2 (en) 2007-08-01 2011-12-20 International Business Machines Corporation Determining when a set of compute nodes participating in a barrier operation on a parallel computer are ready to exit the barrier operation
JP2009176116A (ja) * 2008-01-25 2009-08-06 Univ Waseda マルチプロセッサシステムおよびマルチプロセッサシステムの同期方法
US20100281082A1 (en) * 2009-04-30 2010-11-04 Tatu Ylonen Oy Ltd Subordinate Multiobjects
JP5304194B2 (ja) * 2008-11-19 2013-10-02 富士通株式会社 バリア同期装置、バリア同期システム及びバリア同期装置の制御方法
US8370577B2 (en) * 2009-06-26 2013-02-05 Microsoft Corporation Metaphysically addressed cache metadata
US8229907B2 (en) * 2009-06-30 2012-07-24 Microsoft Corporation Hardware accelerated transactional memory system with open nested transactions
US8316194B2 (en) * 2009-12-15 2012-11-20 Intel Corporation Mechanisms to accelerate transactions using buffered stores
US8402218B2 (en) * 2009-12-15 2013-03-19 Microsoft Corporation Efficient garbage collection and exception handling in a hardware accelerated transactional memory system
US8280866B2 (en) * 2010-04-12 2012-10-02 Clausal Computing Oy Monitoring writes using thread-local write barrier buffers and soft synchronization
US20110264880A1 (en) * 2010-04-23 2011-10-27 Tatu Ylonen Oy Ltd Object copying with re-copying concurrently written objects
US9069545B2 (en) * 2011-07-18 2015-06-30 International Business Machines Corporation Relaxation of synchronization for iterative convergent computations

Also Published As

Publication number Publication date
EP2774037A1 (en) 2014-09-10
WO2013066988A1 (en) 2013-05-10
EP2774037B1 (en) 2019-09-25
KR20140088550A (ko) 2014-07-10
KR101871961B1 (ko) 2018-08-02
JP2014532937A (ja) 2014-12-08
US8607247B2 (en) 2013-12-10
CN103917959A (zh) 2014-07-09
CN103917959B (zh) 2017-11-14
US20130117750A1 (en) 2013-05-09

Similar Documents

Publication Publication Date Title
JP5984952B2 (ja) 作業項目の同期のための方法及びシステム
US20230038061A1 (en) Convergence among concurrently executing threads
US10467013B2 (en) Method and system for yield operation supporting thread-like behavior
US9424099B2 (en) Method and system for synchronization of workitems with divergent control flow
US11803380B2 (en) High performance synchronization mechanisms for coordinating operations on a computer system
US10915364B2 (en) Technique for computational nested parallelism
US20140157287A1 (en) Optimized Context Switching for Long-Running Processes
US9928109B2 (en) Method and system for processing nested stream events
EP2989540B1 (en) Controlling tasks performed by a computing system
US10558418B2 (en) Monitor support on accelerated processing device
TW201331836A (zh) 推理執行和回復
JP2009230756A (ja) 同期並列スレッドプロセッサにおける間接的な関数呼び出し命令
KR101293701B1 (ko) 코어스 그레인드 재구성 어레이에서의 중첩 루프문 수행 장치 및 그 방법
WO2020005412A2 (en) Method and system for opportunistic load balancing in neural networks using metadata
Skrzypczak et al. Efficient parallel implementation of crowd simulation using a hybrid CPU+ GPU high performance computing system
Zheng et al. Hiwaylib: A software framework for enabling high performance communications for heterogeneous pipeline computations
US10564948B2 (en) Method and device for processing an irregular application
JP2024541294A (ja) アクセラレータ常駐ランタイム管理を介した高スケーラブルhpcアプリケーションにおけるレイテンシの低減
JP2022510805A (ja) レイトレーシングにおけるトライアングル及びボックスの交差テストのための統合されたデータパス
JP6697457B2 (ja) プロセッサ・コアをスレッド・モードからレーン・モードに遷移させ、2つのモードの間のデータ転送を可能にすること
US20250165292A1 (en) Data processor
JP2012141852A (ja) 論理計算システム、生成装置、生成方法及びプログラム
Cheramangalath et al. GPU Architecture and Programming Challenges
Chen Accelerating SRD Simulation on GPU.

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20151027

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20151027

A871 Explanation of circumstances concerning accelerated examination

Free format text: JAPANESE INTERMEDIATE CODE: A871

Effective date: 20151027

A975 Report on accelerated examination

Free format text: JAPANESE INTERMEDIATE CODE: A971005

Effective date: 20151218

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20160119

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20160418

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20160705

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20160802

R150 Certificate of patent or registration of utility model

Ref document number: 5984952

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250