CN103608776B - 异构型处理设备上的动态工作划分 - Google Patents

异构型处理设备上的动态工作划分 Download PDF

Info

Publication number
CN103608776B
CN103608776B CN201180060199.1A CN201180060199A CN103608776B CN 103608776 B CN103608776 B CN 103608776B CN 201180060199 A CN201180060199 A CN 201180060199A CN 103608776 B CN103608776 B CN 103608776B
Authority
CN
China
Prior art keywords
processors
module
post
task
apd
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201180060199.1A
Other languages
English (en)
Chinese (zh)
Other versions
CN103608776A (zh
Inventor
本杰明·托马斯·桑德
迈克尔·休斯顿
牛顿·张
基思·洛韦里
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Advanced Micro Devices Inc
Original Assignee
Advanced Micro Devices Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Advanced Micro Devices Inc filed Critical Advanced Micro Devices Inc
Publication of CN103608776A publication Critical patent/CN103608776A/zh
Application granted granted Critical
Publication of CN103608776B publication Critical patent/CN103608776B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5027Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
    • G06F9/5044Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals considering hardware capabilities
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T1/00General purpose image data processing
    • G06T1/20Processor architectures; Processor configuration, e.g. pipelining

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Advance Control (AREA)
  • Image Processing (AREA)
  • Image Generation (AREA)
  • Multi Processors (AREA)
CN201180060199.1A 2010-12-15 2011-12-09 异构型处理设备上的动态工作划分 Active CN103608776B (zh)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US42346510P 2010-12-15 2010-12-15
US61/423,465 2010-12-15
US13/287,418 US9645854B2 (en) 2010-12-15 2011-11-02 Dynamic work partitioning on heterogeneous processing devices
US13/287,418 2011-11-02
PCT/US2011/064172 WO2012082557A2 (en) 2010-12-15 2011-12-09 Dynamic work partitioning on heterogeneous processing devices

Publications (2)

Publication Number Publication Date
CN103608776A CN103608776A (zh) 2014-02-26
CN103608776B true CN103608776B (zh) 2017-09-05

Family

ID=46245295

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201180060199.1A Active CN103608776B (zh) 2010-12-15 2011-12-09 异构型处理设备上的动态工作划分

Country Status (6)

Country Link
US (1) US9645854B2 (enExample)
EP (1) EP2652617B1 (enExample)
JP (1) JP6373586B2 (enExample)
KR (1) KR101961396B1 (enExample)
CN (1) CN103608776B (enExample)
WO (1) WO2012082557A2 (enExample)

Families Citing this family (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012052774A2 (en) * 2010-10-21 2012-04-26 Bluwireless Technology Limited Data processing units
US8789065B2 (en) 2012-06-08 2014-07-22 Throughputer, Inc. System and method for input data load adaptive parallel processing
US9448847B2 (en) 2011-07-15 2016-09-20 Throughputer, Inc. Concurrent program execution optimization
US20130141446A1 (en) * 2011-12-06 2013-06-06 Advanced Micro Devices, Inc. Method and Apparatus for Servicing Page Fault Exceptions
US20130145202A1 (en) * 2011-12-06 2013-06-06 Advanced Micro Devices, Inc. Handling Virtual-to-Physical Address Translation Failures
US8842122B2 (en) * 2011-12-15 2014-09-23 Qualcomm Incorporated Graphics processing unit with command processor
US9146777B2 (en) 2013-01-25 2015-09-29 Swarm Technology Llc Parallel processing with solidarity cells by proactively retrieving from a task pool a matching task for the solidarity cell to process
WO2014143067A1 (en) 2013-03-15 2014-09-18 Intel Corporation Work stealing in heterogeneous computing systems
US10360652B2 (en) * 2014-06-13 2019-07-23 Advanced Micro Devices, Inc. Wavefront resource virtualization
US9959142B2 (en) * 2014-06-17 2018-05-01 Mediatek Inc. Dynamic task scheduling method for dispatching sub-tasks to computing devices of heterogeneous computing system and related computer readable medium
US9678806B2 (en) * 2015-06-26 2017-06-13 Advanced Micro Devices, Inc. Method and apparatus for distributing processing core workloads among processing cores
US9703605B2 (en) * 2015-09-04 2017-07-11 Mediatek, Inc. Fine-grained heterogeneous computing
US10528613B2 (en) * 2015-11-23 2020-01-07 Advanced Micro Devices, Inc. Method and apparatus for performing a parallel search operation
US10223436B2 (en) * 2016-04-27 2019-03-05 Qualcomm Incorporated Inter-subgroup data sharing
US10725667B2 (en) 2017-01-19 2020-07-28 Seoul National University R&Db Foundation Method of transferring data in parallel system, and parallel system for performing the same
KR102066212B1 (ko) * 2017-01-19 2020-01-14 서울대학교산학협력단 병렬 시스템에서의 데이터 복사 방법 및 이를 수행하기 위한 병렬 시스템
US10990436B2 (en) * 2018-01-24 2021-04-27 Dell Products L.P. System and method to handle I/O page faults in an I/O memory management unit
US10908940B1 (en) * 2018-02-26 2021-02-02 Amazon Technologies, Inc. Dynamically managed virtual server system
US11720408B2 (en) * 2018-05-08 2023-08-08 Vmware, Inc. Method and system for assigning a virtual machine in virtual GPU enabled systems
US10963300B2 (en) * 2018-12-06 2021-03-30 Raytheon Company Accelerating dataflow signal processing applications across heterogeneous CPU/GPU systems
US11340942B2 (en) 2020-03-19 2022-05-24 Raytheon Company Cooperative work-stealing scheduler
KR102441045B1 (ko) 2020-12-14 2022-09-05 현대오토에버 주식회사 멀티 코어 구조의 전자 제어 유닛에서 수행되는 방법, 그리고 이를 구현하기 위한 장치
CN113535366B (zh) * 2021-08-31 2025-07-25 知见科技(江苏)有限公司 一种高性能分布式结合的多路视频实时处理方法

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5826081A (en) * 1996-05-06 1998-10-20 Sun Microsystems, Inc. Real time thread dispatcher for multiprocessor applications
US7233998B2 (en) * 2001-03-22 2007-06-19 Sony Computer Entertainment Inc. Computer architecture and software cells for broadband networks
JP4384828B2 (ja) 2001-11-22 2009-12-16 ユニヴァーシティ オブ ワシントン コプロセッサ装置およびデータ転送を容易にするための方法
US7159221B1 (en) 2002-08-30 2007-01-02 Unisys Corporation Computer OS dispatcher operation with user controllable dedication
US7167916B2 (en) * 2002-08-30 2007-01-23 Unisys Corporation Computer OS dispatcher operation with virtual switching queue and IP queues
US7015915B1 (en) * 2003-08-12 2006-03-21 Nvidia Corporation Programming multiple chips from a command buffer
US7650601B2 (en) * 2003-12-04 2010-01-19 International Business Machines Corporation Operating system kernel-assisted, self-balanced, access-protected library framework in a run-to-completion multi-processor environment
US7898545B1 (en) * 2004-12-14 2011-03-01 Nvidia Corporation Apparatus, system, and method for integrated heterogeneous processors
US8037474B2 (en) * 2005-09-27 2011-10-11 Sony Computer Entertainment Inc. Task manager with stored task definition having pointer to a memory address containing required code data related to the task for execution
US8149242B2 (en) * 2006-11-10 2012-04-03 Sony Computer Entertainment Inc. Graphics processing apparatus, graphics library module and graphics processing method
US8286196B2 (en) 2007-05-03 2012-10-09 Apple Inc. Parallel runtime execution on multiple processors
CN101706741B (zh) 2009-12-11 2012-10-24 中国人民解放军国防科学技术大学 一种基于负载平衡的cpu和gpu两级动态任务划分方法
US8819690B2 (en) * 2009-12-30 2014-08-26 International Business Machines Corporation System for reducing data transfer latency to a global queue by generating bit mask to identify selected processing nodes/units in multi-node data processing system

Also Published As

Publication number Publication date
WO2012082557A2 (en) 2012-06-21
EP2652617A4 (en) 2017-06-14
EP2652617B1 (en) 2019-10-09
US9645854B2 (en) 2017-05-09
JP6373586B2 (ja) 2018-08-15
JP2014508982A (ja) 2014-04-10
CN103608776A (zh) 2014-02-26
EP2652617A2 (en) 2013-10-23
WO2012082557A3 (en) 2013-12-27
KR20130127480A (ko) 2013-11-22
KR101961396B1 (ko) 2019-03-22
US20120192201A1 (en) 2012-07-26

Similar Documents

Publication Publication Date Title
CN103608776B (zh) 异构型处理设备上的动态工作划分
CN103262002B (zh) 优化系统调用请求通信
CN103262038B (zh) 图形计算进程调度
US8667201B2 (en) Computer system interrupt handling
EP2652614B1 (en) Graphics processing dispatch from user mode
US20120229481A1 (en) Accessibility of graphics processing compute resources
US10146575B2 (en) Heterogeneous enqueuing and dequeuing mechanism for task scheduling
US20120194526A1 (en) Task Scheduling
EP2663926B1 (en) Computer system interrupt handling
US20120194525A1 (en) Managed Task Scheduling on a Graphics Processing Device (APD)
US20130155074A1 (en) Syscall mechanism for processor to processor calls
US20130155079A1 (en) Saving and Restoring Shader Context State

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant