KR102592330B1 - OpenCL 커널을 처리하는 방법과 이를 수행하는 컴퓨팅 장치 - Google Patents

OpenCL 커널을 처리하는 방법과 이를 수행하는 컴퓨팅 장치 Download PDF

Info

Publication number
KR102592330B1
KR102592330B1 KR1020160180133A KR20160180133A KR102592330B1 KR 102592330 B1 KR102592330 B1 KR 102592330B1 KR 1020160180133 A KR1020160180133 A KR 1020160180133A KR 20160180133 A KR20160180133 A KR 20160180133A KR 102592330 B1 KR102592330 B1 KR 102592330B1
Authority
KR
South Korea
Prior art keywords
group
processing
core
control core
work
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
KR1020160180133A
Other languages
English (en)
Korean (ko)
Other versions
KR20180076051A (ko
Inventor
이강웅
오수림
조영현
유동훈
Original Assignee
삼성전자주식회사
서울대학교산학협력단
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 삼성전자주식회사, 서울대학교산학협력단 filed Critical 삼성전자주식회사
Priority to KR1020160180133A priority Critical patent/KR102592330B1/ko
Priority to US15/787,219 priority patent/US10503557B2/en
Priority to EP17202801.1A priority patent/EP3343370A1/en
Priority to CN201711216401.0A priority patent/CN108241508B/zh
Priority to JP2017238434A priority patent/JP6951962B2/ja
Publication of KR20180076051A publication Critical patent/KR20180076051A/ko
Application granted granted Critical
Publication of KR102592330B1 publication Critical patent/KR102592330B1/ko
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5027Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
    • G06F9/5044Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals considering hardware capabilities
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/30Arrangements for executing machine instructions, e.g. instruction decode
    • G06F9/38Concurrent instruction execution, e.g. pipeline or look ahead
    • G06F9/3836Instruction issuing, e.g. dynamic instruction scheduling or out of order instruction execution
    • G06F9/3851Instruction issuing, e.g. dynamic instruction scheduling or out of order instruction execution from multiple instruction streams, e.g. multistreaming
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5027Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/30Arrangements for executing machine instructions, e.g. instruction decode
    • G06F9/38Concurrent instruction execution, e.g. pipeline or look ahead
    • G06F9/3885Concurrent instruction execution, e.g. pipeline or look ahead using a plurality of independent parallel functional units
    • G06F9/3887Concurrent instruction execution, e.g. pipeline or look ahead using a plurality of independent parallel functional units controlled by a single instruction for multiple data lanes [SIMD]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5061Partitioning or combining of resources
    • G06F9/5066Algorithms for mapping a plurality of inter-dependent sub-tasks onto a plurality of physical CPUs
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T1/00General purpose image data processing
    • G06T1/20Processor architectures; Processor configuration, e.g. pipelining

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Stored Programmes (AREA)
  • Memory System Of A Hierarchy Structure (AREA)
KR1020160180133A 2016-12-27 2016-12-27 OpenCL 커널을 처리하는 방법과 이를 수행하는 컴퓨팅 장치 Active KR102592330B1 (ko)

Priority Applications (5)

Application Number Priority Date Filing Date Title
KR1020160180133A KR102592330B1 (ko) 2016-12-27 2016-12-27 OpenCL 커널을 처리하는 방법과 이를 수행하는 컴퓨팅 장치
US15/787,219 US10503557B2 (en) 2016-12-27 2017-10-18 Method of processing OpenCL kernel and computing device therefor
EP17202801.1A EP3343370A1 (en) 2016-12-27 2017-11-21 Method of processing opencl kernel and computing device therefor
CN201711216401.0A CN108241508B (zh) 2016-12-27 2017-11-28 处理OpenCL内核的方法及用于该方法的计算设备
JP2017238434A JP6951962B2 (ja) 2016-12-27 2017-12-13 OpenCLカーネルを処理する方法、及びそれを遂行するコンピューティング装置

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
KR1020160180133A KR102592330B1 (ko) 2016-12-27 2016-12-27 OpenCL 커널을 처리하는 방법과 이를 수행하는 컴퓨팅 장치

Publications (2)

Publication Number Publication Date
KR20180076051A KR20180076051A (ko) 2018-07-05
KR102592330B1 true KR102592330B1 (ko) 2023-10-20

Family

ID=60473303

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020160180133A Active KR102592330B1 (ko) 2016-12-27 2016-12-27 OpenCL 커널을 처리하는 방법과 이를 수행하는 컴퓨팅 장치

Country Status (5)

Country Link
US (1) US10503557B2 (https=)
EP (1) EP3343370A1 (https=)
JP (1) JP6951962B2 (https=)
KR (1) KR102592330B1 (https=)
CN (1) CN108241508B (https=)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111490946B (zh) * 2019-01-28 2023-08-11 阿里巴巴集团控股有限公司 基于OpenCL框架的FPGA连接实现方法及装置
KR20240136005A (ko) 2023-03-06 2024-09-13 한국전자통신연구원 병렬연산작업 오프로딩 장치 및 방법

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030195938A1 (en) 2000-06-26 2003-10-16 Howard Kevin David Parallel processing systems and method
US20090307699A1 (en) 2008-06-06 2009-12-10 Munshi Aaftab A Application programming interfaces for data parallel computing on multiple processors
KR101284195B1 (ko) 2012-01-09 2013-07-10 서울대학교산학협력단 개방형 범용 병렬 컴퓨팅 프레임워크 동적 작업 분배 장치
US20160147516A1 (en) 2014-11-24 2016-05-26 Mentor Graphics Corporation Execution of complex recursive algorithms

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS595755B2 (ja) 1977-08-02 1984-02-07 日本軽金属株式会社 太陽熱利用暖房方法
US7673011B2 (en) 2007-08-10 2010-03-02 International Business Machines Corporation Configuring compute nodes of a parallel computer in an operational group into a plurality of independent non-overlapping collective networks
US8255345B2 (en) 2009-05-15 2012-08-28 The Aerospace Corporation Systems and methods for parallel processing with infeasibility checking mechanism
US9354944B2 (en) * 2009-07-27 2016-05-31 Advanced Micro Devices, Inc. Mapping processing logic having data-parallel threads across processors
WO2011027382A1 (en) 2009-09-01 2011-03-10 Hitachi, Ltd. Request processing system provided with multi-core processor
KR101640848B1 (ko) 2009-12-28 2016-07-29 삼성전자주식회사 멀티코어 시스템 상에서 단위 작업을 할당하는 방법 및 그 장치
KR101613971B1 (ko) 2009-12-30 2016-04-21 삼성전자주식회사 프로그램 코드의 변환 방법
JP2012094072A (ja) 2010-10-28 2012-05-17 Toyota Motor Corp 情報処理装置
US8683243B2 (en) * 2011-03-11 2014-03-25 Intel Corporation Dynamic core selection for heterogeneous multi-core systems
WO2012157786A1 (ja) * 2011-05-19 2012-11-22 日本電気株式会社 並列処理装置、並列処理方法、最適化装置、最適化方法、および、コンピュータ・プログラム
US9092267B2 (en) * 2011-06-20 2015-07-28 Qualcomm Incorporated Memory sharing in graphics processing unit
US20120331278A1 (en) 2011-06-23 2012-12-27 Mauricio Breternitz Branch removal by data shuffling
US20130141443A1 (en) * 2011-12-01 2013-06-06 Michael L. Schmit Software libraries for heterogeneous parallel processing platforms
JP5238876B2 (ja) * 2011-12-27 2013-07-17 株式会社東芝 情報処理装置及び情報処理方法
KR20130093995A (ko) 2012-02-15 2013-08-23 한국전자통신연구원 계층적 멀티코어 프로세서의 성능 최적화 방법 및 이를 수행하는 멀티코어 프로세서 시스템
KR20140125893A (ko) 2013-01-28 2014-10-30 한국과학기술원 가상화된 매니코어 서버의 작업분배 시스템과 그 방법 및 기록매체
KR102062208B1 (ko) * 2013-05-03 2020-02-11 삼성전자주식회사 멀티스레드 프로그램 코드의 변환 장치 및 방법
JP6200824B2 (ja) * 2014-02-10 2017-09-20 ルネサスエレクトロニクス株式会社 演算制御装置及び演算制御方法並びにプログラム、OpenCLデバイス
CN104035751B (zh) * 2014-06-20 2016-10-12 深圳市腾讯计算机系统有限公司 基于多图形处理器的数据并行处理方法及装置

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030195938A1 (en) 2000-06-26 2003-10-16 Howard Kevin David Parallel processing systems and method
US20090307699A1 (en) 2008-06-06 2009-12-10 Munshi Aaftab A Application programming interfaces for data parallel computing on multiple processors
KR101284195B1 (ko) 2012-01-09 2013-07-10 서울대학교산학협력단 개방형 범용 병렬 컴퓨팅 프레임워크 동적 작업 분배 장치
US20160147516A1 (en) 2014-11-24 2016-05-26 Mentor Graphics Corporation Execution of complex recursive algorithms

Also Published As

Publication number Publication date
KR20180076051A (ko) 2018-07-05
JP6951962B2 (ja) 2021-10-20
CN108241508B (zh) 2023-06-13
EP3343370A1 (en) 2018-07-04
JP2018106709A (ja) 2018-07-05
US20180181443A1 (en) 2018-06-28
US10503557B2 (en) 2019-12-10
CN108241508A (zh) 2018-07-03

Similar Documents

Publication Publication Date Title
US10937125B2 (en) Resource-utilization-based workload re-allocation system
US10942716B1 (en) Dynamic computational acceleration using a heterogeneous hardware infrastructure
RU2571366C2 (ru) Виртуальная архитектура неоднородного доступа к памяти для виртуальных машин
US9483319B2 (en) Job scheduling apparatus and method therefor
TWI525540B (zh) 具有橫跨多個處理器之平行資料執行緒的映射處理邏輯
US9244629B2 (en) Method and system for asymmetrical processing with managed data affinity
US10025503B2 (en) Autonomous dynamic optimization of platform resources
US8413158B2 (en) Processor thread load balancing manager
JP2020537784A (ja) ニューラルネットワークアクセラレーションのための機械学習ランタイムライブラリ
KR102860332B1 (ko) 가속기, 가속기의 동작 방법 및 이를 포함한 가속기 시스템
US20100229175A1 (en) Moving Resources In a Computing Environment Having Multiple Logically-Partitioned Computer Systems
JP2004220608A (ja) スレッド型に基づくコンピュータ・リソースの動的割り付け
KR20160027541A (ko) 멀티-코어 프로세서를 포함하는 시스템 온 칩 및 그것의 쓰레드 스케줄링 방법
US20110161969A1 (en) Consolidating CPU - Cache - Memory Access Usage Metrics
KR20220049294A (ko) 스케줄러, 스케줄러의 동작 방법 및 이를 포함한 전자 장치
CN104156316B (zh) 一种Hadoop集群批处理作业的方法及系统
KR20190057558A (ko) 멀티 코어 제어 시스템
KR102592330B1 (ko) OpenCL 커널을 처리하는 방법과 이를 수행하는 컴퓨팅 장치
US10223260B2 (en) Compiler-generated memory mapping hints
KR101535792B1 (ko) 운영체제 구성 장치 및 방법
US8914778B2 (en) Data placement for execution of an executable
JP7397179B2 (ja) 階層化オブジェクトメモリ配置のためのランタイム装置の管理
Acevedo et al. A Critical Path File Location (CPFL) algorithm for data-aware multiworkflow scheduling on HPC clusters
US9032405B2 (en) Systems and method for assigning executable functions to available processors in a multiprocessing environment
KR101755154B1 (ko) 이종 연산 처리 장치에 대한 동적 작업 할당 방법 및 장치

Legal Events

Date Code Title Description
PA0109 Patent application

St.27 status event code: A-0-1-A10-A12-nap-PA0109

P11-X000 Amendment of application requested

St.27 status event code: A-2-2-P10-P11-nap-X000

P13-X000 Application amended

St.27 status event code: A-2-2-P10-P13-nap-X000

R15-X000 Change to inventor requested

St.27 status event code: A-3-3-R10-R15-oth-X000

R16-X000 Change to inventor recorded

St.27 status event code: A-3-3-R10-R16-oth-X000

P11-X000 Amendment of application requested

St.27 status event code: A-2-2-P10-P11-nap-X000

P13-X000 Application amended

St.27 status event code: A-2-2-P10-P13-nap-X000

R15-X000 Change to inventor requested

St.27 status event code: A-3-3-R10-R15-oth-X000

R16-X000 Change to inventor recorded

St.27 status event code: A-3-3-R10-R16-oth-X000

PG1501 Laying open of application

St.27 status event code: A-1-1-Q10-Q12-nap-PG1501

R18-X000 Changes to party contact information recorded

St.27 status event code: A-3-3-R10-R18-oth-X000

R18-X000 Changes to party contact information recorded

St.27 status event code: A-3-3-R10-R18-oth-X000

PN2301 Change of applicant

St.27 status event code: A-3-3-R10-R13-asn-PN2301

St.27 status event code: A-3-3-R10-R11-asn-PN2301

PN2301 Change of applicant

St.27 status event code: A-3-3-R10-R13-asn-PN2301

St.27 status event code: A-3-3-R10-R11-asn-PN2301

R18-X000 Changes to party contact information recorded

St.27 status event code: A-3-3-R10-R18-oth-X000

A201 Request for examination
E13-X000 Pre-grant limitation requested

St.27 status event code: A-2-3-E10-E13-lim-X000

P11-X000 Amendment of application requested

St.27 status event code: A-2-2-P10-P11-nap-X000

P13-X000 Application amended

St.27 status event code: A-2-2-P10-P13-nap-X000

PA0201 Request for examination

St.27 status event code: A-1-2-D10-D11-exm-PA0201

R18-X000 Changes to party contact information recorded

St.27 status event code: A-3-3-R10-R18-oth-X000

R18-X000 Changes to party contact information recorded

St.27 status event code: A-3-3-R10-R18-oth-X000

R18-X000 Changes to party contact information recorded

St.27 status event code: A-3-3-R10-R18-oth-X000

D13-X000 Search requested

St.27 status event code: A-1-2-D10-D13-srh-X000

D14-X000 Search report completed

St.27 status event code: A-1-2-D10-D14-srh-X000

E701 Decision to grant or registration of patent right
PE0701 Decision of registration

St.27 status event code: A-1-2-D10-D22-exm-PE0701

GRNT Written decision to grant
PR0701 Registration of establishment

St.27 status event code: A-2-4-F10-F11-exm-PR0701

PR1002 Payment of registration fee

St.27 status event code: A-2-2-U10-U11-oth-PR1002

Fee payment year number: 1

PG1601 Publication of registration

St.27 status event code: A-4-4-Q10-Q13-nap-PG1601

R18-X000 Changes to party contact information recorded

St.27 status event code: A-5-5-R10-R18-oth-X000

R18-X000 Changes to party contact information recorded

St.27 status event code: A-5-5-R10-R18-oth-X000

R18 Changes to party contact information recorded

Free format text: ST27 STATUS EVENT CODE: A-5-5-R10-R18-OTH-X000 (AS PROVIDED BY THE NATIONAL OFFICE)

R18-X000 Changes to party contact information recorded

St.27 status event code: A-5-5-R10-R18-oth-X000