JP2013546105A5 - - Google Patents

Download PDF

Info

Publication number
JP2013546105A5
JP2013546105A5 JP2013544736A JP2013544736A JP2013546105A5 JP 2013546105 A5 JP2013546105 A5 JP 2013546105A5 JP 2013544736 A JP2013544736 A JP 2013544736A JP 2013544736 A JP2013544736 A JP 2013544736A JP 2013546105 A5 JP2013546105 A5 JP 2013546105A5
Authority
JP
Japan
Prior art keywords
simd
system call
work item
call request
wavefront
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2013544736A
Other languages
English (en)
Japanese (ja)
Other versions
JP6228459B2 (ja
JP2013546105A (ja
Filing date
Publication date
Priority claimed from US13/307,505 external-priority patent/US8752064B2/en
Application filed filed Critical
Publication of JP2013546105A publication Critical patent/JP2013546105A/ja
Publication of JP2013546105A5 publication Critical patent/JP2013546105A5/ja
Application granted granted Critical
Publication of JP6228459B2 publication Critical patent/JP6228459B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

JP2013544736A 2010-12-14 2011-12-14 システムコール要求の通信の最適化 Active JP6228459B2 (ja)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US42295310P 2010-12-14 2010-12-14
US61/422,953 2010-12-14
US13/307,505 US8752064B2 (en) 2010-12-14 2011-11-30 Optimizing communication of system call requests
US13/307,505 2011-11-30
PCT/US2011/064859 WO2012082867A1 (en) 2010-12-14 2011-12-14 Optimizing communication of system call requests

Publications (3)

Publication Number Publication Date
JP2013546105A JP2013546105A (ja) 2013-12-26
JP2013546105A5 true JP2013546105A5 (enExample) 2015-02-12
JP6228459B2 JP6228459B2 (ja) 2017-11-08

Family

ID=46245087

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2013544736A Active JP6228459B2 (ja) 2010-12-14 2011-12-14 システムコール要求の通信の最適化

Country Status (6)

Country Link
US (1) US8752064B2 (enExample)
EP (1) EP2652575A4 (enExample)
JP (1) JP6228459B2 (enExample)
KR (1) KR101788267B1 (enExample)
CN (1) CN103262002B (enExample)
WO (1) WO2012082867A1 (enExample)

Families Citing this family (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9513975B2 (en) * 2012-05-02 2016-12-06 Nvidia Corporation Technique for computational nested parallelism
US9038075B2 (en) * 2012-11-26 2015-05-19 Red Hat, Inc. Batch execution of system calls in an operating system
US10235732B2 (en) 2013-12-27 2019-03-19 Intel Corporation Scheduling and dispatch of GPGPU workloads
US11126559B2 (en) 2013-12-30 2021-09-21 Michael Henry Kass Translation look-aside buffer and prefetch indicator
US10216632B2 (en) 2013-12-30 2019-02-26 Michael Henry Kass Memory system cache eviction policies
US10002080B2 (en) * 2013-12-30 2018-06-19 Michael Henry Kass Memory system address modification policies
US10521390B2 (en) * 2016-11-17 2019-12-31 The United States Of America As Represented By The Secretary Of The Air Force Systems and method for mapping FIFOs to processor address space
US11093251B2 (en) 2017-10-31 2021-08-17 Micron Technology, Inc. System having a hybrid threading processor, a hybrid threading fabric having configurable computing elements, and a hybrid interconnection network
US11068305B2 (en) 2018-05-07 2021-07-20 Micron Technology, Inc. System call management in a user-mode, multi-threaded, self-scheduling processor
US11157286B2 (en) 2018-05-07 2021-10-26 Micron Technology, Inc. Non-cached loads and stores in a system having a multi-threaded, self-scheduling processor
US11132233B2 (en) 2018-05-07 2021-09-28 Micron Technology, Inc. Thread priority management in a multi-threaded, self-scheduling processor
US11126587B2 (en) 2018-05-07 2021-09-21 Micron Technology, Inc. Event messaging in a system having a self-scheduling processor and a hybrid threading fabric
US11074078B2 (en) 2018-05-07 2021-07-27 Micron Technology, Inc. Adjustment of load access size by a multi-threaded, self-scheduling processor to manage network congestion
US11513838B2 (en) 2018-05-07 2022-11-29 Micron Technology, Inc. Thread state monitoring in a system having a multi-threaded, self-scheduling processor
US11513837B2 (en) 2018-05-07 2022-11-29 Micron Technology, Inc. Thread commencement and completion using work descriptor packets in a system having a self-scheduling processor and a hybrid threading fabric
US11513840B2 (en) 2018-05-07 2022-11-29 Micron Technology, Inc. Thread creation on local or remote compute elements by a multi-threaded, self-scheduling processor
US11119782B2 (en) 2018-05-07 2021-09-14 Micron Technology, Inc. Thread commencement using a work descriptor packet in a self-scheduling processor
US11119972B2 (en) 2018-05-07 2021-09-14 Micron Technology, Inc. Multi-threaded, self-scheduling processor
US11513839B2 (en) 2018-05-07 2022-11-29 Micron Technology, Inc. Memory request size management in a multi-threaded, self-scheduling processor
CN110716750B (zh) * 2018-07-11 2025-05-30 超威半导体公司 用于部分波前合并的方法和系统
US11250107B2 (en) * 2019-07-15 2022-02-15 International Business Machines Corporation Method for interfacing with hardware accelerators
CN112230931B (zh) * 2020-10-22 2021-11-02 上海壁仞智能科技有限公司 适用于图形处理器的二次卸载的编译方法、装置和介质

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001063416A1 (en) * 2000-02-24 2001-08-30 Bops Incorporated Methods and apparatus for scalable array processor interrupt detection and response
EP1880274A2 (en) 2005-04-22 2008-01-23 Altrix Logic, Inc. Array of data processing elements with variable precision interconnect
CN101446909B (zh) * 2007-11-30 2011-12-28 国际商业机器公司 用于管理任务事件的方法和系统
US8106914B2 (en) * 2007-12-07 2012-01-31 Nvidia Corporation Fused multiply-add functional unit
US8312254B2 (en) * 2008-03-24 2012-11-13 Nvidia Corporation Indirect function call instructions in a synchronous parallel thread processor
KR101474478B1 (ko) 2008-05-30 2014-12-19 어드밴스드 마이크로 디바이시즈, 인코포레이티드 로컬 및 글로벌 데이터 공유
US8904366B2 (en) 2009-05-15 2014-12-02 International Business Machines Corporation Use of vectorization instruction sets
US9195487B2 (en) * 2009-05-19 2015-11-24 Vmware, Inc. Interposition method suitable for hardware-assisted virtual machine
US8661435B2 (en) * 2010-09-21 2014-02-25 Unisys Corporation System and method for affinity dispatching for task management in an emulated multiprocessor environment
US8725989B2 (en) * 2010-12-09 2014-05-13 Intel Corporation Performing function calls using single instruction multiple data (SIMD) registers

Similar Documents

Publication Publication Date Title
JP2013546105A5 (enExample)
EP3451162B1 (en) Device and method for use in executing matrix multiplication operations
JP7561925B2 (ja) 専用ニューラルネットワークトレーニングチップ
US11740935B2 (en) FPGA acceleration for serverless computing
JP7092801B2 (ja) Gpuタスクスケジューリングの継続分析タスク
EP3242210B1 (en) Work stealing in heterogeneous computing systems
JP6640243B2 (ja) ニューラルネットワークプロセッサにおけるバッチ処理
JP6464284B2 (ja) 高同時実行性データの記憶方法および装置
JP2009535702A5 (enExample)
CN110494848A (zh) 任务处理方法、设备及机器可读存储介质
WO2016167980A3 (en) Virtual machine systems
EP3217406B1 (en) Memory management method and device, and memory controller
JP7377869B2 (ja) グラフィックスプロセッシングユニットでのパイプライン化された行列乗算
CN114930292A (zh) 协作式工作窃取调度器
CN111651203A (zh) 一种用于执行向量四则运算的装置和方法
KR20130080663A (ko) 멀티-쓰레딩을 사용하는 그래픽 처리를 위한 방법 및 장치
RU2014109364A (ru) Эффективное обеспечение данных из виртуализованного источника данных
JP2014503899A5 (enExample)
EP2801913A1 (en) Memory control apparatus and method
CN106708473B (zh) 一种统一染色器阵列多warp取指电路
JPWO2022212383A5 (enExample)
Jihun et al. Real-Time GPU Task Monitoring and Node List Management Techniques for Container Deployment in a Cluster-Based Container Environment
KR20160144688A (ko) 큐를 이용한 smp 가상 머신 이벤트 라우터 및 방법
Flajslik et al. On the Fence: An Offload Approach to Ordering One-Sided Communication
CN115878184A (zh) 基于一个指令搬移多个数据的方法、存储介质及设备