CN104395890B - 使用异构处理器为应用程序提供低潜伏时间的系统和方法 - Google Patents

使用异构处理器为应用程序提供低潜伏时间的系统和方法 Download PDF

Info

Publication number
CN104395890B
CN104395890B CN201380033791.1A CN201380033791A CN104395890B CN 104395890 B CN104395890 B CN 104395890B CN 201380033791 A CN201380033791 A CN 201380033791A CN 104395890 B CN104395890 B CN 104395890B
Authority
CN
China
Prior art keywords
memory
value
data
requests
processors
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201380033791.1A
Other languages
English (en)
Chinese (zh)
Other versions
CN104395890A (zh
Inventor
亚历山大·洛希夫斯基
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Advanced Micro Devices Inc
Original Assignee
Advanced Micro Devices Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Advanced Micro Devices Inc filed Critical Advanced Micro Devices Inc
Publication of CN104395890A publication Critical patent/CN104395890A/zh
Application granted granted Critical
Publication of CN104395890B publication Critical patent/CN104395890B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5027Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
    • G06F9/5033Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals considering data affinity
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T1/00General purpose image data processing
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/24569Query processing with adaptation to specific hardware, e.g. adapted for using GPUs or SSDs
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/901Indexing; Data structures therefor; Storage structures
    • G06F16/9014Indexing; Data structures therefor; Storage structures hash tables
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09GARRANGEMENTS OR CIRCUITS FOR CONTROL OF INDICATING DEVICES USING STATIC MEANS TO PRESENT VARIABLE INFORMATION
    • G09G5/00Control arrangements or circuits for visual indicators common to cathode-ray tube indicators and other visual indicators
    • G09G5/36Control arrangements or circuits for visual indicators common to cathode-ray tube indicators and other visual indicators characterised by the display of a graphic pattern, e.g. using an all-points-addressable [APA] memory
    • G09G5/363Graphics controllers
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T15/003D [Three Dimensional] image rendering
    • G06T15/005General purpose rendering architectures

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Computer Graphics (AREA)
  • Computer Hardware Design (AREA)
  • Memory System Of A Hierarchy Structure (AREA)
  • Multi Processors (AREA)
  • Retry When Errors Occur (AREA)
CN201380033791.1A 2012-06-08 2013-06-07 使用异构处理器为应用程序提供低潜伏时间的系统和方法 Active CN104395890B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201261657404P 2012-06-08 2012-06-08
US61/657,404 2012-06-08
PCT/US2013/044682 WO2013185015A2 (en) 2012-06-08 2013-06-07 System and method for providing low latency to applications using heterogeneous processors

Publications (2)

Publication Number Publication Date
CN104395890A CN104395890A (zh) 2015-03-04
CN104395890B true CN104395890B (zh) 2018-12-07

Family

ID=48670839

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201380033791.1A Active CN104395890B (zh) 2012-06-08 2013-06-07 使用异构处理器为应用程序提供低潜伏时间的系统和方法

Country Status (6)

Country Link
US (1) US9495718B2 (enExample)
EP (1) EP2859448A2 (enExample)
JP (1) JP6170553B2 (enExample)
KR (1) KR102086019B1 (enExample)
CN (1) CN104395890B (enExample)
WO (1) WO2013185015A2 (enExample)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106164839B (zh) * 2014-02-04 2019-10-22 触觉实验室股份有限公司 以减小的等待时间提供对输入的视觉响应的方法
US9342384B1 (en) * 2014-12-18 2016-05-17 Intel Corporation Function callback mechanism between a central processing unit (CPU) and an auxiliary processor
US9711194B2 (en) * 2015-01-28 2017-07-18 Xilinx, Inc. Circuits for and methods of controlling the operation of a hybrid memory system
KR102352756B1 (ko) 2015-04-29 2022-01-17 삼성전자주식회사 애플리케이션 프로세서, 시스템 온 칩, 및 이를 포함하는 컴퓨팅 장치
KR101923210B1 (ko) * 2016-01-14 2018-11-28 서울대학교산학협력단 이종 멀티코어 프로세서를 활용한 암호화 처리 장치 및 암호화 처리 방법
US11513805B2 (en) * 2016-08-19 2022-11-29 Wisconsin Alumni Research Foundation Computer architecture with synergistic heterogeneous processors
US10929944B2 (en) 2016-11-23 2021-02-23 Advanced Micro Devices, Inc. Low power and low latency GPU coprocessor for persistent computing
US10795840B2 (en) 2018-11-12 2020-10-06 At&T Intellectual Property I, L.P. Persistent kernel for graphics processing unit direct memory access network packet processing
CN111447561B (zh) * 2020-03-16 2023-04-18 阿波罗智联(北京)科技有限公司 车辆的图像处理系统

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1777918A (zh) * 2003-02-18 2006-05-24 舍弗塞德集团有限公司 处理图像的设备及方法
US7554959B1 (en) * 1999-12-02 2009-06-30 Cisco Technology, Inc. Apparatus and method for cluster network device discovery
CN102156658A (zh) * 2010-02-26 2011-08-17 微软公司 对象的低等待时间呈现

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0285927A (ja) * 1988-09-22 1990-03-27 Hitachi Vlsi Eng Corp 記憶装置
FR2767939B1 (fr) * 1997-09-04 2001-11-02 Bull Sa Procede d'allocation de memoire dans un systeme de traitement de l'information multiprocesseur
US8484647B2 (en) * 2009-07-24 2013-07-09 Apple Inc. Selectively adjusting CPU wait mode based on estimation of remaining work before task completion on GPU
US8400458B2 (en) * 2009-09-09 2013-03-19 Hewlett-Packard Development Company, L.P. Method and system for blocking data on a GPU
US9645866B2 (en) * 2010-09-20 2017-05-09 Qualcomm Incorporated Inter-processor communication techniques in a multiple-processor computing platform
EP2652634A1 (en) * 2010-12-16 2013-10-23 Et International Inc. Distributed computing architecture

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7554959B1 (en) * 1999-12-02 2009-06-30 Cisco Technology, Inc. Apparatus and method for cluster network device discovery
CN1777918A (zh) * 2003-02-18 2006-05-24 舍弗塞德集团有限公司 处理图像的设备及方法
CN102156658A (zh) * 2010-02-26 2011-08-17 微软公司 对象的低等待时间呈现

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
GPU-to-CPU Callbacks;Jeff A.Stuart,Michael Cox,John D.Owens;《Euro-Par 2010 Parallel Processing Workshops》;20100930;摘要,正文第1部分,第3部分,第4部分,图1 *

Also Published As

Publication number Publication date
KR102086019B1 (ko) 2020-04-14
US20130328891A1 (en) 2013-12-12
WO2013185015A3 (en) 2014-01-30
CN104395890A (zh) 2015-03-04
JP6170553B2 (ja) 2017-07-26
WO2013185015A2 (en) 2013-12-12
EP2859448A2 (en) 2015-04-15
JP2015522878A (ja) 2015-08-06
KR20150024884A (ko) 2015-03-09
US9495718B2 (en) 2016-11-15

Similar Documents

Publication Publication Date Title
CN104395890B (zh) 使用异构处理器为应用程序提供低潜伏时间的系统和方法
JP6961686B2 (ja) トリガ動作を用いたgpuリモート通信
US10788992B2 (en) System and method for efficient access for remote storage devices
US7953915B2 (en) Interrupt dispatching method in multi-core environment and multi-core processor
CN110178118B (zh) 硬件实现的负载平衡
CN108694087A (zh) 用于最优系统级性能的网络接口卡中的动态负载均衡
CN105933408B (zh) 一种Redis通用中间件的实现方法及装置
US20160026605A1 (en) Registrationless transmit onload rdma
JP6308508B2 (ja) メムキャッシュドシステムのクライアント装置およびサーバ装置、データをキャッシュするメモリのための方法、コンピュータプログラム、並びにコンピュータ可読ストレージ媒体
US20110296437A1 (en) Method and apparatus for lockless communication between cores in a multi-core processor
TW202008172A (zh) 儲存系統
JP2017527027A (ja) ヘテロジニアスプロセッサシステムにおけるキャッシュ間のデータ移動
JP2018088041A (ja) 接続数制御プログラム、振り分け装置および接続数制御方法
JP6899907B2 (ja) データベースバウンドアプリケーション用にユーザインターフェースバックエンドクラスタをスケーリングするための技術
US20250060912A1 (en) Method of submitting work to fabric attached memory
US8327380B2 (en) Method and interprocess communication driver for managing requests of a database client to a database server
US20210297510A1 (en) Efficient packet processing for express data paths
CN115933973B (zh) 远程更新数据的方法、rdma系统及存储介质
US10459776B2 (en) Transmission of large messages in computer systems
JP4089506B2 (ja) ファイル共有システム及びサーバー並びにプログラム
TWI857607B (zh) 用於儲存操作資料傳輸之處理器及用於改良其性能之方法及電腦程式產品
US11362969B2 (en) Efficient packet re-transmission for express data paths
WO2017087001A1 (en) Distributed data shuffling

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant