JP7028745B2 - 高効率ラーニングシステムのためのヘテロジニアスアクセラレータ - Google Patents
高効率ラーニングシステムのためのヘテロジニアスアクセラレータ Download PDFInfo
- Publication number
- JP7028745B2 JP7028745B2 JP2018171047A JP2018171047A JP7028745B2 JP 7028745 B2 JP7028745 B2 JP 7028745B2 JP 2018171047 A JP2018171047 A JP 2018171047A JP 2018171047 A JP2018171047 A JP 2018171047A JP 7028745 B2 JP7028745 B2 JP 7028745B2
- Authority
- JP
- Japan
- Prior art keywords
- processing unit
- reprogrammable
- task
- memory
- arithmetic
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/76—Architectures of general purpose stored program computers
- G06F15/78—Architectures of general purpose stored program computers comprising a single central processing unit
- G06F15/7867—Architectures of general purpose stored program computers comprising a single central processing unit with reconfigurable architecture
- G06F15/7885—Runtime interface, e.g. data exchange, runtime control
- G06F15/7889—Reconfigurable logic implemented as a co-processor
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F13/00—Interconnection of, or transfer of information or other signals between, memories, input/output devices or central processing units
- G06F13/14—Handling requests for interconnection or transfer
- G06F13/20—Handling requests for interconnection or transfer for access to input/output bus
- G06F13/28—Handling requests for interconnection or transfer for access to input/output bus using burst mode transfer, e.g. direct memory access DMA, cycle steal
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/44—Arrangements for executing specific programs
- G06F9/445—Program loading or initiating
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/48—Program initiating; Program switching, e.g. by interrupt
- G06F9/4806—Task transfer initiation or dispatching
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/48—Program initiating; Program switching, e.g. by interrupt
- G06F9/4806—Task transfer initiation or dispatching
- G06F9/4843—Task transfer initiation or dispatching by program, e.g. task dispatcher, supervisor, operating system
- G06F9/4881—Scheduling strategies for dispatcher, e.g. round robin, multi-level priority queues
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/76—Architectures of general purpose stored program computers
- G06F2015/761—Indexing scheme relating to architectures of general purpose stored programme computers
- G06F2015/768—Gate array
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Software Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computer Hardware Design (AREA)
- Advance Control (AREA)
- Microcomputers (AREA)
- Memory System (AREA)
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201762558745P | 2017-09-14 | 2017-09-14 | |
| US62/558,745 | 2017-09-14 | ||
| US15/825,047 US10474600B2 (en) | 2017-09-14 | 2017-11-28 | Heterogeneous accelerator for highly efficient learning systems |
| US15/825,047 | 2017-11-28 |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2019053734A JP2019053734A (ja) | 2019-04-04 |
| JP2019053734A5 JP2019053734A5 (enExample) | 2021-09-02 |
| JP7028745B2 true JP7028745B2 (ja) | 2022-03-02 |
Family
ID=65631148
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2018171047A Active JP7028745B2 (ja) | 2017-09-14 | 2018-09-13 | 高効率ラーニングシステムのためのヘテロジニアスアクセラレータ |
Country Status (5)
| Country | Link |
|---|---|
| US (4) | US10474600B2 (enExample) |
| JP (1) | JP7028745B2 (enExample) |
| KR (1) | KR102689910B1 (enExample) |
| CN (1) | CN109508316B (enExample) |
| TW (1) | TWI754752B (enExample) |
Families Citing this family (28)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US10474600B2 (en) * | 2017-09-14 | 2019-11-12 | Samsung Electronics Co., Ltd. | Heterogeneous accelerator for highly efficient learning systems |
| US11367707B2 (en) * | 2018-09-26 | 2022-06-21 | Intel Corporation | Semiconductor package or structure with dual-sided interposers and memory |
| CN109785224B (zh) * | 2019-01-29 | 2021-09-17 | 华中科技大学 | 一种基于fpga的图数据处理方法和系统 |
| US11211378B2 (en) * | 2019-07-18 | 2021-12-28 | International Business Machines Corporation | Heterogeneous integration structure for artificial intelligence computing |
| KR102147912B1 (ko) * | 2019-08-13 | 2020-08-25 | 삼성전자주식회사 | 프로세서 칩 및 그 제어 방법들 |
| KR102818456B1 (ko) * | 2019-09-23 | 2025-06-10 | 삼성전자주식회사 | 솔리드 스테이트 드라이브 장치 및 그 제조 방법 |
| KR102848819B1 (ko) | 2019-10-10 | 2025-08-22 | 삼성전자주식회사 | Pim을 채용하는 반도체 메모리 장치 및 그 동작 방법 |
| US11769043B2 (en) | 2019-10-25 | 2023-09-26 | Samsung Electronics Co., Ltd. | Batch size pipelined PIM accelerator for vision inference on multiple images |
| US12379933B2 (en) | 2019-10-25 | 2025-08-05 | Samsung Electronics Co., Ltd. | Ultra pipelined accelerator for machine learning inference |
| CN114787830A (zh) * | 2019-12-20 | 2022-07-22 | 惠普发展公司,有限责任合伙企业 | 异构集群中的机器学习工作负载编排 |
| US11520501B2 (en) * | 2019-12-20 | 2022-12-06 | Intel Corporation | Automated learning technology to partition computer applications for heterogeneous systems |
| CN115398448A (zh) * | 2019-12-27 | 2022-11-25 | 美光科技公司 | 神经形态存储器装置和方法 |
| US11315611B2 (en) | 2020-01-07 | 2022-04-26 | SK Hynix Inc. | Processing-in-memory (PIM) system and operating methods of the PIM system |
| US11385837B2 (en) | 2020-01-07 | 2022-07-12 | SK Hynix Inc. | Memory system |
| TWI868210B (zh) | 2020-01-07 | 2025-01-01 | 韓商愛思開海力士有限公司 | 記憶體中處理(pim)系統 |
| US11748100B2 (en) * | 2020-03-19 | 2023-09-05 | Micron Technology, Inc. | Processing in memory methods for convolutional operations |
| TWI811620B (zh) * | 2020-03-24 | 2023-08-11 | 威盛電子股份有限公司 | 運算裝置與資料處理方法 |
| US11941433B2 (en) | 2020-03-24 | 2024-03-26 | Via Technologies Inc. | Computing apparatus and data processing method for offloading data processing of data processing task from at least one general purpose processor |
| CN115335908A (zh) * | 2020-03-30 | 2022-11-11 | 拉姆伯斯公司 | 具有集成高带宽存储器的堆叠裸片神经网络 |
| US12462186B2 (en) | 2020-05-29 | 2025-11-04 | Advanced Micro Devices, Inc. | Stacked dies for machine learning accelerator |
| CN111813526A (zh) * | 2020-07-10 | 2020-10-23 | 深圳致星科技有限公司 | 用于联邦学习的异构处理系统、处理器及任务处理方法 |
| KR20220032366A (ko) | 2020-09-07 | 2022-03-15 | 삼성전자주식회사 | 가변적인 모드 설정을 수행하는 메모리 장치 및 그 동작방법 |
| WO2022139835A1 (en) * | 2020-12-23 | 2022-06-30 | Futurewei Technologies, Inc. | Server architecture with configurable universal expansion slots |
| CN115469800A (zh) | 2021-06-10 | 2022-12-13 | 三星电子株式会社 | 数据处理系统以及用于访问异构存储器系统的方法 |
| US12142596B2 (en) * | 2022-02-25 | 2024-11-12 | Nanya Technology Corporation | Semiconductor structure and manufacturing method thereof |
| KR102816234B1 (ko) * | 2022-12-22 | 2025-06-02 | 연세대학교 산학협력단 | Cpu-pim 작업 분배 방법 |
| US12417047B2 (en) | 2023-01-10 | 2025-09-16 | Google Llc | Heterogeneous ML accelerator cluster with flexible system resource balance |
| US20250231877A1 (en) * | 2024-01-12 | 2025-07-17 | Micron Technology, Inc. | Cache memories in vertically integrated memory systems and associated systems and methods |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2003347470A (ja) | 2002-05-24 | 2003-12-05 | Fujitsu Ltd | 半導体装置の製造方法 |
| JP2010080802A (ja) | 2008-09-29 | 2010-04-08 | Hitachi Ltd | 半導体装置 |
| JP2015533009A (ja) | 2012-09-25 | 2015-11-16 | インテル・コーポレーション | パフォーマンスおよび電力のために構成可能な3dメモリ |
| WO2016209406A1 (en) | 2015-06-26 | 2016-12-29 | Advanced Micro Devices, Inc. | Computer architecture using rapidly reconfigurable circuits and high-bandwidth memory interfaces |
Family Cites Families (108)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4951193A (en) * | 1986-09-05 | 1990-08-21 | Hitachi, Ltd. | Parallel computer with distributed shared memories and distributed task activating circuits |
| US5893154A (en) * | 1993-07-06 | 1999-04-06 | Intel Corporation | CPU write-back cache coherency mechanism that transeers data from a cache memory to a main memory before access of the main memory by an alternate bus master |
| US5524208A (en) * | 1994-06-09 | 1996-06-04 | Dell Usa, L.P. | Method and apparatus for performing cache snoop testing using DMA cycles in a computer system |
| US5918248A (en) * | 1996-12-30 | 1999-06-29 | Northern Telecom Limited | Shared memory control algorithm for mutual exclusion and rollback |
| US20030216874A1 (en) * | 2002-03-29 | 2003-11-20 | Henry Manus P. | Drive techniques for a digital flowmeter |
| US7028299B1 (en) * | 2000-06-30 | 2006-04-11 | Intel Corporation | Task-based multiprocessing system |
| US7155602B2 (en) * | 2001-04-30 | 2006-12-26 | Src Computers, Inc. | Interface for integrating reconfigurable processors into a general purpose computing system |
| US6794273B2 (en) | 2002-05-24 | 2004-09-21 | Fujitsu Limited | Semiconductor device and manufacturing method thereof |
| US7254812B1 (en) * | 2002-05-31 | 2007-08-07 | Advanced Micro Devices, Inc. | Multi-processor task scheduling |
| US8108656B2 (en) * | 2002-08-29 | 2012-01-31 | Qst Holdings, Llc | Task definition for specifying resource requirements |
| EP1443417A1 (en) * | 2003-01-31 | 2004-08-04 | STMicroelectronics S.r.l. | A reconfigurable signal processor with embedded flash memory device |
| GB2409066B (en) * | 2003-12-09 | 2006-09-27 | Advanced Risc Mach Ltd | A data processing apparatus and method for moving data between registers and memory |
| US7614053B2 (en) * | 2004-02-20 | 2009-11-03 | Sony Computer Entertainment Inc. | Methods and apparatus for task management in a multi-processor system |
| US7506297B2 (en) * | 2004-06-15 | 2009-03-17 | University Of North Carolina At Charlotte | Methodology for scheduling, partitioning and mapping computational tasks onto scalable, high performance, hybrid FPGA networks |
| US7743376B2 (en) * | 2004-09-13 | 2010-06-22 | Broadcom Corporation | Method and apparatus for managing tasks in a multiprocessor system |
| TWI251171B (en) * | 2004-09-21 | 2006-03-11 | Univ Tsinghua | Task scheduling method with low power consumption and a SOC using the method |
| US20060090092A1 (en) * | 2004-10-25 | 2006-04-27 | Verhulst Anton H | Clock timing adjustment |
| US20070038814A1 (en) * | 2005-08-10 | 2007-02-15 | International Business Machines Corporation | Systems and methods for selectively inclusive cache |
| GB0519981D0 (en) * | 2005-09-30 | 2005-11-09 | Ignios Ltd | Scheduling in a multicore architecture |
| US8412872B1 (en) | 2005-12-12 | 2013-04-02 | Nvidia Corporation | Configurable GPU and method for graphics processing using a configurable GPU |
| JP5089896B2 (ja) * | 2006-03-17 | 2012-12-05 | 株式会社日立製作所 | マイクロプロセッサの負荷分散機能を備えたストレージシステム |
| JP4934356B2 (ja) * | 2006-06-20 | 2012-05-16 | 株式会社日立製作所 | 映像処理エンジンおよびそれを含む映像処理システム |
| US8806228B2 (en) * | 2006-07-13 | 2014-08-12 | International Business Machines Corporation | Systems and methods for asymmetrical performance multi-processors |
| GB2443277B (en) * | 2006-10-24 | 2011-05-18 | Advanced Risc Mach Ltd | Performing diagnostics operations upon an asymmetric multiprocessor apparatus |
| JP2008158806A (ja) * | 2006-12-22 | 2008-07-10 | Matsushita Electric Ind Co Ltd | 複数プロセッサエレメントを備えるプロセッサ用プログラム及びそのプログラムの生成方法及び生成装置 |
| CN101821717A (zh) * | 2007-10-18 | 2010-09-01 | Nxp股份有限公司 | 采用高速缓存一致性负荷测试控制的电路和方法 |
| US20120191982A1 (en) * | 2007-12-06 | 2012-07-26 | Levin Timothy Evert | Non-volatile storage of encrypted data |
| US8296743B2 (en) | 2007-12-17 | 2012-10-23 | Intel Corporation | Compiler and runtime for heterogeneous multiprocessor systems |
| US8041852B1 (en) * | 2008-12-09 | 2011-10-18 | Calos Fund Limited Liability Company | System and method for using a shared buffer construct in performance of concurrent data-driven tasks |
| US7996564B2 (en) * | 2009-04-16 | 2011-08-09 | International Business Machines Corporation | Remote asynchronous data mover |
| US8310492B2 (en) * | 2009-09-03 | 2012-11-13 | Ati Technologies Ulc | Hardware-based scheduling of GPU work |
| US8307198B2 (en) * | 2009-11-24 | 2012-11-06 | Advanced Micro Devices, Inc. | Distributed multi-core memory initialization |
| US8874943B2 (en) | 2010-05-20 | 2014-10-28 | Nec Laboratories America, Inc. | Energy efficient heterogeneous systems |
| CN103080903B (zh) * | 2010-08-27 | 2016-07-06 | 富士通株式会社 | 调度器、多核处理器系统以及调度方法 |
| US20140068625A1 (en) * | 2010-10-21 | 2014-03-06 | Paul Winser | Data processing systems |
| US8996644B2 (en) | 2010-12-09 | 2015-03-31 | Solarflare Communications, Inc. | Encapsulated accelerator |
| US8745626B1 (en) * | 2012-12-17 | 2014-06-03 | Throughputer, Inc. | Scheduling application instances to configurable processing cores based on application requirements and resource specification |
| US9329843B2 (en) | 2011-08-02 | 2016-05-03 | International Business Machines Corporation | Communication stack for software-hardware co-execution on heterogeneous computing systems with processors and reconfigurable logic (FPGAs) |
| US8990518B2 (en) | 2011-08-04 | 2015-03-24 | Arm Limited | Methods of and apparatus for storing data in memory in data processing systems |
| US9846673B2 (en) * | 2011-11-04 | 2017-12-19 | Waseda University | Processor, accelerator, and direct memory access controller within a processor core that each reads/writes a local synchronization flag area for parallel execution |
| US8745352B2 (en) * | 2011-12-30 | 2014-06-03 | Sybase, Inc. | Optimized approach to parallelize writing to a shared memory resource |
| WO2013177765A1 (en) * | 2012-05-30 | 2013-12-05 | Intel Corporation | Runtime dispatching among heterogeneous group of processors |
| US20140040532A1 (en) | 2012-08-06 | 2014-02-06 | Advanced Micro Devices, Inc. | Stacked memory device with helper processor |
| KR101915198B1 (ko) * | 2012-08-10 | 2018-11-05 | 한화테크윈 주식회사 | 프로세서간 메시지처리장치 및 방법 |
| US9304730B2 (en) | 2012-08-23 | 2016-04-05 | Microsoft Technology Licensing, Llc | Direct communication between GPU and FPGA components |
| US8943505B2 (en) * | 2012-08-24 | 2015-01-27 | National Instruments Corporation | Hardware assisted real-time scheduler using memory monitoring |
| US9430282B1 (en) * | 2012-10-02 | 2016-08-30 | Marvell International, Ltd. | Scheduling multiple tasks in distributed computing system to avoid result writing conflicts |
| US8996781B2 (en) * | 2012-11-06 | 2015-03-31 | OCZ Storage Solutions Inc. | Integrated storage/processing devices, systems and methods for performing big data analytics |
| US9110778B2 (en) * | 2012-11-08 | 2015-08-18 | International Business Machines Corporation | Address generation in an active memory device |
| KR102002826B1 (ko) * | 2012-12-04 | 2019-07-23 | 삼성전자 주식회사 | 저장 장치, 플래시 메모리 및 저장 장치의 동작 방법 |
| US10079044B2 (en) * | 2012-12-20 | 2018-09-18 | Advanced Micro Devices, Inc. | Processor with host and slave operating modes stacked with memory |
| US9135185B2 (en) | 2012-12-23 | 2015-09-15 | Advanced Micro Devices, Inc. | Die-stacked memory device providing data translation |
| US9658977B2 (en) * | 2013-03-15 | 2017-05-23 | Micron Technology, Inc. | High speed, parallel configuration of multiple field programmable gate arrays |
| US9135062B2 (en) * | 2013-04-09 | 2015-09-15 | National Instruments Corporation | Hardware assisted method and system for scheduling time critical tasks |
| US20140344827A1 (en) * | 2013-05-16 | 2014-11-20 | Nvidia Corporation | System, method, and computer program product for scheduling a task to be performed by at least one processor core |
| US9244629B2 (en) * | 2013-06-25 | 2016-01-26 | Advanced Micro Devices, Inc. | Method and system for asymmetrical processing with managed data affinity |
| US9424079B2 (en) | 2013-06-27 | 2016-08-23 | Microsoft Technology Licensing, Llc | Iteration support in a heterogeneous dataflow engine |
| US9600346B2 (en) * | 2013-07-10 | 2017-03-21 | International Business Machines Corporation | Thread scheduling across heterogeneous processing elements with resource mapping |
| US9934043B2 (en) | 2013-08-08 | 2018-04-03 | Linear Algebra Technologies Limited | Apparatus, systems, and methods for providing computational imaging pipeline |
| DE102013224702A1 (de) * | 2013-12-03 | 2015-06-03 | Robert Bosch Gmbh | Steuergerät für ein Kraftfahrzeug |
| US9880971B2 (en) * | 2013-12-20 | 2018-01-30 | Rambus Inc. | Memory appliance for accessing memory |
| KR102205836B1 (ko) * | 2014-01-29 | 2021-01-21 | 삼성전자 주식회사 | 태스크 스케줄링 방법 및 장치 |
| WO2015115950A1 (en) * | 2014-01-31 | 2015-08-06 | Telefonaktiebolaget L M Ericsson (Publ) | Scheduling in cellular communication systems |
| US9444827B2 (en) * | 2014-02-15 | 2016-09-13 | Micron Technology, Inc. | Multi-function, modular system for network security, secure communication, and malware protection |
| US20170016933A1 (en) * | 2014-03-10 | 2017-01-19 | Openiolabs Ltd | Scanning ion conductance microscopy |
| KR101887797B1 (ko) * | 2014-05-08 | 2018-09-10 | 마이크론 테크놀로지, 인크. | 메모리 내 가벼운 일관성 |
| US20150378782A1 (en) * | 2014-06-25 | 2015-12-31 | Unisys Corporation | Scheduling of tasks on idle processors without context switching |
| KR102237373B1 (ko) * | 2014-07-02 | 2021-04-07 | 삼성전자 주식회사 | 전자 장치의 태스크 스케줄링 방법 및 이를 사용하는 전자 장치 |
| US9785481B2 (en) * | 2014-07-24 | 2017-10-10 | Qualcomm Innovation Center, Inc. | Power aware task scheduling on multi-processor systems |
| US10691663B2 (en) * | 2014-09-16 | 2020-06-23 | Sap Se | Database table copy |
| US9947386B2 (en) * | 2014-09-21 | 2018-04-17 | Advanced Micro Devices, Inc. | Thermal aware data placement and compute dispatch in a memory system |
| US9424092B2 (en) * | 2014-09-26 | 2016-08-23 | Microsoft Technology Licensing, Llc | Heterogeneous thread scheduling |
| US9836277B2 (en) * | 2014-10-01 | 2017-12-05 | Samsung Electronics Co., Ltd. | In-memory popcount support for real time analytics |
| US9489136B2 (en) * | 2014-10-27 | 2016-11-08 | Facebook, Inc. | Interrupt driven memory signaling |
| CN105900064B (zh) * | 2014-11-19 | 2019-05-03 | 华为技术有限公司 | 调度数据流任务的方法和装置 |
| CN104615488B (zh) * | 2015-01-16 | 2018-01-19 | 华为技术有限公司 | 异构多核可重构计算平台上任务调度的方法和装置 |
| US10528443B2 (en) * | 2015-01-30 | 2020-01-07 | Samsung Electronics Co., Ltd. | Validation of multiprocessor hardware component |
| GB2536211B (en) * | 2015-03-04 | 2021-06-16 | Advanced Risc Mach Ltd | An apparatus and method for executing a plurality of threads |
| US9542248B2 (en) | 2015-03-24 | 2017-01-10 | International Business Machines Corporation | Dispatching function calls across accelerator devices |
| JP6588230B2 (ja) | 2015-05-12 | 2019-10-09 | 愛知株式会社 | 収納式テーブル |
| US9983857B2 (en) * | 2015-06-16 | 2018-05-29 | Architecture Technology Corporation | Dynamic computational acceleration using a heterogeneous hardware infrastructure |
| GB2539455A (en) * | 2015-06-16 | 2016-12-21 | Nordic Semiconductor Asa | Memory watch unit |
| US10540588B2 (en) | 2015-06-29 | 2020-01-21 | Microsoft Technology Licensing, Llc | Deep neural network processing on hardware accelerators with stacked memory |
| JP6415405B2 (ja) * | 2015-07-31 | 2018-10-31 | 本田技研工業株式会社 | タスク制御システム |
| US10387314B2 (en) * | 2015-08-25 | 2019-08-20 | Oracle International Corporation | Reducing cache coherence directory bandwidth by aggregating victimization requests |
| US10838818B2 (en) * | 2015-09-18 | 2020-11-17 | Hewlett Packard Enterprise Development Lp | Memory persistence from a volatile memory to a non-volatile memory |
| US10031765B2 (en) * | 2015-09-24 | 2018-07-24 | Intel Corporation | Instruction and logic for programmable fabric hierarchy and cache |
| US10977092B2 (en) * | 2015-10-16 | 2021-04-13 | Qualcomm Incorporated | Method for efficient task scheduling in the presence of conflicts |
| US11036509B2 (en) | 2015-11-03 | 2021-06-15 | Intel Corporation | Enabling removal and reconstruction of flag operations in a processor |
| US9996268B2 (en) * | 2015-12-18 | 2018-06-12 | Toshiba Memory Corporation | Memory system and control method of the same |
| US11550632B2 (en) * | 2015-12-24 | 2023-01-10 | Intel Corporation | Facilitating efficient communication and data processing across clusters of computing machines in heterogeneous computing environment |
| JP2017135698A (ja) * | 2015-12-29 | 2017-08-03 | 株式会社半導体エネルギー研究所 | 半導体装置、コンピュータ及び電子機器 |
| US11079936B2 (en) * | 2016-03-01 | 2021-08-03 | Samsung Electronics Co., Ltd. | 3-D stacked memory with reconfigurable compute logic |
| US9977609B2 (en) * | 2016-03-07 | 2018-05-22 | Advanced Micro Devices, Inc. | Efficient accesses of data structures using processing near memory |
| US10083068B2 (en) * | 2016-03-29 | 2018-09-25 | Microsoft Technology Licensing, Llc | Fast transfer of workload between multiple processors |
| CN106156851B (zh) * | 2016-06-24 | 2019-04-05 | 科大讯飞股份有限公司 | 面向深度学习业务的加速装置及方法 |
| US10802992B2 (en) * | 2016-08-12 | 2020-10-13 | Xilinx Technology Beijing Limited | Combining CPU and special accelerator for implementing an artificial neural network |
| US10152393B2 (en) * | 2016-08-28 | 2018-12-11 | Microsoft Technology Licensing, Llc | Out-of-band data recovery in computing systems |
| US10198349B2 (en) * | 2016-09-19 | 2019-02-05 | Advanced Micro Devices, Inc. | Programming in-memory accelerators to improve the efficiency of datacenter operations |
| US10416896B2 (en) * | 2016-10-14 | 2019-09-17 | Samsung Electronics Co., Ltd. | Memory module, memory device, and processing device having a processor mode, and memory system |
| US20180115496A1 (en) * | 2016-10-21 | 2018-04-26 | Advanced Micro Devices, Inc. | Mechanisms to improve data locality for distributed gpus |
| CN108022905A (zh) * | 2016-11-04 | 2018-05-11 | 超威半导体公司 | 使用多个金属层的转接板传输线 |
| US20180173619A1 (en) * | 2016-12-21 | 2018-06-21 | Sandisk Technologies Llc | System and Method for Distributed Logical to Physical Address Mapping |
| US11119923B2 (en) * | 2017-02-23 | 2021-09-14 | Advanced Micro Devices, Inc. | Locality-aware and sharing-aware cache coherence for collections of processors |
| US11599777B2 (en) * | 2017-04-28 | 2023-03-07 | Intel Corporation | Scheduling configuration for deep learning networks |
| CN107102824B (zh) * | 2017-05-26 | 2019-08-30 | 华中科技大学 | 一种基于存储和加速优化的Hadoop异构方法和系统 |
| US10489195B2 (en) * | 2017-07-20 | 2019-11-26 | Cisco Technology, Inc. | FPGA acceleration for serverless computing |
| US10474600B2 (en) * | 2017-09-14 | 2019-11-12 | Samsung Electronics Co., Ltd. | Heterogeneous accelerator for highly efficient learning systems |
-
2017
- 2017-11-28 US US15/825,047 patent/US10474600B2/en active Active
-
2018
- 2018-05-22 TW TW107117305A patent/TWI754752B/zh active
- 2018-06-27 KR KR1020180074070A patent/KR102689910B1/ko active Active
- 2018-08-10 CN CN201810909419.7A patent/CN109508316B/zh active Active
- 2018-09-13 JP JP2018171047A patent/JP7028745B2/ja active Active
-
2019
- 2019-10-07 US US16/595,452 patent/US11226914B2/en active Active
-
2022
- 2022-01-17 US US17/577,370 patent/US11921656B2/en active Active
-
2024
- 2024-02-16 US US18/444,619 patent/US20240193111A1/en active Pending
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2003347470A (ja) | 2002-05-24 | 2003-12-05 | Fujitsu Ltd | 半導体装置の製造方法 |
| JP2010080802A (ja) | 2008-09-29 | 2010-04-08 | Hitachi Ltd | 半導体装置 |
| JP2015533009A (ja) | 2012-09-25 | 2015-11-16 | インテル・コーポレーション | パフォーマンスおよび電力のために構成可能な3dメモリ |
| WO2016209406A1 (en) | 2015-06-26 | 2016-12-29 | Advanced Micro Devices, Inc. | Computer architecture using rapidly reconfigurable circuits and high-bandwidth memory interfaces |
Also Published As
| Publication number | Publication date |
|---|---|
| TW201915724A (zh) | 2019-04-16 |
| US11226914B2 (en) | 2022-01-18 |
| KR20190030579A (ko) | 2019-03-22 |
| US11921656B2 (en) | 2024-03-05 |
| CN109508316A (zh) | 2019-03-22 |
| US20220138132A1 (en) | 2022-05-05 |
| KR102689910B1 (ko) | 2024-07-31 |
| CN109508316B (zh) | 2023-08-18 |
| TWI754752B (zh) | 2022-02-11 |
| US20190079886A1 (en) | 2019-03-14 |
| US20200042477A1 (en) | 2020-02-06 |
| US10474600B2 (en) | 2019-11-12 |
| JP2019053734A (ja) | 2019-04-04 |
| US20240193111A1 (en) | 2024-06-13 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP7028745B2 (ja) | 高効率ラーニングシステムのためのヘテロジニアスアクセラレータ | |
| CN1906587B (zh) | 降低多处理器系统中的功耗的方法和装置 | |
| US8914618B2 (en) | Instruction set architecture-based inter-sequencer communications with a heterogeneous resource | |
| US12249018B2 (en) | Game engine on a chip | |
| US10394604B2 (en) | Method for using local BMC to allocate shared GPU resources inside NVMe over fabrics system | |
| CN103262035B (zh) | 组合式cpu/gpu体系结构系统中的装置发现和拓扑报告 | |
| US10242420B2 (en) | Preemptive context switching of processes on an accelerated processing device (APD) based on time quanta | |
| CN110647495A (zh) | 多维管芯系统中的可编程逻辑器件的可编程逻辑结构可访问的嵌入式片上网络 | |
| JP2014504416A (ja) | 組み合わせたcpu/gpuアーキテクチャシステムにおけるデバイスの発見およびトポロジーのレポーティング | |
| CN105573959A (zh) | 一种计算存储一体的分布式计算机架构 | |
| US20230195664A1 (en) | Software management of direct memory access commands | |
| JP5805783B2 (ja) | コンピュータシステムインタラプト処理 | |
| JP2014503898A (ja) | 処理装置の同期動作のための方法およびシステム | |
| US7680972B2 (en) | Micro interrupt handler | |
| US20250348970A1 (en) | Dynamic dispatch for workgroup distribution | |
| US20220320042A1 (en) | Die stacking for modular parallel processors | |
| Lübbers et al. | Communication and Synchronization in Multithreaded Reconfigurable Computing Systems. | |
| US20250068464A1 (en) | Hierarchical work scheduling | |
| WO2009004628A2 (en) | Multi-core cpu |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20210721 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20210721 |
|
| A871 | Explanation of circumstances concerning accelerated examination |
Free format text: JAPANESE INTERMEDIATE CODE: A871 Effective date: 20210721 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20211005 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20220105 |
|
| TRDD | Decision of grant or rejection written | ||
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20220118 |
|
| A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20220217 |
|
| R150 | Certificate of patent or registration of utility model |
Ref document number: 7028745 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |