US11853758B2 - Techniques for decoupled access-execute near-memory processing - Google Patents

Techniques for decoupled access-execute near-memory processing Download PDF

Info

Publication number
US11853758B2
US11853758B2 US16/585,521 US201916585521A US11853758B2 US 11853758 B2 US11853758 B2 US 11853758B2 US 201916585521 A US201916585521 A US 201916585521A US 11853758 B2 US11853758 B2 US 11853758B2
Authority
US
United States
Prior art keywords
memory
circuitry
access
compute
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active, expires
Application number
US16/585,521
Other languages
English (en)
Other versions
US20200026513A1 (en
Inventor
Berkin Akin
Alaa R. Alameldeen
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Intel Corp
Original Assignee
Intel Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Intel Corp filed Critical Intel Corp
Priority to US16/585,521 priority Critical patent/US11853758B2/en
Assigned to INTEL CORPORATION reassignment INTEL CORPORATION ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: AKIN, BERKIN, ALAMELDEEN, ALAA R.
Publication of US20200026513A1 publication Critical patent/US20200026513A1/en
Priority to EP20181133.8A priority patent/EP3798828A3/de
Priority to CN202010592587.5A priority patent/CN112579487A/zh
Priority to US18/388,797 priority patent/US20240078112A1/en
Application granted granted Critical
Publication of US11853758B2 publication Critical patent/US11853758B2/en
Active legal-status Critical Current
Adjusted expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/30Arrangements for executing machine instructions, e.g. instruction decode
    • G06F9/38Concurrent instruction execution, e.g. pipeline, look ahead
    • G06F9/3885Concurrent instruction execution, e.g. pipeline, look ahead using a plurality of independent parallel functional units
    • G06F9/3889Concurrent instruction execution, e.g. pipeline, look ahead using a plurality of independent parallel functional units controlled by multiple instructions, e.g. MIMD, decoupled access or execute
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F13/00Interconnection of, or transfer of information or other signals between, memories, input/output devices or central processing units
    • G06F13/14Handling requests for interconnection or transfer
    • G06F13/16Handling requests for interconnection or transfer for access to memory bus
    • G06F13/1668Details of memory controller
    • G06F13/1689Synchronisation and timing concerns
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/30Arrangements for executing machine instructions, e.g. instruction decode
    • G06F9/30003Arrangements for executing specific machine instructions
    • G06F9/3004Arrangements for executing specific machine instructions to perform operations on memory
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/76Architectures of general purpose stored program computers
    • G06F15/78Architectures of general purpose stored program computers comprising a single central processing unit
    • G06F15/7807System on chip, i.e. computer system on a single chip; System in package, i.e. computer system on one or more chips in a single package
    • G06F15/7821Tightly coupled to memory, e.g. computational memory, smart memory, processor in memory
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/76Architectures of general purpose stored program computers
    • G06F15/78Architectures of general purpose stored program computers comprising a single central processing unit
    • G06F15/7807System on chip, i.e. computer system on a single chip; System in package, i.e. computer system on one or more chips in a single package
    • G06F15/7825Globally asynchronous, locally synchronous, e.g. network on chip
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0629Configuration or reconfiguration of storage systems
    • G06F3/0631Configuration or reconfiguration of storage systems by allocating resources to storage systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0638Organizing or formatting or addressing of data
    • G06F3/064Management of blocks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0638Organizing or formatting or addressing of data
    • G06F3/0644Management of space entities, e.g. partitions, extents, pools
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0655Vertical data movement, i.e. input-output transfer; data movement between one or more hosts and one or more storage devices
    • G06F3/0658Controller construction arrangements
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/30Arrangements for executing machine instructions, e.g. instruction decode
    • G06F9/30003Arrangements for executing specific machine instructions
    • G06F9/30076Arrangements for executing specific machine instructions to perform miscellaneous control operations, e.g. NOP
    • G06F9/30087Synchronisation or serialisation instructions
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management
US16/585,521 2019-09-27 2019-09-27 Techniques for decoupled access-execute near-memory processing Active 2041-04-03 US11853758B2 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
US16/585,521 US11853758B2 (en) 2019-09-27 2019-09-27 Techniques for decoupled access-execute near-memory processing
EP20181133.8A EP3798828A3 (de) 2019-09-27 2020-06-19 Techniken für entkoppelte zugriffsausführung in der nahspeicherverarbeitung
CN202010592587.5A CN112579487A (zh) 2019-09-27 2020-06-24 用于去耦合的存取-执行近存储器处理的技术
US18/388,797 US20240078112A1 (en) 2019-09-27 2023-11-10 Techniques for decoupled access-execute near-memory processing

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US16/585,521 US11853758B2 (en) 2019-09-27 2019-09-27 Techniques for decoupled access-execute near-memory processing

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US18/388,797 Continuation US20240078112A1 (en) 2019-09-27 2023-11-10 Techniques for decoupled access-execute near-memory processing

Publications (2)

Publication Number Publication Date
US20200026513A1 US20200026513A1 (en) 2020-01-23
US11853758B2 true US11853758B2 (en) 2023-12-26

Family

ID=69161881

Family Applications (2)

Application Number Title Priority Date Filing Date
US16/585,521 Active 2041-04-03 US11853758B2 (en) 2019-09-27 2019-09-27 Techniques for decoupled access-execute near-memory processing
US18/388,797 Pending US20240078112A1 (en) 2019-09-27 2023-11-10 Techniques for decoupled access-execute near-memory processing

Family Applications After (1)

Application Number Title Priority Date Filing Date
US18/388,797 Pending US20240078112A1 (en) 2019-09-27 2023-11-10 Techniques for decoupled access-execute near-memory processing

Country Status (3)

Country Link
US (2) US11853758B2 (de)
EP (1) EP3798828A3 (de)
CN (1) CN112579487A (de)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2594498B (en) * 2020-04-30 2022-06-01 Advanced Risc Mach Ltd Instruction scheduling
US11720360B2 (en) * 2020-09-11 2023-08-08 Apple Inc. DSB operation with excluded region

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6145073A (en) * 1998-10-16 2000-11-07 Quintessence Architectures, Inc. Data flow integrated circuit architecture
US6728873B1 (en) * 2000-06-06 2004-04-27 International Business Machines Corporation System and method for providing multiprocessor speculation within a speculative branch path
US7334110B1 (en) * 2003-08-18 2008-02-19 Cray Inc. Decoupled scalar/vector computer architecture system and method
US7437521B1 (en) * 2003-08-18 2008-10-14 Cray Inc. Multistream processing memory-and barrier-synchronization method and apparatus
US20130305022A1 (en) * 2012-05-14 2013-11-14 International Business Machines Corporation Speeding Up Younger Store Instruction Execution after a Sync Instruction
US20150149718A1 (en) 2009-07-21 2015-05-28 Tadao Nakamura A lower energy consumption and high speed computer system and a marching main memory adapted for the computer system, without the memory bottleneck
US20160041906A1 (en) 2013-09-21 2016-02-11 Oracle International Corporation Sharding of in-memory objects across numa nodes
US9678673B2 (en) * 2013-03-28 2017-06-13 Hewlett Packard Enterprise Development Lp Coordinating replication of data stored in a non-volatile memory-based system

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6145073A (en) * 1998-10-16 2000-11-07 Quintessence Architectures, Inc. Data flow integrated circuit architecture
US6728873B1 (en) * 2000-06-06 2004-04-27 International Business Machines Corporation System and method for providing multiprocessor speculation within a speculative branch path
US7334110B1 (en) * 2003-08-18 2008-02-19 Cray Inc. Decoupled scalar/vector computer architecture system and method
US7437521B1 (en) * 2003-08-18 2008-10-14 Cray Inc. Multistream processing memory-and barrier-synchronization method and apparatus
US20150149718A1 (en) 2009-07-21 2015-05-28 Tadao Nakamura A lower energy consumption and high speed computer system and a marching main memory adapted for the computer system, without the memory bottleneck
US20130305022A1 (en) * 2012-05-14 2013-11-14 International Business Machines Corporation Speeding Up Younger Store Instruction Execution after a Sync Instruction
US9678673B2 (en) * 2013-03-28 2017-06-13 Hewlett Packard Enterprise Development Lp Coordinating replication of data stored in a non-volatile memory-based system
US20160041906A1 (en) 2013-09-21 2016-02-11 Oracle International Corporation Sharding of in-memory objects across numa nodes

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
European Office Action for Patent Application No. 20181133.8, dated Dec. 9, 2021, 7 pages.
European Second Office Action for Patent Application No. 20181133.8, dated Apr. 5, 2023, 8 pages.
Extended European Search Report for Patent Application No. 20181133.8, dated Mar. 12, 2021, 11 pages.
Won W Ro et al., "Design and evaluation of a hierarchical decoupled architecture" , The Journal of Supercomputing, Kluwer Academic Publishers, BO, vol. 38, No. 3, Dec. 2, 2006, pp. 237-259.

Also Published As

Publication number Publication date
CN112579487A (zh) 2021-03-30
EP3798828A2 (de) 2021-03-31
US20200026513A1 (en) 2020-01-23
EP3798828A3 (de) 2021-04-14
US20240078112A1 (en) 2024-03-07

Similar Documents

Publication Publication Date Title
US20240078112A1 (en) Techniques for decoupled access-execute near-memory processing
US11604649B2 (en) Techniques for efficiently transferring data to a processor
KR101720676B1 (ko) 대용량 저장소 디바이스들을 관리하기 위한 장치 및 방법
US20160098200A1 (en) In-memory popcount support for real time analytics
CN107636618B (zh) 在具有嵌入式可编程数据检查的非易失性系统存储器计算系统中的即时重启
US20160283111A1 (en) Read operations in memory devices
CN107301455A (zh) 用于卷积神经网络的混合立方体存储系统及加速计算方法
US20200233803A1 (en) Efficient hardware architecture for accelerating grouped convolutions
WO2020103058A1 (zh) 可编程运算与控制芯片、设计方法及其装置
Chen et al. Accelerator-rich CMPs: From concept to real hardware
KR101743409B1 (ko) 메모리 관리
EP3709159B1 (de) Verarbeitung von anweisungen zur übersetzung von vektorisierten physikalischen gastadressen
JP2021072107A (ja) ストレージ周辺の階層的ソート加速のためのストレージシステム及び方法
US11625279B2 (en) Read-write page replication for multiple compute units
CN103914333B (zh) 基于片上网络互连的多核存储系统仿真器
TWI609265B (zh) 用於揮發性記憶體之以極性爲基的資料移轉功能
CN109983443A (zh) 实现分叉非易失性存储器快速驱动器的技术
Lu et al. A novel NVM memory file system for edge intelligence
US20240070107A1 (en) Memory device with embedded deep learning accelerator in multi-client environment
US20230334750A1 (en) Methods and hardware logic for loading ray tracing data into a shader processing unit of a graphics processing unit
CN114258534B (zh) 阶层式存储器系统
EP4187478A1 (de) Punktwolken-nachbarschafts-map und hash-map-beschleuniger
US20240020248A1 (en) Partial data handling hardware
WO2023225792A1 (en) Techniques to multiply memory access bandwidth using a plurality of links
US20230267992A1 (en) Keeper-free volatile memory system

Legal Events

Date Code Title Description
FEPP Fee payment procedure

Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

AS Assignment

Owner name: INTEL CORPORATION, CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:AKIN, BERKIN;ALAMELDEEN, ALAA R.;SIGNING DATES FROM 20190930 TO 20191002;REEL/FRAME:050730/0454

STPP Information on status: patent application and granting procedure in general

Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER

STPP Information on status: patent application and granting procedure in general

Free format text: FINAL REJECTION MAILED

STCV Information on status: appeal procedure

Free format text: NOTICE OF APPEAL FILED

STCV Information on status: appeal procedure

Free format text: APPEAL BRIEF (OR SUPPLEMENTAL BRIEF) ENTERED AND FORWARDED TO EXAMINER

STCV Information on status: appeal procedure

Free format text: EXAMINER'S ANSWER TO APPEAL BRIEF MAILED

STCV Information on status: appeal procedure

Free format text: ON APPEAL -- AWAITING DECISION BY THE BOARD OF APPEALS

STCV Information on status: appeal procedure

Free format text: BOARD OF APPEALS DECISION RENDERED

STPP Information on status: patent application and granting procedure in general

Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS

STPP Information on status: patent application and granting procedure in general

Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT RECEIVED

STPP Information on status: patent application and granting procedure in general

Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT VERIFIED

STCF Information on status: patent grant

Free format text: PATENTED CASE