US20160110136A1 - ISOLATED SHARED MEMORY ARCHITECTURE (iSMA) - Google Patents

ISOLATED SHARED MEMORY ARCHITECTURE (iSMA) Download PDF

Info

Publication number
US20160110136A1
US20160110136A1 US14/975,273 US201514975273A US2016110136A1 US 20160110136 A1 US20160110136 A1 US 20160110136A1 US 201514975273 A US201514975273 A US 201514975273A US 2016110136 A1 US2016110136 A1 US 2016110136A1
Authority
US
United States
Prior art keywords
memory
ismn
processing units
disaggregated
dram
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US14/975,273
Inventor
Nirmal Raj Saxena
Sreenivas Krishnan
David Wang
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Rambus Inc
Original Assignee
Inphi Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inphi Corp filed Critical Inphi Corp
Priority to US14/975,273 priority Critical patent/US20160110136A1/en
Publication of US20160110136A1 publication Critical patent/US20160110136A1/en
Assigned to RAMBUS INC. reassignment RAMBUS INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: INPHI CORPORATION
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0655Vertical data movement, i.e. input-output transfer; data movement between one or more hosts and one or more storage devices
    • G06F3/0659Command handling arrangements, e.g. command buffers, queues, command scheduling
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F13/00Interconnection of, or transfer of information or other signals between, memories, input/output devices or central processing units
    • G06F13/38Information transfer, e.g. on bus
    • G06F13/40Bus structure
    • G06F13/4004Coupling between buses
    • G06F13/4022Coupling between buses using switching circuits, e.g. switching matrix, connection or expansion network
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F12/00Accessing, addressing or allocating within memory systems or architectures
    • G06F12/02Addressing or allocation; Relocation
    • G06F12/08Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
    • G06F12/0802Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches
    • G06F12/0806Multiuser, multiprocessor or multiprocessing cache systems
    • G06F12/0813Multiuser, multiprocessor or multiprocessing cache systems with a network or matrix configuration
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0602Interfaces specially adapted for storage systems specifically adapted to achieve a particular effect
    • G06F3/061Improving I/O performance
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0602Interfaces specially adapted for storage systems specifically adapted to achieve a particular effect
    • G06F3/061Improving I/O performance
    • G06F3/0613Improving I/O performance in relation to throughput
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0629Configuration or reconfiguration of storage systems
    • G06F3/0635Configuration or reconfiguration of storage systems by changing the path, e.g. traffic rerouting, path reconfiguration
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0638Organizing or formatting or addressing of data
    • G06F3/0644Management of space entities, e.g. partitions, extents, pools
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0655Vertical data movement, i.e. input-output transfer; data movement between one or more hosts and one or more storage devices
    • G06F3/0658Controller construction arrangements
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0668Interfaces specially adapted for storage systems adopting a particular infrastructure
    • G06F3/067Distributed or networked storage systems, e.g. storage area networks [SAN], network attached storage [NAS]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0668Interfaces specially adapted for storage systems adopting a particular infrastructure
    • G06F3/0671In-line storage system
    • G06F3/0673Single storage device
    • G06F3/0679Non-volatile semiconductor memory device, e.g. flash memory, one time programmable memory [OTP]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0668Interfaces specially adapted for storage systems adopting a particular infrastructure
    • G06F3/0671In-line storage system
    • G06F3/0683Plurality of storage devices
    • G06F3/0688Non-volatile semiconductor memory arrays
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Abstract

Techniques for a massively parallel and memory centric computing system. The system has a plurality of processing units operably coupled to each other through one or more communication channels. Each of the plurality of processing units has an ISMn interface device. Each of the plurality of ISMn interface devices is coupled to an ISMe endpoint connected to each of the processing units. The system has a plurality of DRAM or Flash memories configured in a disaggregated architecture and one or more switch nodes operably coupling the plurality of DRAM or Flash memories in the disaggregated architecture. The system has a plurality of high speed optical cables configured to communicate at a transmission rate of 100 G or greater to facilitate communication from any one of the plurality of processing units to any one of the plurality of DRAM or Flash memories.

Description

    CROSS-REFERENCE TO RELATED APPLICATIONS
  • The present application is a continuation of and claims priority to U.S. application Ser. No. 14/194,574, filed Feb. 28, 2014, which claims priority to and is a continuation-in-part of U.S. patent application Ser. No. 14/187,082, filed on Feb. 21, 2014, which is a non-provisional of U.S. Provisional Application No. 61/781,928, filed on Mar. 14, 2013, which are incorporated by reference in their entirety.
  • BACKGROUND OF THE INVENTION
  • The present invention is directed to computing systems and methods. These computing systems can be applied to communications networks and the like.
  • Over the last few decades, the use of communication networks exploded. In the early days Internet, popular applications were limited to emails, bulletin board, and mostly informational and text-based web page surfing, and the amount of data transferred was usually relatively small. Today, Internet and mobile applications demand a huge amount of bandwidth for transferring photo, video, music, and other multimedia files. For example, a social network like Facebook processes more than 500 TB of data daily. With such high demands on data and data transfer, existing data communication systems need to be improved to address these needs.
  • CMOS technology is commonly used to design communication systems implementing Optical Fiber Links. As CMOS technology is scaled down to make circuits and systems run at higher speed and occupy smaller chip (die) area, the operating supply voltage is reduced for lower power. Conventional FET transistors in deep-submicron CMOS processes have very low breakdown voltage as a result the operating supply voltage is maintained around 1 Volt. These limitations provide significant challenges to the continued improvement of communication systems scaling and performance.
  • There have been many types of communication systems and methods. Unfortunately, they have been inadequate for various applications. Therefore, improved computing/communication systems and methods are desired.
  • BRIEF SUMMARY OF THE INVENTION
  • According to the present invention, techniques are directed to computing systems and methods. Additionally, various embodiments enable separate computer systems having such memory systems to send and receive data to and from other memory systems having such auxiliary interfaces.
  • In an example, the present invention provides a massively parallel and memory centric computing system. The system has a plurality of processing units operably coupled to each other through one or more communication channels. Each of the plurality of processing units has an ISMn interface device. Each of the plurality of ISMn interface devices is coupled to an ISMe endpoint connected to each of the processing units. The system has a plurality of DRAM or Flash memories configured in a disaggregated architecture and one or more switch nodes operably coupling the plurality of DRAM or Flash memories in the disaggregated architecture. The system has a plurality of high speed optical cables configured to communicate at a transmission rate of 100 G or greater to facilitate communication from any one of the plurality of processing units to any one of the plurality of DRAM or Flash memories.
  • Many benefits are recognized through various embodiments of the present invention. Such benefits include having an architecture exhibiting superior power efficiency and in-memory computing efficiency. This architecture can involve disaggregating a large pool of memory (NAND flash or DRAM) that is shared amongst multiple CPU server nodes. Another benefit includes low-latency and high-bandwidth interconnect architecture amongst multiple CPU server nodes. Other benefits will be recognized by those of ordinary skill in the art that the mechanisms described can be applied to other communications systems as well.
  • The present invention achieves these benefits and others in the context of known memory technology. These and other features, aspects, and advantages of the present invention will become better understood with reference to the following description, figures, and claims.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • The following diagrams are merely examples, which should not unduly limit the scope of the claims herein. One of ordinary skill in the art would recognize many other variations, modifications, and alternatives. It is also understood that the examples and embodiments described herein are for illustrative purposes only and that various modifications or changes in light thereof will be suggested to persons skilled in the art and are to be included within the spirit and purview of this process and scope of the appended claims.
  • FIG. 1 is a simplified architecture of a shared memory system according to an embodiment of the present invention.
  • FIG. 2 is a simplified architecture of an in memory computing system according to an embodiment of the present invention.
  • FIG. 3 is a table with information regarding the computing systems according to an embodiment of the present invention.
  • DETAILED DESCRIPTION OF THE INVENTION
  • According to the present invention, techniques are directed to computing systems and methods. Additionally, various embodiments enable separate computer systems having such memory systems to send and receive data to and from other memory systems having such auxiliary interfaces.
  • The following description is presented to enable one of ordinary skill in the art to make and use the invention and to incorporate it in the context of particular applications. Various modifications, as well as a variety of uses in different applications will be readily apparent to those skilled in the art, and the general principles defined herein may be applied to a wide range of embodiments. Thus, the present invention is not intended to be limited to the embodiments presented, but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.
  • In the following detailed description, numerous specific details are set forth in order to provide a more thorough understanding of the present invention. However, it will be apparent to one skilled in the art that the present invention may be practiced without necessarily being limited to these specific details. In other instances, well-known structures and devices are shown in block diagram form, rather than in detail, in order to avoid obscuring the present invention.
  • The reader's attention is directed to all papers and documents which are filed concurrently with this specification and which are open to public inspection with this specification, and the contents of all such papers and documents are incorporated herein by reference. All the features disclosed in this specification, (including any accompanying claims, abstract, and drawings) may be replaced by alternative features serving the same, equivalent or similar purpose, unless expressly stated otherwise. Thus, unless expressly stated otherwise, each feature disclosed is one example only of a generic series of equivalent or similar features.
  • Furthermore, any element in a claim that does not explicitly state “means for” performing a specified function, or “step for” performing a specific function, is not to be interpreted as a “means” or “step” clause as specified in 35 U.S.C. Section 112, Paragraph 6. In particular, the use of “step of” or “act of” in the Claims herein is not intended to invoke the provisions of 35 U.S.C. 112, Paragraph 6.
  • Please note, if used, the labels left, right, front, back, top, bottom, forward, reverse, clockwise and counter clockwise have been used for convenience purposes only and are not intended to imply any particular fixed direction. Instead, they are used to reflect relative locations and/or directions between various portions of an object.
  • This invention describes an architecture for disaggregating a large pool of memory (NAND flash or DRAM) that is shared amongst multiple CPU server nodes. Another aspect of this invention is a low-latency and high-bandwidth interconnect architecture amongst multiple CPU server nodes. The notion of disaggregating storage, memory, and IO devices from monolithic designs is gaining importance and is being driven by the following considerations:
  • Much of today's hardware is highly monolithic in that our CPUs are inextricably linked to our motherboards, which in turn are linked to specific networking technology, IO, storage, and memory devices. This leads to poorly configured systems that cannot adapt to evolving software and waste lots of energy and material. Disaggregation is a way to break these monolithic designs.
  • Disaggregation allows independent replacement or upgrade of various disaggregated components. This reduces upgrade costs as opposed to increased costs due to gratuitous upgrade of components in monolithic designs.
  • FIG. 1 illustrates a block diagram of a 64-server node iSMA. iSMA comprises two components: 1) a PCI-express endpoint device called iSMe and 2) a switching node called iSMn. Each of the server CPUs has a local DRAM (not shown in FIG. 1) and is connected to PCIe endpoint iSMe. The iSMe components of each server node connect to one of the iSMn switch nodes. Attached to each iSMn node is a plurality of DRAM memory channels (shown in FIG. 1) or flash memory devices (not shown in FIG. 1). All of the iSMn nodes are interconnected thereby forming a shared memory interconnect fabric.
  • The following describes a mode of operation of iSMA according to an embodiment of the present invention. Upon power-on or system boot, each of the iSMn nodes discovers the locally attached DRAM or flash memory capacity. The iSMn nodes broadcast amongst each of the connected nodes the DRAM/flash memory capacity and the topology information. After a settling time, all of the iSMn nodes learn the topology as well as the sum-total memory capacity information. The topology information comprises the number of connected server CPUs and identification of the connected server CPUs to the iSMn nodes.
  • The iSMn nodes communicate the topology and memory capacity information to the iSMe endpoints via upstream transactions. The iSMe nodes subsequently communicate this topology and sum-total memory capacity information to their respective server CPUs during PCIe enumeration. In particular, the sum-total memory capacity information is reported to the respective server CPU as an address range in a PCIe endpoint base address register (BAR).
  • The reporting of the sum-total memory through a BAR allows each of the server CPUs to have a common address view of the disaggregated memory. Also, the BAR range reporting of the disaggregated memory allows mapping of the physical address range of disaggregated memory into a common virtual address range. Thereby allowing caching of virtual to physical address translations of disaggregated memory via the translation look-aside buffers in the server CPUs.
  • The visibility of the disaggregated memory as a common virtual address simplifies programming models. Also, sharing of this common pool of disaggregated memory by server CPUs is decided through software convention and is influenced by the application use case models.
  • In an example, the iSMn nodes also have processing capability to do data transformation operations to the locally connected memory. The server CPUs, through posted-write transactions or through downloaded executable programs in disaggregated memory, communicate the nature of data transformation. The iSMn nodes with their local processing capability act upon these posted transactions or executable programs to perform data transformation operations. These data transformation operations are often called in-memory computations.
  • FIG. 2 is a simplified architecture of an in memory computing system according to an embodiment of the present invention. In-memory compute capability allows server CPUs to off-load various data transformation operations for large pools of data stored in disaggregated memory to iSMn nodes. This offloading of operations is mostly beneficial both in performance and energy for large data set payloads that show poor cache locality. Thereby, moving computation closer to the memory results in both power and performance efficiency. FIG. 2 illustrates the in-memory compute idea for an 8-server node configuration connected via iSMe endpoints to a single iSMn node.
  • FIG. 3 is a table with information regarding the computing systems according to an embodiment of the present invention. To demonstrate the efficiency of in-memory compute capability of the iSMn nodes, we estimated the performance improvement in GUPs benchmark. The following table illustrates the performance improvements estimates. The estimates demonstrate that we can get from two to three orders of magnitude performance improvement by offloading data transformation operations to disaggregated memory for GUPs class of applications.
  • In various embodiments, a memory buffer as described herein could be implemented as a single integrated circuit (IC), or with a multiple chip chipset with various functions spread among several ICs. For example, a memory system based on the DDR4 standard employs DIMMs which include nine separate data buffer chips arranged close to the connector contacts and provides an interface between the connector and the DRAMs. The standard also provides for a central control element which functions as the register section of the DIMM and includes an extra interface to control the data buffers. For this type of chipset implementation, implementing an auxiliary port as described herein requires a new path from the data buffers to the central controller.
  • In an embodiment, the present invention can include a massively parallel and memory centric computing system. This system can include a plurality of processing units operably coupled to each other through one or more communication channels. Each of the plurality of processing units can have an ISMn (Isolated Shared Memory network) interface device. Each of the plurality of ISMn interface devices can be coupled to an ISMe (Isolated Shared Memory endpoint) device connected to each of the processing units. Each of the plurality of processing units can be numbered from 1 through N, where N is an integer greater than or equal to 32. Each of these processing units can be an ARM or an Intel based x86 processor.
  • In a specific embodiment, the system can be configured to initiate a power on or system boot. Each of the iSMn interface devices can be configured to determine a capacity of any one or all of the plurality of DRAM or Flash memories. Each of the iSMn interface devices can be configured to communicate in a broadcast process among any other iSMn interface device, each of which can be coupled to at least one of the plurality of DRAM or Flash memories. This broadcast process can be provided to determine a capacity and a topology of any or all of the system including the plurality of DRAM or Flash memories or networking configuration. The topology can include information selected from at least one of a number of connected processing units and identification information of the processing units to the iSMn devices.
  • Also, each of the iSMn devices can be configured to initiate communication of the topology can capacity information to any one or all of the iSMe devices using a communication direction from iSMn devices to the iSMe devices. Each of the iSMe devices can be configured to thereafter communicate the topology and a collective capacity of a sum-total of the capacity to a particular processing unit during a PCIe enumeration process. The sum-total memory capacity information can be transferred to a particular processing unit as an address range in a PCIe endpoint base address register.
  • The transferring of the sum-total memory capacity can be provided using a base address register (BAR) characterized by allowing each of the processing units to have a common address view of the disaggregated memory. The BAR range reporting of the disaggregated memory can allow mapping of a physical address range of the disaggregated memory into a common virtual address range. This can allow the caching of a virtual to physical address translation of the disaggregated memory provided by a translation look-aside buffer in the processing unit. The common address view of the disaggregated memory can be configured as a common virtual address.
  • A plurality of DRAM or Flash memories can be configured in a disaggregated architecture. One or more switch nodes can be operably coupled to the plurality of DRAM or Flash memories in the disaggregated architecture. Also, a plurality of high speed optical cables can be configured to communicate at a transmission rate of 100 G or greater to facilitate communication from any one of the plurality of processing units to any one of the plurality of DRAM or Flash memories. Each of the plurality of high speed optical cables can have a length of 1 meter to about 10 kilometers. The transmission rate can be 100 G PAM or other protocol.
  • The embodiments shown in the figures and described above are merely exemplary. The present system encompasses any memory system which employs a memory buffer that serves as an interface between the individual memory chips on a DIMM and a host, and which includes at least one additional, auxiliary interface which enables the memory buffer to serve as an interface between the host and/or memory chips and additional external devices.
  • In other embodiments, a system may include more than one host computer (each with host controller) wherein each host computer includes a memory buffer having a RAM interface and an auxiliary interface, as described herein. The auxiliary interfaces of the memory buffer of one host computer may be directly coupled to an auxiliary interface of the memory buffer of another host computer, or may be coupled via one or more switches. As described herein, such configurations enable the transfer of data from one RAM to another RAM bypassing data paths of the host controllers.
  • Various example embodiments as described with reference to the accompanying drawings, in which embodiments have been shown. This inventive concept may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure is thorough and complete, and has fully conveyed the scope of the inventive concept to those skilled in the art. Like reference numerals refer to like elements throughout this application.
  • It has been understood that, although the terms first, second, etc. may be used herein to describe various elements, these elements should not be limited by these terms. These terms are used to distinguish one element from another. For example, a first element could be termed a second element, and, similarly, a second element could be termed a first element, without departing from the scope of the inventive concept. As used herein, the term “and/or” includes any and all combinations of one or more of the associated listed items.
  • It has be understood that when an element is referred to as being “connected” or “coupled” to another element, it can be directly connected or coupled to the other element or intervening elements may be present. In contrast, when an element is referred to as being “directly connected” or “directly coupled” to another element, there may be no intervening elements present. Other words used to describe the relationship between elements should be interpreted in a like fashion (e.g., “between” versus “directly between,” “adjacent” versus “directly adjacent,” etc.).
  • The terminology used herein is for the purpose of describing particular embodiments and is not intended to be limiting of the inventive concept. As used herein, the singular forms “a,” “an” and “the” are intended to include the plural forms as well, unless the context clearly indicates otherwise. It will be further understood that the terms “comprises,” “comprising,” “includes” and/or “including,” when used herein, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other.
  • Unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this inventive concept belongs. It has been be further understood that terms, such as those defined in commonly used dictionaries, should be interpreted as having a meaning that is consistent with their meaning in the context of the relevant art and will not be interpreted in an idealized or overly formal sense unless expressly so defined herein.
  • While the above is a full description of the specific embodiments, various modifications, alternative constructions and equivalents may be used. Therefore, the above description and illustrations should not be taken as limiting the scope of the present invention which is defined by the appended claims.

Claims (20)

What is claimed is:
1. A massively parallel and memory centric computing system, the system comprising:
an ISMn (Isolated Shared Memory network) provided in each of a plurality of processing units, each of the plurality of processing units operably coupled to each other through at least a communication channel, the ISMn interface device being coupled an ISMe (Isolated Shared Memory end point) device connected to each of the processing units;
a disaggregated architecture comprising a plurality of DRAM or Flash memories configured in the disaggregated architecture;
a switch node operably coupling the plurality of DRAM or Flash memories in the disaggregated architecture; and
a high speed optical cable configured to communicate at a transmission rate of 100 G or greater to facilitate communication from any one of the plurality of processing units to any one of the plurality of DRAM or Flash memories.
2. The system of claim 1 wherein the plurality of high speed optical cables having a length of 1 meter to about 10 kilometers.
3. The system of claim 1 wherein the transmission rate is 100 G PAM or other protocol.
4. The system of claim 1 wherein the plurality of processing units is a number from 1 through N, where N is an integer greater than or equal to thirty two.
5. The system of claim 1 wherein each of the processing units is either an ARM or an Intel based x86 processor.
6. The system of claim 1 wherein the system is configured to initiate a power on or system boot, the iSMn interface devices being configured to determine a capacity of any one or all of the plurality of DRAM or Flash memories.
7. The system of claim 1 wherein the iSMn interface devices is configured to communicate in a broadcast process among any other iSMn interface device, each of which is coupled to at least one of the plurality of DRAM or Flash memories; whereupon the broadcast process is provided to determine a capacity and a topology of any or all of the system including the plurality of DRAM or Flash memories or networking configuration.
8. The system of claim 7 wherein the topology comprises information selected from at least one of a number of connected processing units and identification information of the processing units to the iSMn devices.
9. The system of claim 8 wherein the iSMn device is configured to initiate communication of the topology and capacity information to the iSMe device using a communication direction from iSMn device to the iSMe device.
10. The system of claim 9 wherein the iSMe devices is configured to thereafter communicate the topology and a collective capacity of a sum-total of the capacity to a particular processing unit during a PCIe enumeration process.
11. The system of claim 10 wherein the sum-total memory capacity information is transferred to a particular processing unit as an address range in a PCIe endpoint base address register.
12. The system of claim 11 wherein transferring of the sum-total memory capacity is provided using a base address register (BAR) characterized by allowing each of the processing units to have a common address view of the disaggregated memory.
13. The system of claim 12 wherein the BAR range reporting of the disaggregated memory is configured to provide a mapping of a physical address range of the disaggregated memory into a common virtual address range, thereby configured to provide caching of a virtual to physical address translation of the disaggregated memory provided by a translation look-aside buffer in the processing unit.
14. The system of claim 13 wherein the common address view of the disaggregated memory is configured as a common virtual address.
15. A massively parallel and memory centric computing system, the system comprising:
a plurality of processing units operably coupled to each other through a communication channel;
an ISMe (Isolated Shared Memory endpoint) device coupled to each of the processing units;
an ISMn (Isolated Shared Memory network) interface device coupled to each of the ISMe devices;
a disaggregated architecture comprising a plurality of DRAM or Flash memories configured in the disaggregated architecture and coupled to the plurality of iSMn interface devices;
a switch node operably coupling the plurality of DRAM or Flash memories in the disaggregated architecture; and
a plurality of high speed optical cables configured to communicate at a transmission rate of 100 G or greater to facilitate communication from any one of the plurality of processing units to any one of the plurality of DRAM or Flash memories.
16. The system of claim 15 wherein each of the iSMn interface devices is configured to communicate in a broadcast process among any other iSMn interface device, each of which is coupled to at least one of the plurality of DRAM or Flash memories; whereupon the broadcast process is provided to determine a capacity and a topology of any or all of the system including the plurality of DRAM or Flash memories or networking configuration.
17. The system of claim 16 wherein the topology comprises information selected from at least one of a number of connected processing units and identification information of the processing units to the iSMn devices; and wherein each of the iSMn devices is configured to initiate communication of the topology and capacity information to any one or all of the iSMe devices using a communication direction from iSMn devices to the iSMe devices.
18. The system of claim 17 wherein each of the iSMe devices is configured to thereafter communicate the topology and a collective capacity of a sum-total of the capacity to a particular processing unit during a PCIe enumeration process; and wherein the sum-total memory capacity information is transferred to a particular processing unit as an address range in a PCIe endpoint base address register.
19. The system of claim 18 wherein transferring of the sum-total memory capacity is provided using a base address register (BAR) characterized by allowing each of the processing units to have a common address view of the disaggregated memory; and wherein the BAR range reporting of the disaggregated memory is configured to provide a mapping of a physical address range of the disaggregated memory into a common virtual address range, thereby configured to provide caching of a virtual to physical address translation of the disaggregated memory provided by a translation look-aside buffer in the processing unit.
20. The system of claim 19 wherein the common address view of the disaggregated memory is configured as a common virtual address.
US14/975,273 2013-03-14 2015-12-18 ISOLATED SHARED MEMORY ARCHITECTURE (iSMA) Abandoned US20160110136A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US14/975,273 US20160110136A1 (en) 2013-03-14 2015-12-18 ISOLATED SHARED MEMORY ARCHITECTURE (iSMA)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US201361781928P 2013-03-14 2013-03-14
US201414187082A 2014-02-21 2014-02-21
US14/194,574 US9250831B1 (en) 2013-03-14 2014-02-28 Isolated shared memory architecture (iSMA)
US14/975,273 US20160110136A1 (en) 2013-03-14 2015-12-18 ISOLATED SHARED MEMORY ARCHITECTURE (iSMA)

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US14/194,574 Continuation US9250831B1 (en) 2013-03-14 2014-02-28 Isolated shared memory architecture (iSMA)

Publications (1)

Publication Number Publication Date
US20160110136A1 true US20160110136A1 (en) 2016-04-21

Family

ID=55174925

Family Applications (2)

Application Number Title Priority Date Filing Date
US14/194,574 Expired - Fee Related US9250831B1 (en) 2013-03-14 2014-02-28 Isolated shared memory architecture (iSMA)
US14/975,273 Abandoned US20160110136A1 (en) 2013-03-14 2015-12-18 ISOLATED SHARED MEMORY ARCHITECTURE (iSMA)

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US14/194,574 Expired - Fee Related US9250831B1 (en) 2013-03-14 2014-02-28 Isolated shared memory architecture (iSMA)

Country Status (1)

Country Link
US (2) US9250831B1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170344575A1 (en) * 2016-05-27 2017-11-30 Netapp, Inc. Methods for facilitating external cache in a cloud storage environment and devices thereof
US10394475B2 (en) 2017-03-01 2019-08-27 International Business Machines Corporation Method and system for memory allocation in a disaggregated memory architecture

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11853798B2 (en) * 2020-09-03 2023-12-26 Microsoft Technology Licensing, Llc Disaggregated memory pool assignment
US11481116B2 (en) * 2020-09-09 2022-10-25 Microsoft Technology Licensing, Llc Computing device with independently coherent nodes

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2367912B (en) 2000-08-08 2003-01-08 Sun Microsystems Inc Apparatus for testing computer memory
US20060294443A1 (en) 2005-06-03 2006-12-28 Khaled Fekih-Romdhane On-chip address generation
US8930507B2 (en) * 2012-06-12 2015-01-06 International Business Machines Corporation Physical memory shared among logical partitions in a VLAN
US9910816B2 (en) * 2013-07-22 2018-03-06 Futurewei Technologies, Inc. Scalable direct inter-node communication over peripheral component interconnect-express (PCIe)
US9977618B2 (en) * 2013-12-27 2018-05-22 Intel Corporation Pooling of memory resources across multiple nodes

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170344575A1 (en) * 2016-05-27 2017-11-30 Netapp, Inc. Methods for facilitating external cache in a cloud storage environment and devices thereof
US10394475B2 (en) 2017-03-01 2019-08-27 International Business Machines Corporation Method and system for memory allocation in a disaggregated memory architecture
US10394477B2 (en) 2017-03-01 2019-08-27 International Business Machines Corporation Method and system for memory allocation in a disaggregated memory architecture

Also Published As

Publication number Publication date
US9250831B1 (en) 2016-02-02

Similar Documents

Publication Publication Date Title
US10210121B2 (en) System for switching between a single node PCIe mode and a multi-node PCIe mode
TWI364663B (en) Configurable pci express switch and method controlling the same
US9348539B1 (en) Memory centric computing
US20050270298A1 (en) Daughter card approach to employing multiple graphics cards within a system
JP4128956B2 (en) Switch / network adapter port for cluster computers using a series of multi-adaptive processors in dual inline memory module format
US20160110136A1 (en) ISOLATED SHARED MEMORY ARCHITECTURE (iSMA)
CN100424668C (en) Automatic configurating system for PCI-E bus
JP2005141739A (en) Dynamic reconfiguration of pci express link
US8260895B2 (en) Computer and main circuit board thereof
US11671522B2 (en) System and method for memory access in server communications
US9946664B2 (en) Socket interposer having a multi-modal I/O interface
US20160140074A1 (en) Memory mapping method and memory mapping system
TW200405283A (en) Method and apparatus for enhancing reliability and scalability of serial storage devices
EP3120216A1 (en) A method, apparatus, and system for controlling power consumption of unused hardware of a link interface
US20090144469A1 (en) Usb key emulation system to multiplex information
CN104021809A (en) Universal serial bus (USB) storage
CN202383569U (en) Mainboard with multifunctional extensible peripheral component interconnect express (PCIE) interface device
KR20140123203A (en) Memory system
CN109213717B (en) Double-bridge-plate framework of domestic Feiteng processor
US11899606B2 (en) Memory disaggregation and reallocation
US20090177832A1 (en) Parallel computer system and method for parallel processing of data
US20190286606A1 (en) Network-on-chip and computer system including the same
US20220210952A1 (en) Cooling mass and spring element for low insertion force hot swappable electronic component interface
CN216352292U (en) Server mainboard and server
CN104933001A (en) Double-controller data communication method based on RapidIO technology

Legal Events

Date Code Title Description
AS Assignment

Owner name: RAMBUS INC., CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:INPHI CORPORATION;REEL/FRAME:040038/0898

Effective date: 20160804

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO PAY ISSUE FEE