US20150067243A1 - System and method for executing map-reduce tasks in a storage device - Google Patents

System and method for executing map-reduce tasks in a storage device Download PDF

Info

Publication number
US20150067243A1
US20150067243A1 US14/465,505 US201414465505A US2015067243A1 US 20150067243 A1 US20150067243 A1 US 20150067243A1 US 201414465505 A US201414465505 A US 201414465505A US 2015067243 A1 US2015067243 A1 US 2015067243A1
Authority
US
United States
Prior art keywords
data
server
solid state
ssd
state drive
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US14/465,505
Inventor
Nader Salessi
Joao Alcantara
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NXGN Data Inc
Original Assignee
NXGN Data Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NXGN Data Inc filed Critical NXGN Data Inc
Priority to US14/465,505 priority Critical patent/US20150067243A1/en
Publication of US20150067243A1 publication Critical patent/US20150067243A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F12/00Accessing, addressing or allocating within memory systems or architectures
    • G06F12/02Addressing or allocation; Relocation
    • G06F12/0223User address space allocation, e.g. contiguous or non contiguous base addressing
    • G06F12/023Free address space management
    • G06F12/0238Memory management in non-volatile memory, e.g. resistive RAM or ferroelectric memory
    • G06F12/0246Memory management in non-volatile memory, e.g. resistive RAM or ferroelectric memory in block erasable memory, e.g. flash memory
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5061Partitioning or combining of resources
    • G06F9/5066Algorithms for mapping a plurality of inter-dependent sub-tasks onto a plurality of physical CPUs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2212/00Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
    • G06F2212/72Details relating to flash memory management
    • G06F2212/7208Multiple device management, e.g. distributing data over multiple flash devices
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Definitions

  • the following description relates to cloud data storage systems and more particularly to a system and method for efficiently executing Map-Reduce tasks on large volumes of data, and on data stored in master-slave hardware configurations.
  • HDDs hard disk drives
  • SSDs solid-state storage drives
  • This stored data may also require processing, or be subject to operations, such as during a search, Pattern Mining, Classification, or other processes.
  • a processing device such as a central processing unit (CPU) in a server performs operations on the data.
  • the data is read from the storage node, processed by the CPU and the processed data is sent to the source of a request and/or stored back on the storage node.
  • Standard storage nodes generally do not include computational resources to perform such operations on data stored in the storage node.
  • standard storage node interfaces such as Serial Advanced Technology Attachment (SATA), Fibre Channel, or Serial Attached SCSI (SAS), do not define commands to trigger the storage node to perform data operations in the storage node. Accordingly, operations are performed outside of the storage node, e.g., in a server CPU.
  • a server uses standard read and write commands supported by existing storage node interfaces to move data from and to the storage node. Specifically, the server sends a standard read command to the storage node via a bus. The storage node then sends the stored data over the bus to the server, which typically holds the data in its main memory. The CPU of the server then performs operations on the data to produce a result. Depending on the type of operation, the server provides the result to a requesting source and/or stores the result on the storage node.
  • SATA Serial Advanced Technology Attachment
  • SAS Serial Attached SCSI
  • This process also ties up the communication bus between the server and the storage node since data is sent from the storage node to the server and then back to the storage node.
  • existing processes for searching and analyzing large distributed unstructured databases are time-consuming and use large amounts of resources such as CPU utilization, memory footprint, or energy.
  • Big data may be managed and analyzed using the HadoopTM software framework and using the Map-Reduce programming model.
  • the HadoopTM framework may implement Map-Reduce functions to distribute the data query, which may be a Map-Reduce job, into a large number of small fragments of work, referred to herein as tasks, each of which may be performed on one of a large number of compute nodes.
  • the work may involve map tasks and reduce tasks which may be used to categorize and analyze large amounts of data in distributed systems.
  • a compute node is a piece of hardware capable of performing operations
  • a storage node is a piece of hardware capable of storing data.
  • a piece of hardware may be, or contain, both a compute node and a storage node, and, as another example, a compute node may include or contain a storage node.
  • An intermediate data structure stores the intermediate data values.
  • reduce modules which are configured to retrieve the intermediate data values from the intermediate data structure and to apply at least one user-specified reduce operation to the intermediate data values to provide output data.
  • the map and/or reduce tasks are automatically parallelized across multiple compute nodes in the parallel processing environment.
  • the programs or instructions for handling parallelization of the map and reduce tasks are application independent.
  • the input data and the intermediate data values can include key/value pairs and the reduce operation can include combining intermediate data values having the same key.
  • the intermediate data structure can include one or more intermediate data files coupled to each map module for storing intermediate data values.
  • the map and reduce tasks can be executed on different compute nodes.
  • the output data can be written to the local storage node or to another compute node using a distributed file system, for instance, a HadoopTM distributed file system (HDFS).
  • a distributed file system for instance, a HadoopTM distributed file system (HDFS).
  • HDFS HadoopTM distributed file system
  • Map-Reduce is a programming model that allows large amounts of data to be processed on parallel computer platforms using two basic functions: map and reduce.
  • Data is first mapped (for grouping purposes) using the map function and then reduced (aggregated) using the reduce function. For example, records having different attributes such as “dog” and “cat” could be mapped, for grouping purposes, to new records (or tuples) where each has attributes of “animal” instead of “dog” or “cat”. Then, by a reduce function, all the “animal” records (or tuples) could be aggregated.
  • a Map-Reduce model implemented in a parallel processing computer system may enhance the processing of massive quantities of data by a “divide-and-conquer” strategy that may result from dividing the data into portions and processing it on parallel-processing computer installations.
  • Related art hardware systems may include a set of data nodes, which may also be referred to as slave nodes, controlled by a master node which may also be referred to as a job tracker or name node.
  • the master node may use the Map-Reduce process to assign tasks to slave nodes, the slave nodes may complete the tasks, and the master node may then aggregate the results produced by the slave nodes.
  • the master node and the slave nodes may be servers, each including a CPU and a storage node.
  • slave node sub job operations executed in a CPU which retrieves data from a storage node and may save results on a storage node are relatively slow and power-inefficient.
  • a system and method in, e.g., a HadoopTM system, for more efficiently processing data stored on storage nodes.
  • aspects of embodiments of the present disclosure are directed toward a system and method of providing enhanced data processing and analysis in a cluster of compute nodes executing Map-Reduce tasks in a HadoopTM framework.
  • HadoopTM framework divides a data query (Map-Reduce job) into a large number of small fragments of work, each of which may be performed on one of a large number of compute nodes. The work may involve a map task and a reduce task which may be used to categorize and analyze large amounts of data in distributed systems.
  • a HadoopTM cluster contains a master node and a plurality of slave nodes.
  • the slave nodes include intelligent solid-state drives capable of executing Map-Reduce tasks. The use of intelligent solid-state drives reduces the need to exchange data with a CPU in a server.
  • an intelligent solid state drive including: a processing unit; and a flash memory; the processing unit configured to be in communication with the flash memory, and including: a hardware engine; and a microcontroller; the solid state drive configured to perform map and reduce tasks.
  • the intelligent solid state drive is configured to run an operating system.
  • the operating system is configured to enable the drive to execute a high-level computer language.
  • the computer language is an object-oriented programming language.
  • the cluster includes a cluster of nodes, the cluster of nodes including: a master node; and a plurality of slave nodes; wherein a slave node of the plurality of slave nodes includes a server including a server central processing unit (CPU) and an intelligent solid state drive.
  • a server including a server central processing unit (CPU) and an intelligent solid state drive.
  • the cluster includes a cluster of nodes, the cluster of nodes including: a master node; and a plurality of slave nodes; wherein a slave node of the plurality of slave nodes is an intelligent solid state drive.
  • the plurality includes a master node and a plurality of slave nodes, a slave node of the plurality of slave nodes including an intelligent solid state drive, the method including: submitting the query to the master node; assigning a plurality of tasks to the plurality of slave nodes, by the master node, the plurality of tasks being configured to execute portions of the query; executing the plurality of tasks, by the plurality of slave nodes; returning the results of the execution of the plurality of tasks, by the plurality of slave nodes, to the master node; and aggregating, by the master node, the results of the execution of the plurality of tasks.
  • the method includes assigning of a task by a first slave node of the plurality of slave nodes to a second slave node of the plurality of slave nodes.
  • a task of the plurality of tasks includes an execution of a map function within a Map-Reduce framework.
  • a task of the plurality of tasks includes an execution of a reduce function within a Map-Reduce framework.
  • FIG. 1 shows an exemplary block diagram depicting a solid state drive in communication with a server, in accordance with related art systems
  • FIG. 2 depicts an exemplary block diagram of a solid state drive in communication with a server, in accordance with an embodiment of the present invention
  • FIG. 3 shows exemplary acts for performing queries in a solid state drive, in accordance with an embodiment of the present invention
  • FIG. 4 depicts an exemplary block diagram of a processing unit of a solid state drive in communication with a server and flash memory storage, in accordance with an embodiment of the present invention
  • FIG. 5 shows a representation of the storage area of a solid state drive, in relation with searches conducted on the storage area, in accordance with an embodiment of the present invention
  • FIG. 6 shows a chart comparing the power usage of an existing system with the power usage of a system employing a storage node constructed according to an embodiment of the present invention, during an exemplary query of a solid state drive;
  • FIG. 7 depicts an exemplary block diagram of a processing unit of a solid state drive in communication with a server and a plurality of flash memories, in accordance with an embodiment of the present invention
  • FIG. 8 shows a cluster of nodes in a distributed computing system according to an embodiment of the present invention
  • FIG. 9 shows a data node architecture according to an embodiment of the present invention.
  • FIG. 10 shows a data node architecture according to another embodiment of the present invention.
  • FIG. 11 is a flowchart illustrating acts involved in the storage of unstructured data in a distributed file system according to an embodiment of the present invention.
  • FIG. 12 is a flowchart illustrating acts involved in execution of a query in a distributed file system according to an embodiment of the present invention.
  • the present invention relates to systems and methods for processing data in large systems using solid state storage.
  • processing of data stored on an intelligent solid state storage node which may be referred to as an intelligent solid state drive (SSD)
  • SSD intelligent solid state drive
  • processing of data stored on an intelligent solid state storage node does not require comparatively slow reading and re-writing of the data and, instead, is accommodated by performing the processing within the intelligent SSD.
  • Comparable SSDs typically include a controller for facilitating the transfer of data to and from the SSD.
  • the CPU in a typical comparable SSD has limited processing capability, which is an obstacle to running an operating system and to running JavaTM. It also lacks a hardware engine for performing a word count or pattern matching.
  • FIG. 1 is a block diagram of a system which includes a server 110 in communication with a storage node for performing data queries according to prior art implementations.
  • FIG. 1 includes a server 110 , which can include a processor, such as a server central processing unit (CPU) 120 , and a comparable SSD 125 , which can include a storage node controller 140 and a non-volatile memory, e.g., a flash memory 150 .
  • the server 110 and comparable SSD 125 may be implemented in a cloud-based computing environment.
  • the server 110 and comparable SSD 125 may communicate using any storage buses as well as PCIe with any protocol which runs on it.
  • storage nodes may be connected to, and controlled by, a host CPU which need not be a server CPU but may be a CPU in an application not configured as a server.
  • a server CPU may be an example of a host CPU.
  • the phrase “in communication with” refers to in direct communication with or in indirect communication with via one or more components named or unnamed herein.
  • the server 110 and the comparable SSD 125 can be in communication with each other via a wired or wireless connection.
  • the comparable SSD 125 may comprise pins (or a socket) to mate with a corresponding socket (or pins) on the server 110 to establish an electrical and physical connection.
  • the comparable SSD 125 can comprise a wireless transceiver to place the server 110 and the comparable SSD 125 in wireless communication with each other.
  • the server 110 and the comparable SSD 125 may be separately housed from each other, or contained in the same housing.
  • the server 110 may receive a query, which may, for example, entail finding the number of occurrences of a certain pattern or text.
  • a pattern is a combination of strings and logical operations, in which the logical operations determine which combinations of the strings, if they are found in a set of data, will constitute a match for the pattern in the set of data.
  • the server 110 may send a data request to the comparable SSD 125 .
  • the comparable SSD 125 receives the data request, and retrieves the requested data.
  • the comparable SSD 125 then sends the data to the server 110 .
  • the server CPU 120 processes the data and returns the results.
  • the server 110 and the comparable SSD 125 may include additional components, which are not shown in FIG. 1 to simplify the drawing.
  • an intelligent SSD 130 may have the capabilities of a comparable SSD 125 in addition to further capabilities discussed in detail below.
  • an intelligent SSD 130 may be used in applications in which a comparable SSD 125 might otherwise be used, such as those described above with respect to FIG. 1 .
  • a server 110 ′ may include a processor, such as a server central processing unit (CPU) 120 , and an intelligent SSD 130 .
  • the server 110 ′ and intelligent SSD 130 may be implemented in a cloud-based computing environment.
  • the server 110 ′ and intelligent SSD 130 may communicate using any storage buses as well as PCIe with any protocol which runs on it.
  • storage nodes may be connected to, and controlled by, a host CPU which need not be a server CPU but may be a CPU in an application not configured as a server.
  • the server 110 ′ and the intelligent SSD 130 can be in communication with each other via a wired or wireless connection.
  • the intelligent SSD 130 may comprise pins (or a socket) to mate with a corresponding socket (or pins) in the server 110 ′ to establish an electrical and physical connection with, e.g., the CPU 120 .
  • the intelligent SSD 130 can comprise a wireless transceiver to place the server 110 ′ and the intelligent SSD 130 in wireless communication with each other.
  • the server 110 ′ and the intelligent SSD 130 may be separately housed from each other, or contained in the same housing.
  • the server 110 ′ may receive a query, described by map and reduce functions, which may, for example, entail finding the number of occurrences of a certain pattern or text.
  • a pattern is a combination of strings and logical operations, in which the logical operations determine which combinations of the strings, if they are found in a set of data, will constitute a match for the pattern in the set of data.
  • the server 110 ′ may send a data request to the intelligent SSD 130 .
  • the intelligent SSD 130 receives the data request, and retrieves the requested data.
  • the intelligent SSD 130 then sends the data to the server 110 ′.
  • the server CPU 120 processes the data and returns the results.
  • FIG. 2 is a block diagram of a system which includes a server 110 ′ containing, and in communication with, an intelligent SSD 130 for performing data queries according to aspects of the present disclosure.
  • the server 110 ′ and intelligent SSD 130 may be part of a cloud-based computing environment, a network, or a separate subsystem.
  • the server may also contain a server CPU 120 , and a data buffer 260 , which may be composed of DDR memory.
  • the intelligent SSD 130 includes an SSD controller 240 that is designed for data operations or analytics, such as search and analysis of a large volume of unstructured data.
  • the SSD controller 240 can include, for example, a reconfigurable digital signal processing (DSP) core containing arithmetic and logic units and other dedicated hardware units that may be used to perform data analytics, and other operations such as compression, decompression, encryption, and decryption.
  • DSP reconfigurable digital signal processing
  • the intelligent SSD 130 includes an ARM-based core or any other suitable CPU.
  • the intelligent SSD 130 may include an SSD controller 240 and a flash memory 150 .
  • the SSD controller 240 performs querying of data.
  • a Map-Reduce job may be composed of a request to find a text word, and/or the number of occurrences of that text word in the storage nodes in the server.
  • the task can be computed locally within the storage node.
  • the server 110 ′ may be configured to receive queries.
  • the server also referred to as the data node
  • the server passes the query to the storage nodes in the server.
  • Each of these storage nodes which may be SSDs, may then process the query and return the results to the server, which may compile them. While this process is illustrated with reference to a query, described by Map-Reduce functions, a similar process enables data analytics, machine learning algorithms, and other such operations to be performed on the SSD controller 240 .
  • a query may include pattern matching, word count or occurrence counting.
  • pattern matching and occurrence counting the data are searched for matches to one or more specified patterns; in pattern matching, the matching data are returned whereas in occurrence counting only the number of matches is returned.
  • the SSD controller 240 may run a JavaTM engine.
  • the ability to run a JavaTM engine on the SSD controller 240 may enable the SSD controller 240 to participate in a HadoopTM system and to execute map and reduce tasks. It may also enable the storage node to perform more complex operations in response to communications supported by standard storage node interfaces.
  • the ability to run JavaTM requires a higher performance CPU and may require an operating system.
  • the unused portion of the SSD controller 240 may be used for running the operating system and JavaTM for high level operation. Other operations, such as counting the number of occurrences of a string in the SSD data, for which high speed execution at low power consumption is important, may be performed by purpose-designed processing elements or by a DSP core in the SSD controller 240 .
  • FIG. 3 illustrates a flow diagram of an exemplary method for performing data queries, which may be described by Map-Reduce functions, pursuant to aspects of the present disclosure.
  • the system of FIG. 2 receives a query 302 for data.
  • the server determines whether the query will be processed in the server or whether it will be passed to the SSD. If the system performs the query in the server, then at act 306 , the server sends a data request to the SSD. The server receives the requested data from the SSD at act 308 and processes the query in the server at act 310 . Finally, the server returns the query results at act 312 .
  • the server passes the query to the SSD.
  • the SSD processes the query at act 316 and passes the query results to the server at act 318 .
  • the server returns the query results at act 320 . While this process is illustrated with reference to a query, a similar process enables data analytics, pattern matching and searching, and other such operations to be performed on the SSD controller 240 .
  • FIG. 4 is a block diagram of an SSD controller 240 according to an embodiment of the present invention.
  • the SSD controller 240 includes a host interface 400 , a microcontroller 410 , a hardware engine 420 , a digital signal processor (DSP) or DSP core 430 , a processing unit buffer 460 , and a flash interface 440 .
  • the host interface 400 may be, for example, PCIe, NVMe, SATA, Fibre Channel, SAS, SCSI over PCIe, or Ethernet.
  • There may be other components on the SSD controller 240 which are not illustrated, such as buffers and the like.
  • the SSD controller 240 communicates with the flash memory 150 through the flash interface 440 , which may implement a flash channel interface.
  • the hardware engine 420 may include a hardware-based search engine.
  • a search engine is a digital hardware block designed to test a set of data to determine whether and where it contains matches to a specified pattern.
  • a comparator implemented in hardware is an example of a search engine, and a more complex search engine may include, among its functional elements, a comparator implemented in hardware.
  • the SSD controller 240 may be a single silicon chip, e.g., a system on a chip (SOC).
  • the DSP core is a reconfigurable DSP.
  • the microcontroller 410 , the processing unit buffer 460 , and the flash interface 440 are all integrated onto a single semiconductor chip (e.g., a single silicon chip), along with a hardware engine 420 or a DSP core 430 , or both.
  • a single semiconductor chip e.g., a single silicon chip
  • the SSD controller 240 may receive a query through the host interface 400 .
  • the microcontroller 410 processes the query by using the hardware engine 420 to compare the requested content of the query with the data on the flash memory 150 .
  • the hardware engine 420 may be a hardware-based search engine and may work in conjunction with the DSP 430 .
  • the DSP may perform encryption and decryption and the hardware engine may perform searching.
  • the microcontroller 410 returns the results of the query to the server 110 ′ through the host interface 400 .
  • the results of the query or other operation in the intelligent SSD 130 may, instead of being returned to the server CPU 120 or host CPU, be stored in the flash memory 150 or flash memories 150 ( FIG. 7 ), for subsequent retrieval by the server CPU or host CPU.
  • FIG. 5 shows a representation of the storage area of an intelligent SSD 130 , in relation with search processes conducted on the storage area, in accordance with an embodiment of the present disclosure.
  • FIG. 5 includes a data storage area 510 , which reflects the storage area on the flash memory 150 where data may be located.
  • the server 110 ′ may send queries regularly based on requests it may receive. Once a query, e.g., a request to search for occurrences of a first pattern, is passed to the intelligent SSD 130 , the intelligent SSD 130 begins processing it, searching the flash memory 150 . If an additional query, related to a second pattern, arrives before the intelligent SSD 130 has completed processing of the initial query, the intelligent SSD 130 may continue to search for the first pattern while simultaneously or concurrently beginning to search for the second pattern. Once it reaches the end of the flash memory 150 , the SSD may then start again at the beginning of the flash memory 150 , searching for only the second pattern, until it reaches the point in the flash memory 150 at which it received the second query.
  • a query e
  • a wide range of capabilities may be implemented in a system constructed according to the embodiment illustrated in FIG. 4 .
  • the hardware engine 420 includes a hardware-based search engine
  • the system may be capable of high-speed, power-efficient searches of the flash memory 150 , for example to count the number of occurrences of a string, pattern, or number.
  • the DSP 430 may be used for functions such as encryption of the data written to the flash memory 150 and decryption of the data read from the flash memory 150 , or similarly for compression and decompression at write and read respectively, to increase the effective storage capacity of the intelligent SSD 130 .
  • an application layer running for example in the server CPU 120 , may perform encryption or compression, and the DSP 430 may decrypt or decompress the data as needed based on the algorithm used in the application layer.
  • the algorithm used by the application layer may be communicated to the DSP by the host CPU.
  • a DSP may also be used for other functions such as in-line indexing, which may be performed when data is written to the flash memory 150 , producing an index which may subsequently be used to accelerate search operations of the flash memory 150 .
  • the intelligent SSD 130 may perform sophisticated analysis including searches and conditional searches. For example a server may have stored in it a very large number of email messages, and a user may wish to find messages satisfying certain criteria, having been sent by a particular sender to any recipient at a particular company. The combination of these two criteria may be tested for by a suitably selected pattern, but if the user wishes to narrow the search further, e.g., with an intelligent search of the body of each email to determine whether a particular transaction was discussed, a more sophisticated algorithm than pattern matching may be required.
  • a conditional search may be used in this example, where criteria related to the body of an email are tested only if an email first meets a first set of criteria, e.g., related to the header of the email; in this case, additional criteria, e.g., a second or third set of criteria related to the body of the email may be added to the search.
  • a system constructed according to the embodiment illustrated in FIG. 4 is suitable for implementing a broad class of analysis algorithms including searches and conditional searches.
  • Search A upon receipt of query requiring a first search, referred to as Search A, the process begins reading the data storage area 510 at a storage location 520 and progresses through the data storage area 510 .
  • Search B a query requiring a second search
  • Search C is added to the set of simultaneously conducted searches at storage location 540 .
  • Search A completes at storage location 550
  • Search B and Search C continue at storage location 520 .
  • Search B then completes at storage location 530
  • Search C completes at storage location 540 .
  • the search process continues previously initiated searches across the data storage area 510 , while simultaneously commencing newly requested searches.
  • the data storage area 510 may be comprised of logical or physical addresses.
  • the addresses may be defined as blocks, pages, sectors, or the like.
  • FIG. 6 shows a chart of the power usage of existing systems in relation to the power usage of a system constructed according to an embodiment of the present invention, for an exemplary query of a solid state drive.
  • the solid line depicts typical power usage when a server processes a query.
  • the dotted line depicts typical power usage when the query is instead performed by the solid state drive 130 .
  • the server CPU both consumes a higher level of power (level A in FIG. 6 ) and spends more time processing the query, the energy consumed, i.e., the area under the solid line, is significantly greater than the energy consumed (represented by the cross-hatched area under the dashed line) when the query is processed by the intelligent SSD 130 , which consumes power at a lower level (level B in FIG. 6 ) and completes the processing more quickly.
  • the microcontroller 410 may be connected to a plurality of hardware engines 420 , each of which may access a plurality of flash memories 150 through a buffer 460 and a flash interface 440 .
  • This structure allows searches to be performed in parallel, resulting in shorter processing times (as illustrated in FIG. 6 ).
  • the disclosed systems and methods have many potential applications, including but not limited to data queries, data analytics, pattern mining, machine learning algorithm, encryption and decryption. While the illustrations above relate to a query, a similar process may be performed, for example, in relation to data analytics, pattern mining, machine learning algorithm, classification, compression and decompression, and encryption and decryption.
  • SSDs solid state drives
  • HDD hard disk drive
  • embodiments of the present invention may be used in a redundant array of independent disks (RAID) to achieve similar advantages in optimizing performance and resource utilization, while taking advantage of efficiencies in RAID parity calculations and the number of physical inputs and outputs (I/Os) performed. Accordingly, these embodiments can be used to make RAID controllers and subsystems more efficient.
  • RAID redundant array of independent disks
  • One or more computer processors operating in accordance with instructions may implement the functions associated with managing use of cache devices in accordance with the present disclosure as described above. If such is the case, it is within the scope of the present disclosure that such instructions may be stored on one or more non-transitory processor readable storage media (e.g., a magnetic disk, non-volatile random-access memory, phase-change memory or other storage medium). Additionally, modules implementing functions may also be physically located at various positions, including being distributed such that portions of functions are implemented at different physical locations.
  • an intelligent SSD 130 containing one or more CPUs is capable of running an operating system on one or more of its CPUs and also of running programs written in high-level languages, e.g., object-oriented languages such as JavaTM, PythonTM, or C++.
  • Such programs may describe Map-Reduce functions in a cluster of compute nodes.
  • the cluster may include a master node 810 and a number of slave nodes 820 ; the master node and the slave nodes may each include or be connected to a storage node 830 .
  • Some or all of the storage nodes may be intelligent SSDs 130 , and the remainder may be comparable storage nodes such as comparable SSDs or comparable HDDs.
  • the intelligent SSDs 130 may contain hardware accelerated engines to improve the performance of some tasks, for example search.
  • the intelligent SSD 130 is part of a data node 820 in the same way as a comparable SSD 125 or other storage node may be part of a data node 820 , but it is capable of executing Map-Reduce tasks without transferring the data from the intelligent SSD 130 out to the main CPU.
  • a data node 820 ′ is an intelligent SSD 130 itself, as illustrated by the data node 820 ′ of FIG. 8 .
  • the data node 820 may have the architecture of a server, with a server CPU 120 , DRAM memory, an Ethernet or gigabit Ethernet (GbE) connection for external communications, and a storage node, which, as illustrated, may be an intelligent SSD 130 .
  • the intelligent SSD 130 may communicate with the CPU using vendor-specific commands, i.e., commands specified by the vendor of the intelligent SSD 130 , which may be executed by the CPU after the installation of appropriate drivers in the data node 820 .
  • This embodiment provides value especially in legacy, i.e., existing, installations in which it may be more cost-effective to replace comparable storage nodes with intelligent SSDs 130 when the comparable storage nodes fail, or as performance improvements and power efficiency improvements are needed.
  • the data node 820 ′ may be the intelligent SSD 130 itself, having an Ethernet, or gigabit Ethernet (GbE), connection for external communications, a buffer 460 , an SSD controller 240 , and a flash array.
  • the SSD controller 240 may for example include, as shown in FIG. 4 , a microcontroller 410 , a hardware engine 420 , and a DSP 430 .
  • This embodiment may be of particular benefit in new installations, in which the use of an intelligent SSD 130 instead of a data node containing a comparable storage node may provide cost savings, in addition to improvements in performance and reductions in power consumption.
  • unstructured data is placed on the data nodes via the master node ( FIG. 11 ) and queries are executed on the data using the Map-Reduce framework ( FIG. 12 ).
  • the process of storing unstructured data on the HadoopTM distributed file system (HDFS) includes three acts.
  • the data are chopped into multiple data blocks, which are scattered, in an act 1110 , over multiple data nodes 820 .
  • Metadata for the data blocks are stored, in an act 1115 , on the master node.
  • Unstructured data may include social feeds, geographic information systems (GIS) data, documents, extensible markup language (XML), email, web logs, images, audit logs, data from fields sensors, radio frequency identification (RFID) data, data about world events, data about market events, and closed-circuit television (CCTV) footage.
  • GIS geographic information systems
  • XML extensible markup language
  • RFID radio frequency identification
  • CCTV closed-circuit television
  • a user submits a query to the system. This may be accomplished through an application interface.
  • the query is submitted, e.g., by a client machine running the application interface, to the master node 810 , which, in an act 1215 , uses the map process to assign sub-jobs for executing the query to slave nodes 820 .
  • Each slave node 820 to which a sub job is assigned by the master node 810 may optionally, in an act 1220 , further assign sub jobs to other slave nodes 820 .
  • the sub-jobs are executed in parallel in an act 1225 , the slave nodes 820 return the results to the master node 810 in an act 1230 , and the master node 810 , in an act 1235 , aggregates the results.

Abstract

A system and method of providing enhanced data processing and analysis in an infrastructure for distributed computing and large-scale data processing. This infrastructure uses the Hadoop™ framework to divide an application into a large number of small fragments of work, each of which may be performed on one of a large number of compute nodes. The work may involve map tasks and reduce tasks which may be used to categorize and analyze large amounts of data in distributed systems. This infrastructure includes a cluster with a master node and a plurality of slave nodes. The slave nodes may include, or may be, intelligent solid-state drives capable of executing Map-Reduce functions. The use of intelligent solid-state drives reduces the need to exchange data with a CPU in a server.

Description

    CROSS-REFERENCE TO RELATED APPLICATION
  • This application is a continuation of U.S. patent application Ser. No. 14/015,815, filed on Aug. 30, 2013, now U.S. Pat. No. 8,819,335, the entire content of which is incorporated herein by reference.
  • FIELD
  • The following description relates to cloud data storage systems and more particularly to a system and method for efficiently executing Map-Reduce tasks on large volumes of data, and on data stored in master-slave hardware configurations.
  • BACKGROUND
  • Every day, several quintillion bytes of data may be created around the world. These data come from everywhere: posts to social media sites, digital pictures and videos, purchase transaction records, bank transactions, sensors used to gather data and intelligence, like climate information, cell phone GPS signal, and many others. This type of data and its vast accumulation is often referred to as “big data.” This vast amount of data eventually is stored and maintained in storage nodes, such as hard disk drives (HDDs), solid-state storage drives (SSDs), or the like, and these may reside on networks or on storage accessible via the Internet, which may be referred to as the “cloud.” This stored data may also require processing, or be subject to operations, such as during a search, Pattern Mining, Classification, or other processes. Typically, a processing device, such as a central processing unit (CPU), in a server performs operations on the data. The data is read from the storage node, processed by the CPU and the processed data is sent to the source of a request and/or stored back on the storage node. Standard storage nodes generally do not include computational resources to perform such operations on data stored in the storage node.
  • Moreover, standard storage node interfaces, such as Serial Advanced Technology Attachment (SATA), Fibre Channel, or Serial Attached SCSI (SAS), do not define commands to trigger the storage node to perform data operations in the storage node. Accordingly, operations are performed outside of the storage node, e.g., in a server CPU. To perform such an operation, a server uses standard read and write commands supported by existing storage node interfaces to move data from and to the storage node. Specifically, the server sends a standard read command to the storage node via a bus. The storage node then sends the stored data over the bus to the server, which typically holds the data in its main memory. The CPU of the server then performs operations on the data to produce a result. Depending on the type of operation, the server provides the result to a requesting source and/or stores the result on the storage node.
  • There are several disadvantages associated with this process of reading the data from the storage node, and processing the data within the server, and potentially storing the processed data back on the storage node. Because of these disadvantages, the process of performing data operations on the server is referred to as “costly” or “expensive” in terms of device performance and power consumption. Because the server CPU is involved in every step of the process, this process occupies the CPU of the server, consumes power, blocks other user operations that otherwise could have been performed, and requires that the server contain a buffer, or a larger buffer than would otherwise be needed. The buffer is typically the main memory of the CPU, or double data rate (DDR) random access memory. This process also ties up the communication bus between the server and the storage node since data is sent from the storage node to the server and then back to the storage node. In other words, existing processes for searching and analyzing large distributed unstructured databases are time-consuming and use large amounts of resources such as CPU utilization, memory footprint, or energy.
  • In summary, typical operations like search, pattern mining, classification, machine learning algorithms and data analysis are, in existing systems, performed on the local server's CPU. Search and processing may be performed over the entire data residing in storage nodes (e.g., solid state drives (SSDs), hard disk drives (HDDs), etc.) within the server. Data needs to be moved from the storage node into the CPU memory where it can then be processed. This is inefficient, e.g., slow, because a single server CPU, which may control a large collection of storage nodes, has relatively little processing power with which to process the large volume of data stored on the collection of storage nodes. Efficiency may also be compromised by one or more data bottlenecks between the server CPU and the storage nodes. Moreover, requiring the server's CPU to do this work makes inefficient use of energy as well, in part because a general-purpose CPU like a server CPU generally is not optimized for large data set processing, and in part because transferring data over a data bus and across the interface to the storage node requires a significant amount of power.
  • Big data may be managed and analyzed using the Hadoop™ software framework and using the Map-Reduce programming model. The Hadoop™ framework may implement Map-Reduce functions to distribute the data query, which may be a Map-Reduce job, into a large number of small fragments of work, referred to herein as tasks, each of which may be performed on one of a large number of compute nodes. In particular, the work may involve map tasks and reduce tasks which may be used to categorize and analyze large amounts of data in distributed systems. As used herein, a compute node is a piece of hardware capable of performing operations, and a storage node is a piece of hardware capable of storing data. Thus, for example, a piece of hardware may be, or contain, both a compute node and a storage node, and, as another example, a compute node may include or contain a storage node.
  • Related art Map-Reduce systems for large-scale processing of data in a parallel processing environment include one or more map modules configured to read input data and to apply at least one application-specific map operation to the input data to produce intermediate data values. An intermediate data structure stores the intermediate data values. These systems also include reduce modules, which are configured to retrieve the intermediate data values from the intermediate data structure and to apply at least one user-specified reduce operation to the intermediate data values to provide output data. Preferably, the map and/or reduce tasks are automatically parallelized across multiple compute nodes in the parallel processing environment. The programs or instructions for handling parallelization of the map and reduce tasks are application independent. The input data and the intermediate data values can include key/value pairs and the reduce operation can include combining intermediate data values having the same key. The intermediate data structure can include one or more intermediate data files coupled to each map module for storing intermediate data values. The map and reduce tasks can be executed on different compute nodes. The output data can be written to the local storage node or to another compute node using a distributed file system, for instance, a Hadoop™ distributed file system (HDFS).
  • Map-Reduce (M-R) is a programming model that allows large amounts of data to be processed on parallel computer platforms using two basic functions: map and reduce. Data is first mapped (for grouping purposes) using the map function and then reduced (aggregated) using the reduce function. For example, records having different attributes such as “dog” and “cat” could be mapped, for grouping purposes, to new records (or tuples) where each has attributes of “animal” instead of “dog” or “cat”. Then, by a reduce function, all the “animal” records (or tuples) could be aggregated. A Map-Reduce model implemented in a parallel processing computer system may enhance the processing of massive quantities of data by a “divide-and-conquer” strategy that may result from dividing the data into portions and processing it on parallel-processing computer installations.
  • Related art hardware systems may include a set of data nodes, which may also be referred to as slave nodes, controlled by a master node which may also be referred to as a job tracker or name node. Within the Hadoop™ framework, the master node may use the Map-Reduce process to assign tasks to slave nodes, the slave nodes may complete the tasks, and the master node may then aggregate the results produced by the slave nodes.
  • The master node and the slave nodes may be servers, each including a CPU and a storage node. As in the case of other operations, slave node sub job operations executed in a CPU which retrieves data from a storage node and may save results on a storage node are relatively slow and power-inefficient. Thus, there is a need for a system and method, in, e.g., a Hadoop™ system, for more efficiently processing data stored on storage nodes.
  • SUMMARY
  • Aspects of embodiments of the present disclosure are directed toward a system and method of providing enhanced data processing and analysis in a cluster of compute nodes executing Map-Reduce tasks in a Hadoop™ framework. Hadoop™ framework divides a data query (Map-Reduce job) into a large number of small fragments of work, each of which may be performed on one of a large number of compute nodes. The work may involve a map task and a reduce task which may be used to categorize and analyze large amounts of data in distributed systems. A Hadoop™ cluster contains a master node and a plurality of slave nodes. The slave nodes include intelligent solid-state drives capable of executing Map-Reduce tasks. The use of intelligent solid-state drives reduces the need to exchange data with a CPU in a server.
  • According to an embodiment of the present invention there is provided an intelligent solid state drive including: a processing unit; and a flash memory; the processing unit configured to be in communication with the flash memory, and including: a hardware engine; and a microcontroller; the solid state drive configured to perform map and reduce tasks.
  • In one embodiment, the intelligent solid state drive is configured to run an operating system.
  • In one embodiment, the operating system is configured to enable the drive to execute a high-level computer language.
  • In one embodiment, the computer language is an object-oriented programming language.
  • In one embodiment, the cluster includes a cluster of nodes, the cluster of nodes including: a master node; and a plurality of slave nodes; wherein a slave node of the plurality of slave nodes includes a server including a server central processing unit (CPU) and an intelligent solid state drive.
  • In one embodiment, the cluster includes a cluster of nodes, the cluster of nodes including: a master node; and a plurality of slave nodes; wherein a slave node of the plurality of slave nodes is an intelligent solid state drive.
  • In one embodiment, the plurality includes a master node and a plurality of slave nodes, a slave node of the plurality of slave nodes including an intelligent solid state drive, the method including: submitting the query to the master node; assigning a plurality of tasks to the plurality of slave nodes, by the master node, the plurality of tasks being configured to execute portions of the query; executing the plurality of tasks, by the plurality of slave nodes; returning the results of the execution of the plurality of tasks, by the plurality of slave nodes, to the master node; and aggregating, by the master node, the results of the execution of the plurality of tasks.
  • In one embodiment, the method includes assigning of a task by a first slave node of the plurality of slave nodes to a second slave node of the plurality of slave nodes.
  • In one embodiment, a task of the plurality of tasks includes an execution of a map function within a Map-Reduce framework.
  • In one embodiment, a task of the plurality of tasks includes an execution of a reduce function within a Map-Reduce framework.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • In order to facilitate a fuller understanding of the present disclosure, reference is now made to the accompanying drawings, in which like elements are referenced with like numerals. These drawings should not be construed as limiting the present disclosure, but are intended to be exemplary only.
  • FIG. 1 shows an exemplary block diagram depicting a solid state drive in communication with a server, in accordance with related art systems;
  • FIG. 2 depicts an exemplary block diagram of a solid state drive in communication with a server, in accordance with an embodiment of the present invention;
  • FIG. 3 shows exemplary acts for performing queries in a solid state drive, in accordance with an embodiment of the present invention;
  • FIG. 4 depicts an exemplary block diagram of a processing unit of a solid state drive in communication with a server and flash memory storage, in accordance with an embodiment of the present invention;
  • FIG. 5 shows a representation of the storage area of a solid state drive, in relation with searches conducted on the storage area, in accordance with an embodiment of the present invention;
  • FIG. 6 shows a chart comparing the power usage of an existing system with the power usage of a system employing a storage node constructed according to an embodiment of the present invention, during an exemplary query of a solid state drive;
  • FIG. 7 depicts an exemplary block diagram of a processing unit of a solid state drive in communication with a server and a plurality of flash memories, in accordance with an embodiment of the present invention;
  • FIG. 8 shows a cluster of nodes in a distributed computing system according to an embodiment of the present invention;
  • FIG. 9 shows a data node architecture according to an embodiment of the present invention;
  • FIG. 10 shows a data node architecture according to another embodiment of the present invention;
  • FIG. 11 is a flowchart illustrating acts involved in the storage of unstructured data in a distributed file system according to an embodiment of the present invention; and
  • FIG. 12 is a flowchart illustrating acts involved in execution of a query in a distributed file system according to an embodiment of the present invention.
  • DETAILED DESCRIPTION
  • The detailed description set forth below in connection with the appended drawings is intended as a description of exemplary embodiments of a system and method for performing efficient data operations and analytics provided in accordance with the present invention and is not intended to represent the only forms in which the present invention may be constructed or utilized. The description sets forth the features of the present invention in connection with the illustrated embodiments. It is to be understood, however, that the same or equivalent functions and structures may be accomplished by different embodiments that are also intended to be encompassed within the spirit and scope of the invention. As denoted elsewhere herein, like element numbers are intended to indicate like elements or features.
  • The present invention relates to systems and methods for processing data in large systems using solid state storage. According to an embodiment of the present invention, processing of data stored on an intelligent solid state storage node, which may be referred to as an intelligent solid state drive (SSD), does not require comparatively slow reading and re-writing of the data and, instead, is accommodated by performing the processing within the intelligent SSD.
  • Comparable SSDs typically include a controller for facilitating the transfer of data to and from the SSD. The CPU in a typical comparable SSD has limited processing capability, which is an obstacle to running an operating system and to running Java™. It also lacks a hardware engine for performing a word count or pattern matching.
  • FIG. 1 is a block diagram of a system which includes a server 110 in communication with a storage node for performing data queries according to prior art implementations. FIG. 1 includes a server 110, which can include a processor, such as a server central processing unit (CPU) 120, and a comparable SSD 125, which can include a storage node controller 140 and a non-volatile memory, e.g., a flash memory 150. The server 110 and comparable SSD 125 may be implemented in a cloud-based computing environment. The server 110 and comparable SSD 125 may communicate using any storage buses as well as PCIe with any protocol which runs on it. In other embodiments storage nodes may be connected to, and controlled by, a host CPU which need not be a server CPU but may be a CPU in an application not configured as a server. Thus a server CPU may be an example of a host CPU.
  • As used herein, the phrase “in communication with” refers to in direct communication with or in indirect communication with via one or more components named or unnamed herein. The server 110 and the comparable SSD 125 can be in communication with each other via a wired or wireless connection. For example, in one embodiment, the comparable SSD 125 may comprise pins (or a socket) to mate with a corresponding socket (or pins) on the server 110 to establish an electrical and physical connection. In another embodiment, the comparable SSD 125 can comprise a wireless transceiver to place the server 110 and the comparable SSD 125 in wireless communication with each other. The server 110 and the comparable SSD 125 may be separately housed from each other, or contained in the same housing.
  • As shown in FIG. 1, in operation, the server 110 may receive a query, which may, for example, entail finding the number of occurrences of a certain pattern or text. As used herein, a pattern is a combination of strings and logical operations, in which the logical operations determine which combinations of the strings, if they are found in a set of data, will constitute a match for the pattern in the set of data. In response, the server 110 may send a data request to the comparable SSD 125. The comparable SSD 125 receives the data request, and retrieves the requested data. The comparable SSD 125 then sends the data to the server 110. The server CPU 120 processes the data and returns the results. The server 110 and the comparable SSD 125 may include additional components, which are not shown in FIG. 1 to simplify the drawing.
  • In one embodiment of the present invention and referring to FIG. 2, an intelligent SSD 130 may have the capabilities of a comparable SSD 125 in addition to further capabilities discussed in detail below. Thus, an intelligent SSD 130 may be used in applications in which a comparable SSD 125 might otherwise be used, such as those described above with respect to FIG. 1.
  • In particular and as shown in FIG. 2, a server 110′ may include a processor, such as a server central processing unit (CPU) 120, and an intelligent SSD 130. The server 110′ and intelligent SSD 130 may be implemented in a cloud-based computing environment. The server 110′ and intelligent SSD 130 may communicate using any storage buses as well as PCIe with any protocol which runs on it. In other embodiments storage nodes may be connected to, and controlled by, a host CPU which need not be a server CPU but may be a CPU in an application not configured as a server.
  • The server 110′ and the intelligent SSD 130 can be in communication with each other via a wired or wireless connection. For example, in one embodiment, the intelligent SSD 130 may comprise pins (or a socket) to mate with a corresponding socket (or pins) in the server 110′ to establish an electrical and physical connection with, e.g., the CPU 120. In another embodiment, the intelligent SSD 130 can comprise a wireless transceiver to place the server 110′ and the intelligent SSD 130 in wireless communication with each other. The server 110′ and the intelligent SSD 130 may be separately housed from each other, or contained in the same housing.
  • As shown in FIG. 2, in operation, the server 110′ may receive a query, described by map and reduce functions, which may, for example, entail finding the number of occurrences of a certain pattern or text. As used herein, a pattern is a combination of strings and logical operations, in which the logical operations determine which combinations of the strings, if they are found in a set of data, will constitute a match for the pattern in the set of data. In response, the server 110′ may send a data request to the intelligent SSD 130. The intelligent SSD 130 receives the data request, and retrieves the requested data. The intelligent SSD 130 then sends the data to the server 110′. The server CPU 120 processes the data and returns the results.
  • FIG. 2 is a block diagram of a system which includes a server 110′ containing, and in communication with, an intelligent SSD 130 for performing data queries according to aspects of the present disclosure. The server 110′ and intelligent SSD 130 may be part of a cloud-based computing environment, a network, or a separate subsystem. The server may also contain a server CPU 120, and a data buffer 260, which may be composed of DDR memory.
  • According to aspects of the present disclosure, the intelligent SSD 130 includes an SSD controller 240 that is designed for data operations or analytics, such as search and analysis of a large volume of unstructured data. The SSD controller 240 can include, for example, a reconfigurable digital signal processing (DSP) core containing arithmetic and logic units and other dedicated hardware units that may be used to perform data analytics, and other operations such as compression, decompression, encryption, and decryption. In one embodiment, the intelligent SSD 130 includes an ARM-based core or any other suitable CPU. These additional cores and circuitry within the silicon of the SSD controller 240 occupy a small area and as a result consume little power. Although these functions could also be performed on a server CPU, transferring data over a data bus and across the interface to the storage node requires a significant amount of power. By designing and/or integrating the silicon of the SSD controller 240 to perform the desired functions, their execution can be made significantly more power-efficient. The intelligent SSD 130 may include an SSD controller 240 and a flash memory 150.
  • In one embodiment, the SSD controller 240 performs querying of data. For example, a Map-Reduce job may be composed of a request to find a text word, and/or the number of occurrences of that text word in the storage nodes in the server. According to aspects of the present disclosure, instead of reading contents of the storage node into the server CPU and counting the number of matches, the task can be computed locally within the storage node. The server 110′ may be configured to receive queries. When the server (also referred to as the data node) receives a query, the server passes the query to the storage nodes in the server. Each of these storage nodes, which may be SSDs, may then process the query and return the results to the server, which may compile them. While this process is illustrated with reference to a query, described by Map-Reduce functions, a similar process enables data analytics, machine learning algorithms, and other such operations to be performed on the SSD controller 240.
  • A query may include pattern matching, word count or occurrence counting. In both pattern matching and occurrence counting, the data are searched for matches to one or more specified patterns; in pattern matching, the matching data are returned whereas in occurrence counting only the number of matches is returned. In addition to pattern matching, word count, and occurrence count, the SSD controller 240 may run a Java™ engine. The ability to run a Java™ engine on the SSD controller 240 may enable the SSD controller 240 to participate in a Hadoop™ system and to execute map and reduce tasks. It may also enable the storage node to perform more complex operations in response to communications supported by standard storage node interfaces. The ability to run Java™ requires a higher performance CPU and may require an operating system. The unused portion of the SSD controller 240 may be used for running the operating system and Java™ for high level operation. Other operations, such as counting the number of occurrences of a string in the SSD data, for which high speed execution at low power consumption is important, may be performed by purpose-designed processing elements or by a DSP core in the SSD controller 240.
  • FIG. 3 illustrates a flow diagram of an exemplary method for performing data queries, which may be described by Map-Reduce functions, pursuant to aspects of the present disclosure. Specifically, the system of FIG. 2 receives a query 302 for data. At act 304, the server determines whether the query will be processed in the server or whether it will be passed to the SSD. If the system performs the query in the server, then at act 306, the server sends a data request to the SSD. The server receives the requested data from the SSD at act 308 and processes the query in the server at act 310. Finally, the server returns the query results at act 312.
  • If the system performs the query in the SSD, then at act 314, the server passes the query to the SSD. The SSD processes the query at act 316 and passes the query results to the server at act 318. Finally, the server returns the query results at act 320. While this process is illustrated with reference to a query, a similar process enables data analytics, pattern matching and searching, and other such operations to be performed on the SSD controller 240.
  • FIG. 4 is a block diagram of an SSD controller 240 according to an embodiment of the present invention. The SSD controller 240 includes a host interface 400, a microcontroller 410, a hardware engine 420, a digital signal processor (DSP) or DSP core 430, a processing unit buffer 460, and a flash interface 440. The host interface 400 may be, for example, PCIe, NVMe, SATA, Fibre Channel, SAS, SCSI over PCIe, or Ethernet. There may be other components on the SSD controller 240 which are not illustrated, such as buffers and the like. The SSD controller 240 communicates with the flash memory 150 through the flash interface 440, which may implement a flash channel interface. In one embodiment, there may be a separate hardware engine 420 for each flash channel; the hardware engine 420 may include a hardware-based search engine. As used herein, a search engine is a digital hardware block designed to test a set of data to determine whether and where it contains matches to a specified pattern. Thus, a comparator implemented in hardware is an example of a search engine, and a more complex search engine may include, among its functional elements, a comparator implemented in hardware. The SSD controller 240 may be a single silicon chip, e.g., a system on a chip (SOC). In one embodiment, the DSP core is a reconfigurable DSP. In one embodiment, the microcontroller 410, the processing unit buffer 460, and the flash interface 440 are all integrated onto a single semiconductor chip (e.g., a single silicon chip), along with a hardware engine 420 or a DSP core 430, or both.
  • As shown in FIG. 4, the SSD controller 240 may receive a query through the host interface 400. In one embodiment, the microcontroller 410 processes the query by using the hardware engine 420 to compare the requested content of the query with the data on the flash memory 150. The hardware engine 420 may be a hardware-based search engine and may work in conjunction with the DSP 430. In one example the DSP may perform encryption and decryption and the hardware engine may perform searching. The microcontroller 410 returns the results of the query to the server 110′ through the host interface 400. In other embodiments, the results of the query or other operation in the intelligent SSD 130 may, instead of being returned to the server CPU 120 or host CPU, be stored in the flash memory 150 or flash memories 150 (FIG. 7), for subsequent retrieval by the server CPU or host CPU.
  • FIG. 5 shows a representation of the storage area of an intelligent SSD 130, in relation with search processes conducted on the storage area, in accordance with an embodiment of the present disclosure. FIG. 5 includes a data storage area 510, which reflects the storage area on the flash memory 150 where data may be located. The server 110′ may send queries regularly based on requests it may receive. Once a query, e.g., a request to search for occurrences of a first pattern, is passed to the intelligent SSD 130, the intelligent SSD 130 begins processing it, searching the flash memory 150. If an additional query, related to a second pattern, arrives before the intelligent SSD 130 has completed processing of the initial query, the intelligent SSD 130 may continue to search for the first pattern while simultaneously or concurrently beginning to search for the second pattern. Once it reaches the end of the flash memory 150, the SSD may then start again at the beginning of the flash memory 150, searching for only the second pattern, until it reaches the point in the flash memory 150 at which it received the second query.
  • A wide range of capabilities may be implemented in a system constructed according to the embodiment illustrated in FIG. 4. For example, if the hardware engine 420 includes a hardware-based search engine, the system may be capable of high-speed, power-efficient searches of the flash memory 150, for example to count the number of occurrences of a string, pattern, or number. The DSP 430 may be used for functions such as encryption of the data written to the flash memory 150 and decryption of the data read from the flash memory 150, or similarly for compression and decompression at write and read respectively, to increase the effective storage capacity of the intelligent SSD 130. In one embodiment an application layer, running for example in the server CPU 120, may perform encryption or compression, and the DSP 430 may decrypt or decompress the data as needed based on the algorithm used in the application layer. The algorithm used by the application layer may be communicated to the DSP by the host CPU. A DSP may also be used for other functions such as in-line indexing, which may be performed when data is written to the flash memory 150, producing an index which may subsequently be used to accelerate search operations of the flash memory 150.
  • In one embodiment, the intelligent SSD 130 may perform sophisticated analysis including searches and conditional searches. For example a server may have stored in it a very large number of email messages, and a user may wish to find messages satisfying certain criteria, having been sent by a particular sender to any recipient at a particular company. The combination of these two criteria may be tested for by a suitably selected pattern, but if the user wishes to narrow the search further, e.g., with an intelligent search of the body of each email to determine whether a particular transaction was discussed, a more sophisticated algorithm than pattern matching may be required. A conditional search may be used in this example, where criteria related to the body of an email are tested only if an email first meets a first set of criteria, e.g., related to the header of the email; in this case, additional criteria, e.g., a second or third set of criteria related to the body of the email may be added to the search. A system constructed according to the embodiment illustrated in FIG. 4 is suitable for implementing a broad class of analysis algorithms including searches and conditional searches.
  • In another example as illustrated in FIG. 5, upon receipt of query requiring a first search, referred to as Search A, the process begins reading the data storage area 510 at a storage location 520 and progresses through the data storage area 510. During Search A, while at a storage location 530, a query requiring a second search, referred to as Search B, is received, and the process begins Search B at storage location 530, in addition to continuing Search A at storage location 530. Similarly, Search C is added to the set of simultaneously conducted searches at storage location 540. Search A completes at storage location 550, while Search B and Search C continue at storage location 520. Search B then completes at storage location 530, and Search C completes at storage location 540.
  • As illustrated in FIG. 5, the search process continues previously initiated searches across the data storage area 510, while simultaneously commencing newly requested searches. The data storage area 510 may be comprised of logical or physical addresses. The addresses may be defined as blocks, pages, sectors, or the like.
  • FIG. 6 shows a chart of the power usage of existing systems in relation to the power usage of a system constructed according to an embodiment of the present invention, for an exemplary query of a solid state drive. The solid line depicts typical power usage when a server processes a query. The dotted line depicts typical power usage when the query is instead performed by the solid state drive 130. Because the server CPU both consumes a higher level of power (level A in FIG. 6) and spends more time processing the query, the energy consumed, i.e., the area under the solid line, is significantly greater than the energy consumed (represented by the cross-hatched area under the dashed line) when the query is processed by the intelligent SSD 130, which consumes power at a lower level (level B in FIG. 6) and completes the processing more quickly.
  • Referring to FIG. 7, in one embodiment of an SSD controller 240, the microcontroller 410 may be connected to a plurality of hardware engines 420, each of which may access a plurality of flash memories 150 through a buffer 460 and a flash interface 440. This structure allows searches to be performed in parallel, resulting in shorter processing times (as illustrated in FIG. 6).
  • Applications
  • The disclosed systems and methods have many potential applications, including but not limited to data queries, data analytics, pattern mining, machine learning algorithm, encryption and decryption. While the illustrations above relate to a query, a similar process may be performed, for example, in relation to data analytics, pattern mining, machine learning algorithm, classification, compression and decompression, and encryption and decryption.
  • There are many alternatives that can be used with these embodiments. For example, while solid state drives (SSDs) were discussed in examples above, any type of suitable memory device, such as a hard disk drive (HDD), can be used. Further, embodiments of the present invention may be used in a redundant array of independent disks (RAID) to achieve similar advantages in optimizing performance and resource utilization, while taking advantage of efficiencies in RAID parity calculations and the number of physical inputs and outputs (I/Os) performed. Accordingly, these embodiments can be used to make RAID controllers and subsystems more efficient.
  • Other embodiments are within the scope and spirit of the invention. For example, the functionality described above can be implemented using software, hardware, firmware, hardwiring, or combinations of any of these. One or more computer processors operating in accordance with instructions may implement the functions associated with managing use of cache devices in accordance with the present disclosure as described above. If such is the case, it is within the scope of the present disclosure that such instructions may be stored on one or more non-transitory processor readable storage media (e.g., a magnetic disk, non-volatile random-access memory, phase-change memory or other storage medium). Additionally, modules implementing functions may also be physically located at various positions, including being distributed such that portions of functions are implemented at different physical locations.
  • Referring to FIG. 8, in one embodiment, an intelligent SSD 130 containing one or more CPUs is capable of running an operating system on one or more of its CPUs and also of running programs written in high-level languages, e.g., object-oriented languages such as Java™, Python™, or C++. Such programs may describe Map-Reduce functions in a cluster of compute nodes. The cluster may include a master node 810 and a number of slave nodes 820; the master node and the slave nodes may each include or be connected to a storage node 830. Some or all of the storage nodes may be intelligent SSDs 130, and the remainder may be comparable storage nodes such as comparable SSDs or comparable HDDs. The intelligent SSDs 130 may contain hardware accelerated engines to improve the performance of some tasks, for example search.
  • In one embodiment, the intelligent SSD 130 is part of a data node 820 in the same way as a comparable SSD 125 or other storage node may be part of a data node 820, but it is capable of executing Map-Reduce tasks without transferring the data from the intelligent SSD 130 out to the main CPU. In another embodiment a data node 820′ is an intelligent SSD 130 itself, as illustrated by the data node 820′ of FIG. 8.
  • These two possibilities are illustrated in FIGS. 9 and 10. Referring to FIG. 9, the data node 820 may have the architecture of a server, with a server CPU 120, DRAM memory, an Ethernet or gigabit Ethernet (GbE) connection for external communications, and a storage node, which, as illustrated, may be an intelligent SSD 130. The intelligent SSD 130 may communicate with the CPU using vendor-specific commands, i.e., commands specified by the vendor of the intelligent SSD 130, which may be executed by the CPU after the installation of appropriate drivers in the data node 820. This embodiment provides value especially in legacy, i.e., existing, installations in which it may be more cost-effective to replace comparable storage nodes with intelligent SSDs 130 when the comparable storage nodes fail, or as performance improvements and power efficiency improvements are needed.
  • Referring to FIG. 10, in another embodiment, the data node 820′ may be the intelligent SSD 130 itself, having an Ethernet, or gigabit Ethernet (GbE), connection for external communications, a buffer 460, an SSD controller 240, and a flash array. The SSD controller 240 may for example include, as shown in FIG. 4, a microcontroller 410, a hardware engine 420, and a DSP 430. This embodiment may be of particular benefit in new installations, in which the use of an intelligent SSD 130 instead of a data node containing a comparable storage node may provide cost savings, in addition to improvements in performance and reductions in power consumption.
  • In operation unstructured data is placed on the data nodes via the master node (FIG. 11) and queries are executed on the data using the Map-Reduce framework (FIG. 12). Referring to FIG. 11, in one embodiment, the process of storing unstructured data on the Hadoop™ distributed file system (HDFS) includes three acts. In an act 1105, the data are chopped into multiple data blocks, which are scattered, in an act 1110, over multiple data nodes 820. Metadata for the data blocks are stored, in an act 1115, on the master node. Unstructured data may include social feeds, geographic information systems (GIS) data, documents, extensible markup language (XML), email, web logs, images, audit logs, data from fields sensors, radio frequency identification (RFID) data, data about world events, data about market events, and closed-circuit television (CCTV) footage.
  • Referring to FIG. 12, the execution of queries involves several acts. In an act 1205, a user submits a query to the system. This may be accomplished through an application interface. In an act 1210, the query is submitted, e.g., by a client machine running the application interface, to the master node 810, which, in an act 1215, uses the map process to assign sub-jobs for executing the query to slave nodes 820. Each slave node 820 to which a sub job is assigned by the master node 810 may optionally, in an act 1220, further assign sub jobs to other slave nodes 820. The sub-jobs are executed in parallel in an act 1225, the slave nodes 820 return the results to the master node 810 in an act 1230, and the master node 810, in an act 1235, aggregates the results.
  • The present disclosure is not to be limited in scope by the specific embodiments described herein. Indeed, other various embodiments of and modifications to the present disclosure, in addition to those described herein, will be apparent to those of ordinary skill in the art from the foregoing description and accompanying drawings. Thus, such other embodiments and modifications are intended to fall within the scope of the present disclosure. Further, although the present disclosure has been described herein in the context of a particular implementation in a particular environment for a particular purpose, those of ordinary skill in the art will recognize that its usefulness is not limited thereto and that the present disclosure may be beneficially implemented in any number of environments for any number of purposes. Accordingly, the claims set forth below should be construed in view of the full breadth and spirit of the present disclosure as described herein and equivalents thereof.

Claims (2)

What is claimed is:
1. An intelligent solid state drive comprising:
a solid state drive controller;
a solid state drive host interface; and
a flash memory;
the solid state drive controller configured to be in communication with the flash memory, and comprising:
a hardware engine; and
a microcontroller;
the solid state drive configured to:
receive, through the solid state drive host interface, a data query described by Map-Reduce functions;
execute map and reduce tasks, in the solid state drive controller, to process the query; and
return a query result through the solid state drive host interface.
2. An intelligent solid state drive comprising:
a solid state drive controller;
a solid state drive host interface; and
a flash memory;
the solid state drive host interface being selected from the group consisting of: PCIe, NVMe, SATA, Fibre Channel, SAS, SCSI over PCIe, and Ethernet,
the solid state drive controller configured to be in communication with the flash memory, and comprising:
a hardware engine; and
a microcontroller;
the solid state drive configured to perform map and reduce tasks.
US14/465,505 2013-08-30 2014-08-21 System and method for executing map-reduce tasks in a storage device Abandoned US20150067243A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US14/465,505 US20150067243A1 (en) 2013-08-30 2014-08-21 System and method for executing map-reduce tasks in a storage device

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US14/015,815 US8819335B1 (en) 2013-08-30 2013-08-30 System and method for executing map-reduce tasks in a storage device
US14/465,505 US20150067243A1 (en) 2013-08-30 2014-08-21 System and method for executing map-reduce tasks in a storage device

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US14/015,815 Continuation US8819335B1 (en) 2013-08-30 2013-08-30 System and method for executing map-reduce tasks in a storage device

Publications (1)

Publication Number Publication Date
US20150067243A1 true US20150067243A1 (en) 2015-03-05

Family

ID=51358748

Family Applications (2)

Application Number Title Priority Date Filing Date
US14/015,815 Active US8819335B1 (en) 2013-08-30 2013-08-30 System and method for executing map-reduce tasks in a storage device
US14/465,505 Abandoned US20150067243A1 (en) 2013-08-30 2014-08-21 System and method for executing map-reduce tasks in a storage device

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US14/015,815 Active US8819335B1 (en) 2013-08-30 2013-08-30 System and method for executing map-reduce tasks in a storage device

Country Status (1)

Country Link
US (2) US8819335B1 (en)

Cited By (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104794231A (en) * 2015-05-04 2015-07-22 罗敬宁 Unstructured data processing method and device based on Map/Reduce
CN105426119A (en) * 2015-10-28 2016-03-23 上海新储集成电路有限公司 Storage apparatus and data processing method
US9372755B1 (en) 2011-10-05 2016-06-21 Bitmicro Networks, Inc. Adaptive power cycle sequences for data recovery
US9400617B2 (en) 2013-03-15 2016-07-26 Bitmicro Networks, Inc. Hardware-assisted DMA transfer with dependency table configured to permit-in parallel-data drain from cache without processor intervention when filled or drained
US9423457B2 (en) 2013-03-14 2016-08-23 Bitmicro Networks, Inc. Self-test solution for delay locked loops
US9430386B2 (en) 2013-03-15 2016-08-30 Bitmicro Networks, Inc. Multi-leveled cache management in a hybrid storage system
US9484103B1 (en) 2009-09-14 2016-11-01 Bitmicro Networks, Inc. Electronic storage device
US9501436B1 (en) 2013-03-15 2016-11-22 Bitmicro Networks, Inc. Multi-level message passing descriptor
US9672178B1 (en) 2013-03-15 2017-06-06 Bitmicro Networks, Inc. Bit-mapped DMA transfer with dependency table configured to monitor status so that a processor is not rendered as a bottleneck in a system
US9720603B1 (en) 2013-03-15 2017-08-01 Bitmicro Networks, Inc. IOC to IOC distributed caching architecture
US9734067B1 (en) 2013-03-15 2017-08-15 Bitmicro Networks, Inc. Write buffering
US9798688B1 (en) 2013-03-15 2017-10-24 Bitmicro Networks, Inc. Bus arbitration with routing and failover mechanism
US9811461B1 (en) 2014-04-17 2017-11-07 Bitmicro Networks, Inc. Data storage system
US9842024B1 (en) 2013-03-15 2017-12-12 Bitmicro Networks, Inc. Flash electronic disk with RAID controller
US9858084B2 (en) 2013-03-15 2018-01-02 Bitmicro Networks, Inc. Copying of power-on reset sequencer descriptor from nonvolatile memory to random access memory
US9875205B1 (en) 2013-03-15 2018-01-23 Bitmicro Networks, Inc. Network of memory systems
US9916213B1 (en) 2013-03-15 2018-03-13 Bitmicro Networks, Inc. Bus arbitration with routing and failover mechanism
US9934045B1 (en) 2013-03-15 2018-04-03 Bitmicro Networks, Inc. Embedded system boot from a storage device
US9952991B1 (en) 2014-04-17 2018-04-24 Bitmicro Networks, Inc. Systematic method on queuing of descriptors for multiple flash intelligent DMA engine operation
US9971524B1 (en) 2013-03-15 2018-05-15 Bitmicro Networks, Inc. Scatter-gather approach for parallel data transfer in a mass storage system
US9996419B1 (en) 2012-05-18 2018-06-12 Bitmicro Llc Storage system with distributed ECC capability
US10025736B1 (en) 2014-04-17 2018-07-17 Bitmicro Networks, Inc. Exchange message protocol message transmission between two devices
WO2017193108A3 (en) * 2016-05-06 2018-07-26 ZeroDB, Inc. Encryption for distributed storage and processing
US10042792B1 (en) 2014-04-17 2018-08-07 Bitmicro Networks, Inc. Method for transferring and receiving frames across PCI express bus for SSD device
US10055150B1 (en) 2014-04-17 2018-08-21 Bitmicro Networks, Inc. Writing volatile scattered memory metadata to flash device
US10078604B1 (en) 2014-04-17 2018-09-18 Bitmicro Networks, Inc. Interrupt coalescing
US10120586B1 (en) 2007-11-16 2018-11-06 Bitmicro, Llc Memory transaction with reduced latency
US10133686B2 (en) 2009-09-07 2018-11-20 Bitmicro Llc Multilevel memory bus system
US10149399B1 (en) 2009-09-04 2018-12-04 Bitmicro Llc Solid state drive with improved enclosure assembly
WO2019033292A1 (en) * 2017-08-16 2019-02-21 Intel Corporation Networked shuffle storage
US10359953B2 (en) * 2016-12-16 2019-07-23 Western Digital Technologies, Inc. Method and apparatus for offloading data processing to hybrid storage devices
US20190324692A1 (en) * 2018-04-24 2019-10-24 Fujitsu Limited Storage device, information processing system, and non-transitory computer-readable storage medium for storing program
US10489318B1 (en) 2013-03-15 2019-11-26 Bitmicro Networks, Inc. Scatter-gather approach for parallel data transfer in a mass storage system
US10552050B1 (en) 2017-04-07 2020-02-04 Bitmicro Llc Multi-dimensional computer storage system
US10990324B2 (en) 2019-06-25 2021-04-27 Western Digital Technologies, Inc. Storage node processing of predefined data functions
US11281531B2 (en) * 2019-06-25 2022-03-22 Western Digital Technologies, Inc. Serial storage node processing of data functions
US11314593B2 (en) * 2019-06-25 2022-04-26 Western Digital Technologies, Inc. Storage node processing of data functions using overlapping symbols

Families Citing this family (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9298760B1 (en) * 2012-08-03 2016-03-29 Google Inc. Method for shard assignment in a large-scale data processing job
US8819335B1 (en) * 2013-08-30 2014-08-26 NXGN Data, Inc. System and method for executing map-reduce tasks in a storage device
US11966554B2 (en) * 2013-09-16 2024-04-23 Field Squared, Inc. User interface defined document
US9858007B2 (en) * 2013-11-12 2018-01-02 Avago Technologies General Ip (Singapore) Pte. Ltd. Decoupling host and device address maps for a peripheral component interconnect express controller
JP6519111B2 (en) * 2014-07-18 2019-05-29 富士通株式会社 Data processing control method, data processing control program and data processing control device
US9753661B2 (en) 2014-08-06 2017-09-05 Ngd Systems, Inc. Power efficient method and system for executing host data processing tasks during data retention operations in a storage device
US20160094619A1 (en) * 2014-09-26 2016-03-31 Jawad B. Khan Technologies for accelerating compute intensive operations using solid state drives
US9632831B2 (en) * 2014-09-29 2017-04-25 Samsung Electronics Co., Ltd. Distributed real-time computing framework using in-storage processing
US10198293B2 (en) 2014-09-29 2019-02-05 Samsung Electronics Co., Ltd. Distributed real-time computing framework using in-storage processing
US11487779B2 (en) 2014-11-05 2022-11-01 Sap Se Virtual function as query operator
KR102410289B1 (en) 2014-12-18 2022-06-17 인텔 코포레이션 Resistive memory cells including localized filamentary channels, devices including the same, and methods of making the same
EP3238281A4 (en) 2014-12-24 2018-08-08 INTEL Corporation Resistive memory cells and precursors thereof, methods of making the same, and devices including the same
US9766818B2 (en) 2014-12-31 2017-09-19 Samsung Electronics Co., Ltd. Electronic system with learning mechanism and method of operation thereof
US10198185B2 (en) 2014-12-31 2019-02-05 Samsung Electronics Co., Ltd. Computing system with processing and method of operation thereof
US9459933B1 (en) 2015-01-30 2016-10-04 Amazon Technologies, Inc. Contention and selection of controlling work coordinator in a distributed computing environment
KR101656360B1 (en) * 2015-02-12 2016-09-22 소프트온넷(주) Cloud System for supporting auto-scaled Hadoop Distributed Parallel Processing System
JP2016206757A (en) * 2015-04-16 2016-12-08 富士通株式会社 Distributed processing program, distributed processing method, and information processing apparatus
US11042328B2 (en) 2015-06-01 2021-06-22 Samsung Electronics Co., Ltd. Storage apparatus and method for autonomous space compaction
US10489197B2 (en) 2015-06-01 2019-11-26 Samsung Electronics Co., Ltd. Highly efficient inexact computing storage device
US10216418B2 (en) 2015-06-01 2019-02-26 Samsung Electronics Co., Ltd. Storage apparatus and method for autonomous space compaction
US9811379B2 (en) 2015-06-01 2017-11-07 Samsung Electronics Co., Ltd. Highly efficient inexact computing storage device
US10102029B2 (en) * 2015-06-30 2018-10-16 International Business Machines Corporation Extending a map-reduce framework to improve efficiency of multi-cycle map-reduce jobs
WO2017035651A1 (en) * 2015-09-01 2017-03-09 Laszlo Osvath A system and a method for remote health testing and diagnostics
KR102435863B1 (en) 2015-09-16 2022-08-25 삼성전자주식회사 Method of searching a matching key of storage device and server system comprising the same
US10055142B1 (en) 2015-10-13 2018-08-21 Maxlinear Asia Singapore Pte Ltd. Apparatus and method for command processing for a fast block input/output device
CN107924399B (en) * 2015-12-24 2021-08-10 株式会社日立制作所 Search processing system and method for processing search request with unknown data transmission amount to host
US10268521B2 (en) 2016-01-22 2019-04-23 Samsung Electronics Co., Ltd. Electronic system with data exchange mechanism and method of operation thereof
US10003634B2 (en) 2016-05-14 2018-06-19 Richard Banister Multi-threaded download with asynchronous writing
EP3279810B1 (en) 2016-06-14 2019-09-04 Huawei Technologies Co., Ltd. Data access method and related apparatus and system
US10289568B2 (en) 2016-07-07 2019-05-14 Microsoft Technology Licensing, Llc Application-driven storage systems for a computing system
US10176092B2 (en) 2016-09-21 2019-01-08 Ngd Systems, Inc. System and method for executing data processing tasks using resilient distributed datasets (RDDs) in a storage device
US10417733B2 (en) 2017-05-24 2019-09-17 Samsung Electronics Co., Ltd. System and method for machine learning with NVMe-of ethernet SSD chassis with embedded GPU in SSD form factor
US10565014B2 (en) 2017-12-05 2020-02-18 Western Digital Technologies, Inc. Data processing offload using in-storage code execution
US11360829B2 (en) 2018-10-16 2022-06-14 Ngd Systems, Inc. System and method for communicating between computational storage devices
US11080152B2 (en) 2019-05-15 2021-08-03 Western Digital Technologies, Inc. Optimized neural network data organization
US11194769B2 (en) 2020-04-27 2021-12-07 Richard Banister System and method for re-synchronizing a portion of or an entire source database and a target database

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140136764A1 (en) * 2012-11-09 2014-05-15 Sandisk Technologies Inc. Use of bloom filter and improved program algorithm for increased data protection in cam nand memory
US20140136761A1 (en) * 2012-11-09 2014-05-15 Sandisk Technologies Inc. Architectures for data analytics using computational nand memory
US8819335B1 (en) * 2013-08-30 2014-08-26 NXGN Data, Inc. System and method for executing map-reduce tasks in a storage device

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7650331B1 (en) 2004-06-18 2010-01-19 Google Inc. System and method for efficient large-scale data processing
US8560779B2 (en) 2011-05-20 2013-10-15 International Business Machines Corporation I/O performance of data analytic workloads
US8549518B1 (en) * 2011-08-10 2013-10-01 Nutanix, Inc. Method and system for implementing a maintenanece service for managing I/O and storage for virtualization environment
WO2013052555A1 (en) * 2011-10-03 2013-04-11 Kyaw Thu Systems and methods for performing contextual classification using supervised and unsupervised training
JP5831324B2 (en) * 2012-03-27 2015-12-09 富士通株式会社 Control device, control method, program, and distributed processing system

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140136764A1 (en) * 2012-11-09 2014-05-15 Sandisk Technologies Inc. Use of bloom filter and improved program algorithm for increased data protection in cam nand memory
US20140136761A1 (en) * 2012-11-09 2014-05-15 Sandisk Technologies Inc. Architectures for data analytics using computational nand memory
US8819335B1 (en) * 2013-08-30 2014-08-26 NXGN Data, Inc. System and method for executing map-reduce tasks in a storage device

Cited By (49)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10120586B1 (en) 2007-11-16 2018-11-06 Bitmicro, Llc Memory transaction with reduced latency
US10149399B1 (en) 2009-09-04 2018-12-04 Bitmicro Llc Solid state drive with improved enclosure assembly
US10133686B2 (en) 2009-09-07 2018-11-20 Bitmicro Llc Multilevel memory bus system
US9484103B1 (en) 2009-09-14 2016-11-01 Bitmicro Networks, Inc. Electronic storage device
US10082966B1 (en) 2009-09-14 2018-09-25 Bitmicro Llc Electronic storage device
US9372755B1 (en) 2011-10-05 2016-06-21 Bitmicro Networks, Inc. Adaptive power cycle sequences for data recovery
US10180887B1 (en) 2011-10-05 2019-01-15 Bitmicro Llc Adaptive power cycle sequences for data recovery
US9996419B1 (en) 2012-05-18 2018-06-12 Bitmicro Llc Storage system with distributed ECC capability
US9977077B1 (en) 2013-03-14 2018-05-22 Bitmicro Llc Self-test solution for delay locked loops
US9423457B2 (en) 2013-03-14 2016-08-23 Bitmicro Networks, Inc. Self-test solution for delay locked loops
US9501436B1 (en) 2013-03-15 2016-11-22 Bitmicro Networks, Inc. Multi-level message passing descriptor
US10210084B1 (en) 2013-03-15 2019-02-19 Bitmicro Llc Multi-leveled cache management in a hybrid storage system
US10423554B1 (en) 2013-03-15 2019-09-24 Bitmicro Networks, Inc Bus arbitration with routing and failover mechanism
US9842024B1 (en) 2013-03-15 2017-12-12 Bitmicro Networks, Inc. Flash electronic disk with RAID controller
US9858084B2 (en) 2013-03-15 2018-01-02 Bitmicro Networks, Inc. Copying of power-on reset sequencer descriptor from nonvolatile memory to random access memory
US9875205B1 (en) 2013-03-15 2018-01-23 Bitmicro Networks, Inc. Network of memory systems
US9916213B1 (en) 2013-03-15 2018-03-13 Bitmicro Networks, Inc. Bus arbitration with routing and failover mechanism
US9934160B1 (en) 2013-03-15 2018-04-03 Bitmicro Llc Bit-mapped DMA and IOC transfer with dependency table comprising plurality of index fields in the cache for DMA transfer
US9934045B1 (en) 2013-03-15 2018-04-03 Bitmicro Networks, Inc. Embedded system boot from a storage device
US9798688B1 (en) 2013-03-15 2017-10-24 Bitmicro Networks, Inc. Bus arbitration with routing and failover mechanism
US9971524B1 (en) 2013-03-15 2018-05-15 Bitmicro Networks, Inc. Scatter-gather approach for parallel data transfer in a mass storage system
US9734067B1 (en) 2013-03-15 2017-08-15 Bitmicro Networks, Inc. Write buffering
US9720603B1 (en) 2013-03-15 2017-08-01 Bitmicro Networks, Inc. IOC to IOC distributed caching architecture
US10013373B1 (en) 2013-03-15 2018-07-03 Bitmicro Networks, Inc. Multi-level message passing descriptor
US9672178B1 (en) 2013-03-15 2017-06-06 Bitmicro Networks, Inc. Bit-mapped DMA transfer with dependency table configured to monitor status so that a processor is not rendered as a bottleneck in a system
US9430386B2 (en) 2013-03-15 2016-08-30 Bitmicro Networks, Inc. Multi-leveled cache management in a hybrid storage system
US9400617B2 (en) 2013-03-15 2016-07-26 Bitmicro Networks, Inc. Hardware-assisted DMA transfer with dependency table configured to permit-in parallel-data drain from cache without processor intervention when filled or drained
US10042799B1 (en) 2013-03-15 2018-08-07 Bitmicro, Llc Bit-mapped DMA transfer with dependency table configured to monitor status so that a processor is not rendered as a bottleneck in a system
US10120694B2 (en) 2013-03-15 2018-11-06 Bitmicro Networks, Inc. Embedded system boot from a storage device
US10489318B1 (en) 2013-03-15 2019-11-26 Bitmicro Networks, Inc. Scatter-gather approach for parallel data transfer in a mass storage system
US9811461B1 (en) 2014-04-17 2017-11-07 Bitmicro Networks, Inc. Data storage system
US9952991B1 (en) 2014-04-17 2018-04-24 Bitmicro Networks, Inc. Systematic method on queuing of descriptors for multiple flash intelligent DMA engine operation
US10055150B1 (en) 2014-04-17 2018-08-21 Bitmicro Networks, Inc. Writing volatile scattered memory metadata to flash device
US10042792B1 (en) 2014-04-17 2018-08-07 Bitmicro Networks, Inc. Method for transferring and receiving frames across PCI express bus for SSD device
US10078604B1 (en) 2014-04-17 2018-09-18 Bitmicro Networks, Inc. Interrupt coalescing
US10025736B1 (en) 2014-04-17 2018-07-17 Bitmicro Networks, Inc. Exchange message protocol message transmission between two devices
CN104794231A (en) * 2015-05-04 2015-07-22 罗敬宁 Unstructured data processing method and device based on Map/Reduce
CN105426119A (en) * 2015-10-28 2016-03-23 上海新储集成电路有限公司 Storage apparatus and data processing method
WO2017193108A3 (en) * 2016-05-06 2018-07-26 ZeroDB, Inc. Encryption for distributed storage and processing
US10691817B2 (en) 2016-05-06 2020-06-23 ZeroDB, Inc. Encryption for distributed storage and processing
US10359953B2 (en) * 2016-12-16 2019-07-23 Western Digital Technologies, Inc. Method and apparatus for offloading data processing to hybrid storage devices
US10552050B1 (en) 2017-04-07 2020-02-04 Bitmicro Llc Multi-dimensional computer storage system
WO2019033292A1 (en) * 2017-08-16 2019-02-21 Intel Corporation Networked shuffle storage
US11194522B2 (en) * 2017-08-16 2021-12-07 Intel Corporation Networked shuffle storage
US20190324692A1 (en) * 2018-04-24 2019-10-24 Fujitsu Limited Storage device, information processing system, and non-transitory computer-readable storage medium for storing program
US11074012B2 (en) * 2018-04-24 2021-07-27 Fujitsu Limited Storage device, information processing system, and non-transitory computer-readable storage medium for storing program
US10990324B2 (en) 2019-06-25 2021-04-27 Western Digital Technologies, Inc. Storage node processing of predefined data functions
US11281531B2 (en) * 2019-06-25 2022-03-22 Western Digital Technologies, Inc. Serial storage node processing of data functions
US11314593B2 (en) * 2019-06-25 2022-04-26 Western Digital Technologies, Inc. Storage node processing of data functions using overlapping symbols

Also Published As

Publication number Publication date
US8819335B1 (en) 2014-08-26

Similar Documents

Publication Publication Date Title
US8819335B1 (en) System and method for executing map-reduce tasks in a storage device
US10176092B2 (en) System and method for executing data processing tasks using resilient distributed datasets (RDDs) in a storage device
US9021189B2 (en) System and method for performing efficient processing of data stored in a storage node
US9092321B2 (en) System and method for performing efficient searches and queries in a storage node
US10162598B2 (en) Flash optimized columnar data layout and data access algorithms for big data query engines
Ji et al. Big data processing in cloud computing environments
Kang et al. Enabling cost-effective data processing with smart SSD
US11681754B2 (en) Technologies for managing connected data on persistent memory-based systems
Kim et al. Fast, energy efficient scan inside flash memory SSDs
US9336135B1 (en) Systems and methods for performing search and complex pattern matching in a solid state drive
KR20170008153A (en) A heuristic interface for enabling a computer device to utilize data property-based data placement inside a nonvolatile memory device
US11048753B2 (en) Flexible record definitions for semi-structured data in a relational database system
US10296250B2 (en) Method and apparatus for improving performance of sequential logging in a storage device
WO2013155751A1 (en) Concurrent-olap-oriented database query processing method
US11709835B2 (en) Re-ordered processing of read requests
US11797506B2 (en) Database management systems for managing data with data confidence
US20180059985A1 (en) Dynamic management of relationships in distributed object stores
WO2024032526A1 (en) Data retrieval processing method and system
CN105426119A (en) Storage apparatus and data processing method
US11200210B2 (en) Method of efficient backup of distributed file system files with transparent data access
US10394472B1 (en) Classification and identification from raw data within a memory domain
US11249916B2 (en) Single producer single consumer buffering in database systems
US20220050807A1 (en) Prefix probe for cursor operations associated with a key-value database system
US9336313B1 (en) Systems and methods for performing single and multi threaded searches and complex pattern matching in a solid state drive
WO2022222523A1 (en) Log management method and apparatus

Legal Events

Date Code Title Description
STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION