CN101135953A - Storage control system and processing node thereof - Google Patents

Storage control system and processing node thereof Download PDF

Info

Publication number
CN101135953A
CN101135953A CNA2007101546662A CN200710154666A CN101135953A CN 101135953 A CN101135953 A CN 101135953A CN A2007101546662 A CNA2007101546662 A CN A2007101546662A CN 200710154666 A CN200710154666 A CN 200710154666A CN 101135953 A CN101135953 A CN 101135953A
Authority
CN
China
Prior art keywords
processing node
data
cache
request
processing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CNA2007101546662A
Other languages
Chinese (zh)
Other versions
CN100495312C (en
Inventor
安辉
赖守锋
刘介良
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
New H3C Technologies Co Ltd
Original Assignee
Hangzhou H3C Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hangzhou H3C Technologies Co Ltd filed Critical Hangzhou H3C Technologies Co Ltd
Priority to CNB2007101546662A priority Critical patent/CN100495312C/en
Publication of CN101135953A publication Critical patent/CN101135953A/en
Application granted granted Critical
Publication of CN100495312C publication Critical patent/CN100495312C/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Memory System Of A Hierarchy Structure (AREA)

Abstract

The memory controlling system is located between the main computer and the disk and is used for processing a data request message sent from the main computer to the disk. It has a topology structure composed of multi processing nodes; wherein, between every two nodes there is a data-exchanging channel; said processing nodes is used for executing the memory controlling operation corresponding to the received data request type through said data-exchanging channel. The invention also reveals a processing node thereof.

Description

Storage control system and processing node thereof
Technical field
The present invention relates to memory technology, be specifically related to the processing node in storage control system, the storage control system.
Background technology
Fig. 1 is existing memory system architecture synoptic diagram.As shown in Figure 1, storage system comprises storage control subsystem and at least 1 magnetic disk media.Wherein, the storage control subsystem is carried out read-write operation according to the request of data that is received from applied host machine to corresponding magnetic disk media.Wherein, applied host machine is application server or subscriber's main station; Magnetic disk media is used to store data.
The storage control subsystem is the control core of storage operation, specifically comprise at least 1 host channel adapter (HCA, Host Channel Adapter), at least 1 buffer memory (CACHE) and at least 1 disk channel adapter (DCA, Disk Channel Adapter).HCA, CACHE and DCA are the processing nodes in the storage control subsystem.Wherein, HCA provides host channel, is coupled in applied host machine, in order to handle the request of data that applied host machine is initiated; DCA provides disk channel, is coupled in magnetic disk media, in order to carry out the read-write operation to magnetic disk media; CACHE is coupled between HCA and the DCA, and the passage of the buffer memory between host channel and the disk channel is provided.Because CACHE has higher read or write speed than magnetic disk media, adopts CACHE to come the pending data of buffer memory usually, to improve the response speed of the application-oriented main frame of storage control subsystem.
Existing storage control operation is divided into 3 stages, and in the user interactions stage that the first is responsible for by HCA, it two is the buffer memory stages of being responsible for by CACHE, and it three is the mutual stages of disk of being responsible for by DCA.
As can be seen from Figure 1, be star-like the connection between HCA and the CACHE, also be star-like the connection between DCA and the CACHE.Can directly transmit read write command between HCA and the DCA, or by CACHE transmission read write command, still, for reading and writing data, because needs adopt CACHE carry out buffer memory, so reading and writing data between HCA and the DCA must be transmitted by CACHE.
Referring to Fig. 1, in present storage control system, also comprise the CACHE administrative unit (annexations of the not shown CACHE administrative unit of Fig. 1 and other equipment) that is used for managing each CACHE resource, when reading and writing data transmission between HCA and DCA, request CACHE administrative unit is that request of data is distributed spatial cache.
As seen, existing storage control subsystem has following shortcoming:
One, existing storage control subsystem adopt Y-connection as topological structure, and when a plurality of HCA visited same DCA simultaneously, data arrived DCA via CACHE and by same link.Because it is limited to arrive the link bandwidth of this DCA, data jamming may occur, thereby cause storing the reduction of control efficiency.
Two, a plurality of CACHE need special CACHE administrative unit to manage, and this CACHE administrative unit is responsible for write request and distributes spatial cache, even are tasks such as whether read request inquiry CACHE hits.But, adopt special CACHE administrative unit not only to increase the cost of storing control subsystem, and the CACHE management work can only be finished, the way to manage underaction by the CACHE administrative unit.After if the CACHE administrative unit breaks down, the processing node of One's name is legion then can't be communicated by letter normally in the storage control system, thereby has reduced the reliability of storage control system.
Three, the processing node function singleness in the storage control subsystem.The processing node of picture HCA, CACHE and DCA etc., a processing node can only be finished a kind of function, and the function of each processing node is fixed.In actual applications, adopt different hardware integrated circuit boards to realize respectively, and these integrated circuit boards are set in the carrier such as blade server as HCA, CACHE and the DCA of processing node.Usually, the carrier that is provided with processing node has the slot corresponding to HCA, CACHE and DCA, because the physical arrangement of HCA, CACHE and DCA is different, its inserting slot construction is also inequality, and the quantity of these slots and type are to design in advance and fixing.For example, 5 HCA slots, 4 CACHE slots and 5 DCA slots are set in identical carrier in advance according to actual needs, are plugged with the corresponding hardware integrated circuit board in each slot respectively.When actual needs changes, for example the demand of DCA increases, and the demand of HCA is when reducing to some extent, and common way is to set up DCA on the DCA slot idle in current carrier, and unnecessary HCA is removed to satisfy the demand that changes.If current carrier does not have idle DCA slot, can only satisfy the increase of DCA demand so by the carrier that extra increase is provided with DCA.More than two kinds of extended modes, under slot-type restriction, all can not utilize the HCA slot that the frees out DCA that pegs graft.Simultaneously, under processing node function singleness and fixing restriction, can not adopt original HCA to finish the function of DCA.As seen, the storage control subsystem expansion underaction that adopts existing processing node to form.
Four, in the storage control subsystem of forming by HCA, CACHE and DCA, treat write data will be successively through the SWITCH at the Switching Module at the SWITCH at HCA place, CACHE place and DCA place totally 3 exchanges could arrive DCA; In like manner, the data that read from magnetic disk media of DCA also will could arrive HCA through 3 exchanges.As seen, the response speed of storage control has been subjected to influence.
In addition, because processing nodes such as HCA, CACHE and DCA all adopt independently hardware integrated circuit board realization, the storage control subsystem for realizing minimum capacity also needs at least 3 hardware integrated circuit boards, circuit scale is also bigger, has increased the realization cost of the storage control subsystem of minimum capacity.
Summary of the invention
In view of this, the invention provides a kind of storage control system, can improve the bandwidth between each processing node in the storage control system.
Described storage control system is between applied host machine and magnetic disk media, in order to handle the request of data that applied host machine is initiated to magnetic disk media, it is characterized in that this system constitutes full connection topological structure by a plurality of processing nodes, wherein, has the data interaction passage between per 2 processing nodes;
Described processing node is used at self or between by described data interaction passage and other processing node, carries out the data storage control operation corresponding with the type of the request of data that is received.
Preferably, described processing node further with the place system in other processing nodes consult, determine processing node as management node;
Carry out management as the processing node of management node to CACHE resource in the described storage control system.
Wherein, the buffer memory CACHE resource of being managed by described management node is the CACHE resource in all processing nodes in the system, perhaps, is shared with the CACHE resource that is provided by all processing nodes in the system.
Preferably, described processing node comprises the main processing node and the standby processing node of paired appearance, and when main processing node lost efficacy, the standby processing node corresponding with it detected failure event, and starts self processing operation.
The present invention also provides a kind of processing node that is applied in the storage control system, can improve in its place storage control system the bandwidth between each processing node.
Described storage control system is between applied host machine and magnetic disk media, in order to handle the request of data that applied host machine is initiated to magnetic disk media, it is characterized in that this processing node comprises Switching Module, be used between place processing node and the every other processing node data interaction passage is provided.
According to above technical scheme as seen, use the present invention and can improve the bandwidth between each processing node in the storage control system.Specifically, have following beneficial effect:
Can carry out information interaction between one, per two processing nodes, the topological structure of the storage control system that is formed by processing node is full syndeton so.When a plurality of processing nodes are visited a processing node simultaneously, communicate by letter with accessed processing node by different links respectively, can not cause the situation that data jamming takes place on the link that arrives accessed processing node.As seen, the full topological structure that connects can improve the bandwidth between the processing node, thereby has improved the storage control efficiency.
Two, need not proprietary CACHE administrative unit.Complete any one processing node that connects in the storage control system all can be through consultation as management node.Be responsible for whole storage control system or, avoided adopting special CACHE administrative unit to carry out the management of spatial cache, thereby reduced the cost of storage control system as the processing node of management node for self distributes spatial cache.In addition, the processing node in the storage control system can also be managed the CACHE resource in self node independently, perhaps combines the CACHE resource in the managed together processing node with management node, has improved the dirigibility of management.
Description of drawings
Fig. 1 is existing memory system architecture synoptic diagram.
Fig. 2 is the structural representation of storage control system in the embodiment of the invention.
Fig. 3 is for having the structural representation of the storage control system of 16 processing nodes in the embodiment of the invention.
Fig. 4 is the basic structure synoptic diagram of the processing node of storage control system of the present invention.
Fig. 5 is a kind of structural representation of processing node in the embodiment of the invention.
Fig. 6 is the structural representation of HCC in the embodiment of the invention.
Fig. 7 is the structural representation of DCC in the embodiment of the invention.
Fig. 8 is the another kind of structural representation of DCC in the embodiment of the invention.
Fig. 9 is the structural representation of CACHE in the embodiment of the invention.
Figure 10 is the structural representation of processing node in the embodiment of the invention two.
Figure 11 is the structural representation of processing node in the embodiment of the invention three.
Figure 12 is the structural representation of processing node in the embodiment of the invention four.
Figure 13 is the structural representation of processing node in the embodiment of the invention five.
Figure 14 is the structural representation of processing node in the embodiment of the invention six.
Figure 15 is the structural representation of processing node in the embodiment of the invention seven.
Figure 16 is the structural representation of storage control system in the present invention's one example.
Embodiment
Below in conjunction with the accompanying drawing embodiment that develops simultaneously, describe the present invention.
The present invention has at first proposed a kind of storage control system of full connection topological structure.Fig. 2 is the structural representation of the whole exchange storage control system that proposes in one embodiment of the invention.As shown in Figure 2, in one embodiment of the invention, connect storage control system entirely and constitute full connection topological structure, wherein, have the data interaction passage between per 2 processing nodes by a plurality of processing nodes.
Wherein, processing node in self or by between data interaction passage and other processing node, is carried out the data storage control operation corresponding with the type of the request of data that is received.Comprise Switching Module in this processing node, be used between place processing node and the every other processing node data interaction passage is provided.
The full processing node that connects in the storage control system can be configured to have HCA, and/or buffer memory, and/or the function of DCA.Processing node with HCA function is coupled in applied host machine, and the transmission channel between processing node and the applied host machine is provided; Processing node with DCA function is coupled in magnetic disk media, and the transmission channel between processing node and the magnetic disk media is provided; Processing node with CACHE function is as the buffer memory in the storage control system.Wherein, have at least a processing node to have the HCA function, have at least a node to have the DCA function, have at least a node to have caching function.Processing node can have the function more than 1 simultaneously.
As can be seen from Figure 2, all have the data interaction passage between per two processing nodes, each processing node in the present embodiment storage control system has formed the topological structure of full connection so.When the same processing node of the processing node of a plurality of HCA of having functions visit with DCA function, do not adopt same link, therefore avoided the data jamming that may occur in the existing storage control system.As seen, this full topological structure that connects can improve the bandwidth between the processing node, and the storage control efficiency is provided.
The quantity of processing node is unrestricted in the full connection storage control system, can add as required or reduce.Fig. 3 shows the structural representation of the storage control system that has 16 processing nodes in the embodiment of the invention.
Connect in the storage control system complete, the way to manage of CACHE resource can have following several:
1, centralized management, promptly the management function of CACHE resource is born by some processing nodes that is confirmed as management node.The whichever processing node is data cached, all needs to this management node request spatial cache.This CACHE management function adopts and is carried in program realization in the processing node, thereby has made full use of the resource of processing node.
Management node is consulted to determine by processing node in the system mutually.In a plurality of processing nodes, any one processing node can be consulted with other processing nodes, determines the processing node as management node, when self during as management node, carry out the CACHE management work.The mode of consulting can be a processing node according to the priority of self priority and other processing nodes, determines that the highest processing node of priority is as management node.The CACHE resource that management node can be managed is to report management node by the processing node with CACHE function.
In practice, if management node lost efficacy, can't distribute the CACHE resource so, cause the storage control system can't operate as normal, the storage control system reliability be not high.For the reliability of storage control system is provided, management node just often carries out the mutual of heartbeat signal with other processing nodes at self, can't receive the heartbeat signal of this management node when other processing nodes, then redefine management node, and take over CACHE resource management work by the management node that redefines out.This heartbeat signal is a cyclical signal.
The benefit of this centralized management is that the CACHE resource of total system is shared and unified management.In practice, can adopt and existing realize read-write CACHE such as multiple existing C ACHE read-write technology such as Mailbox.
2, decentralized management, i.e. the CACHE resource that self has by each processing node management.The processing node that does not have CACHE is to the processing node request spatial cache with CACHE.Under decentralized management, CACHE resource in certain processing node management self and certain pre-assigned processing node can also be set, such as processing node 1 can be managed local CACHE and the CACHE under the processing node 2, even certain part of 2 times CACHE of processing node, the CACHE resource that each processing node can be managed is to distribute in the process of system initialization.
3, centralized and distributing is in conjunction with management, promptly when system initialization, send the address field that can be shared among himself CACHE by the management node of the processing node with CACHE in system, and manage the CACHE resource that self is not shared, management node is in charge of the CACHE resource that is shared in each processing node.
As seen, the embodiment of the invention provides diversified Managed Solution for the CACHE management.
In practice, if certain processing node lost efficacy, the work of being responsible for by this processing node can't normally be carried out so, also can cause the storage control system reliability not high.Therefore, in order further to improve the reliability of storage control system, when disposing its function, be secondary node of each processing node configuration for processing node.Active and standby processing node occur in pairs and the function that has identical.When main processing node operate as normal, standby processing node also can receive the request of data identical with main processing node, but does not deal with; When main processing node lost efficacy, take over its work by its standby processing node, thereby guaranteed the reliability of storage control system.Main processing node at self just often produces cyclical signal, to the standby processing node output of correspondence.When self is unusual, output signal not.When standby processing node does not receive the cyclical signal of its main processing node output, judge that then main processing node lost efficacy, and started self processing capacity, simultaneously to take over the work of main processing node.
In addition, the processing node in the storage control system is supported hot plug.Lost efficacy similarly to main processing node, and before processing node will log off, issued to its standby processing node and to extract event message, standby processing node receives this and extracts event message, starts its data Request Processing function, thereby takes over professional.When processing node inserts, dispose function and the standby processing node that this processing node has for it by the system manager.
Fig. 4 is applied to the basic structure synoptic diagram of the processing node of storage control system for the present invention.As shown in Figure 4, this processing node comprises functional module, processing module and Switching Module.
Wherein, processing module is used for control function module and carries out the storage control operation corresponding with the request of data type that is received at the place processing node; The data interaction passage that provides by Switching Module is carried out the data storage control operation corresponding with the request of data type that is received between place processing node and other processing nodes.Wherein, the type of request of data comprises read data request and write data requests and the reading cache request and writing cache request buffer memory for magnetic disk media.
Functional module is carried out the storage control operation corresponding with its function under the control of processing module.
Switching Module links to each other with the Switching Module of other processing nodes, is used to provide the data interaction passage between place processing node and the every other processing node.
When processing node disposed different functional modules, this processing node can have HCA, and/or buffer memory, and/or the function of DCA.
Preferably, processing module has at least 2 unified interface, and a unified interface connects a functional module.Because the physical arrangement of unified interface is all identical with agreement, so can exchange arbitrarily between each functional module, thereby realize the configurable of processing node function.In this case, processing module is before realizing control, need determine the function type of place processing node according to connected functional module, and after receiving request of data, control function module is carried out storage control operation corresponding with the type of the request of data that is received under determined function type.In addition, when the storage control system initialization, when having new functional module to insert or connected functional module when being pulled out, processing module redefines the function of place processing node.Like this, the function of processing node can dispose arbitrarily as required in the embodiment of the invention, and when demand changed, the quantity according to increase in demand that changes or minimizing functional module perhaps replaced with the functional module that can realize difference in functionality with original functional module.
Fig. 5 shows a kind of structural representation of the processing node that is applied to storage control system in the embodiment of the invention.As shown in Figure 5, this processing node comprises: the processing module, Switching Module (SWITCH), the host channel adapter (HCC that are made up of processor and bridge sheet, Host Channel Card), cache module (CACHE) and disk channel adapter (DCC, Disk Channel Card).Wherein, HCC, DCC and CACHE are functional modules.
Wherein, the bridge sheet in the processing module is a hardware of being responsible for connection processing device, SWITCH and other configurable functionality modules.This bridge sheet provides at least two unified interface with unified specification and protocol definition for functional module, among can peg graft on unified interface HCC, DCC and the CACHE one, thus make the function of processing node no longer single also no longer fixing.So just can realize flexible configuration by the functional module in the configuration process node to storage control system.Processor in the processing module regularly triggers carries out the operation of determining the processing node function type, also can detect the function type that detects the type of the functional module of inserting when functional module is inserted and redefine processing node in processing module.This processor carries out respective handling according to the request of data that receives under the function type of determining.
SWITCH is used to provide the transmission channel between place processing node and outside other processing nodes.The place processing node sends request of data and data by SWITCH to other processing nodes, by request of data and the data of SWITCH reception from other processing nodes.The number of the processing node that this SWITCH connected is relevant with processing node number in the storage control system, owing to be full connection, when storage control system comprised the individual processing node of n (n is the integer more than or equal to 1), the SWITCH in each processing node linked to each other with other (n-1) individual processing nodes.
When processing node disposed a HCC at least, this processing node had the HCA function; When processing node disposed a DCC at least, this processing node had the DCA function; When processing node disposed a CACHE at least, this processing node had caching function.
Wherein, HCC is coupled in applied host machine, for providing transmission channel between applications main frame and the processing node.HCC will be received from the request of data of applied host machine and carry out protocol conversion, be converted to the discernible form of processing module, send to processing module then; To carry out protocol conversion from the data of handling module, be converted to the discernible form of applied host machine, send to applied host machine then.
Fig. 6 is the structural representation of HCC in the embodiment of the invention.As shown in Figure 6, this HCC comprises user interface, protocol converter and node interface.Wherein, request of data that meets the subscriber interface protocol type and data that user interface will be received from applied host machine send to protocol converter, protocol converter carries out protocol conversion to this request of data and data, be converted to the request of data and the data that meet unified interface protocol type on the bridge sheet, send to processing module by node interface then.The data that node interface is sent processing module send to protocol converter, and protocol converter carries out protocol conversion to these data, are converted to the data that meet the subscriber interface protocol type, return to applied host machine by user interface then.Wherein, the protocol type that the subscriber interface protocol type is supported according to applied host machine determines, for example optical fiber interface, gigabit network interface, the employed agreement of 10,000,000,000 network interfaces.The unified interface protocol type can adopt the PCI-EXPRESS agreement.
DCC is used between magnetic disk media and the processing node transmission channel is provided.DCC will be received from the request of data and the data of processing module and carry out protocol conversion, be converted to the discernible form of magnetic disk media, send to magnetic disk media then; To carry out protocol conversion from the data of magnetic disk media, be converted to the discernible form of processing module, send to processing module.
Fig. 7 is the structural representation of DCC in the embodiment of the invention.As shown in Figure 7, this DCC comprises disk interface, protocol converter and node interface.Wherein, request of data that meets the unified interface protocol type and data that node interface will be received from processing module send to protocol converter, protocol converter carries out protocol conversion to this request of data and data, be converted to the request of data and the data that meet the disk interface protocol type, send to magnetic disk media by disk interface then.Disk interface will send to protocol converter from the data that magnetic disk media reads, and protocol converter carries out protocol conversion to these data, is converted to the data that meet the unified interface protocol type, sends to processing module by node interface then.Wherein, the protocol type that the disk interface protocol type is supported according to magnetic disk media determines, for example optical fiber interface agreement etc.The unified interface protocol type can adopt the PCI-EXPRESS agreement.
DCC shown in Fig. 7 has just played the effect of data channel, processing module finish to from user's data request resolve, generate disk read-write instruction and send to the operation of magnetic disk media by DCC.In order to reduce the burden of processing module, also can adopt DCC structure as shown in Figure 8.As shown in Figure 8, be with Fig. 7 difference, in DCC, increase independent DCC processor, be used for according to the request of data and the data that are received from node interface, generation is used for magnetic disk media is carried out the disk read-write instruction of read-write operation, sends to magnetic disk media via protocol converter and disk interface then; The data that read that are received from protocol converter are sent to node interface.So, processing module only need send to DCC with request of data and data and gets final product.This DCC processor can adopt existing Magnetic Disk Controller to realize, for example Redundant Array of Inexpensive Disc (RAID, Redundant Array of Inexpennsive Disks) card.
Fig. 9 is the structural representation of CACHE in the embodiment of the invention.As shown in Figure 9, this CACHE comprises node interface, cache controller and buffer memory medium.Wherein, node interface connection processing module as the data transmission channel between processing module and the cache controller, sends to the buffer memory control module with the request of data that is received from processing module.Cache controller carries out read-write operation according to the request of data that is received from node interface to the buffer memory medium; The buffer memory medium is used for data cached.Usually, node interface and cache controller adopt field programmable gate array (FPGA, Field Programmable Gate Array) to realize, the buffer memory medium adopts second generation double-speed (DDR2) storage medium.What as seen, CACHE adopted is the structure of FPGA+DDR2.The distribution of spatial cache is managed by processing module among this CACHE, and its capacity depends on the size of processing module addressing space.The command that node interface is received from processing module is the data that meet the unified interface protocol type, for example the command of accord with PCI-EXPRESS agreement.If cache controller is not supported the unified interface protocol type, also need increase protocol conversion module between node interface and cache controller, the command that meets the unified interface protocol type that is used for receiving by node interface is converted to the protocol format that cache controller is supported.
For a plurality of embodiment, the principle of work when processing node of the present invention is disposed the difference in functionality module is described in detail below.Among each embodiment, HCC adopts the structure shown in Fig. 6, and DCC can adopt the structure shown in Fig. 7 or Fig. 8, and CACHE adopts the structure shown in Fig. 9.Respectively form module because the bridge sheet only is responsible for connection, simple transfer carried out in data request and data, below succinct in order to describe, omitted request of data and data process through the gap bridge sheet.
Embodiment one
In the present embodiment, the functional module that is connected on the processing module by unified interface comprises at least one HCC, at least one DCC and at least one CACHE.Under this combination, the function type of processing node is (HCC+DCC+CACHE) type.DCC in the present embodiment adopts the structure with DCC processor shown in Fig. 8.
Still referring to Fig. 5, the processing node in the present embodiment comprises processor, bridge sheet, SWITCH, at least one HCC, at least one CACHE and at least one DCC.Dotted line among Fig. 5 schematically shows request of data and the transmission direction of data in processing node.As shown in Figure 5, processor determines that according to the functional module that is connected on the bridge sheet function type of current processing node is (HCC+DCC+CACHE) type, processor after receiving request of data, carry out under (HCC+DCC+CACHE) type with receive and obtain the corresponding storage control operation of request of data.
Particularly, processor is the type of judgment data request at first, and stores control operation accordingly according to judged result.
When judgement receives write data requests, with metadata cache to be written in described CACHE, judge whether the processing node corresponding with write data requests comprises this processing node, if comprise, then control local DCC from buffer memory, obtain treat write data and write with magnetic disk media that this DCC links to each other in; If do not comprise, can continue to judge whether the processing node of write data requests correspondence comprises other processing node, if by SWITCH the write data for the treatment of of buffer memory is being sent to the processing node corresponding with write data requests, otherwise do not carrying out any operation.When the processing node of present embodiment does not comprise SWITCH, only need be after receiving write data requests and being buffered in CACHE, directly control local DCC and from buffer memory, obtain and treat write data and write corresponding magnetic disk media.
Particularly, processor is the spatial cache among the data allocations CACHE to be written, to treat that write data is kept in the spatial cache that is distributed earlier, according to the LBA (Logical Block Addressing) (LBA in the write data requests, LogicBlock Address) and LBA that self preserves and the corresponding relation between the DCC, determine and the corresponding DCC of LBA in the write data requests, the DCC place processing node of correspondence is defined as the processing node corresponding with write data requests.As the DCC of correspondence during at local processing node, the processing node of judging the write data requests correspondence is this processing node, the address of the spatial cache that distributes for the write data requests that receives from HCC is before sent to DCC corresponding the local processing node, this corresponding DCC obtains data according to the address of the spatial cache that receives from CACHE, and is saved in the indicated magnetic disk media of LBA in the write data requests.As the DCC of correspondence during at other processing node, the processing node of judging the write data requests correspondence is other processing node, and processor sends to the processing node with corresponding DCC with write data requests that receives and the data that are kept in the spatial cache by SWITCH.
When judgement receives read data request, whether the processing node of judging the read data request correspondence is this processing node, if, then by local DCC reading of data from coupled magnetic disk media, and be buffered in the buffer memory of this processing node, return to the applied host machine of initiating read data request by HCC then; Otherwise, this read data request is sent to corresponding processing node by SWITCH, receive the data that read that corresponding processing node returns, and be buffered in the buffer memory of this processing node, return to the applied host machine of initiating read data request by HCC then.
Particularly, when judgement received read data request, LBA that preserves according to the LBA in the read data request and self and the corresponding relation between the DCC were determined and the corresponding DCC of LBA in the read data request.Judge that whether this corresponding DCC is at local processing node, judge promptly whether the processing node corresponding with read data request is this processing node, if, then processor sends to DCC corresponding in the local processing node with read data request, this corresponding DCC is according to the indication of LBA in the read data request, reading of data from connected magnetic disk media, and to processor application spatial cache, processor is according to wanting data in buffer to distribute spatial cache and the address of the spatial cache of distribution being sent to DCC, DCC writes the data that read in the spatial cache of distribution, and notification processor, processor obtains data that DCC writes and returns to the applied host machine of initiating read data request by HCC from the spatial cache that distributes.Otherwise, judge that promptly corresponding DCC is when other processing nodes, then processor sends to read data request by SWITCH the processing node at corresponding DCC place, and receive the data that this processing node returns by SWITCH, processor is the spatial cache among the data allocations CACHE that receives, the data that receive are kept in the spatial cache that is distributed, and the data that will be kept at then among the CACHE return to the applied host machine of initiating read data request by HCC.
It more than is the storage control operation that each functional module of control is carried out after processor receives the request of data that HCC sends.Request of data can also be that other processing nodes are sent by SWITCH, and at this moment, processor is judged the type of the request of data that receives from SWITCH, visits the magnetic disk media that local DCC connects according to judged result.
When judgement receives write data requests from other processing nodes, processor is the spatial cache among the data allocations CACHE that receives, the data that receive are kept in the spatial cache into its distribution, give DCC with the address notification of the spatial cache of write data requests and distribution, DCC is reading of data from the spatial cache that distributes, and according to the LBA in the write data requests data that read is write in the corresponding magnetic disk media.
When judgement receives from other processing node read data request, processor sends to DCC with read data request, DCC is according to the reading of data from the magnetic disk media of correspondence of the LBA in the read data request, to processor application spatial cache, processor is according to wanting data in buffer to distribute spatial cache and the address of the spatial cache of distribution being sent to DCC, DCC writes the data that read in the spatial cache of distribution, and notification processor.Processor obtains the data that DCC writes from the spatial cache that distributes, and returns to the processing node of initiating read data request by SWITCH.
From the described scheme of present embodiment as seen, processing node not only can be handled request of data and data from this locality, can also handle request of data and data from other processing nodes.When storage control is finished in two processing nodes cooperations, receiving request of data and data and be host node with the processing node that the request of data that receives and data are issued other processing nodes from the HCC of this locality, is from node from the processing node that SWITCH receives request of data and data and magnetic disk media that local DCC is connected conducts interviews.Processing node can be host node, also can be from node, also can have host node simultaneously and from the identity of node.
In actual applications, the processing node in the present embodiment not only can be finished the operation of read data, write data, can also finish operations such as data backup, mirror image.For example, when being connected with at least 2 DCC on the processing node, this processing node can carry out identical data storage operations to two magnetic disk medias that DCC connected, to realize the backup or the mirror image of data.Perhaps, this processing node can also be when carrying out data storage operations to the magnetic disk media that local DCC connected, by SWITCH the magnetic disk media that DCC connected in other processing nodes that communicate with is carried out identical data storage operations simultaneously, to realize the backup or the mirror image of data.
When processing node is management node, also need to handle cache request.For example, be that the write request request distributes spatial cache, for whether read request inquiry CACHE hits etc.
In the present embodiment, what DCC adopted is the structure with DCC processor shown in Fig. 8, and this DCC processor can be realized the operation to magnetic disk media, can also finish the operation into data application spatial cache.When adopting Fig. 7 that the DCC structure is shown, DCC is finished by processor the operation of magnetic disk media, and processor directly is the data allocations spatial cache simultaneously.And DCC is responsible for carrying out the protocol conversion of unified interface protocol type to the disk interface protocol type only as the data transmission channel between magnetic disk media and the processor.
By present embodiment as seen, processing node can be by increasing HCC, CACHE or DCC, and perhaps the mutual alternative between HCC, CACHE and the DCC realizes the flexible expansion of storage control system, thereby meets the different needs.For example, when the DCC demand increases and HCC demand when reducing to some extent, not only can satisfy the increase of DCC demand by grafting DCC on the unified interface of free time, unnecessary HCC can also be replaced with DCC, satisfy the increase of DCC demand, thereby realized the flexible expansion of storage control system.
If dispose HCC, CACHE and DCC in the processing node, this processing node can also independently be finished the storage control of minimum capacity so.Need not to adopt existing HCA, CACHE and three hardware integrated circuit boards of DCA to realize the storage control of minimum capacity, thereby reduced the cost of storage control system.
In addition, the processor of each node is responsible for this node and distributes spatial cache, be equivalent to be provided with in processor inner CACHE administration module, this inside CACHE administration module can be realized by one section supervisory routine by processor operation, thereby utilized the resource of processor fully.Owing to not be used in the extra CACHE administration module of processing node outer setting in order to spatial cache is managed, further reduced the cost of storage control system.
Embodiment two
In the present embodiment, the functional module that is connected on the processing module by unified interface comprises at least one HCC.Under this combination, the function type of processing node is the HCC type.
Figure 10 is for being applied to the structural representation of the processing node of storage control system in the embodiment of the invention two.Dotted line among Figure 10 schematically shows request of data and the transmission direction of data in processing node.As shown in figure 10, this processing node comprises processor, bridge sheet, SWITCH and at least one HCC.Processor determines that according to the functional module that is connected on the bridge sheet function type of current processing node is the HCC type, and processor is carried out the storage control operation corresponding with received request of data under the HCC type after receiving request of data.
Particularly, processor is judged the type of the request of data that is received from the request of data of HCC reception from applied host machine, and stores control operation accordingly according to judged result.
When judgement received write data requests, processor sent to the processing node corresponding with write data requests with the write data requests that receives by SWITCH.Particularly, processor is determined the DCC corresponding with LBA in the write data requests according to the LBA in the write data requests and the LBA of self storage and the corresponding relation between the DCC, and the DCC place processing node that this is corresponding is defined as the processing node corresponding with write data requests.In the description of following examples, it is identical to relate to operation and the present embodiment determined with the corresponding processing node of write data requests, just repeats no more.
When judgement received read data request, this processor sent to the processing node corresponding with read data request with the read data request that receives by SWITCH; Receive the data that read that this corresponding processing node returns by SWITCH, the data that read that receive are returned to the applied host machine of initiating read data request by HCC.Wherein, also adopt the LBA that self stores and the corresponding relation between the DCC when determining the operation of the processing node corresponding with read data request, determine the DCC corresponding with LBA in the read data request, the DCC place processing node that this is corresponding is defined as the processing node corresponding with read data request.In the description of following examples, it is identical to relate to operation and the present embodiment determined with the corresponding processing node of read data request, just repeats no more.
Wherein, a plurality of HCC that are configured in the same processing node can be the host channel adapters of supporting different user interface protocol type.Because a plurality of HCC are identical with the interface that the bridge sheet connects, can support the HCC of different user interface protocol type by on the bridge sheet, pegging graft, to satisfy different user side demands.
The present invention is designed to unified interface with the interface of HCC, DCC and CACHE and bridge sheet, and the place that connects HCC so can connect DCC or CACHE.Therefore, processing node in the present embodiment with HCA function, its function is not unalterable, when needing to increase system cache quantity or memory capacity when the number of users minimizing, can the HCC in the present embodiment be replaced with DCC or CACHE according to concrete demand.Certainly, also can under the situation that bridge sheet interface quantity allows, connect more HCC, DCC or CACHE.
By present embodiment the above as seen, when only disposing HCC in the processing node, this processing node can be realized existing HCA function, this processing node is finished when receiving request of data in the storage control operation and the operation of user interactions.Identical just because of the interface that is had on the bridge sheet, so the function of processing node is not fixed in the present embodiment, flexible expansion as required at any time.
Embodiment three
In the present embodiment, the functional module that is connected on the processing module by unified interface comprises at least one DCC.Under this combination, the function type of processing node is the DCC type.
Figure 11 is for being applied to the structural representation of the processing node of storage control system in the embodiment of the invention three.Dotted line among Figure 11 schematically shows request of data and the transmission direction of data in processing node.As shown in figure 11, this processing node comprises processor, bridge sheet, SWITCH and at least one DCC.Processor determines that according to the functional module that is connected on the bridge sheet function type of current processing node is the DCC type, and processor is carried out the storage control operation corresponding with received request of data under the DCC type after receiving request of data.
Particularly, processor is judged the type of the request of data that is received from the request of data of SWITCH reception from other processing nodes, and stores control operation accordingly according to judged result.
When judgement receives write data requests, carry out write operation for the magnetic disk media that this DCC links to each other by local DCC.
When judgement received read data request, processor carried out read operation by local DCC to the magnetic disk media that this DCC connects according to the read data request that receives; The data that will read from magnetic disk media send by SWITCH.The target that data send can be the processing node of initiating read data request, do not possess CACHE if initiate the processing node of read data request, also these data can be sent to processing node and carry out buffer memory, and notice is initiated processing node reading of data from the processing node that is cached with the data that read of request of data with caching function.When sending the data to the processing node with caching function, this provides the processing node of caching function to be specified by the processing node of initiating read data request, also can be specified by this processing node.Provide the processing node of caching function can adopt following examples four described processing nodes.
Similar to embodiment two, a plurality of DCC that are configured in the present embodiment in the same processing node can be the disk channel adapters of supporting different disk interface protocol type.Because a plurality of DCC are connected on the unified interface of bridge sheet, can support the DCC of different disk interface protocol type by on the bridge sheet, pegging graft, to satisfy different disk medium demand.DCC has processor in the present embodiment, and this processor can be finished various RAID read-write capabilitys.When employing does not have the DCC of processor, can realize various RAID read-write capabilitys by the processor in the processing node of DCC place.
By present embodiment the above as seen, when only disposing DCC in the processing node, this processing node can be realized existing DCA function, this processing node when receiving request of data, finish the storage control operation in the mutual operation of magnetic disk media.Identical just because of the unified interface that is had on the bridge sheet, so the function of the processing node in the present embodiment do not fix, flexible expansion as required at any time.
Embodiment four
In the present embodiment, the functional module that is connected on the processing module by unified interface comprises at least one CACHE.Under this combination, the function type of processing node is the CACHE type.
Figure 12 is for being applied to the structural representation of the processing node of storage control system in the embodiment of the invention four.Dotted line among Figure 12 schematically shows request of data and the transmission direction of data in processing node.As shown in figure 12, this processing node comprises processor, bridge sheet, SWITCH and at least one CACHE.Processor in this processing node is used to manage all CACHE that are connected on the bridge sheet.Processor determines that according to the functional module that is connected on the bridge sheet function type of current processing node is the CACHE type.Processor is carried out the storage control operation corresponding with received request of data under the CACHE type after receiving request of data.
Particularly, processor is judged the type of the request of data that is received from the request of data of SWITCH reception from other processing nodes, and stores control operation accordingly according to judged result.Owing to have only CACHE in this processing node, so the type of the request of data that receives of this processing node comprises and reads cache request and write cache request.
When judgement receives when writing cache request, processor distributes spatial cache for treating data in buffer, data cached writing in the spatial cache that is distributed will be treated, also the address of the spatial cache that distributed to be sent to the processing node of initiating to write cache request by SWITCH, perhaps the address of this spatial cache be sent to by SWITCH and will read data cached processing node.For example, for the processing node among the embodiment three with DCC, after processing node reading of data with DCC, processing node that can be in present embodiment send write cache request and with the data that read as treating the data cached processing node that sends in the present embodiment, the address of the spatial cache that the processing node in the reception present embodiment returns, this processing node with DCC is given the processing node of initiating write data requests with the address notification of spatial cache, makes it obtain data in buffer in processing node from present embodiment.Perhaps, processing node in the present embodiment is receiving when writing cache request, also obtained to initiate the processing node address of write data requests, so directly the address with spatial cache sends to the processing node of initiating write data requests, makes it obtain data in buffer in processing node from present embodiment.
When judgement receives when reading cache request, processor is according to the address of reading entrained spatial cache in the cache request, and reading of data from the CACHE of correspondence returns to the data that read and initiates this and read the processing node of cache request.
Can dispose a plurality of CACHE on the processing node, be configured in CACHE on the processing node by increase and come buffer memory capacity in the expanding system.
By present embodiment the above as seen, when only disposing CACHE in the processing node, this processing node can be realized existing caching function, this processing node is finished the operation of buffer memory in the storage control operation when receiving request of data.Identical just because of the interface that is had on the bridge sheet, so the function of the processing node in the present embodiment do not fix, flexible expansion as required at any time.
Embodiment five
In the present embodiment, the functional module that is connected on the processing module by unified interface comprises at least one HCC and at least one CACHE.Under this combination, the function type of processing node is (HCC+CACHE) type.
Figure 13 is for being applied to the structural representation of the processing node of storage control system in the embodiment of the invention five.Dotted line among Figure 13 schematically shows request of data and the transmission direction of data in processing node.As shown in figure 13, this processing node comprises processor, bridge sheet, SWITCH, at least one HCC and at least one CACHE.Processor determines that according to the functional module that is connected on the bridge sheet function type of current processing node is the HCC+CACHE type, processor is carried out the storage control operation corresponding with received request of data under (HCC+CACHE) type after receiving request of data.
Particularly, processor is judged the type of the request of data that is received from the request of data of HCC reception from applied host machine, and stores control operation accordingly according to judged result.
When being judged to be when receiving write data requests, in CACHE, and the processing node by SWITCH notice write data requests correspondence allows it will treat that by SWITCH write data takes away from CACHE to processor with metadata cache to be written.
Particularly, processor is the spatial cache among the data allocations CACHE to be written, will treat that write data writes in the spatial cache that is distributed.Because this locality do not have DCC, processor sends to the address of the spatial cache of the write data requests that receives and distribution the processing node of write data requests correspondence by SWITCH.When receive by SWITCH other processing nodes read cache request the time, processor is according to the address of reading the spatial cache in the cache request, obtains from CACHE and treats write data and return to the processing node of initiating to read cache request by SWITCH.
When being judged to be when receiving read data request, processor sends to other processing nodes with read data request by SWITCH, allow its by SWITCH the metadata cache that will read in CACHE, and then the data that will be buffered among the CACHE by HCC return to the applied host machine of initiating read data request.
Particularly, processor sends to the processing node corresponding with read data request with the read data request that receives by SWITCH, and receive the data that read that corresponding processing node returns by SWITCH, processor is the spatial cache among the data allocations CACHE that reads, the data that read are kept in the spatial cache that is distributed, and the data that will be kept among the CACHE by HCC return to the applied host machine of initiating read data request then.
By present embodiment as seen, if dispose HCC and CACHE in a processing node, this processing node is finished when receiving request of data in the storage control operation and the operation of user interactions and buffer memory.So, this processing node must just can be finished the storage control operation of complete three phases with the processing node cooperation with DCC.For example, the storage control operation is finished in the processing node cooperation shown in the processing node of present embodiment and embodiment one or the embodiment three.Processing node in the present embodiment can be realized existing HCA and caching function.Identical just because of the interface that is had on the bridge sheet, so the function of the processing node in the present embodiment do not fix, flexible expansion as required at any time.
Embodiment six
In the present embodiment, the functional module that is connected on the processing module by unified interface comprises at least one DCC and at least one CACHE.Under this combination, the function type of processing node is (DCC+CACHE) type.
Figure 14 is for being applied to the structural representation of the processing node of storage control system in the embodiment of the invention six.Dotted line among Figure 14 schematically shows request of data and the transmission direction of data in processing node.As shown in figure 14, this processing node comprises processor, bridge sheet, SWITCH, at least one DCC and at least one CACHE.Processor determines that according to the functional module that is connected on the bridge sheet function type of current processing node is (DCC+CACHE) type, processor is carried out the storage control operation corresponding with received request of data under (DCC+CACHE) type after receiving request of data.
Particularly, processor is judged the type of the request of data that is received by the request of data of SWITCH reception from other nodes, and stores control operation accordingly according to judged result.
When judgement receives write data requests, in CACHE, and notice DCC will be buffered in treating in write data writes with DCC is connected the magnetic disk media among the CACHE to processor with metadata cache to be written.
Particularly, processor is the spatial cache among the data allocations CACHE to be written, to write the spatial cache that is distributed from the write data for the treatment of that SWITCH obtains, according to the LBA in the write data requests and the LBA of self storage and the corresponding relation between the DCC, determine the DCC corresponding with LBA in the write data requests, give corresponding DCC with the address notification of write data requests and the spatial cache that is distributed, this DCC reads from CACHE according to the address of spatial cache and treats write data, and writes in the corresponding magnetic disk media according to the write data for the treatment of that the LBA in the write data requests will read.Wherein, processor obtains when treating write data from other processing nodes by SWITCH, can be to receive the write data for the treatment of that other processing nodes initiatively send, also can initiatively from other processing nodes, obtain according to the address for the treatment of the spatial cache of write data in other processing nodes that receives.
When judgement receives read data request, processor notice DCC reading of data and being cached among the CACHE from coupled magnetic disk media, the processor data that will be buffered among the CACHE return to the processing node of initiating read data request by SWITCH then.
Particularly, processor is according to the LBA in the read data request and the LBA of self storage and the corresponding relation between the DCC, determine the DCC corresponding with LBA in the read data request, the read data request that receives is sent to corresponding DCC, this DCC is according to the indication of the LBA in the read data request, reading of data from coupled magnetic disk media, and send to processor and to write cache request, after processor receives and writes cache request, according to treating data cached distribution spatial cache and the address of the spatial cache that distributes being sent to DCC, DCC with the metadata cache that reads in the spatial cache that distributes, and notification processor.Processor obtains data in buffer and returns to the applied host machine of initiation read data request by HCC from the spatial cache that distributes.
By present embodiment as seen, if dispose DCC and CACHE in a processing node, this processing node when receiving request of data, finish the storage control operation in buffer memory and with the mutual operation of disk.So, this processing node must just can be finished the storage control operation of complete three phases with the processing node cooperation with HCC.For example, the storage control operation is finished in the processing node cooperation shown in processing node in the present embodiment and embodiment one or embodiment two or the embodiment five.Processing node in the present embodiment can be realized existing DCA and caching function.Identical just because of the interface that is had on the bridge sheet, so the function of the processing node in the present embodiment do not fix, flexible expansion as required at any time.
Embodiment seven
In the present embodiment, the functional module that is connected on the processing module by unified interface comprises at least one HCC and at least one DCC.Under this combination, the function type of processing node is (HCC+DCC) type.
Figure 15 is for being applied to the structural representation of the processing node of storage control system in the embodiment of the invention seven.Dotted line among Figure 15 schematically shows request of data and the transmission direction of data in processing node.As shown in figure 15, this processing node comprises processor, bridge sheet, SWITCH, at least one DCC and at least one HCC.Processor determines that according to the functional module that is connected on the bridge sheet function type of current processing node is (HCC+DCC) type, processor is carried out the storage control operation corresponding with received request of data under (HCC+DCC) type after receiving request of data.
Be that with the difference of embodiment one this processing node does not have CACHE, reading and writing to DCC in the process of data so, will treat data cached being buffered in other processing nodes, and when needs read, from data cached processing node, read.This data cached back end can adopt the processing node shown in the embodiment four.The function of processor is similar to the functional processor among the embodiment one, and difference is that the address of the not responsible distribution spatial cache of the processor in the present embodiment does not promptly need to manage CACHE.
Particularly, judge when receiving write data requests, processor sends to data carry to be written in writing cache request by SWITCH has the processing node that caching function is provided, receive the address of the spatial cache that returns from this processing node by SWITCH, the address of the spatial cache that receives is carried at the DCC that sends to this write data requests correspondence in the write data requests.Receive DCC notification processor when the process write data request of write data requests, at this moment, processor sends the cache request of reading of the address of carrying spatial cache to data cached processing node by SWITCH, receive the data in buffer of returning by SWITCH, these data are sent to DCC, at this moment, DCC can write the data that receive in the magnetic disk media of LBA correspondence in the write data requests.
Judge when receiving read data request, processor sends to DCC with read data request, DCC carries out read operation to the magnetic disk media of LBA correspondence in the read data request, the data that read are sent to processor, processor sends to processing node that buffer memory be provided by SWITCH with the data carry that reads in writing cache request, receive the address of the spatial cache that returns from this processing node by SWITCH; Then, when the user returns the data that read, processor sends the cache request of reading of the address of carrying spatial cache to data cached processing node by SWITCH, receive the data in buffer of returning by SWITCH, the data that receive are returned to the applied host machine of initiating read data request by HCC.
Certainly, identical with previous embodiment, the bridge sheet in the present embodiment processing node also has unified interface, so the function of the processing node in the present embodiment do not fix, at any time flexible expansion as required.
More than among seven embodiment, the CACHE of its place processing node of the processor management of processing node.When processing node did not have CACHE, processing node was to other processing node request spatial caches.The processing node that is requested spatial cache can be the management node in the system, is spatial cache in its distribution system by management node; The processing node that is requested spatial cache also can be to be the processing node that spatial cache is provided of the processing node configuration that do not have CACHE in advance, and this processing node distributes spatial cache among himself CACHE for the processing node of request spatial cache.
If can not in time be responded, can also adopt the data cached request of CACHE from the user's data request.The CACHE of data cached request can adopt same physical entity to realize with data cached CACHE, and two CACHE also can be set, and adopts one of them data cached request, adopts another data cached.
Below for an example that adopts the described processing node of above embodiment to form storage control system.Figure 16 is the structural representation of storage control system in the present invention's one example.As shown in figure 16, this storage control system comprises 4 processing nodes, can directly communicate between per two processing nodes.Wherein, processing node 1 and processing node 2 are the processing node of (HCC+CACHE) type, carry out information interaction with connected applied host machine respectively; Processing node 3 is that the processing node of (HCC+DCC+CACHE) type and connected applied host machine and magnetic disk media carry out information interaction; Processing node 4 is that the processing node and the connected magnetic disk media of DCC type carries out information interaction.As shown in Figure 16, processing node 1 makes up with processing node 3 or with processing node 4, can finish the storage control operation; Processing node 2 makes up with processing node 3 or with processing node 4, can finish the storage control operation; Processing node 3 can be finished the storage control operation with processing node 4 combinations, and processing node 3 self also can be finished the storage control operation certainly.
Per two processing nodes directly link to each other by SWITCH in this storage control system, have formed the topological structure of full connection.When processing node 1,2 and 3 simultaneously during access process node 4, respectively by and processing node 4 between the link direct communication, do not pass through same link, avoided taking place because of shared same link in the prior art problem of data jamming.
Under centralization, suppose that when system initialization elect processing node 1 as management node, so, processing node 2,3 and 4 is to processing node 1 application spatial cache; Under the Decentralization mode, pre-configured processing node 4 is to processing node 2 application spatial caches, and so, processing node 1,2 and 3 is managed its CACHE resource that has independently, and processing node 4 is to processing node 2 application spatial caches.Concentrating and disperseing in conjunction with under the way to manage, processing node 2 and 3 is shared portion C ACHE, by processing node 1 management as management node, so, processing node 2 and 3 is managed the CACHE resource that is not shared separately independently, and processing node 1 and 4 can be to processing node 1 application spatial cache.Certainly, processing node 2 and 3 also can be when its CACHE have distributed, to processing node 1 application spatial cache.As seen, storage control system need not be provided with special CACHE administrative unit and treat data cached distribution spatial cache beyond processing node, distribute the work of spatial cache to finish by the processor in each processing node, thereby make full use of the resource of processor, reduced the cost of storage control system.
When actual demand changed, for example the demand of DCC increased, and the demand of HCC is when reducing to some extent, any one that can be in processing node 1~4 or a plurality of arbitrarily in increase DCC, thereby provide more and magnetic disk media carries out mutual passage.Certainly, also the HCC in the processing node 3 can be replaced with DCC, thereby provide more and magnetic disk media carries out mutual passage; HCC in processing node 1 or the processing node 2 can also be replaced with DCC, thereby processing node 1 or processing node 2 are changed into the function type of (HCC+DCC+CACHE), satisfy the needs that the DCC demand increases.As seen the storage control system expansion formed of the processing node that is provided by the embodiment of the invention is very flexible.Certainly, in actual applications, can be fixing also with the function setting of section processes node in the storage control system, the function setting of another part processing node is configurable.The variable configuration mode of this partial fixing part also can be finished the storage control operation equally, and when demand changes, by the configurable processing node of function is configured the demand that also can satisfy variation.And when the function of processing node changed, the topological structure of storage control system still was full syndeton, need not change, still can operate as normal.
Processing node 3 among Figure 16 adopts the processing node structure shown in the embodiment one, and this processing node 3 can independently be finished 3 stages in the storage control operation.Data to be stored need not can arrive magnetic disk media through SWITCH, have saved the exchanges data time.When adopting two processing node cooperations to finish the storage control operation, only need through a SWITCH, still short than adopting existing storage control system carry out the data-switching time that 3 clearing houses consume.
By the above as can be seen, storage controlling schemes provided by the present invention, when a plurality of processing nodes are visited a processing node simultaneously, a plurality of processing nodes are communicated by letter with accessed processing node by different links respectively, can not cause the situation that data jamming takes place on the link that arrives accessed processing node, thereby increased the bandwidth between the processing node, improved the storage control efficiency.Simultaneously, can improve the expansion dirigibility of storage control system.The response speed of storage control is higher, and the storage control system cost is lower.
In sum, more than be preferred embodiment of the present invention only, be not to be used to limit protection scope of the present invention.Within the spirit and principles in the present invention all, any modification of being done, be equal to replacement, improvement etc., all should be included within protection scope of the present invention.

Claims (17)

1. storage control system, described storage control system is between applied host machine and magnetic disk media, in order to handle the request of data that applied host machine is initiated to magnetic disk media, it is characterized in that, this system constitutes full connection topological structure by a plurality of processing nodes, wherein, has the data interaction passage between per 2 processing nodes;
Described processing node is used at self or between by described data interaction passage and other processing node, carries out the data storage control operation corresponding with the type of the request of data that is received.
2. the system as claimed in claim 1 is characterized in that, described processing node comprises functional module, processing module and Switching Module;
Described processing module is used to control described functional module and carries out the storage control operation corresponding with the request of data type that is received at the place processing node; The data interaction passage that provides by described Switching Module is carried out the data storage control operation corresponding with the request of data type that is received between place processing node and other processing nodes;
Described functional module is used for carrying out the storage control operation corresponding with its function under the control of described processing module;
Described Switching Module is used between place processing node and the every other processing node data interaction passage is provided.
3. system as claimed in claim 2 is characterized in that described processing module has unified interface, and each unified interface connects a described functional module;
Described processing module is further used for determining according to connected functional module the function type of place processing node, under determined function type, control described functional module and carry out the storage control operation corresponding with the request of data type that is received at the place processing node; The data interaction passage that provides by described Switching Module is carried out the data storage control operation corresponding with the request of data type that is received between place processing node and other processing nodes.
4. system as claimed in claim 3 is characterized in that, described functional module comprises host channel adapter HCC, is coupled in applied host machine, be used to provide and applied host machine between the data interaction passage;
Described processing module determines that according to connected functional module the function type of place processing node is the HCC type; By the request of data of described HCC reception from applied host machine, the data interaction passage that provides by described Switching Module, between place processing node and the processing node corresponding, carry out and described request of data corresponding cache operation and disk operating with described request of data.
5. system as claimed in claim 3 is characterized in that, described functional module comprises disk channel adapter DCC, is coupled in magnetic disk media, be used to provide and magnetic disk media between the data interaction passage;
Described processing module determines that according to connected functional module the function type of place processing node is the DCC type; The data interaction passage that provides by described Switching Module receives the request of data from other processing node, carries out the disk operating corresponding with described request of data in the processing node of place.
6. system as claimed in claim 3 is characterized in that described functional module comprises buffer memory CACHE, is used for data cached;
Described processing module determines that according to connected functional module the function type of place processing node is the buffer memory type; The data interaction passage that provides by described Switching Module receives the request of data from other processing node, carries out in the processing node of place and described request of data corresponding cache operation.
7. system as claimed in claim 3 is characterized in that, described functional module HCC and CACHE;
Described HCC is coupled in applied host machine, be used to provide and applied host machine between the data interaction passage;
Described CACHE is used for data cached;
Described processing module determines that according to connected functional module the function type of place processing node is the HCC+CACHE type; By the request of data of described HCC reception from applied host machine, in the processing node of place, carry out and described request of data corresponding cache operation, the data interaction passage that provides by described Switching Module, between place processing node and the processing node corresponding, carry out the disk operating corresponding with described request of data with described request of data.
8. system as claimed in claim 3 is characterized in that described functional module comprises DCC and CACHE;
Described DCC is coupled in magnetic disk media, be used to provide and magnetic disk media between the data interaction passage;
Described CACHE is used for data cached;
Described processing module determines that according to connected functional module the function type of place processing node is the DCC+CACHE type; By the request of data of described Switching Module reception, in the processing node of place, carry out and described request of data corresponding cache operation and disk operating from other processing node.
9. system as claimed in claim 3 is characterized in that described functional module comprises HCC and DCC;
Described HCC is coupled in applied host machine, be used to provide and applied host machine between the data interaction passage;
Described DCC is coupled in magnetic disk media, be used to provide and magnetic disk media between the data interaction passage;
Described processing module determines that according to connected functional module the function type of place processing node is the HCC+DCC type; By the request of data of described HCC reception, in the processing node of place, carry out the disk operating corresponding with described request of data from applied host machine.
10. system as claimed in claim 9, described processing module, the further data exchange channel that provides by described Switching Module are provided, in place processing node and system, provide between the processing node of CACHE, carry out and described request of data corresponding cache operation.
11. system as claimed in claim 3 is characterized in that, described functional module comprises HCC, DCC and CACHE;
Described HCC is coupled in applied host machine, be used to provide and applied host machine between the data interaction passage;
Described DCC is coupled in magnetic disk media, be used to provide and magnetic disk media between the data interaction passage;
Described CACHE is used for data cached;
Described processing module determines that according to connected functional module the function type of place processing node is the HCC+DCC+CACHE type; By the request of data of described HCC reception, in the processing node of place, carry out and described request of data corresponding cache operation and disk operating from applied host machine.
12. system as claimed in claim 11, described processing module, further the data interaction passage that provides by described Switching Module are provided, between place processing node and the processing node corresponding, carry out the disk operating corresponding with described request of data with described request of data.
13. as claim 9,10,11 or 12 described systems, it is characterized in that, described processing module, further the data interaction passage that provides by described Switching Module receives the request of data from other processing node.
14. the system as claimed in claim 1 is characterized in that, described processing node further with the place system in other processing nodes consult, determine processing node as management node;
Carry out management as the processing node of management node to CACHE resource in the described storage control system.
15. system as claimed in claim 14 is characterized in that, the buffer memory CACHE resource of being managed by described management node is the CACHE resource in all processing nodes in the system, perhaps, is shared with the CACHE resource that is provided by all processing nodes in the system.
16. the system as claimed in claim 1, it is characterized in that described processing node comprises the main processing node and the standby processing node of paired appearance, when main processing node lost efficacy, the standby processing node corresponding with it detects failure event, and starts self processing operation.
17. the processing node in the storage control system, described storage control system is between applied host machine and magnetic disk media, in order to handle the request of data that applied host machine is initiated to magnetic disk media, it is characterized in that, this processing node comprises Switching Module, is used between place processing node and the every other processing node data interaction passage is provided.
CNB2007101546662A 2007-08-17 2007-09-21 Storage control system and processing node thereof Expired - Fee Related CN100495312C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB2007101546662A CN100495312C (en) 2007-08-17 2007-09-21 Storage control system and processing node thereof

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
CN200710120445 2007-08-17
CN200710120445.3 2007-08-17
CNB2007101546662A CN100495312C (en) 2007-08-17 2007-09-21 Storage control system and processing node thereof

Publications (2)

Publication Number Publication Date
CN101135953A true CN101135953A (en) 2008-03-05
CN100495312C CN100495312C (en) 2009-06-03

Family

ID=39160066

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2007101546662A Expired - Fee Related CN100495312C (en) 2007-08-17 2007-09-21 Storage control system and processing node thereof

Country Status (1)

Country Link
CN (1) CN100495312C (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014121588A1 (en) * 2013-02-06 2014-08-14 华为技术有限公司 Data access system, memory sharing device, and data access method
CN105893156A (en) * 2016-04-20 2016-08-24 北京云宏信达信息科技有限公司 Request processing method of storage and computing system and storage and computing system
CN106161492A (en) * 2015-03-24 2016-11-23 华为技术有限公司 A kind of decision data method, Apparatus and system
CN112416259A (en) * 2020-12-04 2021-02-26 海光信息技术股份有限公司 Data access method and data access device
CN113704160A (en) * 2021-08-17 2021-11-26 深圳市安信达存储技术有限公司 Data storage method and system based on Feiteng processor and storage mainboard
CN113704161A (en) * 2021-08-17 2021-11-26 深圳市安信达存储技术有限公司 Data storage method and system based on Loongson processor and storage mainboard
CN117633119A (en) * 2024-01-25 2024-03-01 平凯星辰(北京)科技有限公司 Data synchronization method, node, equipment and storage medium of data synchronization system

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014121588A1 (en) * 2013-02-06 2014-08-14 华为技术有限公司 Data access system, memory sharing device, and data access method
US9594682B2 (en) 2013-02-06 2017-03-14 Huawei Technologies Co., Ltd. Data access system, memory sharing device, and data reading method
CN106161492A (en) * 2015-03-24 2016-11-23 华为技术有限公司 A kind of decision data method, Apparatus and system
CN106161492B (en) * 2015-03-24 2019-05-10 华为技术有限公司 A kind of decision data method, apparatus and system
US10484248B2 (en) 2015-03-24 2019-11-19 Huawei Technologies Co., Ltd. Data decision method, apparatus and system
CN105893156A (en) * 2016-04-20 2016-08-24 北京云宏信达信息科技有限公司 Request processing method of storage and computing system and storage and computing system
CN105893156B (en) * 2016-04-20 2018-02-09 北京云宏信达信息科技有限公司 Store the request processing method and storage computing system in computing system
CN112416259A (en) * 2020-12-04 2021-02-26 海光信息技术股份有限公司 Data access method and data access device
CN112416259B (en) * 2020-12-04 2022-09-13 海光信息技术股份有限公司 Data access method and data access device
CN113704160A (en) * 2021-08-17 2021-11-26 深圳市安信达存储技术有限公司 Data storage method and system based on Feiteng processor and storage mainboard
CN113704161A (en) * 2021-08-17 2021-11-26 深圳市安信达存储技术有限公司 Data storage method and system based on Loongson processor and storage mainboard
CN117633119A (en) * 2024-01-25 2024-03-01 平凯星辰(北京)科技有限公司 Data synchronization method, node, equipment and storage medium of data synchronization system

Also Published As

Publication number Publication date
CN100495312C (en) 2009-06-03

Similar Documents

Publication Publication Date Title
CN100492274C (en) Storage control system and process node thereof
CN100495312C (en) Storage control system and processing node thereof
US6976116B2 (en) Disk subsystem
US6654831B1 (en) Using multiple controllers together to create data spans
JP4338068B2 (en) Storage system
US7389367B2 (en) Method of managing I/O interface modules in a computer system
US7216148B2 (en) Storage system having a plurality of controllers
JP4508612B2 (en) Cluster storage system and management method thereof
CN100562842C (en) Storage system
CN1723434A (en) Apparatus and method for a scalable network attach storage system
JP4786255B2 (en) Storage system and storage control method
US20110145452A1 (en) Methods and apparatus for distribution of raid storage management over a sas domain
JP2001256003A (en) Disk array controller, its disk array control unit and its expanding method
CN101222490B (en) Network server system and method
JP2003323263A (en) Common memory control method and control system
US20060156055A1 (en) Storage network that includes an arbiter for managing access to storage resources
KR20080096547A (en) Virtual network storage system, network storage device and virtual method
US7299334B2 (en) Storage system configurations
US20020124134A1 (en) Data storage system cluster architecture
CN101370027A (en) Network storage system, method and application server
JP2005056200A (en) Data management method, disk storage device and disk storage system
JP2000089916A (en) File controller
US7043603B2 (en) Storage device control unit and method of controlling the same
CN111684421B (en) Server system
CN101232513B (en) storage device array and manufacturing method thereof

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CP03 Change of name, title or address
CP03 Change of name, title or address

Address after: 310052 Binjiang District Changhe Road, Zhejiang, China, No. 466, No.

Patentee after: Xinhua three Technology Co., Ltd.

Address before: 310053 Hangzhou hi tech Industrial Development Zone, Zhejiang province science and Technology Industrial Park, No. 310 and No. six road, HUAWEI, Hangzhou production base

Patentee before: Huasan Communication Technology Co., Ltd.

CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20090603

Termination date: 20200921