CN112181314A - Distributed storage method and system - Google Patents

Distributed storage method and system Download PDF

Info

Publication number
CN112181314A
CN112181314A CN202011169756.0A CN202011169756A CN112181314A CN 112181314 A CN112181314 A CN 112181314A CN 202011169756 A CN202011169756 A CN 202011169756A CN 112181314 A CN112181314 A CN 112181314A
Authority
CN
China
Prior art keywords
data
node
nodes
data node
name
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011169756.0A
Other languages
Chinese (zh)
Inventor
李菲菲
赵文慧
于昊
董信超
靳登科
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur Cloud Information Technology Co Ltd
Original Assignee
Inspur Cloud Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Cloud Information Technology Co Ltd filed Critical Inspur Cloud Information Technology Co Ltd
Priority to CN202011169756.0A priority Critical patent/CN112181314A/en
Publication of CN112181314A publication Critical patent/CN112181314A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0602Interfaces specially adapted for storage systems specifically adapted to achieve a particular effect
    • G06F3/0604Improving or facilitating administration, e.g. storage management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0602Interfaces specially adapted for storage systems specifically adapted to achieve a particular effect
    • G06F3/0614Improving the reliability of storage systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0602Interfaces specially adapted for storage systems specifically adapted to achieve a particular effect
    • G06F3/0625Power saving in storage systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0668Interfaces specially adapted for storage systems adopting a particular infrastructure
    • G06F3/067Distributed or networked storage systems, e.g. storage area networks [SAN], network attached storage [NAS]

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention relates to the field of cloud storage and distributed storage, and particularly provides a distributed storage method, which comprises the steps of firstly, establishing a data node cluster; then, judging whether the operation is normal or not, if not, recording error data information of the data nodes, and if so, recording data information of the data nodes; and finally, judging and selecting a proper data node by the name node. Compared with the prior art, the distributed storage method and the distributed storage system provided by the invention can improve the robustness and the fault tolerance of the distributed storage system, improve the overall performance of the system and optimize the data storage performance.

Description

Distributed storage method and system
Technical Field
The invention relates to the field of cloud storage and distributed storage, and particularly provides a distributed storage method and system.
Background
With the explosive growth of data information, people's daily life and various industries generate massive data to be managed and stored, and with the growth of data volume at every moment, a distributed storage system becomes a necessary choice for data storage. Meanwhile, new higher requirements are put on storage performance including read-write and fault tolerance. Therefore, how to efficiently and stably implement storage of mass data becomes a challenge, and how to improve robustness and fault tolerance of a distributed storage system becomes a problem to be solved urgently.
Disclosure of Invention
Aiming at the defects of the prior art, the invention provides a distributed storage method with strong practicability.
The invention further aims to provide a distributed storage system which is reasonable in design, safe and applicable.
The technical scheme adopted by the invention for solving the technical problems is as follows:
a distributed storage method comprises the steps of firstly, establishing a data node cluster; then, judging whether the operation is normal or not, if not, recording error data information of the data nodes, and if so, recording data information of the data nodes; and finally, judging and selecting a proper data node by the name node.
Furthermore, the data node cluster comprises a plurality of data nodes, and the running state of each data node in the data node cluster is judged.
Further, whether each data node in the data node cluster is in a normal operation state or not is judged, the state of each data node in the data node cluster is traversed, and if the operation state of the data node is normal, correct data information of the data node is recorded and stored;
and if the running state of the data node is abnormal, recording and storing wrong data information of the data node.
Further, for the traversed data nodes, the client side sends a reading request to the name node, the name node makes corresponding judgment on the running state of the data node through a feedback regulation mechanism, and feedbacks again to perform state judgment and confirmation again, and after the judgment on the name node is completed, a proper data node is selected nearby to complete the storage work of the data information.
Further, the feedback regulation mechanism can record problem information existing in the data nodes, and gather and feed back the problem information to the name node in real time, and the data information judgment is performed twice, and the data node and the name node are subjected to data information judgment once.
A distributed storage system comprises clients, name nodes, data node clusters and a feedback regulation mechanism,
the client is used for initiating a reading request to the name node;
the name node is used for recording relevant data information of all the data node clusters and feeding back available data information of the data node clusters to the client;
the data node cluster comprises a plurality of data nodes, and adjacent data nodes are stored in a mutual transmission mode to realize multi-copy storage and backup of data;
the feedback adjusting mechanism is used for recording problem information existing in the data nodes and summarizing and feeding back the problem information to the name nodes in real time.
Furthermore, when each data node in the data node cluster is started, the data list held by the data node cluster is sent to the name node periodically through the client.
Further, the name node comprehensively judges according to the problem information, and selects an effective data node nearby by using a feedback regulation mechanism to feed back and transmit the data information.
Further, the feedback adjustment mechanism records and stores the data nodes with problems.
Compared with the prior art, the distributed storage method and the system have the following outstanding beneficial effects:
(1) the distributed storage method and the system provided by the invention can improve the robustness and the fault tolerance of the distributed storage system, improve the overall performance of the system and optimize the data storage performance.
(2) The invention improves the accuracy and the high efficiency of data reading and writing, avoids repeated judgment on the data node cluster and improves the judgment efficiency of the system.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly introduced below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to these drawings without creative efforts.
FIG. 1 is a schematic flow diagram of a distributed storage method;
fig. 2 is a schematic structural diagram of a distributed storage system.
Detailed Description
The present invention will be described in further detail with reference to specific embodiments in order to better understand the technical solutions of the present invention. It is to be understood that the described embodiments are merely exemplary of the invention, and not restrictive of the full scope of the invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
A preferred embodiment is given below:
as shown in fig. 1, a distributed storage method in this embodiment includes the following steps:
s01, establishing a data node cluster, wherein the data node cluster comprises 3 data nodes, and judging the running state of the 3 data nodes in the data node cluster.
S02, judging whether the 3 data nodes in the data node cluster are in a normal operation state, traversing the state of each data node in the data node cluster, and if the operation state of the data node is normal, recording and storing correct data information of the data node.
And S03, traversing the state of each data node in the data node cluster, and recording and storing the wrong data information of the data node if the running state of the data node is abnormal.
S04, for the traversed data nodes, the client side sends a reading request to the name node, the name node makes corresponding judgment on the running state of the data nodes through a feedback regulation mechanism, and feedbacks again to make the state judgment and confirmation again;
and S05, after the judgment of the name node is completed, selecting a proper data node nearby to complete the storage work of the data information.
The feedback adjusting mechanism can record problem information existing in the data nodes and summarize and feed back the problem information to the name nodes in real time. The feedback regulation mechanism comprises two data information judgments, and the data information judgments are carried out by the data nodes and the name nodes. The feedback regulation mechanism can also store the data node state and other data information, thereby avoiding repeated judgment and improving the judgment efficiency.
As shown in fig. 2, a distributed storage system includes clients, name nodes, data node clusters, and a feedback adjustment mechanism.
The client is mainly responsible for initiating a reading request to the name node and needs to store data.
The name node is a core module and a main node of the distributed storage system, does not store actual file data, but records relevant data information of all data nodes, and is mainly responsible for feeding back available data information of the data nodes to the client.
The data node cluster of this embodiment includes 3 data nodes, and the data node is auxiliary module and the slave node of distributed storage system to store actual file data, can store through the mode of mutual transmission between the adjacent data node, thereby realize many copies of storage and backup to the data. When the three data nodes are started, the name nodes issued by the three data nodes report the data list held by the three data nodes, and the information is sent to the name nodes at regular intervals.
In view of the fact that the data nodes may not be accessible or are down, in order to ensure stable and efficient transmission of the data information, the feedback adjustment mechanism is added, and for the data nodes with problems, the feedback adjustment mechanism records the problem information of the data nodes and summarizes and feeds the problem information back to the name nodes in real time. And the name node comprehensively judges according to the problem information and quickly selects other effective data nodes nearby by using a feedback regulation mode to feed back and transmit the data information. And the data nodes with problems are recorded and stored through a feedback regulation mechanism, so that repeated judgment work is avoided.
The feedback regulation mechanism comprises two data information judgments, and the data information judgments are carried out on the data nodes and the name nodes. The feedback regulation mechanism can store the data node state and other data information, thereby avoiding repeated judgment and improving the judgment efficiency.
The above embodiments are only specific examples of the present invention, and the scope of the present invention includes but is not limited to the above embodiments, and any suitable changes or substitutions that are consistent with the claims of a distributed storage method and system of the present invention and are made by those of ordinary skill in the art should fall within the scope of the present invention.
Although embodiments of the present invention have been shown and described, it will be appreciated by those skilled in the art that changes, modifications, substitutions and alterations can be made in these embodiments without departing from the principles and spirit of the invention, the scope of which is defined in the appended claims and their equivalents.

Claims (9)

1. A distributed storage method is characterized in that firstly, a data node cluster is established; then, judging whether the operation is normal or not, if not, recording error data information of the data nodes, and if so, recording data information of the data nodes; and finally, judging and selecting a proper data node by the name node.
2. The distributed storage method according to claim 1, wherein the data node cluster includes a plurality of data nodes, and the operation state determination is performed on each data node in the data node cluster.
3. The distributed storage method according to claim 2, wherein whether each data node in the data node cluster is in a normal operation state is determined, the state of each data node in the data node cluster is traversed, and if the operation state of the data node is normal, correct data information of the data node is recorded and stored;
and if the running state of the data node is abnormal, recording and storing wrong data information of the data node.
4. The distributed storage method according to claim 3, wherein for the traversed data nodes, the client initiates a read request to the name node, the name node makes a corresponding judgment on the operating state of the data node through a feedback regulation mechanism, and performs feedback again to perform state judgment and confirmation, and after the judgment on the name node is completed, an appropriate data node is selected nearby to complete the storage work of the data information.
5. The distributed storage method according to claim 4, wherein the feedback adjustment mechanism records problem information existing in the data nodes, and summarizes the problem information in real time and feeds the problem information back to the name node, and the method includes two data information determinations, and the data information determination is performed by both the data node and the name node.
6. A distributed storage system is characterized by comprising clients, name nodes, data node clusters and a feedback regulation mechanism,
the client is used for initiating a reading request to the name node;
the name node is used for recording relevant data information of all the data node clusters and feeding back available data information of the data node clusters to the client;
the data node cluster comprises a plurality of data nodes, and adjacent data nodes are stored in a mutual transmission mode to realize multi-copy storage and backup of data;
the feedback adjusting mechanism is used for recording problem information existing in the data nodes and summarizing and feeding back the problem information to the name nodes in real time.
7. The distributed storage system according to claim 6, wherein each data node in the data node cluster sends its own data list to name node periodically through client at startup.
8. The distributed storage system according to claim 7, wherein the name node selects an effective data node nearby for feedback and transmission of data information by using a feedback adjustment mechanism according to the problem information comprehensive judgment.
9. A meal storage system according to claim 8, wherein the feedback adjustment mechanism records and saves the data node in question.
CN202011169756.0A 2020-10-28 2020-10-28 Distributed storage method and system Pending CN112181314A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011169756.0A CN112181314A (en) 2020-10-28 2020-10-28 Distributed storage method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011169756.0A CN112181314A (en) 2020-10-28 2020-10-28 Distributed storage method and system

Publications (1)

Publication Number Publication Date
CN112181314A true CN112181314A (en) 2021-01-05

Family

ID=73922875

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011169756.0A Pending CN112181314A (en) 2020-10-28 2020-10-28 Distributed storage method and system

Country Status (1)

Country Link
CN (1) CN112181314A (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2019119311A1 (en) * 2017-12-20 2019-06-27 华为技术有限公司 Data storage method, device, and system
CN111274205A (en) * 2020-01-07 2020-06-12 北京松果电子有限公司 Data block access method and device and storage medium

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2019119311A1 (en) * 2017-12-20 2019-06-27 华为技术有限公司 Data storage method, device, and system
CN111274205A (en) * 2020-01-07 2020-06-12 北京松果电子有限公司 Data block access method and device and storage medium

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
肖君: "《教育大数据》", 30 January 2020 *

Similar Documents

Publication Publication Date Title
EP2474919B1 (en) System and method for data replication between heterogeneous databases
US9785498B2 (en) Archival storage and retrieval system
US9747301B2 (en) Distributed file system using consensus nodes
EP1625502B1 (en) Redundant data assigment in a data storage system
EP1625501B1 (en) Read, write, and recovery operations for replicated data
CN101655813B (en) Storage system
CN101334797B (en) Distributed file systems and its data block consistency managing method
CN100458721C (en) Method for automatic backup using computer network
US9424272B2 (en) Distributed file system using consensus nodes
CN111630826A (en) Consensus system and method
CN101079896B (en) A method for constructing multi-availability mechanism coexistence framework of concurrent storage system
CN101751415B (en) Metadata service system, metadata synchronized method and writing server updating method
CN101137984B (en) Systems, methods, and software for distributed loading of databases
CN113987064A (en) Data processing method, system and equipment
CN112039970B (en) Distributed business lock service method, server, system and storage medium
CN113905054B (en) RDMA (remote direct memory access) -based Kudu cluster data synchronization method, device and system
CN114385755A (en) Distributed storage system
CN103544081A (en) Management method and device for double metadata servers
CN112181314A (en) Distributed storage method and system
CN109815207A (en) Date storage method and Client Agent
CN113204424A (en) Method and device for optimizing Raft cluster and storage medium
CN114330698B (en) Neural model storage system and method of brain-like computer operating system
CN114741367B (en) Data synchronization method, data synchronization device, electronic equipment, storage medium and program product
CN112667698B (en) MongoDB data synchronization method based on converged media platform
CN112256202B (en) Distributed storage system and method for deleting volumes in distributed storage system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20210105