CN112148219A - Design method and device for ceph type distributed storage cluster - Google Patents

Design method and device for ceph type distributed storage cluster Download PDF

Info

Publication number
CN112148219A
CN112148219A CN202010973881.0A CN202010973881A CN112148219A CN 112148219 A CN112148219 A CN 112148219A CN 202010973881 A CN202010973881 A CN 202010973881A CN 112148219 A CN112148219 A CN 112148219A
Authority
CN
China
Prior art keywords
storage
different types
performances
performance
osd
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010973881.0A
Other languages
Chinese (zh)
Inventor
孙家彦
余意成
位自豪
李达
陈多美
刘宇航
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Uxsino Software Co ltd
Original Assignee
Beijing Uxsino Software Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Uxsino Software Co ltd filed Critical Beijing Uxsino Software Co ltd
Priority to CN202010973881.0A priority Critical patent/CN112148219A/en
Publication of CN112148219A publication Critical patent/CN112148219A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0668Interfaces specially adapted for storage systems adopting a particular infrastructure
    • G06F3/067Distributed or networked storage systems, e.g. storage area networks [SAN], network attached storage [NAS]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/18File system types
    • G06F16/182Distributed file systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0629Configuration or reconfiguration of storage systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0662Virtualisation aspects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0668Interfaces specially adapted for storage systems adopting a particular infrastructure
    • G06F3/0671In-line storage system
    • G06F3/0683Plurality of storage devices
    • G06F3/0685Hybrid storage combining heterogeneous device types, e.g. hierarchical storage, hybrid arrays

Abstract

The embodiment of the invention discloses a ceph type distributed storage cluster design method and a device, wherein the method comprises the following steps: selecting a hard disk according to different application scene requirements, and creating osd with different types and performances; creating storage pools of different types and performances according to osds of different types and performances; wherein the storage pools of different types and performance correspond to osds of different types and performance; and determining storage modes corresponding to the storage pools with different types and performances according to the storage pools with different types and performances. According to the invention, different types and performances of osd hard disks are created according to different application scene requirements, so that different types and performances of storage pools are obtained, corresponding storage modes are determined according to the different types and performances of the storage pools, and storage devices and storage types can be reasonably selected according to enterprise requirements and existing hardware conditions, so that hardware resources are utilized to the maximum extent, and a set of cloud storage design use scheme suitable for most cloud platform product computing modules is generated.

Description

Design method and device for ceph type distributed storage cluster
Technical Field
The invention relates to the technical field of distributed storage, in particular to a design method and device of a ceph type distributed storage cluster.
Background
The rapid development of information technology generates massive information data, and how to store the information and the huge cost brought by the information storage bring new challenges to enterprises. The disadvantages of the traditional storage scheme are more and more prominent in the big data era, and the distributed storage system is produced on the background. Distributed storage is a data storage technology, which uses disk space on each machine in an enterprise through a network and forms a virtual storage device with these distributed storage resources, and data is distributed and stored in each corner of the enterprise.
In the internet era, the agility, flexibility and robustness of enterprise networks are important to meeting strict business requirements, the importance of cloud computing schemes is increasingly remarkable, the most important ring of cloud computing is storage, and a good distributed storage can enable the performance of a cloud platform to be greatly improved as a foundation for computing. Common distributed storage systems include cfs, lustre, ceph, cluster, and the like. ceph (a distributed storage system) is a typical distributed system in the open source field, and can provide object storage, file storage and block storage at the same time, and is popular with more and more enterprises under the guidance of openstack.
However, in the current market, the cloud computing platforms have few excellent design and use schemes for ceph, hardware resources used as cloud storage cannot be fully utilized, network segment configuration and storage types are unreasonable in selection, and a set of cloud storage design and use schemes suitable for most cloud platform product computing modules cannot be efficiently generated.
Disclosure of Invention
Because the existing method has the above problems, embodiments of the present invention provide a ceph type distributed storage cluster design method and apparatus.
In a first aspect, an embodiment of the present invention provides a ceph-type distributed storage cluster design method, including:
selecting a hard disk according to different application scene requirements, and creating osd with different types and performances;
according to the read-write speed requirement of the hard disk, cluster network configuration is carried out;
creating storage pools of different types and performances according to the osds of different types and performances;
wherein the storage pools of different types and performance correspond to osds of different types and performance;
and determining a storage mode corresponding to the storage pool with different types and performances according to the storage pool with different types and performances.
Further, each osd is composed of a solid state disk SSD, which serves as a log disk or data disk, and a mechanical hard disk HHD, which serves as a data disk, collocated.
Further, the hard disk selection according to different application scene requirements and creating osds of different types and performances includes:
different types and capabilities of osd are created according to the respective read-write speeds and storage capabilities of the SSD and HHD.
Further, the creating osd of different types and performances according to the respective read-write speed and storage performance of the SSD and the HHD includes:
when the read-write speed of the osd is required to be 300-600M per second, a SSD data disk and an SSD log disk are adopted to form the high-performance osd;
when the read-write speed of the osd is required to be 210-380M per second, an SSD log disk and an HHD data disk are adopted to form the osd with general performance;
when the read-write speed of the osd is required to be 60-80M per second, a HHD data disc is adopted to form low-performance osd.
Further, the creating the storage pool of different types and performances according to the osd of different types and performances comprises:
and corresponding osds with the same type and performance in the osds with different types and performances to the same storage pool to obtain a plurality of storage pools with different performances and types.
Further, the determining, according to the storage pools of different types and performances, a storage manner corresponding to the storage pools of different types and performances includes:
and determining the corresponding storage mode to be block storage, object storage or file storage according to the storage pools with different types and performances.
Further, after determining the storage mode corresponding to the storage pool of different type and performance according to the storage pool of different type and performance, the method further includes:
if the cluster application scene is high in reading and writing speed, selecting an object for storage, and further creating a virtual machine containing high-performance osd;
if the cluster application scene is of a common read-write speed, selecting block storage, and further creating a virtual machine containing common performance osd;
and if the cluster application scene is low in read-write speed but high in storage space requirement, selecting files for storage, and further creating a virtual machine containing low-performance osd.
In a second aspect, an embodiment of the present invention further provides a ceph type distributed storage cluster apparatus, including:
the first determining module is used for selecting hard disks according to different application scene requirements and creating osds with different types and performances;
the second determining module is used for configuring the cluster network according to the read-write speed requirement of the hard disk;
a third determining module, configured to create storage pools of different types and performances according to the osds of different types and performances;
wherein the storage pools of different types and performance correspond to osds of different types and performance;
and the fourth determining module is used for determining the storage mode corresponding to the storage pool with different type and performance according to the storage pool with different type and performance.
In a third aspect, an embodiment of the present invention further provides an electronic device, including a memory, a processor, and a computer program stored in the memory and executable on the processor, where the processor, when executing the computer program, implements the ceph-type distributed storage cluster design method according to the first aspect.
In a fourth aspect, an embodiment of the present invention further provides a non-transitory computer-readable storage medium, on which a computer program is stored, and when the computer program is executed by a processor, the method for designing a ceph-type distributed storage cluster according to the first aspect is implemented.
According to the technical scheme, the ceph type distributed storage cluster design method, the device, the electronic equipment and the storage medium provided by the embodiment of the invention create different types and performances of osd hard disks according to different application scene requirements, further obtain different types and performances of storage pools, and determine corresponding storage modes according to the different types and performances of storage pools.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to these drawings without creative efforts.
Fig. 1 is a flowchart of a ceph type distributed storage cluster design method according to an embodiment of the present invention;
fig. 2 is a schematic diagram of hard disk type selection of a ceph-type distributed storage cluster design method according to an embodiment of the present invention;
fig. 3 is a schematic diagram of a network configuration of a ceph-type distributed storage cluster design method according to an embodiment of the present invention;
fig. 4 is a schematic diagram of a storage manner of a ceph-type distributed storage cluster design method according to an embodiment of the present invention;
fig. 5 is a schematic diagram of a storage cluster structure of a ceph-type distributed storage cluster design method according to an embodiment of the present invention;
FIG. 6 is a schematic diagram of a ceph-type distributed storage cluster apparatus according to an embodiment of the present invention;
fig. 7 is a schematic structural diagram of an electronic device according to an embodiment of the invention.
Detailed Description
The following further describes embodiments of the present invention with reference to the accompanying drawings. The following examples are only for illustrating the technical solutions of the present invention more clearly, and the protection scope of the present invention is not limited thereby.
Fig. 1 shows a flowchart of a ceph-type distributed storage cluster design method according to an embodiment of the present invention, fig. 2 is a schematic diagram of hard disk type selection of the ceph-type distributed storage cluster design method according to an embodiment of the present invention, fig. 3 is a schematic diagram of network configuration of the ceph-type distributed storage cluster design method according to an embodiment of the present invention, and fig. 4 is a schematic diagram of a storage manner of the ceph-type distributed storage cluster design method according to an embodiment of the present invention. The design method of a ceph-type distributed storage cluster according to the embodiment of the present invention is explained and explained in detail with reference to fig. 1, fig. 2, fig. 3, and fig. 4. As shown in fig. 1, a ceph-type distributed storage cluster design method provided in an embodiment of the present invention specifically includes the following steps:
step 101: selecting a hard disk according to different application scene requirements, and creating osd with different types and performances;
in this step, the different application scenario requirements are distributed storage requirements generated by the enterprise in different application scenarios, including storage requirements related to storage space and reading speed. And carrying out model selection combination on the hard disks according to storage requirements of enterprises generated under different application scenes to obtain osds with different types and performances, and meanwhile, reasonably selecting the osd as the ceph cluster storage device according to the existing hardware conditions of the enterprises. The osd is a process responsible for physical storage, and is generally configured to correspond to disks one to one, one disk starts an osd process, and one disk and a daemon process corresponding to the disk are called an osd. In this step, one disk may include one or two hard disks. The hard disk with high read-write speed corresponds to high-performance osd, the hard disk with high read-write speed and the hard disk with large storage space corresponds to common-performance osd, and the hard disk with large storage space independently corresponds to low-performance osd.
In this step, it should be noted that the meaning thereof is: the method can create the osd with high, medium and low performance through reasonable collocation according to the storage requirements of enterprises under different application scenes and the existing hardware conditions, thereby enabling the hardware utilization to be more efficient and reasonable, providing convenience for the division of subsequent storage pools and the selection of storage types, avoiding the resource waste generated by the random combination of hard disks with different types and performances, and simultaneously reducing the condition that the storage or reading requirements cannot be met due to unreasonable hardware type selection.
Step 102: according to the read-write speed requirement of the hard disk, cluster network configuration is carried out;
in this step, it should be noted that, when network configuration is performed on the ceph storage cluster, a corresponding IP address needs to be added, as shown in fig. 3, multiple ceph cluster nodes may be set, and it is determined whether data packets can be successfully exchanged (sent and received) between different hosts according to ping, so as to ensure that each cluster node must ping each other.
In this step, the ceph cluster uses two networks, i.e., an internal network and an external network, the external network is used for providing services for external users, the ceph cluster relies on an internal network to maintain the document of the cluster, the Monitor node needs to Monitor the heartbeat of the osd through the internal network at any time, and meanwhile, the data replication needs to be performed through the internal network. Each block of SSD read-write speed can reach 500MB/s, and in order to prevent the network from becoming the performance bottleneck of the Ceph cluster, a trillion network is recommended to be used; thus, the server is required to be configured with two gigabit optical ports and two gigabit switches as main switches and standby switches. In order to provide better performance for the outside, the gigabit optical port is also used for providing service for the outside, the gigabit optical port is configured redundantly and is connected to an external gigabit switch, so that the performance of the system is improved. In consideration of the work of system deployment, subsequent cluster maintenance and the like, the server uses the independent network port for the work of system deployment, cluster maintenance and the like, the work does not need high network speed, and the gigabit network port can completely meet the requirement.
In this step, it should be noted that the meaning of configuring the cluster network according to different application scene requirements is as follows: the cluster network can be selected according to the existing network environment of the enterprise, and the network segments can be selected according to the requirements, so that the requirements of constructing various cluster environments of the enterprise are met.
Step 103: creating storage pools of different types and performances according to the osds of different types and performances;
wherein the storage pools of different types and performance correspond to osds of different types and performance;
in this step, the storage pool is the logical partition where ceph stores data, which acts as a namespace. Each storage pool may correspond to multiple osds, thus creating storage pools of different types and capabilities based on osds of different types and capabilities. With each storage pool corresponding to osd of the same type and performance. For example: all high-performance osds composed of hard disks with high read-write speed in the cluster correspond to a high-performance storage pool, all general-performance osds composed of hard disks with high read-write speed and hard disks with large storage space in the cluster correspond to a general-performance storage pool, and all low-performance osds composed of hard disks with large storage space in the cluster correspond to a low-performance storage pool.
In this step, it should be noted that, according to the osd of different types and performances, the meaning of creating the storage pools of different types and performances is: the types and the performances of the storage pools are further divided according to the osds with different types and performances, and convenience is provided for selecting corresponding storage modes according to the types and the performances of the storage pools in the follow-up process. Meanwhile, each storage pool points to osd with the same type and performance, so that the storage pools can be used as fault isolation domains, each storage pool only uses a specific osd, once a certain osd is damaged, only a certain storage pool of a certain service is influenced, the fault range is controlled in a small range, and the damage of the osd only causes data balance in the corresponding storage pool, so that global influence cannot be caused. However, in the prior art, no scheme for designing cluster hardware type selection and storage pool division is provided, and at this time, when a certain hard disk in ceph fails, the corresponding osd exits from service, and data belonging to the hard disk is reconstructed and redistributed to other hard disks, so that storage cluster resources are occupied. In addition, if a storage pool points to osds of multiple types and performances, when one osd is damaged, the storage pool will be affected, and thus the osd of the other types and performances pointed to by the storage pool will be affected.
Step 104: and determining a storage mode corresponding to the storage pool with different types and performances according to the storage pool with different types and performances.
In this step, the storage means includes a block storage (rbd), an object storage (obj), and a file storage (fs). The file storage and reading speed is low, the transmission speed is low, and a third party is required to transfer the file storage; the block storage read-write speed is high, and the capacity of a logic disk is large; the object storage read-write speed is high, and the transmission speed is high. And determining the storage mode matched with the storage pool according to the type and the performance of the storage pool. For example: matching a high-performance storage pool corresponding to a high-performance osd consisting of hard disks with high read-write speed with an object storage with better read-write capability and storage capability; matching the read-write capacity and the block storage with the common storage capacity by the common performance storage pool corresponding to the hard disk with the high read-write speed and the hard disk with the large storage space; and a low-performance storage pool corresponding to the low-performance osd formed by the hard disks with large storage space has poor matching read-write capability and better storage capability.
In this step, it should be noted that, according to the storage pools of different types and performances, the significance of determining the storage manner corresponding to the storage pools of different types and performances is: because the osds with different types and performances are determined according to different application scenes and the existing hardware conditions and requirements of enterprises, the storage mode matched by the storage pools with different types and performances corresponding to the osds with different types and performances can meet the requirements of enterprise cloud storage, and the ceph storage type can be reasonably selected according to the requirements of the enterprises.
According to the technical scheme, the ceph-type distributed storage cluster design method provided by the embodiment of the invention has the advantages that different types and performances of osd hard disks are created according to different application scene requirements, storage pools with different types and performances are further obtained, corresponding storage modes are determined according to the storage pools with different types and performances, and the hard disks and the storage types can be reasonably selected according to enterprise requirements and existing hardware conditions, so that hardware resources are utilized to the maximum extent, and a cloud storage design use scheme suitable for most cloud platform product computing modules is generated.
Based on the contents of the above embodiments, in the present embodiment, each osd is composed of a solid state disk SSD serving as a log disk or a data disk and a mechanical hard disk HHD serving as a data disk collocated.
In this embodiment, it should be noted that one osd corresponds to one disk, and each disk includes two hard disks or one hard disk. The optional hard disk is a Solid State Disk (SSD) and a mechanical hard disk (HHD), and the SSD has the advantages that the read-write speed is dozens of times to hundreds of times faster than the HHD, and has the defect of high unit cost and is not suitable for large-capacity storage. HHD has the advantages of low unit cost and suitability for mass storage, but the speed is far lower than that of SSD. Based on the different characteristics of SSD and HHD, SSD is used as a data disk for read performance focus, SSD is used as a data disk for write performance focus optimization for storage devices, namely SSD log disks, while HHD is selected as a data disk.
In this embodiment, it should be noted that the meaning of different osd composed of SSD and HHD collocation is: according to the characteristics of SSD and HHD, two hard disks are collocated to form osd with different performances, so that enterprise hardware resources are utilized to the maximum extent, resource waste caused by random combination of hard disks with different types and performances is avoided, and meanwhile, the situation that storage or reading requirements cannot be met due to unreasonable hardware type selection can be reduced.
Based on the content of the foregoing embodiment, in this embodiment, the selecting a hard disk according to different application scene requirements and creating osds with different types and performances includes:
different types and capabilities of osd are created according to the respective read-write speeds and storage capabilities of the SSD and HHD.
In this embodiment, it can be understood that, when one disk includes two SSDs, the read-write performance of the disk is better; when a disk comprises an SSD and a HHD, the disk has both reading and writing performance and storage performance, the storage performance is higher than the combination, but the reading and writing speed is not as high as the combination; when only one HHD is included in a disk, both read and write performance and storage performance are lower than the above combination.
In this embodiment, it should be noted that the significance of the cluster characteristics exhibited by the read/write/storage performance of the SSD and HHD in different combinations is: the method has the advantages that enterprise hardware resources are fully utilized, resource waste caused by random combination of hard disks of different types and performances is avoided, and meanwhile the situation that storage or reading requirements cannot be met due to unreasonable hardware type selection can be reduced.
Based on the content of the above embodiment, in the present embodiment, the creating osd of different types and performances according to the respective read-write speed and storage performance of SSD and HHD includes:
when the read-write speed of the osd is required to be 300-600M per second, a SSD data disk and an SSD log disk are adopted to form the high-performance osd;
when the read-write speed of the osd is required to be 210-380M per second, an SSD log disk and an HHD data disk are adopted to form the osd with general performance;
when the read-write speed of the osd is required to be 60-80M per second, a HHD data disc is adopted to form low-performance osd.
In the embodiment, the cluster characteristics are exerted through the read-write and storage performances of different combination modes of SSD and HHD, for example, when the read-write speed requirement of osd is 300-600M per second, a high-performance osd with higher read-write speed requirement is formed by adopting an SSD data disc and an SSD log disc; when the read-write speed of the osd is required to be 210-380M per second, a general performance osd with certain requirements on both storage performance and read-write speed is formed by adopting a HHD data disc and an SSD log disc; when the read-write speed of the osd is required to be 60-80M per second, a low-performance osd which is composed of one HHD data disk and has high storage performance but low read-write speed requirement is adopted.
Based on the content of the foregoing embodiment, in this embodiment, the creating the storage pools of different types and performances according to the osd of different types and performances includes:
and corresponding osds with the same type and performance in the osds with different types and performances to the same storage pool to obtain a plurality of storage pools with different performances and types.
In this embodiment, it should be noted that the storage pools of different types and performances are created according to osd of different types and performances. With each storage pool corresponding to osd of the same type and performance. For example: all high-performance osds composed of hard disks with high read-write speed in the cluster correspond to a high-performance storage pool, all general-performance osds composed of hard disks with high read-write speed and hard disks with large storage space in the cluster correspond to a general-performance storage pool, and all low-performance osds composed of hard disks with large storage space in the cluster correspond to a low-performance storage pool.
In this embodiment, it should be noted that, corresponding osds with the same type and performance in osds with different types and performances to the same storage pool, obtaining multiple storage pools with different performances and types is significant in that: osd with the same type and performance corresponds to the same storage pool, so that storage pools with different types and performances are divided, and convenience is provided for selecting corresponding storage modes according to the types and performances of the storage pools. Meanwhile, each storage pool points to osd with the same type and performance, so that the storage pools can be used as fault isolation domains, each storage pool only uses a specific osd, once a certain osd is damaged, only a certain storage pool of a certain service is influenced, the fault range is controlled in a small range, and the damage of the osd only causes data balance in the corresponding storage pool, so that global influence cannot be caused. However, in the prior art, no scheme for designing cluster hardware type selection and storage pool division is provided, and at this time, when a certain hard disk in ceph fails, the corresponding osd exits from service, and data belonging to the hard disk is reconstructed and redistributed to other hard disks, so that storage cluster resources are occupied. In addition, if a storage pool points to osds of multiple types and performances, when one osd is damaged, the storage pool will be affected, and thus the osd of the other types and performances pointed to by the storage pool will be affected.
Based on the content of the foregoing embodiment, in this embodiment, the determining, according to the storage pools of different types and performances, the storage manner corresponding to the storage pools of different types and performances includes:
and determining the corresponding storage mode to be block storage, object storage or file storage according to the storage pools with different types and performances.
In this embodiment, it should be noted that the storage manner includes block storage, object storage, and file storage. The file storage and reading speed is low, the transmission speed is low, and a third party is required to transfer the file storage; the block storage read-write speed is high, and the capacity of a logic disk is large; the object storage read-write speed is high, and the transmission speed is high. And determining the storage mode matched with the storage pool according to the type and the performance of the storage pool. For example: matching a high-performance storage pool corresponding to a high-performance osd consisting of hard disks with high read-write speed with an object storage with better read-write capability and storage capability; matching the read-write capacity and the block storage with the common storage capacity by the common performance storage pool corresponding to the hard disk with the high read-write speed and the hard disk with the large storage space; and a low-performance storage pool corresponding to the low-performance osd formed by the hard disks with large storage space has poor matching read-write capability and better storage capability.
In this step, it should be noted that, according to the storage pools of different types and performances, the significance of determining the storage manner corresponding to the storage pools of different types and performances is: because the osds with different types and performances are determined according to different application scenes and the existing hardware conditions and requirements of enterprises, the storage mode matched by the storage pools with different types and performances corresponding to the osds with different types and performances can meet the requirements of enterprise cloud storage, and the ceph storage type can be reasonably selected according to the requirements of the enterprises.
Based on the content of the foregoing embodiment, in this embodiment, after determining, according to the storage pools of different types and performances, the storage manner corresponding to the storage pools of different types and performances, the method further includes:
if the cluster application scene is high in reading and writing speed, selecting an object for storage, and further creating a virtual machine containing high-performance osd;
if the cluster application scene is of a common read-write speed, selecting block storage, and further creating a virtual machine containing common performance osd;
and if the cluster application scene is low in read-write speed but high in storage space requirement, selecting files for storage, and further creating a virtual machine containing low-performance osd.
In this embodiment, it should be noted that the virtual machine is created by selecting a corresponding storage manner according to different application scenarios. Specifically, a high-performance storage pool corresponding to a high-performance osd composed of hard disks with high read-write speed is matched with object storage with better read-write capability and storage capability, and then a virtual machine containing the high-performance osd is created; matching the read-write capacity and the block storage with the common storage capacity by using a common performance storage pool corresponding to a hard disk with a high read-write speed and a hard disk with a large storage space, and further creating a virtual machine containing the common performance osd; and a low-performance storage pool corresponding to the low-performance osd consisting of hard disks with large storage space is used for storing files with poor matching read-write capability and excellent storage capability, so that a virtual machine containing the low-performance osd is created.
In this embodiment, it should be noted that the method and the system can solve the problems of automatic construction of a distributed storage Ceph cluster, construction (repeated construction) of a Ceph cluster after formatting, external connection of a Ceph cluster, addition and deletion of Ceph cluster nodes, OSD disk selection of the Ceph cluster, performance setting of a Ceph cluster storage pool, interaction between Ceph multiple storage types and a computing module, and the like. The method comprises the following steps: the method comprises the steps of configuring a ceph cluster node, reserving a formatted ceph environment, deleting a ceph cluster main node, externally connecting a ceph cluster software and hardware information input database, selecting the type and the capacity of a hard disk used by OSD, dividing the performance of a storage pool, and interacting the storage type (file storage, block storage and object storage) with a computing module. According to the cloud storage design and use method, the cloud computing storage module suitable for various computing modules is designed through the selection combination of the operating system, the hard disk and the network, the cloud storage module is easy to build, maintain and use, various hardware resources are reasonably and efficiently utilized, and a set of cloud storage design and use scheme suitable for most cloud platform product computing modules is researched under the condition that the hardware resources are maximally utilized as cloud storage.
Based on the same inventive concept, another embodiment of the present invention provides a ceph-type distributed storage cluster device, and referring to fig. 6, an embodiment of the present invention provides a schematic diagram of a ceph-type distributed storage cluster device, where the ceph-type distributed storage cluster device includes: a first determination module 61, a second determination module 62, a third determination module 63, and a fourth determination module 64, wherein: the first determining module is used for selecting hard disks according to different application scene requirements and creating osds with different types and performances;
the second determining module is used for configuring the cluster network according to the read-write speed requirement of the hard disk;
a third determining module, configured to create storage pools of different types and performances according to the osds of different types and performances;
wherein the storage pools of different types and performance correspond to osds of different types and performance;
and the fourth determining module is used for determining the storage mode corresponding to the storage pool with different type and performance according to the storage pool with different type and performance.
In the embodiment, the different application scenario requirements are distributed storage requirements generated by the enterprise in different application scenarios, including storage requirements related to storage space and reading speed. And carrying out model selection combination on the hard disks according to storage requirements of enterprises generated under different application scenes to obtain osds with different types and performances, and meanwhile, reasonably selecting the osd as the ceph cluster storage device according to the existing hardware conditions of the enterprises. The osd is a process responsible for physical storage, and is generally configured to correspond to disks one to one, one disk starts an osd process, and one disk and a daemon process corresponding to the disk are called an osd. In this step, one disk may include one or two hard disks. The hard disk with high read-write speed corresponds to high-performance osd, the hard disk with high read-write speed and the hard disk with large storage space corresponds to common-performance osd, and the hard disk with large storage space independently corresponds to low-performance osd.
In this embodiment, when network configuration is performed on a ceph storage cluster, a corresponding IP address needs to be added, as shown in fig. 3, multiple ceph cluster nodes may be set, and it is determined whether data packets can be successfully exchanged (transmitted and received) between different hosts according to ping, so as to ensure that each cluster node must ping each other.
In this step, the ceph cluster uses two networks, i.e., an internal network and an external network, the external network is used for providing services for external users, the ceph cluster relies on an internal network to maintain the document of the cluster, the Monitor node needs to Monitor the heartbeat of the osd through the internal network at any time, and meanwhile, the data replication needs to be performed through the internal network. The SSD fast writing speed can reach 500MB/s per block, and in order to prevent the network from becoming the performance bottleneck of the Ceph cluster, a trillion network is recommended to be used; thus, the server is required to be configured with two gigabit optical ports and two gigabit switches as main switches and standby switches. In order to provide better performance for the outside, the gigabit optical port is also used for providing service for the outside, the gigabit optical port is configured redundantly and is connected to an external gigabit switch, so that the performance of the system is improved. In consideration of the work of system deployment, subsequent cluster maintenance and the like, the server uses the independent network port for the work of system deployment, cluster maintenance and the like, the work does not need high network speed, and the gigabit network port can completely meet the requirement.
In this embodiment, a storage pool is a logical partition when ceph stores data, which acts as a namespace. Each storage pool may correspond to multiple osds, thus creating storage pools of different types and capabilities based on osds of different types and capabilities. With each storage pool corresponding to osd of the same type and performance. For example: all high-performance osds composed of hard disks with high read-write speed in the cluster correspond to a high-performance storage pool, all general-performance osds composed of hard disks with high read-write speed and hard disks with large storage space in the cluster correspond to a general-performance storage pool, and all low-performance osds composed of hard disks with large storage space in the cluster correspond to a low-performance storage pool.
In this embodiment, the storage means includes block storage, object storage, and file storage. The file storage and reading speed is low, the transmission speed is low, and a third party is required to transfer the file storage; the block storage read-write speed is high, and the capacity of a logic disk is large; the object storage read-write speed is high, and the transmission speed is high. And determining the storage mode matched with the storage pool according to the type and the performance of the storage pool. For example: matching a high-performance storage pool corresponding to a high-performance osd consisting of hard disks with high read-write speed with an object storage with better read-write capability and storage capability; matching the read-write capacity and the block storage with the common storage capacity by the common performance storage pool corresponding to the hard disk with the high read-write speed and the hard disk with the large storage space; and a low-performance storage pool corresponding to the low-performance osd formed by the hard disks with large storage space has poor matching read-write capability and better storage capability.
According to the technical scheme, the ceph-type distributed storage cluster device provided by the embodiment of the invention creates osd hard disks with different types and performances according to different application scene requirements, further obtains storage pools with different types and performances, determines corresponding storage modes according to the storage pools with different types and performances, and can reasonably select the hard disks and the storage types according to enterprise requirements and existing hardware conditions, so that hardware resources are utilized to the maximum extent, and a cloud storage design use scheme suitable for most cloud platform product computing modules is generated.
The ceph type distributed storage cluster device described in this embodiment may be used to execute the foregoing method embodiments, and the principle and technical effect are similar, which are not described herein again.
Based on the same inventive concept, another embodiment of the present invention provides an electronic device, which is shown in fig. 7, and specifically includes the following contents: a processor 701, a memory 702, a communication interface 703 and a communication bus 704;
the processor 701, the memory 702 and the communication interface 703 complete mutual communication through the communication bus 704; the communication interface 703 is used for implementing information transmission between the devices;
the processor 701 is configured to call a computer program in the memory 702, and when the processor executes the computer program, the processor implements all the steps of the above-mentioned ceph type distributed storage cluster design method, for example, when the processor executes the computer program, the processor implements the following steps: selecting a hard disk according to different application scene requirements, and creating osd with different types and performances; carrying out cluster network configuration according to different application scene requirements; creating storage pools of different types and performances according to the osds of different types and performances; wherein the storage pools of different types and performance correspond to osds of different types and performance; and determining a storage mode corresponding to the storage pool with different types and performances according to the storage pool with different types and performances.
Based on the same inventive concept, a further embodiment of the present invention provides a non-transitory computer-readable storage medium, having stored thereon a computer program, which when executed by a processor implements all the steps of one of the above-mentioned ceph-type distributed storage cluster design methods, for example, the processor implements the following steps when executing the computer program: selecting a hard disk according to different application scene requirements, and creating osd with different types and performances; carrying out cluster network configuration according to different application scene requirements; creating storage pools of different types and performances according to the osds of different types and performances; wherein the storage pools of different types and performance correspond to osds of different types and performance; and determining a storage mode corresponding to the storage pool with different types and performances according to the storage pool with different types and performances.
In addition, the logic instructions in the memory may be implemented in the form of software functional units and may be stored in a computer readable storage medium when sold or used as a stand-alone product. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.
The above-described embodiments of the apparatus are merely illustrative, and the units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the embodiment of the present invention. One of ordinary skill in the art can understand and implement it without inventive effort.
Through the above description of the embodiments, those skilled in the art will clearly understand that each embodiment can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware. With this understanding, the technical solutions mentioned above may be embodied in the form of a software product, which may be stored in a computer-readable storage medium, such as a ROM/RAM, a magnetic disk, an optical disk, or the like, and includes several instructions for enabling a computer device (which may be a personal computer, a server, or a network device) to execute the ceph type distributed storage cluster design method described in the embodiments or some parts of the embodiments.
Finally, it should be noted that: the above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims (10)

1. A design method of a ceph type distributed storage cluster is characterized by comprising the following steps:
selecting a hard disk according to different application scene requirements, and creating osd with different types and performances;
according to the read-write speed requirement of the hard disk, cluster network configuration is carried out;
creating storage pools of different types and performances according to the osds of different types and performances;
wherein the storage pools of different types and performance correspond to osds of different types and performance;
and determining a storage mode corresponding to the storage pool with different types and performances according to the storage pool with different types and performances.
2. The ceph-type distributed storage cluster design method according to claim 1, wherein each osd is composed of a solid state disk SSD and a mechanical hard disk HHD collocated, where SSD is used as log disk or data disk and HHD is used as data disk.
3. The ceph-type distributed storage cluster design method according to claim 1 or 2, wherein the hard disk selection according to different application scenario requirements and the creation of osds of different types and performances comprises:
different types and capabilities of osd are created according to the respective read-write speeds and storage capabilities of the SSD and HHD.
4. The ceph-type distributed storage cluster design method according to claim 3, wherein the creating osd of different types and performances according to the respective read-write speed and storage performance of SSD and HHD comprises:
when the read-write speed of the osd is required to be 300-600M per second, a SSD data disk and an SSD log disk are adopted to form the high-performance osd;
when the read-write speed of the osd is required to be 210-380M per second, an SSD log disk and an HHD data disk are adopted to form the osd with general performance;
when the read-write speed of the osd is required to be 60-80M per second, a HHD data disc is adopted to form low-performance osd.
5. The ceph-type distributed storage cluster design method according to claim 1, wherein said creating different types and capabilities of storage pools according to said different types and capabilities of osd comprises:
and corresponding osds with the same type and performance in the osds with different types and performances to the same storage pool to obtain a plurality of storage pools with different performances and types.
6. The ceph-type distributed storage cluster design method according to claim 1, wherein the determining, according to the storage pools of different types and performances, the storage manner corresponding to the storage pools of different types and performances includes:
and determining the corresponding storage mode to be block storage, object storage or file storage according to the storage pools with different types and performances.
7. The ceph-type distributed storage cluster design method according to claim 1 or 6, after determining, according to the storage pools of different types and performances, the storage manner corresponding to the storage pools of different types and performances, the method further comprising:
if the cluster application scene is high in reading and writing speed, selecting an object for storage, and further creating a virtual machine containing high-performance osd;
if the cluster application scene is of a common read-write speed, selecting block storage, and further creating a virtual machine containing common performance osd;
and if the cluster application scene is low in read-write speed but high in storage space requirement, selecting files for storage, and further creating a virtual machine containing low-performance osd.
8. A ceph-type distributed storage cluster apparatus, comprising:
the first determining module is used for selecting hard disks according to different application scene requirements and creating osds with different types and performances;
the second determining module is used for configuring the cluster network according to the read-write speed requirement of the hard disk;
a third determining module, configured to create storage pools of different types and performances according to the osds of different types and performances;
wherein the storage pools of different types and performance correspond to osds of different types and performance;
and the fourth determining module is used for determining the storage mode corresponding to the storage pool with different type and performance according to the storage pool with different type and performance.
9. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the steps of the ceph-type distributed storage cluster design method according to any of claims 1 to 7 are implemented by the processor when executing the program.
10. A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the steps of the ceph-type distributed storage cluster design method according to any one of claims 1 to 7.
CN202010973881.0A 2020-09-16 2020-09-16 Design method and device for ceph type distributed storage cluster Pending CN112148219A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010973881.0A CN112148219A (en) 2020-09-16 2020-09-16 Design method and device for ceph type distributed storage cluster

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010973881.0A CN112148219A (en) 2020-09-16 2020-09-16 Design method and device for ceph type distributed storage cluster

Publications (1)

Publication Number Publication Date
CN112148219A true CN112148219A (en) 2020-12-29

Family

ID=73892928

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010973881.0A Pending CN112148219A (en) 2020-09-16 2020-09-16 Design method and device for ceph type distributed storage cluster

Country Status (1)

Country Link
CN (1) CN112148219A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112988066A (en) * 2021-02-23 2021-06-18 新华三大数据技术有限公司 Data processing method and device
CN113031857A (en) * 2021-01-30 2021-06-25 柏科数据技术(深圳)股份有限公司 Data writing method, device, server and storage medium
CN113254969A (en) * 2021-06-08 2021-08-13 挂号网(杭州)科技有限公司 Service data processing method and device, electronic equipment and storage medium
CN113821165A (en) * 2021-08-20 2021-12-21 济南浪潮数据技术有限公司 Distributed cluster fusion storage method, system and equipment

Citations (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040199566A1 (en) * 2003-03-14 2004-10-07 International Business Machines Corporation System, method, and apparatus for policy-based data management
DE102008009851A1 (en) * 2007-02-13 2008-09-04 Samsung Electronics Co., Ltd., Suwon Host data processing system e.g. computer, operating method, involves sending file delete command to external storage device that includes erase before write memory device for storing file
EP2081115A1 (en) * 2008-01-16 2009-07-22 Samsung Electronics Co., Ltd. Durable data storage system and method
CN104283960A (en) * 2014-10-15 2015-01-14 福建亿榕信息技术有限公司 System for achieving heterogeneous network storage virtualization integration and hierarchical management
US20150058487A1 (en) * 2013-08-26 2015-02-26 Vmware, Inc. Translating high level requirements policies to distributed configurations
US20150160872A1 (en) * 2013-12-09 2015-06-11 Hsun-Yuan Chen Operation method of distributed memory disk cluster storage system
CN108418860A (en) * 2018-01-26 2018-08-17 郑州云海信息技术有限公司 A kind of osd heartbeat means of communication based on ceph clusters
CN109189583A (en) * 2018-09-20 2019-01-11 郑州云海信息技术有限公司 A kind of distributed lock implementation method and device
CN109669631A (en) * 2018-12-10 2019-04-23 浪潮(北京)电子信息产业有限公司 A kind of storage pool planing method, device, equipment and computer readable storage medium
CN109714229A (en) * 2018-12-27 2019-05-03 山东超越数控电子股份有限公司 A kind of performance bottleneck localization method of distributed memory system
CN109992204A (en) * 2019-03-19 2019-07-09 新华三技术有限公司成都分公司 Date storage method and device
CN110222013A (en) * 2019-06-11 2019-09-10 深信服科技股份有限公司 A kind of method, system, equipment and storage medium that cluster storage capacity determines
CN110764953A (en) * 2019-09-30 2020-02-07 浪潮电子信息产业股份有限公司 Data recovery method, device and equipment and readable storage medium
CN110989924A (en) * 2019-10-30 2020-04-10 烽火通信科技股份有限公司 Metadata storage performance optimization method and storage server
CN111124301A (en) * 2019-12-18 2020-05-08 深圳供电局有限公司 Data consistency storage method and system of object storage device
CN111124282A (en) * 2019-11-29 2020-05-08 浪潮电子信息产业股份有限公司 Storage method, storage device and storage equipment in object storage system
CN111414136A (en) * 2020-03-13 2020-07-14 苏州浪潮智能科技有限公司 Method, system, device and medium for creating storage pool

Patent Citations (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040199566A1 (en) * 2003-03-14 2004-10-07 International Business Machines Corporation System, method, and apparatus for policy-based data management
DE102008009851A1 (en) * 2007-02-13 2008-09-04 Samsung Electronics Co., Ltd., Suwon Host data processing system e.g. computer, operating method, involves sending file delete command to external storage device that includes erase before write memory device for storing file
EP2081115A1 (en) * 2008-01-16 2009-07-22 Samsung Electronics Co., Ltd. Durable data storage system and method
US20150058487A1 (en) * 2013-08-26 2015-02-26 Vmware, Inc. Translating high level requirements policies to distributed configurations
US20150160872A1 (en) * 2013-12-09 2015-06-11 Hsun-Yuan Chen Operation method of distributed memory disk cluster storage system
CN104283960A (en) * 2014-10-15 2015-01-14 福建亿榕信息技术有限公司 System for achieving heterogeneous network storage virtualization integration and hierarchical management
CN108418860A (en) * 2018-01-26 2018-08-17 郑州云海信息技术有限公司 A kind of osd heartbeat means of communication based on ceph clusters
CN109189583A (en) * 2018-09-20 2019-01-11 郑州云海信息技术有限公司 A kind of distributed lock implementation method and device
CN109669631A (en) * 2018-12-10 2019-04-23 浪潮(北京)电子信息产业有限公司 A kind of storage pool planing method, device, equipment and computer readable storage medium
CN109714229A (en) * 2018-12-27 2019-05-03 山东超越数控电子股份有限公司 A kind of performance bottleneck localization method of distributed memory system
CN109992204A (en) * 2019-03-19 2019-07-09 新华三技术有限公司成都分公司 Date storage method and device
CN110222013A (en) * 2019-06-11 2019-09-10 深信服科技股份有限公司 A kind of method, system, equipment and storage medium that cluster storage capacity determines
CN110764953A (en) * 2019-09-30 2020-02-07 浪潮电子信息产业股份有限公司 Data recovery method, device and equipment and readable storage medium
CN110989924A (en) * 2019-10-30 2020-04-10 烽火通信科技股份有限公司 Metadata storage performance optimization method and storage server
CN111124282A (en) * 2019-11-29 2020-05-08 浪潮电子信息产业股份有限公司 Storage method, storage device and storage equipment in object storage system
CN111124301A (en) * 2019-12-18 2020-05-08 深圳供电局有限公司 Data consistency storage method and system of object storage device
CN111414136A (en) * 2020-03-13 2020-07-14 苏州浪潮智能科技有限公司 Method, system, device and medium for creating storage pool

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113031857A (en) * 2021-01-30 2021-06-25 柏科数据技术(深圳)股份有限公司 Data writing method, device, server and storage medium
CN112988066A (en) * 2021-02-23 2021-06-18 新华三大数据技术有限公司 Data processing method and device
CN112988066B (en) * 2021-02-23 2022-08-30 新华三大数据技术有限公司 Data processing method and device
CN113254969A (en) * 2021-06-08 2021-08-13 挂号网(杭州)科技有限公司 Service data processing method and device, electronic equipment and storage medium
CN113821165A (en) * 2021-08-20 2021-12-21 济南浪潮数据技术有限公司 Distributed cluster fusion storage method, system and equipment
CN113821165B (en) * 2021-08-20 2023-12-22 济南浪潮数据技术有限公司 Distributed cluster fusion storage method, system and equipment

Similar Documents

Publication Publication Date Title
CN112148219A (en) Design method and device for ceph type distributed storage cluster
US10735509B2 (en) Systems and methods for synchronizing microservice data stores
US8938604B2 (en) Data backup using distributed hash tables
US10481804B2 (en) Cluster system with calculation and storage converged
US20140136892A1 (en) Virtual tape library device and data recovery method
US20190235979A1 (en) Systems and methods for performing computing cluster node switchover
CN101808127B (en) Data backup method, system and server
CN107666493B (en) Database configuration method and equipment thereof
US9336093B2 (en) Information processing system and access control method
WO2019085975A1 (en) Network topology display method and network management device
US20210263968A1 (en) Context preservation
CN112202853B (en) Data synchronization method, system, computer device and storage medium
CN111913782A (en) Method and equipment for realizing virtual machine flow mirror image based on tunnel technology
US8060773B1 (en) Systems and methods for managing sub-clusters within a multi-cluster computing system subsequent to a network-partition event
US9513996B2 (en) Information processing apparatus, computer-readable recording medium having stored program for controlling information processing apparatus, and method for controlling information processing apparatus
KR20230024416A (en) Virtual Machine Migration Methods, Devices, Storage Media and Electronic Devices Between Cloud Platforms
CN111865632A (en) Switching method of distributed data storage cluster and switching instruction sending method and device
CN109254873B (en) Data backup method, related device and system
US9015433B2 (en) Resiliency with a destination volume in a replication environment
US8621260B1 (en) Site-level sub-cluster dependencies
US11301436B2 (en) File storage method and storage apparatus
EP4357949A1 (en) Authentication method and apparatus, and storage system
JP5016696B2 (en) High availability system, server, high availability maintenance method and program
CN115510016A (en) Client response method, device and medium based on directory fragmentation
JP2024506131A (en) Virtual machine memory sharing methods and devices

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination