CN111917853A

CN111917853A - Optimization method for distributed cache scaling of content distribution network

Info

Publication number: CN111917853A
Application number: CN202010724041.0A
Authority: CN
Inventors: 赵明; 房兰涛; 谢恩鹏; 杨明生
Original assignee: Shandong Yunman Intelligent Technology Co ltd
Current assignee: Shandong Yunman Intelligent Technology Co ltd
Priority date: 2020-07-24
Filing date: 2020-07-24
Publication date: 2020-11-10

Abstract

The invention provides an optimization method for distributed cache expansion and contraction capacity of a content distribution network, which can ensure the effectiveness of the existing cache as far as possible when a distributed cache server cluster expands, contracts and adjusts weight. Which comprises the following steps: s1, setting a client, and sending a request file by the client; s2, using a calling service; s3, generating a cache server consistency Hash distribution graph; s4, generating a file path hash value requested by the client; s5, searching the position of the file path hash value in the cache server consistency hash distribution graph; s6, determining which cache server is used according to the position of the hash value; s7, whether the files in the cache server hit the cache or not is judged, if yes, the step S8 is executed, and if not, the step S9 is executed; s8, responding the file request of the client, S9, returning the source from the source station and locally caching in the cache server, and then executing the step S8.

Description

Optimization method for distributed cache scaling of content distribution network

Technical Field

The invention relates to an optimization method for distributed cache expansion and contraction capacity of a content distribution network, and belongs to the technical field of networks.

Background

For a cache server, there are usually several features: the user access flow is large; the back source bandwidth is limited; the cache space of the cache server is limited, and the full storage of resources cannot be guaranteed. Based on the characteristics, the cache server only stores some hot resources. The more content the cache server caches, the easier it is for the client to hit the requested file download, thereby reducing the traffic back to the source station. When the number and storage capacity of the cache servers are fixed, how to cache as many hot files as possible becomes a key for improving the cache utilization efficiency. The best method for solving the problems is to enable each server in the cache server cluster to cache different files, so as to avoid caching repeated files. There are generally two approaches to avoid duplication: firstly, establishing an index dictionary to distribute the corresponding relation between files and a cache server; and secondly, determining the relation between the file and the cache server according to the file path hash. The index dictionary established by the former method consumes a great deal of time and calculation cost when being used for query. The invention randomly distributes the files in the cache server through the hash of the file path, thereby realizing the construction of the distributed cache server with non-repeated files. However, when the distributed cache is subjected to capacity expansion, the traditional hash distribution can redistribute files, and finally, a large amount of cache files in the server are invalidated. Once the cache is largely invalidated, the back source bandwidth is increased sharply in a short time, and the service capability of the content distribution network is reduced.

Disclosure of Invention

The invention aims to provide an optimization method for distributed cache expansion and contraction capacity of a content distribution network, which can ensure the effectiveness of the existing cache as far as possible when a distributed cache server cluster expands, contracts and adjusts weight.

In order to achieve the purpose, the invention is realized by the following technical scheme:

a method for optimizing distributed cache scaling for a content distribution network, comprising the steps of:

s1, setting a client, and sending a request file by the client;

s2, using a calling service;

s3, generating a cache server consistency Hash distribution graph;

s4, generating a file path hash value requested by the client;

s5, searching the position of the file path hash value in the cache server consistency hash distribution graph;

s6, determining which cache server is used according to the position of the hash value;

s7, whether the files in the cache server hit the cache or not is judged, if yes, the step S8 is executed, and if not, the step S9 is executed;

s8, responding to a file request of the client;

s9, returning the source from the source station and locally caching the source in a cache server, and then executing the step S8.

On the basis of the optimization method for the distributed cache expansion capacity of the content distribution network, the method for generating the cache server consistency hash distribution diagram comprises the following steps:

s301, setting a plurality of virtual node names for a cache server;

s302, calculating 32-bit hash values for the virtual nodes of the servers, wherein the specific algorithm is as follows:

hash = 32-bit FNV hash initial value

Numerical value for each byte in a string

hash = value of hash xor byte 32-bit FNV hash prime number

Returning to the hash; the value obtained by hashing the hash algorithm is an integer value in the range of 0-4,294,967,295;

and S303, distributing the integer values in a line segment with the length of 2 to the power of 32 to form a server hash distribution graph.

Based on the optimization method for the distributed cache expansion capacity of the content distribution network, the server is called to use Keepaived or NLB software.

On the basis of the optimization method for the distributed cache expansion capacity of the content distribution network, the reverse proxy and the cache are realized by the Nginx software through the back source cache function of the cache server.

The invention has the advantages that: the consistency hash is used for distributing the cache files, after the distributed cache server cluster is subjected to capacity expansion or after the weight proportion among the cache servers is adjusted, most caches can be guaranteed not to lose efficacy, and therefore the source returning bandwidth of the cache servers and the service efficiency of the cache disks are optimized.

Drawings

The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the principles of the invention and not to limit the invention.

FIG. 1 is a schematic diagram of a cache server consistent hash distribution according to the present invention.

Fig. 1-1 is a schematic diagram of the initial distribution states of the cache servers s1, s2, and s3.

Fig. 1-2 are schematic diagrams of the distribution status of the cache server s3 after removal.

Fig. 1-3 are schematic diagrams illustrating distribution states after a cache server s4 is added.

Fig. 1-4 are schematic diagrams of the distribution status of the cache server s3 after doubling the weight.

Fig. 2 is a flowchart of the operation of the distributed cache server.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

In order to optimize the distributed cache in the content distribution network, and ensure the effectiveness of the existing cache as much as possible when expanding the capacity, contracting the capacity and adjusting the weight, the invention provides an optimization method for expanding the capacity of the distributed cache in the content distribution network, which comprises the following steps:

s1, setting a client, and sending a request file by the client;

s2, using a calling service in an edge node cache server of the content distribution network, wherein the calling service can be an independent server and can be a service shared on the cache server, the calling service uses Kespactive software or NLB and other similar software to realize high availability, but does not need to establish a session sharing function for the service, a plurality of calling services can be established in a distributed cache server cluster, and load balancing is realized for the calling services by LVS and other software;

s3, generating a cache server consistency Hash distribution graph;

s4, generating a file path hash value requested by the client;

s8, responding to a file request of the client;

The method for generating the cache server consistency hash distribution diagram comprises the following steps:

s301, setting a plurality of virtual node names for a cache server;

hash = 32-bit FNV hash initial value

Numerical value for each byte in a string

hash = value of hash xor byte 32-bit FNV hash prime number

Return tohash; the value computed by the hash algorithm hash is 0-4,294,967,295 (i.e., 2)²³-1) a range of integer values;

DETAILED DESCRIPTION OF EMBODIMENT (S) OF INVENTION

Taking fig. 1 as an example, assume that there are three cache servers (denoted as s1, s2, s3, respectively) in the distributed cluster, and each server is set to 4 virtual nodes (denoted as n1, n2, n3, n4, respectively), and there are 12 virtual nodes in total (their names are: s1n1, s1n2, s1n3, s1n4, s2n1, s2n2, s2n3, s2n4, s3n1, s3n2, s3n3, s3n4, respectively); after the names of the 12 virtual nodes are subjected to hash calculation by using the 32-bit FNV-1a, 12 integers can be obtained, and the integers are all in the range of 0-4,294,967,295; as shown in fig. 1-1, the range of the line segment is 0-4,294,967,295, and the integer of the 12 server virtual nodes is placed at the corresponding position in the line segment; each integer point extends right to the next integer point (the leftmost 0 to s1n3 segments merge into the s1n3 to s2n2 segments) forming 1 closed segment, which will eventually form a hash distribution map containing 12 segments.

When the scheduling system of the cache server cluster allocates the cache file, an integer is generated for a path of the target file by using the 32-bit FNV-1a hash calculation, and then a line segment where the target file is located is found according to the generated cache server hash distribution diagram, so that the cache server which needs to cache the target file is located.

Based on the above example, how the scheduling service maintains the consistency of the original cache distribution as much as possible when the distributed cache server cluster has the conditions of capacity reduction, capacity expansion and weight adjustment is explained respectively next.

On the basis of the distribution of fig. 1-1, when a cache server cluster capacity reduction situation occurs, it is assumed here that the cache server 3 is removed. As shown in fig. 1-2, all 4 virtual node integers (s 3n1, s3n2, s3n3, s3n 4) originally belonging to the cache server 3 in the figure are removed, and they are marked as gray in fig. 1-2-s 3n1, -s3n2, -s3n3, -s3n 4. Then, after each integral point is expanded to the next integral point, the original server 1 and partial virtual nodes of the server 2 will extend to cover the distribution position of the original cache server 3 (the content of the change is the gray part in fig. 1-2). After this adjustment, the distribution position of the original cache server 3 is replaced by a part of each of the cache servers 1 and 2. Eventually it appears that the original cache files in cache servers 1 and 2 are still valid, and they also take over 3 partial cache files of cache server. Therefore, in a capacity reduction scene, the effectiveness of the original cache file can be ensured to the greatest extent, and load balancing can be realized.

On the basis of the distribution of fig. 1-1, when the cache server cluster expansion occurs, it is assumed that one cache server 4 is newly added. As shown in FIGS. 1-3, 4 virtual node integers (s 4n1, s4n2, s4n3, s4n 4) are randomly generated for the server 4 between 0-4,294, 967,295, and are labeled as + s4n1, + s4n2, + s4n3, + s4n4, which are gray in FIGS. 1-3. Then, after each integral point is expanded to the next integral point again, part of the virtual nodes of the original server is occupied by the virtual nodes of the newly added server 4 (the changed content is the gray part of fig. 1-3). After the adjustment, the newly added server 4 segments occupy a part of the distribution line segments of the original three cache servers. Finally, a part of cache files in the original cache servers are invalid, and the invalid cache files are borne by the newly added cache server 4. Therefore, under the condition of capacity expansion, the failure rate of the original cache file can be reduced to the greatest extent, and load balance can be realized.

On the basis of the distribution of fig. 1-1, when a situation that the load weight is adjusted by the cache server cluster occurs, it is assumed here that the weight of the cache server 3 is doubled, and the number of virtual nodes of the server 3 is increased from 4 to 8. As shown in FIGS. 1-4, 4 new virtual node integers (s 3n5, s3n6, s3n7, s3n 8) are additionally randomly generated for the server 3 between 0-4,294,967,295, which are labeled as + s3n5, + s3n6, + s3n7, + s3n8 in gray in FIGS. 1-4. Then, after each integral point is expanded to the next integral point, part of the virtual nodes of the original server is occupied by the weighted virtual nodes of the server 3 (the changed content is the gray part in fig. 1-4). After this adjustment, the server 3 with the increased weight occupies a part of the distribution line segment of the original server. Finally, the original cache servers 1 and 2 respectively have a part of cache files, and are allocated as the cache server 3 for bearing. Therefore, under the condition of adjusting the load weight, the failure rate of the original cache file can be reduced to the greatest extent, and load balancing can be realized.

The present invention is described in detail with reference to fig. 2.

1. It is necessary to establish a highly available dispatch service, which can be realized by using Keepalived or NLB and similar software, and load balance between dispatch servers.

2. When a client uses a cache server cluster, all requests of the client need to pass through a scheduling server, the scheduling server generates a corresponding number of virtual nodes for the cache servers according to the number and weight of the existing cache servers in the cluster, calculates a hash integral value of each node through a 32-bit FNV-1a hash algorithm, and finally distributes the integral values in a line segment with the length of 2 to the power of 32 to form a server hash distribution map.

3. Based on the hash distribution diagram, the scheduling server calculates a corresponding hash integer value through a 32-bit FNV-1a hash algorithm according to a file path requested by the client, and then searches the position of the file path hash value in the hash distribution diagram of the cache server, so as to determine which cache server the file requested by the client should provide service.

4. The cache server needs to complete a series of operations such as checking whether a cache file exists, returning to a source if necessary, caching after returning to a source and the like, and can provide a downloading service for the client, wherein the source returning and caching functions of the cache server can be completed by high-performance services such as Nginx and the like.

When the method of the invention is adopted, when the distributed cache server cluster expands, contracts and adjusts the weight, the consistent Hash distribution among the servers can ensure that most cached files cannot be invalid.

Finally, it should be noted that: although the present invention has been described in detail with reference to the foregoing embodiments, it will be apparent to those skilled in the art that changes may be made in the embodiments and/or equivalents thereof without departing from the spirit and scope of the invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

1. A method for optimizing distributed cache expansion capacity of a content distribution network is characterized by comprising the following steps:

s1, setting a client, and sending a request file by the client;

s2, using a calling service;

s3, generating a cache server consistency Hash distribution graph;

s4, generating a file path hash value requested by the client;

s8, responding to a file request of the client;

2. The method for optimizing distributed cache expansion capacity for a content distribution network according to claim 1, wherein the method for generating the cache server consistent hash distribution map is as follows:

s301, setting a plurality of virtual node names for a cache server;

hash = 32-bit FNV hash initial value

Numerical value for each byte in a string

hash = value of hash xor byte 32-bit FNV hash prime number

3. The optimization method for distributed cache scalability for content distribution networks according to claim 1, wherein the invocation server uses Keepalived or NLB software.

4. The optimization method for distributed cache scalability for content distribution networks according to claim 1, 2 or 3, characterized in that: the back source cache function of the cache server is realized by Nginx software to realize reverse proxy and cache.