CN109271236A

CN109271236A - A kind of method, apparatus of traffic scheduling, computer storage medium and terminal

Info

Publication number: CN109271236A
Application number: CN201811084824.6A
Authority: CN
Inventors: 何翔龙
Original assignee: Zhengzhou Yunhai Information Technology Co Ltd
Current assignee: Zhengzhou Yunhai Information Technology Co Ltd
Priority date: 2018-09-18
Filing date: 2018-09-18
Publication date: 2019-01-25

Abstract

A kind of method, apparatus of traffic scheduling, computer storage medium and terminal, comprising: the operation in system is divided into two or more priority levels；The scheduling of operation is carried out according to the priority level of division.The embodiment of the present invention realizes the scheduling of operation according to the division of priority level, optimizes the process of job run, improves the efficiency of job run.

Description

Service scheduling method, device, computer storage medium and terminal

Technical Field

The present disclosure relates to, but not limited to, computer clustering technologies, and in particular, to a method, an apparatus, a computer storage medium, and a terminal for scheduling a service.

Background

Kubernets (K8 s for short, is an open source, used for managing containerized applications on multiple hosts in a cloud platform, and its objective is to make deploying containerized applications simple and efficient, and provide a mechanism for application deployment, planning, updating, and maintenance) the main functions of the container cluster management system include: packaging, instantiating and running the application program by using a Docker (an open-source application container engine, so that developers can package their applications and dependence packages into a portable container and then release the package to any popular Linux machine, and can also realize virtualization); running and managing containers across hosts in a cluster manner; the problem of communication between containers running between different hosts is solved, and the like. The Scheduler (Scheduler) is a Scheduler loaded and operated in the kubernets container cluster management system, which is responsible for collecting and statistically analyzing the resource usage of all nodes (nodes) in the system, and then based on this, sends the newly-built Pod (Pod is an abstract concept, which contains a container group composed of one or more containers, and also the resources shared by these containers; Pod is similar to a logical host with specific application in model, which can contain different application containers that are relatively tightly coupled; Pod is an atomic unit in the kubernets platform, and Pod is operated on Node) to the available Node with the highest priority for building.

In the process of adding Pod, the scheduling strategy of the scheduler is divided into two stages: a preparation (predites) phase and a priority scheduling (Priorities) phase; the Predicates stage is mainly used for determining whether the Pod can be dispatched to a specific Node to run, outputting and recording all nodes meeting the requirements at the stage, and taking the result as the input of the second stage; the priority stage is to answer "which is the more appropriate question", i.e. to screen the nodes again, to screen out the node that is most appropriate to operate Pod. Specifically, the Predicates stage includes five scheduling strategies: respectively determining nodes suitable for operating Pod from the aspects of ports, resources, hard disks and the like, and recording the names of the nodes; the priority phase includes three scheduling policies: a least request resource priority scheduling policy (leaserequest priority), a least equal service priority scheduling policy (servicespareadingpriority), and an equal priority scheduling policy (equal priority). Comparing the nodes determined in the Predicates stage through the three scheduling strategies to determine the nodes finally used for operating Pod; specifically, the calculation principle of LeastRequestedpriority is to schedule the newly created Pod to a Node with smaller occupied calculation resources as much as possible, where the "calculation resources" include Central Processing Unit (CPU) resources and Memory (Memory) resources; the calculation principle of the serviceSpreadingpriority is that the number of the Pods belonging to the same service on the same Node is reduced as much as possible, so that the dispatched Pods can realize high availability and flow load balance of the service as much as possible; the calculation principle of equal priority is to equally treat each available Node screened out in the Predicates stage; at present, LeastRequestedpriority is the main basis for determining the node that is ultimately used to run the Pod.

Currently, a Kubernetes container cluster management system mainly carries out scheduling operation of operations according to time sequence, and when key operations do not operate timely, system service is influenced; in order to avoid the influence on system service, the operation scheduling is mainly performed manually by technicians, so that the operation is complex and the efficiency is low.

Disclosure of Invention

The following is a summary of the subject matter described in detail herein. This summary is not intended to limit the scope of the claims.

Embodiments of the present invention provide a method and an apparatus for scheduling a service, a computer storage medium, and a terminal, which can optimize a flow of job operation and improve efficiency of job operation.

The embodiment of the invention provides a method for scheduling service, which comprises the following steps:

dividing the jobs in the system into two or more priority levels;

and scheduling the job according to the divided priority level.

Optionally, the dividing the jobs in the system into two or more priority levels includes:

dividing the jobs in the system into a first priority and a second priority according to business logic and/or system available resources;

wherein the system available resources include part or all of the following resources: central processing unit resources and memory resources; the first priority is higher than the second priority.

Optionally, before dividing the jobs in the system into two or more priority levels, the method further includes, according to a preset period, performing the following determination processing:

when the system available resources in the system meet the running requirements of all jobs in the system, determining not to divide the priority level of the jobs;

and when the available system resources in the system cannot meet the running requirements of all the jobs in the system, determining to divide the priority levels of the jobs.

Optionally, after dividing the job in the system into two or more priority levels and before scheduling the job according to the divided priority levels, the method further includes:

judging whether the available resources of the system can meet the running of all the jobs with higher priority levels;

the system available resources can not meet the running of all the jobs with higher priority levels, and the running jobs with lower priority levels are stopped one by one until the system available resources can meet the running of all the jobs with higher priority levels;

wherein the higher priority level job comprises: the job with the highest priority level.

Optionally, the scheduling the job according to the divided priority levels includes:

adding the jobs with the divided priority levels into a preset scheduling queue according to the high-low sequence of the priority levels;

and scheduling the jobs according to the sequence of the jobs added to the scheduling queue.

Optionally, when the job is scheduled according to the divided priority levels, the method further includes:

judging the waiting time of the operation with lower priority level;

when the waiting time of the operation with the lower priority level is longer than the preset time, the priority level of the operation is adjusted to be a high priority level;

wherein the lower priority level job comprises: the job with the lowest priority level.

On the other hand, an embodiment of the present invention further provides a device for service scheduling, including: a dividing unit and a scheduling unit; wherein,

the dividing unit is used for: dividing the jobs in the system into two or more priority levels;

the scheduling unit is used for: and scheduling the job according to the divided priority level.

Optionally, the dividing unit is specifically configured to:

Optionally, the apparatus further includes a determining unit, configured to:

when the system available resources in the system meet the running requirements of all jobs in the system, determining not to divide the priority level of the jobs; and when the available system resources in the system cannot meet the running requirements of all the jobs in the system, determining to divide the priority levels of the jobs.

In another aspect, an embodiment of the present invention further provides a computer storage medium, where computer-executable instructions are stored in the computer storage medium, and the computer-executable instructions are used to execute the service scheduling method.

In another aspect, an embodiment of the present invention further provides a terminal, including: a memory and a processor; wherein,

the processor is configured to execute program instructions in the memory;

the program instructions read on the processor to perform the following operations:

dividing the jobs in the system into two or more priority levels;

and scheduling the job according to the divided priority level.

Compared with the related art, the technical scheme of the application comprises the following steps: dividing the jobs in the system into two or more priority levels; and scheduling the job according to the divided priority level. According to the embodiment of the invention, the scheduling of the operation is realized according to the division of the priority level, the operation flow is optimized, and the operation efficiency is improved.

Additional features and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.

Drawings

The accompanying drawings are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the example serve to explain the principles of the invention and not to limit the invention.

Fig. 1 is a flowchart of a method for scheduling services according to an embodiment of the present invention;

fig. 2 is a block diagram of a service scheduling apparatus according to an embodiment of the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, embodiments of the present invention will be described in detail below with reference to the accompanying drawings. It should be noted that the embodiments and features of the embodiments in the present application may be arbitrarily combined with each other without conflict.

The steps illustrated in the flow charts of the figures may be performed in a computer system such as a set of computer-executable instructions. Also, while a logical order is shown in the flow diagrams, in some cases, the steps shown or described may be performed in an order different than here.

Fig. 1 is a flowchart of a method for scheduling a service according to an embodiment of the present invention, as shown in fig. 1, including:

step 101, dividing the operation in the system into two or more priority levels;

It should be noted that, according to the service logic and the available resources of the system, a person skilled in the art can determine that the previous job with important service logic is a service with higher priority level.

Optionally, the dividing the jobs in the system into a first priority and a second priority according to the embodiment of the present invention includes:

and receiving the division information of the priority level through a preset information interaction interface, and dividing the jobs in the system into a first priority and a second priority.

It should be noted that the interactive interface of the embodiment of the present invention may include an Application Programming Interface (API).

And step 102, scheduling the job according to the divided priority level.

Optionally, before dividing the job in the system into two or more priority levels, the method according to the embodiment of the present invention further includes, according to a preset period, performing the following determination processing:

when the system available resources in the system cannot meet the running requirements of all jobs in the system, determining to divide the priority levels of the jobs;

it should be noted that, in the embodiment of the present invention, the preset period may be set according to a unit time length of the operation, and may be the minimum unit time length. Whether the available resources of the system meet the running requirements of the job can be determined by analyzing the occupation condition of the resources when the job runs according to the amount of the available resources of the system by a person skilled in the art.

Optionally, after dividing the job in the system into two or more priority levels and before scheduling the job according to the divided priority levels, the method according to the embodiment of the present invention further includes:

It should be noted that, for the reason that the higher priority level is relativity, in the embodiment of the present invention, it is assumed that the priority level is divided into two levels, that is, priority processing is required and priority processing is not required, and the job with the higher priority level is a job with a priority level that needs priority processing; assuming that three or more priority levels are set in the embodiment of the present invention, the embodiment of the present invention may select, as the job with the higher priority level, a plurality of jobs with higher priority levels after sorting the jobs in the order from the higher priority level to the lower priority level.

Optionally, the scheduling the job according to the divided priority levels in the embodiment of the present invention includes:

It should be noted that, for jobs with the same priority, the job scheduling order may be determined according to the job start time with reference to the related art. And adding the terminated low-priority jobs to a pre-established scheduling queue.

Optionally, when scheduling the job according to the divided priority levels, the method according to the embodiment of the present invention further includes:

judging the waiting time of the operation with lower priority level;

It should be noted that, in the embodiment of the present invention, the job adjusted to the high priority may be added to the head of the scheduling queue, or the job whose priority adjustment is completed may be added to the scheduling queue according to time.

After the priority level of the job is determined, the job scheduling according to the priority level of the job may include: when the available resources of the system are insufficient, the running jobs with lower priorities are terminated, and the resources are released to be used for running the jobs with higher priorities. The operation scheduled by the embodiment of the invention realizes the scheduling through the scheduling queue, and the terminated low-priority operation can be added into the scheduling queue again to wait for the resource allocation. When the scheduling queue is empty, scheduling ends. Taking Kubernetes' resource scheduling as an example, the embodiment of the present invention may include: creating a scheduling queue, assigning a scheduling strategy and scheduling; the specific process may include: and providing an interface for performing priority level division on the jobs in Kubernetes for a user, and realizing the priority level division of the jobs. And (3) specifying a scheduling strategy: if the resources of the jobs with the high priority level are insufficient, the resources of the jobs with the low priority level which are started to run recently (the jobs with the low priority level are run according to time sequence) can be preempted; the method specifically comprises the following steps: terminating the operation process with the low priority level until the available system resources (residual resources) are enough for the operation with the high priority level, and waiting for the completion of the operation of the existing high-level operation if the operation process is not enough after all the operations are terminated; then adding the terminated jobs with low priority levels into the scheduling queue again to wait for scheduling; jobs with low priority levels cannot be preempted, and jobs with high priority levels cannot be preempted; the embodiment of the invention sets the maximum tolerance time, and avoids the situation that a certain low-priority-level job is always at the tail of the scheduling queue and can not be executed. When the waiting time of a certain low-priority-level operation is longer than the preset time, the embodiment of the invention adjusts the priority level of the operation to be a high-priority level; the embodiment of the invention can move the operation with the adjusted priority level to the head of the scheduling queue; the embodiment of the invention can set the scheduling queue to store the node data in a bidirectional linked list mode. And when the head node is empty, the scheduling queue is empty, and the scheduling task is completed. And for each node newly added into the scheduling queue, firstly judging the priority level of the node, if the priority level is low, directly adding the node to the tail of the queue, and if the priority level is high, traversing each node from the head of the queue in sequence and adding the node to the front of the node with the first priority level being low. The embodiment of the invention perfects the Kubernetes resource scheduling system based on the priority resource preemption scheduling strategy, and realizes more reasonable and efficient utilization of computing resources in practical application.

Fig. 2 is a block diagram of a service scheduling apparatus according to an embodiment of the present invention, and as shown in fig. 2, the apparatus includes: a dividing unit and a scheduling unit; wherein,

Optionally, the dividing unit in the embodiment of the present invention is specifically configured to:

Optionally, the apparatus in this embodiment of the present invention further includes a determining unit, configured to:

when the system available resources in the system meet the running requirements of all jobs in the system, determining not to divide the priority level of the jobs; when the system available resources in the system cannot meet the running requirements of all jobs in the system, determining to divide the priority levels of the jobs;

it should be noted that, in the embodiment of the present invention, the preset period may be set according to a unit time length of the operation, and may be the minimum unit time length.

Optionally, the device scheduling unit in the embodiment of the present invention is further configured to:

Optionally, the scheduling of the job by the scheduling unit according to the divided priority levels in the embodiment of the present invention includes:

judging the waiting time of the operation with lower priority level;

The embodiment of the invention also provides a computer storage medium, wherein the computer storage medium stores computer executable instructions, and the computer executable instructions are used for executing the service scheduling method.

An embodiment of the present invention further provides a terminal, including: a memory and a processor; wherein,

the processor is configured to execute program instructions in the memory;

dividing the jobs in the system into two or more priority levels;

and scheduling the job according to the divided priority level.

It will be understood by those skilled in the art that all or part of the steps of the above methods may be implemented by a program instructing associated hardware (e.g., a processor) to perform the steps, and the program may be stored in a computer readable storage medium, such as a read only memory, a magnetic or optical disk, and the like. Alternatively, all or part of the steps of the above embodiments may be implemented using one or more integrated circuits. Accordingly, each module/unit in the above embodiments may be implemented in hardware, for example, by an integrated circuit to implement its corresponding function, or in software, for example, by a processor executing a program/instruction stored in a memory to implement its corresponding function. The present invention is not limited to any specific form of combination of hardware and software.

Although the embodiments of the present invention have been described above, the above description is only for the convenience of understanding the present invention, and is not intended to limit the present invention. It will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims.

Claims

1. A method for traffic scheduling, comprising:

dividing the jobs in the system into two or more priority levels;

and scheduling the job according to the divided priority level.

2. The method of claim 1, wherein the dividing jobs in a system into two or more priority levels comprises:

3. The method according to claim 1 or 2, wherein before dividing the jobs in the system into two or more priority levels, the method further comprises, in a preset cycle, performing the following determination process:

4. The method of claim 3, wherein after dividing the jobs in the system into two or more priority levels and before scheduling the jobs according to the divided priority levels, the method further comprises:

5. The method according to claim 1 or 2, wherein said scheduling of jobs according to the prioritized level comprises:

6. The method according to claim 1 or 2, wherein when scheduling of the job is performed according to the divided priority levels, the method further comprises:

judging the waiting time of the operation with lower priority level;

7. An apparatus for traffic scheduling, comprising: a dividing unit and a scheduling unit; wherein,

8. The apparatus according to claim 7, wherein the dividing unit is specifically configured to:

9. The apparatus according to claim 7 or 8, characterized in that the apparatus further comprises a determining unit for:

10. A computer storage medium having stored thereon computer-executable instructions for performing the method of traffic scheduling of any of claims 1-6.

11. A terminal, comprising: a memory and a processor; wherein,

the processor is configured to execute program instructions in the memory;

dividing the jobs in the system into two or more priority levels;

and scheduling the job according to the divided priority level.