CN112214285A

CN112214285A - Docker-based model service deployment system

Info

Publication number: CN112214285A
Application number: CN202011135150.5A
Authority: CN
Inventors: 洪万福; 钱智毅; 张宏伟
Original assignee: Xiamen Yuanting Information Technology Co ltd
Current assignee: Xiamen Yuanting Information Technology Co ltd
Priority date: 2020-10-22
Filing date: 2020-10-22
Publication date: 2021-01-12

Abstract

The invention relates to a model service deployment system based on Docker, which comprises a model warehouse management module, a mirror image library management module, a resource quota management module, a service management module and a service monitoring module, wherein the model warehouse management module is used for managing the resource quota; the model warehouse management module is used for uniformly managing the models; the mirror image library management module is used for providing basic environment mirror image support for model training or service release; the resource quota management module is used for managing the running resources of each model service and can uniformly schedule various types of cluster resources to support heterogeneous hardware resource scheduling; the service management module is used for deploying the model and managing the life cycle of the model according to the model management mode and the resource quota management mode output by the model warehouse management module and the resource quota management module; the service monitoring module is used for monitoring the operation condition and service record of the model service in real time. The invention can more conveniently deploy the model service and realize the high availability and multi-type model support of the model service.

Description

Docker-based model service deployment system

Technical Field

The invention relates to the field of artificial intelligence, in particular to a model service deployment system based on Docker.

Background

With the development of artificial intelligence, machine learning and deep learning related technologies are applied to many industries and fields. Model deployment artificial intelligence is a very important link in application, modeling is difficult, and deployment is more difficult. For model deployment, single-service manual deployment is common at present, and shutdown updating is needed for model service updating in the later period; the model service operation resources cannot be managed and controlled, and the server operation resources cannot be reasonably distributed; the access rights of various model services cannot be uniformly controlled, and the running state and the access condition of the services cannot be monitored. In order to solve the defects, some model service management systems appear in the industry, model service authentication, service state monitoring and the like can be uniformly managed, and the problem of model service deployment management is solved to a certain extent. However, these systems have some disadvantages, such as that the deployment resources cannot be controlled in fine granularity, flexible expansion is not supported, and gray scale distribution is not supported.

Disclosure of Invention

In order to solve the problems, the invention provides a model service deployment system based on Docker.

The specific scheme is as follows:

a model service deployment system based on Docker comprises a model warehouse management module, a mirror image library management module, a resource quota management module, a service management module and a service monitoring module;

the model warehouse management module is used for uniformly managing the models, and the models are acquired in one or more modes of uploading through local model files and transmitting the models through a butted training platform;

the mirror image library management module is used for providing basic environment mirror image support for model training or service release;

the resource quota management module is used for managing the running resources of each model service and can uniformly schedule various types of cluster resources to support heterogeneous hardware resource scheduling;

the service management module is used for deploying the model and managing the life cycle of the model according to the model management mode and the resource quota management mode output by the model warehouse management module and the resource quota management module;

the service monitoring module is used for monitoring the operation condition and service record of the model service in real time.

Further, the functions of the model warehouse management module further include: online model compression conversion, model version management and one-key service release.

Further, the basic environment image comprises a built-in image and a user-defined image, the built-in image comprises a dependent environment of the fixed frame, and the user-defined image comprises a user-defined environment image except the dependent environment of the fixed frame.

Further, the user-defined mirror image supports the construction in a mode of uploading Dockerfile files or the construction through the selection of online environment mirror image options.

Further, the resource quota management module has functions of resource application, resource approval, resource monitoring and flexible expansion capacity.

Further, the deployment of the model in the service management module supports the deployment of a built-in fixed framework and a user-defined framework.

Furthermore, the service monitoring module provides a visual interface and monitors the performance of the service management module after the service management module deploys each model service and the operation condition of each model service in real time.

Furthermore, the real-time monitored operation status of each model service includes deployment resource usage, request information data, service response data, and service authentication information statistical data.

By adopting the technical scheme, the model service can be more conveniently deployed, and high-availability and multi-type model support of the model service is realized.

Drawings

FIG. 1 is a block diagram of a system according to an embodiment of the present invention.

Detailed Description

The invention is further illustrated by the accompanying drawings. The accompanying drawings, which are incorporated in and constitute a part of this disclosure, illustrate embodiments of the invention and, together with the description, serve to explain the principles of the embodiments. Those skilled in the art will appreciate still other possible embodiments and advantages of the present invention with reference to these figures.

The invention will now be further described with reference to the accompanying drawings and detailed description.

The first embodiment is as follows:

an embodiment of the present invention provides a model service deployment system based on a Docker, as shown in fig. 1, the system includes a model warehouse management module, a mirror library management module, a resource quota management module, a service management module, and a service monitoring module, where:

(1) the model warehouse management module is used for uniformly managing the models, and the models are acquired in one or more modes of uploading through local model files and transmitting the models through a butted training platform.

The model is generated by machine learning or deep learning training.

The model warehouse management module comprises an external model operation interface and is in butt joint with other training platforms through the external model operation interface.

The functions of the model warehouse management module further comprise: online model compression conversion, model version management, one-key service release, and the like.

(2) The mirror image library management module is used for providing basic environment mirror image support for model training or service release so as to realize flexible framework environment support.

The basic environment mirror image comprises a built-in mirror image and a user-defined mirror image, wherein the built-in mirror image comprises a dependent environment of a fixed frame, such as a common frame: sklearn, XGboost, LigthGBM, SparkML, TensorFlow, caffe, Theano, Keras, Pythroch, MXNet, and the like. And if the built-in mirror image does not meet the requirements of the user, the user can configure the user-defined mirror image on line. The user-customized image comprises a user-customized environment image except for the dependent environment of the fixed framework. The user-defined mirror image supports the construction in a mode of uploading Dockerfile files or the construction through the selection of online environment mirror image options.

(3) The resource quota management module is used for managing the running resources of each model service, and can uniformly schedule various types of cluster resources to support heterogeneous hardware resource scheduling. The functions of the resource quota management module in this embodiment include resource application, resource approval, resource monitoring, elastic expansion and contraction, and the like, and the operating resources of each model service can be flexibly controlled in a fine-grained manner.

(4) The service management module is used for deploying the model and managing the life cycle of the model according to the model management mode and the resource quota management mode output by the model warehouse management module and the resource quota management module.

The deployment of the model has high reliability and expandability, and supports built-in fixed frameworks such as PMML, Sklearn, XGboost, LigthGBM and SparkML; deployment of ONNX, TensorFlow, cafe, Theano, Keras, Pythroch, MXNet and user-defined frameworks.

The service management module can provide functions of one-stop model service release management, service authentication management, service version control, gray release, online real-time prediction, batch prediction, model evaluation and the like, wherein the service authentication management supports control of service interface access limitation according to days and access quantity.

(5) The service monitoring module is used for monitoring the operation condition and service record of the model service in real time.

The service monitoring module provides a visual interface and monitors the performance of each model service after the service management module deploys each model service and the running condition of each model service in real time. The real-time monitored operation condition of each model service comprises deployment resource use condition, request information data, service response data and service authentication information statistical data.

The embodiment of the invention provides a visual one-stop model version control system, which can more conveniently deploy model services by the purposes of model service version management, monitoring deployment performance, operation condition and web service indexes, and realizes high availability of the model services and multi-type model support.

While the invention has been particularly shown and described with reference to a preferred embodiment, it will be understood by those skilled in the art that various changes in form and detail may be made therein without departing from the spirit and scope of the invention as defined by the appended claims.

Claims

1. A model service deployment system based on Docker is characterized by comprising a model warehouse management module, a mirror image library management module, a resource quota management module, a service management module and a service monitoring module;

2. The Docker-based model service deployment system of claim 1, wherein: the functions of the model warehouse management module further comprise: online model compression conversion, model version management and one-key service release.

3. The Docker-based model service deployment system of claim 1, wherein: the basic environment mirror image comprises a built-in mirror image and a user-defined mirror image, the built-in mirror image comprises a dependent environment of the fixed frame, and the user-defined mirror image comprises a user-defined environment mirror image except the dependent environment of the fixed frame.

4. The Docker-based model service deployment system of claim 3, wherein: the user-defined mirror image supports the construction in a mode of uploading Dockerfile files or the construction through the selection of online environment mirror image options.

5. The Docker-based model service deployment system of claim 1, wherein: the resource quota management module has the functions of resource application, resource approval, resource monitoring and elastic expansion and contraction capacity.

6. The Docker-based model service deployment system of claim 1, wherein: the deployment of the model in the service management module supports the deployment of a built-in fixed framework and a user-defined framework.

7. The Docker-based model service deployment system of claim 1, wherein: the service monitoring module provides a visual interface and monitors the performance of each model service after the service management module deploys each model service and the running condition of each model service in real time.

8. The Docker-based model service deployment system of claim 7, wherein: the real-time monitored operation condition of each model service comprises deployment resource use condition, request information data, service response data and service authentication information statistical data.