CN112906907B

CN112906907B - Method and system for layering management and distribution of machine learning pipeline model

Info

Publication number: CN112906907B
Application number: CN202110313978.3A
Authority: CN
Inventors: 董昕; 郭勇; 梁艳; 王杰; 杨雅志
Original assignee: Chengdu Technological University CDTU
Current assignee: Chengdu Technological University CDTU
Priority date: 2021-03-24
Filing date: 2021-03-24
Publication date: 2024-02-23
Anticipated expiration: 2041-03-24
Also published as: CN112906907A

Abstract

The invention discloses a method and a system for layered management and distribution of a machine learning pipeline model, wherein the method comprises a model management client and a model mirror warehouse; the model management client comprises a command line management tool module and a Docker background module; the command line management tool module is used for providing tools such as machine learning pipeline model construction, uploading, downloading and the like in a command line mode; the Docker background module is used for providing an API to receive a request from a command line management tool, supporting a user to manage a machine learning pipeline model in a self-defined mirror image mode, distributing the machine learning pipeline model to different modules according to different requests and further executing corresponding work; the model mirror image warehouse supports uploading and downloading of machine learning pipeline models by users, and can conduct layered storage management on model files and model attributes at a server side; a mirrored warehouse approach is supported that meets the OCI standard.

Description

Method and system for layering management and distribution of machine learning pipeline model

Technical Field

The invention relates to the field of model warehouses in computer storage systems, in particular to a method and a system for layering management and distribution of machine learning pipeline models.

Background

The distribution systems of the machine learning model in the industry can be classified into a scheme: model warehouse based on file system or object storage system and with single model file as basic storage object. The user needs to upload the model to the model repository by way of SDK or UI. After the model is uploaded, the model repository will store the model or metadata of the model in a storage backend maintained by itself. When the model is needed to be used for reasoning, a user can download the model by using the SDK or the interface provided by the model warehouse to perform reasoning service. This approach does not distinguish between a single model and a pipeline model (i.e., a combination of several models based on a workflow).

In fact, this approach stores the pipeline model as a single model file, which cannot cope with the requirements of pipeline model training or deployment in terms of flexibility:

a) The user hopes to reorganize and combine the models in the pipeline to form a new pipeline model;

b) The user wishes to extract one or more of the pipeline models, retrain or deploy alone;

in addition, complex pipeline models present the following performance challenges:

c) Slower pipeline model upload and download speeds;

d) Higher model persistence storage costs.

Disclosure of Invention

The invention aims to overcome the defects of low management and distribution flexibility and low performance of a pipeline model in the existing scheme, and aims to provide a layering management and distribution method and system of a machine learning pipeline model, which support the storage and distribution of the machine learning pipeline model by referring to a mirror image warehouse distribution mirror image mode, so that each machine learning model in the pipeline is stored as a single layer in a file system in a mirror image; the computational relationship graph of the machine-learned pipeline model is defined with the DAG, which is then named as the model of the DAG to store the computational relationship of the pipeline, and the DAG model is stored as the uppermost separate layer in the file system.

The invention is realized by the following technical scheme:

a method of machine learning pipeline model hierarchical management and distribution, comprising the steps of:

s1: obtaining a plurality of machine learning models, defining pipeline relations among the machine learning models by using a DAG, and obtaining the DAG model through serialization operation;

s2: customizing a configuration file;

s3: generating a mirror image construction script according to the self-defined configuration file;

s4: constructing the plurality of machine learning models and the DAG model according to the mirror image construction script, and generating a Docker mirror image of a machine learning pipeline model and a corresponding workpiece type file;

s5: and pushing the Docker mirror image of the machine learning pipeline model and the corresponding workpiece type file to a model mirror image warehouse.

The method comprises the steps of supporting storage and distribution of machine learning pipeline models by means of mirror image warehouse distribution mirror images, and accordingly storing each machine learning model in a pipeline as a single layer in a file system in the mirror image; the computational relationship graph of the machine-learned pipeline model is defined with the DAG, which is then named as the model of the DAG to store the computational relationship of the pipeline, and the DAG model is stored as the uppermost separate layer in the file system. The complex pipeline model is shared among data scientists, and the consistency of multiparty training and service is ensured. And can construct and test any complex model on a working basis, which also provides the possibility for more complex model structures required by integrated learning, multitasking learning and federal learning techniques, and simultaneously allows users to dynamically implement model operations and custom evaluations.

Further, the DAG is configured to define a conduit relationship between a plurality of machine learning models; the DAG called by a plurality of machine learning models is completed by using a programming language, and the DAG is stored as a model through a serialization operation.

Further, the hierarchical relationship of the plurality of machine learning models and the DAG model in the Docker mirror image to be constructed is divided through a custom configuration file.

Further, the custom configuration file specifically includes: the method comprises the steps that a configuration file needs to be assigned to a read position of a machine learning model, each mirror image layer in a Docker mirror image to be divided by the configuration file comprises a constraint relation of the machine learning model, a construction sequence and a construction hierarchy of the mirror image layers corresponding to a plurality of machine learning models to be divided by the configuration file, and a construction sequence and a construction hierarchy of the mirror image layers corresponding to a DAG model to be divided by the configuration file. Namely, n machine learning models sequentially correspond to a 1 st layer, a 2 nd layer, a … … th layer, an n-1 st layer and an n th layer; the configuration file needs to describe the construction sequence and the construction hierarchy of the mirror image layer corresponding to the DAG model; i.e. the n +1 layer located at the top most layer.

Further, the specific process of generating the mirror image construction script includes: and after the model management client side reads and analyzes the configuration file operation, generating a mirror image construction script conforming to the Docker specification.

Further, constructing the machine learning models and the DAG model according to the mirror image construction script, and generating a Docker mirror image of the machine learning pipeline model conforming to the Docker standard by the model management client and generating a workpiece type manual file of the machine learning pipeline model conforming to the OCI standard after expanding a Docker background function.

Further, the structure of the Manifest file is according to the OCI distribution specification; wherein, the Manifest is a JSON format file, comprising two parts: a Config part and a Layers part; the Config part records configuration about the mirror image, is metadata of the mirror image, and is used for displaying information in a UI of a mirror image warehouse and distinguishing construction of different operating systems; the Layers part consists of the content of which the multi-layer media type is application/vnd.oci.image layer.v1 in the OCI standard; each layer in the Config part and the Layers part is stored in a model mirror warehouse in a Blob mode; wherein, the digest of the pipeline model image exists as a Key; mirror warehouse provides a low latency pipeline model management and distribution infrastructure and can save significantly on model storage space.

Further, pushing the Docker mirror image of the machine learning pipeline model to a model mirror image warehouse specifically includes: the model management client interacts with the model mirror warehouse through a pushing command, and uploads the pipeline model mirror to the model mirror warehouse; each layer of Config and Layers in the model mirror image warehouse is stored in the model mirror image warehouse in a Blob mode; wherein the digest of the pipeline model image exists as a Key.

Further, the model management client pulls the machine learning pipeline model Docker mirror image from the model mirror image warehouse, and specifically comprises the following steps: the model management client interacts with the model mirror image warehouse through a pull command and downloads a machine learning pipeline model mirror image; after sending a request to the Manifest of the model image warehouse, the model management client side uses the digest to download all Blobs in parallel; including Config and all Layers.

A layering management and distribution system for machine learning pipeline models comprises a model management client and a model mirror warehouse; the model management client comprises a command line management tool module and a Docker background module;

the model management client pulls a Docker mirror image of a machine learning pipeline model from the model mirror image warehouse; the model management client interacts with the model mirror image warehouse through a pushing command, and uploads a pipeline model mirror image to the model mirror image warehouse

The command line management tool module is used for providing tools such as machine learning pipeline model construction, uploading, downloading and the like in a command line mode;

the Docker background module is used for providing an API to receive a request from a command line management tool, supporting a user to manage a machine learning pipeline model in a self-defined mirror image mode, distributing the machine learning pipeline model to different modules according to different requests and further executing corresponding work;

the model mirror warehouse is used for providing Docker Registry API to receive a request from a model management client, supporting a user to upload and download a machine learning pipeline model, and being capable of carrying out layered storage management on model files and model attributes at a server side; a mirrored warehouse approach is supported that meets the OCI standard.

Further, the model management client supports the OCI standard, and model files and model attributes in the model management client are subjected to hierarchical storage management.

Further, the model management client pulls the machine learning pipeline model Docker mirror image from the model mirror image warehouse, and specifically comprises the following steps: the model management client interacts with the model mirror image warehouse through a pull command and downloads a machine learning pipeline model mirror image; after sending a request to a Manifest file of a model mirror warehouse, the model management client side uses a digest to download all blobs in parallel; including a Config portion and a Layers portion.

Compared with the prior art, the invention has the following advantages and beneficial effects:

the invention relates to a method and a system for hierarchically managing and distributing machine learning pipeline models, which support the storage and distribution of the machine learning pipeline models by referring to a mirror image warehouse distribution mirror image mode, so that each machine learning model in the pipeline is stored as a single layer in a file system in the mirror image; the computational relationship graph of the machine-learned pipeline model is defined with the DAG, which is then named as the model of the DAG to store the computational relationship of the pipeline, and the DAG model is stored as the uppermost separate layer in the file system. The invention can lead data scientists to share complex pipeline models and ensure the consistency of multiparty training and service. And can build and test any complex model on the basis of the work of the users, which also provides the possibility for more complex model structures required by the integrated learning, the multitasking learning and the federal learning technologies, and simultaneously enables the users to dynamically realize model operation and custom evaluation. Moreover, the invention uses mirroring to encapsulate the pipeline model, provides a low-latency pipeline model management and distribution infrastructure with a mirrored warehouse, and can greatly save the space for model storage.

Drawings

The accompanying drawings, which are included to provide a further understanding of embodiments of the invention and are incorporated in and constitute a part of this application, illustrate embodiments of the invention. In the drawings:

FIG. 1 is a system frame diagram of the present invention;

FIG. 2 is a flow chart of the system of the present invention.

Detailed Description

For the purpose of making apparent the objects, technical solutions and advantages of the present invention, the present invention will be further described in detail with reference to the following examples and the accompanying drawings, wherein the exemplary embodiments of the present invention and the descriptions thereof are for illustrating the present invention only and are not to be construed as limiting the present invention.

In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention. However, it will be apparent to one of ordinary skill in the art that: no such specific details are necessary to practice the invention. In other instances, well-known structures, circuits, materials, or methods have not been described in detail in order not to obscure the invention.

Throughout the specification, references to "one embodiment," "an embodiment," "one example," or "an example" mean: a particular feature, structure, or characteristic described in connection with the embodiment or example is included within at least one embodiment of the invention. Thus, the appearances of the phrases "in one embodiment," "in an example," or "in an example" in various places throughout this specification are not necessarily all referring to the same embodiment or example. Furthermore, the particular features, structures, or characteristics may be combined in any suitable combination and/or sub-combination in one or more embodiments or examples. Moreover, those of ordinary skill in the art will appreciate that the illustrations provided herein are for illustrative purposes and that the illustrations are not necessarily drawn to scale. The term "and/or" as used herein includes any and all combinations of one or more of the associated listed items.

In the description of the present invention, it should be understood that the terms "front", "rear", "left", "right", "upper", "lower", "vertical", "horizontal", "high", "low", "inner", "outer", etc. indicate orientations or positional relationships based on the drawings, are merely for convenience in describing the present invention and simplifying the description, and do not indicate or imply that the apparatus or elements referred to must have a specific orientation, be configured and operated in a specific orientation, and thus should not be construed as limiting the scope of the present invention.

Embodiment one:

as shown in fig. 2, a method for hierarchically managing and distributing a machine learning pipeline model includes the following steps:

s2: customizing a configuration file;

s4: constructing a plurality of machine learning models and DAG models according to the mirror image construction script, and generating a Docker mirror image of the machine learning pipeline model and a corresponding workpiece type file;

The DAG is used to define a pipeline relationship between a plurality of machine learning models; the DAG called by a plurality of machine learning models is completed by using a programming language, and the DAG is stored as a model through a serialization operation.

And dividing the hierarchical relationship of the plurality of machine learning models and the DAG model in the Docker mirror image to be constructed through a custom configuration file.

The custom configuration file specifically includes: the method comprises the steps that a configuration file needs to be assigned to a read position of a machine learning model, each mirror image layer in a Docker mirror image to be divided by the configuration file comprises a constraint relation of the machine learning model, a construction sequence and a construction hierarchy of the mirror image layers corresponding to a plurality of machine learning models to be divided by the configuration file, and a construction sequence and a construction hierarchy of the mirror image layers corresponding to a DAG model to be divided by the configuration file. Namely, n machine learning models sequentially correspond to a 1 st layer, a 2 nd layer, a … … th layer, an n-1 st layer and an n th layer; the configuration file needs to describe the construction sequence and the construction hierarchy of the mirror image layer corresponding to the DAG model; i.e. the n +1 layer located at the top most layer.

The specific process for generating the mirror image construction script comprises the following steps: and after the model management client side reads and analyzes the configuration file operation, generating a mirror image construction script conforming to the Docker specification.

And constructing a plurality of machine learning models and DAG models according to the mirror image construction script, and generating a workpiece type Manifest file of the machine learning pipeline model which accords with the OCI standard after a Docker mirror image of the machine learning pipeline model which accords with the Docker standard is constructed by the generating model management client and a Docker background function is expanded.

The structure of the management file is according to the OCI distribution specification; wherein, the Manifest is a JSON format file, comprising two parts: a Config part and a Layers part; the Config part records configuration about the mirror image, is metadata of the mirror image, and is used for displaying information in a UI of a mirror image warehouse and distinguishing construction of different operating systems; the Layers part is composed of the content of the multilayer mediaType as application/vnd. OCI. Image layer. V1 in the OCI standard; each layer in the Config part and the Layers part is stored in a model mirror warehouse in a Blob mode; wherein, the digest of the pipeline model image exists as a Key; mirror warehouse provides a low latency pipeline model management and distribution infrastructure and can save significantly on model storage space.

Pushing the Docker mirror image of the machine learning pipeline model to a model mirror image warehouse, which specifically comprises the following steps: the model management client interacts with the model mirror warehouse through a pushing command, and uploads the pipeline model mirror to the model mirror warehouse; each layer of Config and Layers in the model mirror image warehouse is stored in the model mirror image warehouse in a Blob mode; wherein the digest of the pipeline model image exists as a Key.

The model management client pulls a machine learning pipeline model dock mirror image from a model mirror image warehouse, and specifically comprises the following steps: the model management client interacts with the model mirror image warehouse through a pull command and downloads a machine learning pipeline model mirror image; after sending a request to the Manifest of the model image warehouse, the model management client side uses the digest to download all Blobs in parallel; including Config and all Layers.

As shown in fig. 1, a machine learning pipeline model layering management and distribution system comprises a model management client and a model mirror warehouse; the model management client comprises a command line management tool module and a Docker background module;

the model management client pulls a Docker mirror image of the machine learning pipeline model from the model mirror image warehouse; the model management client interacts with the model mirror warehouse through a pushing command, and uploads the pipeline model mirror to the model mirror warehouse

the Docker background module is used for providing an API to receive a request from the command line management tool, supporting a user to manage a machine learning pipeline model in a self-defined mirror image mode, and distributing the machine learning pipeline model to different modules according to different requests so as to execute corresponding work;

the model mirror image warehouse is used for providing Docker Registry API to receive a request from a model management client, supporting a user to upload and download a machine learning pipeline model, and being capable of carrying out hierarchical storage management on model files and model attributes at a server side; a mirrored warehouse approach is supported that meets the OCI standard.

The model management client supports OCI standard, and performs hierarchical storage management on model files and model attributes in the model management client.

The model management client pulls a machine learning pipeline model dock mirror image from a model mirror image warehouse, and specifically comprises the following steps: the model management client interacts with the model mirror image warehouse through a pull command and downloads a machine learning pipeline model mirror image; after sending a request to a Manifest file of a model mirror warehouse, the model management client side uses a digest to download all blobs in parallel; including a Config portion and a Layers portion.

The key steps are as follows:

the developer of the pipeline model constructs a Docker mirror image according to the machine learning pipeline model mirror image layering method and the Manifest specification;

delivering the constructed mirror image to a machine learning pipeline mirror image warehouse by a developer;

the model user uses the client to pull the mirror image from the mirror image warehouse, and the pulling process accords with the standard mirror image warehouse service API:

the client side firstly requests the Manifest of the mirror image from the mirror image warehouse;

the client pulls the mirror image Config information;

the client pulls all file layers that contain the pipeline model.

The foregoing description of the embodiments has been provided for the purpose of illustrating the general principles of the invention, and is not meant to limit the scope of the invention, but to limit the invention to the particular embodiments, and any modifications, equivalents, improvements, etc. that fall within the spirit and principles of the invention are intended to be included within the scope of the invention.

Claims

1. A method for hierarchically managing and distributing machine-learned pipeline models, comprising the steps of:

s2: customizing a configuration file;

2. The method of claim 1, wherein the DAG is configured to define a pipeline relationship between a plurality of machine learning models; the DAG called by a plurality of machine learning models is completed by using a programming language, and the DAG is stored as a model through a serialization operation.

3. The method for hierarchical management and distribution of machine learning pipeline models according to claim 1, wherein the hierarchical relationship of the plurality of machine learning models and DAG models in the Docker mirror image to be constructed is divided by a custom configuration file.

4. A method for hierarchically managing and distributing a machine-learning pipeline model according to claim 3, wherein the custom configuration file specifically comprises: the method comprises the steps that a configuration file needs to be assigned to a read position of a machine learning model, each mirror image layer in a Docker mirror image to be divided by the configuration file comprises a constraint relation of the machine learning model, a construction sequence and a construction hierarchy of the mirror image layers corresponding to a plurality of machine learning models to be divided by the configuration file, and a construction sequence and a construction hierarchy of the mirror image layers corresponding to a DAG model to be divided by the configuration file.

5. The method for hierarchically managing and distributing machine-learning pipeline models according to claim 1, wherein the specific process of generating the mirror image construction script comprises: and after the model management client side reads and analyzes the configuration file operation, generating a mirror image construction script conforming to the Docker specification.

6. The method for hierarchically managing and distributing machine learning pipeline models according to claim 1, wherein the plurality of machine learning models and the DAG model are constructed according to the mirror image construction script, and a Docker mirror image of the machine learning pipeline model conforming to the Docker specification is constructed by a generating model management client and a workpiece type management file of the machine learning pipeline model conforming to the OCI standard is generated by expanding a Docker background function.

7. The method for hierarchically managing and distributing machine-learning pipeline models according to claim 6, wherein the structure of the management file is according to the OCI distribution specification; wherein, the Manifest is a JSON format file, comprising two parts: a Config part and a Layers part; the Config part records configuration about the mirror image, is metadata of the mirror image, and is used for displaying information in a UI of a mirror image warehouse and distinguishing construction of different operating systems; the Layers part consists of the content of multilayer media type being application/vnd.oci.image layer v1 in the OCI standard; each layer in the Config part and the Layers part is stored in a model mirror warehouse in a Blob mode; wherein the digest of the pipeline model image exists as a Key.

8. The system for hierarchically managing and distributing the machine learning pipeline model is characterized by comprising a model management client and a model mirror warehouse; the model management client comprises a command line management tool module and a Docker background module;

the model management client pulls a Docker mirror image of a machine learning pipeline model from the model mirror image warehouse; the model management client interacts with the model mirror warehouse through a pushing command, and uploads a pipeline model mirror to the model mirror warehouse;

the command line management tool module is used for providing machine learning pipeline model building, uploading and downloading tools in a command line mode;

the model mirror image warehouse is used for providing a DockerRaegistryAPI to receive a request from a model management client, supporting a user to upload and download a machine learning pipeline model, and carrying out layered storage management on model files and model attributes at a server; supporting a mirror warehouse scheme conforming to the OCI standard;

the method comprises the steps of obtaining a plurality of machine learning models, defining pipeline relations among the machine learning models by using DAGs, obtaining a DAG model through serialization operation, defining a calculation relation diagram of the machine learning pipeline model by using the DAG model, storing the DAG model as an uppermost single layer in a file system, constructing the machine learning models and the DAG model according to mirror image construction scripts, generating Docker images and corresponding workpiece type files of the machine learning pipeline model, and pushing the Docker images and the corresponding workpiece type files of the machine learning pipeline model to a model image warehouse.

9. The system for hierarchical management and distribution of machine learning pipeline models of claim 8 wherein the model management client supports the OCI standard and wherein model files and model attributes in the model management client are managed in a hierarchical storage.

10. The system for hierarchically managing and distributing machine learning pipeline models according to claim 8, wherein the model management client pulls machine learning pipeline model Docker images from a model image warehouse, and specifically comprises: the model management client interacts with the model mirror image warehouse through a pull command and downloads a machine learning pipeline model mirror image; after sending a request to a Manifest file of a model mirror warehouse, the model management client side uses a digest to download all blobs in parallel; including a Config portion and a Layers portion.