WO2023080275A1

WO2023080275A1 - Deep learning framework application database server for classifying gender and age, and method therefor

Info

Publication number: WO2023080275A1
Application number: PCT/KR2021/015950
Authority: WO
Inventors: 이준혁
Original assignee: (주)한국플랫폼서비스기술
Priority date: 2021-11-04
Filing date: 2021-11-04
Publication date: 2023-05-11
Also published as: KR20230065037A

Abstract

The present invention relates to a deep learning framework application database server for classifying a gender and an age, and a method therefor. The present invention relates to a server for classifying, by deep learning inference, a gender and an age of a person from an image into multiple learning models through query analysis, and a method therefor, in which gender and age analysis queries are extracted into multiple detailed functions so that deep learning inference can be performed according to priorities of the multiple detailed functions.

Description

Deep learning framework application database server and method for classifying gender and age

The present invention relates to a deep learning framework application database server and method for classifying gender and age, and a server for classifying gender and age of a person from an image into a plurality of learning models by query analysis by deep learning inference and the same It's about how.

In order to create a learning engine that provides intelligence based on deep learning technology, there are various difficulties such as deep network design, learning function setting, and parameter tuning. These problems cannot be easily solved unless you are a deep learning expert, so it is difficult for anyone to easily have a deep learning-based learning engine.

In addition, whenever a learning engine is created, common elements of deep learning are used redundantly, and the same process must be repeatedly performed. In addition, when using one server or device for deep learning training, training and inference time are required depending on the amount of data.

In addition, various applications are possible through the deep learning framework database server, but applications for two or more functions may be required.

(Patent Document 1) KR10-2058124 B1

An object of the present invention is to enable even a user without specialized knowledge in deep learning to infer data corresponding to a query by learning data stored in an information database by a user's requested query using a deep learning method without difficulty, Provides a deep learning framework application database server and method for classifying gender and age, which can use a model that has been developed, takes less time for deep learning training and inference, and can be applied to classify gender and age of a person from an image. is in

A deep learning framework application database server for classifying gender and age according to an embodiment of the present invention includes an input/output unit that receives an inference query for a gender and age classification function of a dataset for inference from a user; a query analyzer for analyzing the query and extracting a plurality of detailed functions to achieve the function of the query; A database comprising: a storage unit having a plurality of learning model tables and a data set table; and a framework unit that interworks with the database and performs deep learning of the plurality of detailed functions using the learning model table and the dataset table, wherein the query analysis unit converts the inference query to the plurality of detailed functions. As a function, the face detection function of the upper group and the gender and age classification function of the lower group are extracted, and among the plurality of learning model tables, the previously learned face detection learning model table, gender classification learning model table, and age classification learning model table are selected. The framework unit performs deep learning inference of the plurality of detailed functions in association with each of the face detection learning model table, the gender classification learning model table, and the age classification learning model table, respectively. However, the face detection learning model according to the face detection learning model table, which is the upper group, may be preferentially deep learning inferred.

In addition, a dataset management module for converting the dataset for inference into a dataset table for inference; and if the framework unit infers that there is a face of the original image included in the dataset table for inference using the face detection learning model, crops the face portion to generate a face image box, and An auxiliary management module for associating an image ID, which is a unique number of the original image, with a box ID, which is a unique number, and generating mapping information for mapping location information of the face image box.

In addition, the auxiliary management module determines the gender and age of the person in the face image box by using a gender classification learning model and an age classification learning model according to the gender classification learning model table and the age classification learning model table in the framework unit. When the deep learning inference is made, the deep learning inferred gender and age may be mapped with the box ID.

In addition, the dataset management module adds the age and age of the person to the original image by using the location information of the face image box of the original image and the mapping information including the image ID of the original image to obtain a resultant image. can create

In addition, the query analysis unit further extracts a preprocessing detection function for detecting whether deep learning inference of the query function, which is a main function, is necessary for an inference dataset, and the storage unit includes a preprocessing detection learning model table associated with the preprocessing detection function, , Deep learning inference of the preprocessing sensing function is performed on the inference dataset, and when the inference data is classified as requiring deep learning inference of the main function, the plurality of details of the inference dataset It may further include a control unit that enables deep learning inference of the function.

In addition, among the plurality of learning models according to the plurality of detailed functions, a first learning model has a plurality of tasks, and the plurality of tasks correspond to a plurality of rows of a network table provided in the first learning model table; Each of the plurality of tasks has a unique number corresponding to a row number of the network table, and a series of first to second tasks among the plurality of tasks are performed by a first distribution server among the plurality of distribution servers, Further comprising a control unit for performing deep learning inference in a distributed manner, wherein each of the plurality of distributed servers includes a deep learning framework interworking with a database, and includes the first learning model table, and the control unit includes the first learning model table. The second row number of the network table corresponding to the unique number of the learning model table, the third result value list of the third task immediately before the first task, and the unique number of the second task are stored in the first distribution server. can be sent to

In addition, when the control unit receives a unique number of the learning model table and a fourth result value list of a fourth task among the plurality of tasks from a second distribution server among the plurality of distribution servers, the fourth result value list It is determined by a list of result values performed from the first task to the fourth task among a plurality of tasks, and a fifth row number among the plurality of network tables is disposed immediately after the fourth row number, and the control unit 2 If a sixth row number, which is a task instruction end row, is further received from the distribution server, the framework unit receives the fourth result value list as an input and causes the fifth to sixth row numbers to be operated. can

In addition, the learning model management module selects a second learning model table suitable for the first detailed function when there is no pre-learned first learning model table related to the first detailed function among the plurality of detailed functions, The second learning model table is converted into the first learning model table through deep learning training of the first detailed function based on the associated training dataset table, and the deep learning training is distributed and processed with a plurality of distributed servers. And, the plurality of distributed servers may each have a deep learning framework that works with the database.

In addition, a control unit for distributing the batch size of the training dataset, the second learning model table, and the training dataset table to a plurality of distributed servers, wherein the deep learning framework application database server comprises the plurality of distributed servers. functions as a first distribution server among the distribution servers of , and the control unit randomly changes the data order of the training dataset table and converts it into a batch data set table by dividing it according to the batch size, and the framework unit A model architecture is built using an architecture table belonging to the second learning model table, a learning parameter table belonging to the second learning model table is initialized and assigned to the model architecture to create a second learning model, and the second learning model is created. For the learning model, deep learning training may be performed using a plurality of mini-batches of the batch dataset table.

In addition, the framework unit derives a new learning parameter when batch learning for one mini-batch among the plurality of mini-batches is completed, and the control unit spreads the new learning parameter to the remaining distributed servers of the plurality of distributed servers, , If the new learning parameter is generated, further comprising an integrating unit that integrates the new learning parameter and at least one learning parameter spread from the remaining distributed servers and updates the learning parameter to be applied to the next batch learning, wherein the integrating unit When all allocated epochs are completed, a final learning parameter is derived by integrating the last learning parameter derived from the framework unit and at least one learning parameter finally spread in the remaining distributed servers, and the control unit derives a trained model architecture and The final learning parameter may be converted into the learned first learning model table.

An inference method of a deep learning framework application database server for classifying gender and age according to an embodiment of the present invention includes receiving an inference query of a gender and age classification function of a dataset for inference from a user; extracting the inference query into a plurality of detailed functions, which are a face detection function of an upper group and a gender and age classification function of a lower group; selecting a pre-learned face detection learning model table, gender classification learning model table, and age classification learning model table; and performing deep learning in the order of deep learning inference of functions corresponding to the upper group and deep learning inference of functions corresponding to the lower group.

Also, converting the dataset for inference into a dataset table for inference; deep learning inference of a face detection function for an original image provided in the dataset table for inference using a face detection learning model according to the face detection learning model table; generating a face image box by cropping the face portion if it is inferred that the original image has a face; generating mapping information by associating an image ID, which is a unique number of the original image, with a box ID, which is a unique number of the face image box; and adding location information of the face image box to mapping information including the box ID.

In addition, deep learning inferring the gender and age of the person in the face image box using the gender classification learning model and the age classification learning model according to the gender classification learning model table and the age classification learning model table; adding the sex and age deduced by deep learning to the mapping information including the box ID; and generating a resultant image by adding the age and age of the person to the original image using location information of the facial image box of the original image and mapping information having an image ID of the original image. can

According to the present invention, by using query-based deep learning technology, a deep learning framework is connected to a database in the form of a plug-in, enabling deep learning training and inference using data stored in the database by a user's request query, Deep learning training and inference can be used by users without knowledge. By distributing deep learning, the time required to perform deep learning can be reduced. In addition, it is possible to classify the gender and age of a person from an image through deep learning inference.

1 is a configuration diagram schematically showing the overall configuration of a query-based deep learning inference system according to an embodiment of the present invention.

2 is a control configuration diagram of a database server according to an embodiment of the present invention.

3 is a data management configuration diagram according to an embodiment of the present invention.

4 is a database structure diagram according to an embodiment of the present invention.

5 is a control configuration diagram of a conversion unit according to an embodiment of the present invention.

6 and 7 are conversion operation diagrams of a conversion unit according to an embodiment of the present invention.

8 is a flowchart showing the execution flow of a query-based machine learning technique according to an embodiment of the present invention.

9 is an operational flowchart for explaining a query-based deep learning inference method according to an embodiment of the present invention.

10 is a schematic configuration diagram of a database-linked deep learning distribution system according to another embodiment of the present invention.

FIG. 11 is a block diagram of a main server and distributed servers according to FIG. 10 .

12 shows a dataset of a main server and a dataset for training of a distributed server.

13 is a flowchart of a training method of the system of FIG. 10;

14 is a flowchart of an inference method of the system of FIG. 10 .

15 to 17 are signal flow diagrams according to different embodiments of the asynchronous distributed server of FIG. 13 .

18 and 19 are signal flow diagrams according to different embodiments of the synchronous distributed server of FIG. 13 .

20 is a signal flow diagram according to the distributed inference of FIG. 14 .

21 schematically illustrates the learning model.

Fig. 22 shows part of the intermediate result table according to Fig. 20;

23 shows part of the network table.

24 is a detailed block diagram of some components of a deep learning framework application database server for classifying gender and age according to another embodiment of the present invention.

25 is a flowchart of a method of classifying gender and age according to another embodiment of the present invention.

Fig. 26 shows the intermediate data of Figs. 24 and 25;

Hereinafter, the present invention will be described in more detail with reference to the drawings.

Terms such as first and second may be used to describe various components, but the components should not be limited by the terms. These terms are only used for the purpose of distinguishing one component from another. For example, a first element may be termed a second element, and similarly, a second element may be termed a first element, without departing from the scope of the present invention. The terms and/or include any combination of a plurality of related recited items or any of a plurality of related recited items.

It is understood that when an element is referred to as being "connected" or "connected" to another element, it may be directly connected or connected to the other element, but other elements may exist in the middle. It should be. On the other hand, when an element is referred to as “directly connected” or “directly connected” to another element, it should be understood that no other element exists in the middle. In addition, that the first component and the second component on the network are connected or connected means that data can be exchanged between the first component and the second component in a wired or wireless manner.

In addition, the suffixes "module" and "unit" for the components used in the following description are simply given in consideration of ease of preparation of this specification, and do not themselves give a particularly important meaning or role. Accordingly, the “module” and “unit” may be used interchangeably.

When these components are implemented in actual applications, two or more components may be combined into one component, or one component may be subdivided into two or more components as needed. The same reference numerals have been assigned to the same or similar components throughout the drawings, and detailed descriptions of components having the same reference numerals may be omitted as they are replaced with descriptions of the components described above.

Furthermore, the present invention covers all possible combinations of the embodiments shown herein. The various embodiments of the present invention are different but not mutually exclusive. One embodiment of the particular shape, structure, function, and characteristic described herein may be implemented in another embodiment. For example, components mentioned in the first and second embodiments may perform all functions of the first and second embodiments.

1 is a configuration diagram schematically showing the overall configuration of a query-based deep learning inference system according to an embodiment of the present invention. 2 is a control configuration diagram of a database server according to an embodiment of the present invention. 3 is a data management configuration diagram according to an embodiment of the present invention. 4 is a database structure diagram according to an embodiment of the present invention. 5 is a control configuration diagram of a conversion unit according to an embodiment of the present invention. 6 and 7 are conversion operation diagrams of a conversion unit according to an embodiment of the present invention. 8 is a flowchart showing the execution flow of a query-based machine learning technique according to an embodiment of the present invention. 9 is an operational flowchart for explaining a query-based deep learning inference method according to an embodiment of the present invention.

Referring to FIG. 1 , a query-based deep learning inference system 1 according to an embodiment of the present invention may apply query-based machine learning technology. To this end, the query-based deep learning inference system 1 may include a database server 10 and a terminal 20.

Here, in the query-based deep learning technology, when a user transmits a request such as deep learning to the database (DB) server 10 through the terminal 20 as a query, the database server 10 stores the data stored in the database server 10. A deep learning framework connected to the database server 10 using data may refer to a technology in which machine learning, deep learning, inference, and the like are performed.

Deep learning can be a set of machine learning (machine learning) algorithms that attempt a high level of abstraction through a combination of several nonlinear transform methods. Machine learning is a field of artificial intelligence, which can refer to the field of developing algorithms and techniques that allow computers to learn. Artificial intelligence refers to a computer system having functions of human intelligence, and may refer to artificially implementing human intelligence in a machine or the like. In this specification, 'deep learning' is not limited to deep learning technology itself, but can be interpreted as extending to machine learning or artificial intelligence.

The terminal 20 includes a smart phone, a portable terminal, a mobile terminal, a personal digital assistant (PDA), a portable multimedia player (PMP) terminal, and a telematics terminal. , Navigation terminal, personal computer, notebook computer, slate PC, tablet PC, ultrabook, wearable device (for example, watch type terminal) (Smartwatch), glass-type terminal (Smart Glass), HMD (Head Mounted Display), etc.), Wibro terminal, IPTV (Internet Protocol Television) terminal, smart TV, digital broadcasting terminal, AVN (Audio Video Navigation) terminal , A / V (Audio / Video) system, and a flexible terminal (Flexible Terminal) may be any one or a combination. The terminal 20 may further include a server computer.

The terminal 20 can access the database server 10 (hereinafter referred to as DB server). A user or manager may send a query to the DB server 10 through the terminal 20 or receive a result according to the query.

The DB server 10 may be a server that operates a database or is connected to and controls a database. The DB server 10 may refer to a concept including a set of integratedly managed data (database) and middleware that manages them. The database server 10 may mean a database management system (DBMS). The database may also be used in the sense of a DB server 10 or a database management system (DBMS).

The DB server 10 may mean any device that works according to a query or generates a result according to a query. The query may follow SQL (Structured Query Language) syntax. The database of the DB server 10 is preferably a relational database.

The terminal 20 may input a deep learning inference query and receive an inference result corresponding to the query from the DB server 10 .

The terminal 20 may request various functions to the DB server 10 through a query and receive a response from the DB server 10 . The terminal 20 may check or modify data stored in the DB server 10 or add new data through a query. The terminal 20 may check or modify the learning model stored in the DB server 10 through a query and create a learning model for new learning. The terminal 20 may select data and learning models through a query, set parameters, request machine learning, and check intermediate and final results of learning. The terminal 20 may select data and a pre-learned learning model through a query, request machine inference, and check the inference result.

Referring to FIG. 2 , the DB server 10 may include a control unit 100, a storage unit 200, a framework unit 300, a conversion unit 360, and an input/output unit 370.

The input/output unit 370 may be its own interface device. The input/output unit 370 may include an input device and an output device separately.

The output device may output a video signal and/or an audio signal. The output device may be a display device such as a monitor, and/or a speaker.

The input device may generate input data that a user inputs to control the operation of the DB server 10 . The input device may include a user manipulation device such as a keyboard, key pad, touch pad, and mouse.

The input and output device may be implemented as one such as a touch screen.

The input device may input an audio signal and/or a video signal to the DB server 10. The input device may include a camera and a microphone.

The input device may include a sensor device. Sensor devices include temperature sensor, humidity sensor, brightness sensor, dust sensor, pressure sensor, vibration sensor, voltage sensor, current sensor, parallel sensor, magnetic sensor, light sensor, proximity sensor, distance sensor, inclination sensor, gas sensor, thermal sensor A sensor, a flame detection sensor, a metal detection sensor, a hall sensor, and the like may be provided. The sensor device generates temperature, humidity, brightness, dust (carbon), pressure, vibration, voltage, current, parallel, magnetic, illuminance, proximity, distance, tilt, gas, heat detection, flame detection, metal detection, and rotation amount data. can do.

The input/output unit 370 may serve as an interface with all external devices connected to the DB server 10 . Examples of the external device may include a wired/wireless data port, a socket of a card such as a memory card, an audio I/O (Input/Output) terminal, and a video I/O (Input/Output) terminal. . The input/output unit 370 may receive data from such an external device or transmit data inside the DB server 10 to an external device.

The input/output unit 370 may perform a communication function. For communication, at least one short-range communication protocol such as Bluetooth, Radio Frequency Identification (RFID), Ultra Wideband (UWB), and ZigBee may be used. Communications may include Internet access. The input/output unit 370 may exchange data with an external device, for example, the terminal 20 through communication.

Although the terminal 20 is shown as a separate device in this specification, the input/output unit 370 may perform the functions of the terminal 20 . That is, the terminal 20 may be replaced (omitted) by the input/output unit 370, and the present invention may be implemented.

The input/output unit 370 is in charge of communication with the user's communication means (terminal 2), and can control the communication protocol and data format on the network with communication equipment and computing equipment, which are various types of connection means of the user. .

Examples of the data format may include Open Neural Network Exchange Format (ONNX), Neural Network Exchange Format (NNEF), or Comma-separated values (CSV).

The input/output unit 370 may be a channel that receives a control command or query from a user and provides a result to the user.

The storage unit 200 may store data and programs necessary for the DB server 10 to operate. The storage unit 200 may store programs for processing and control of the control unit 110 and may perform a function for temporarily storing input or output data.

The storage unit 200 may mean a device that stores data as a database or a database itself.

The storage unit 200 may store information about job performance, history of previous jobs, and users. The storage unit 200 may store information and/or data through connection with a storage device provided separately from the outside or a storage device provided in an external computer network. Deep learning results with big data characteristics can be distributed and stored or stored separately externally, and can be called and applied upon request.

The control unit 100 may execute overall control functions of the DB server 10 by controlling the operation of each unit of the DB server 10 .

The control unit 100 may access data in the database, manage data, or create data in a table. Data management may mean inquiring, modifying, and/or uploading data.

The control unit 100 may control all functions for interpreting and executing a user's query, performing a task according to a query, or providing a result.

3 and 4, the control unit 100 may include a query analysis unit 110, a dataset management module 120, a learning model management module 130, and a result management module 160, The storage unit 200 may store a query analysis value 210 , a dataset 220 , a learning model 230 , and a learning result 260 .

The query analysis unit 110 may interpret and/or analyze a query requested by a user and store it as a query analysis value 210 . The query analysis value 210 may include a function and/or content of a query. Queries can be largely classified into training (learning) and inference. The query analysis value 210 may store a value distinguishing whether a query is training or inference.

The function of the query may be a request for deriving a result value that the user wants to obtain through deep learning. For example, a query to recognize text may be a function to classify content of text by detecting text from image data.

The function of the query is not limited to the user's request. For example, a user's request may be one, but multiple detailed functions may be required to carry it out. The query analyzer 110 may analyze a query to extract a plurality of detailed functions necessary for performing deep learning.

A plurality of detailed functions can be divided into upper categories and lower categories. For example, in the case of a query to distinguish the gender of a person, a detailed function for detecting a face in image data of an upper category and a detailed function for classifying the gender of a detected face in a lower category may be extracted. In this case, a detailed function of a higher category may be performed first, and then a detailed function of a lower category may be performed.

The contents of the query may be additional various things other than functions. For example, it may be selecting a specific learning model, or designating a dataset for training or inference.

The dataset 220 managed by the dataset management module 120 refers to a set of information or data having the same format to be used for learning and reasoning. Information or data includes numbers, texts, images, videos, and voices, and may be any type of information or data used in machine learning.

The same format of data that can be clustered into the dataset 220 can be defined based on the extension. For example, in the case of image information, if the extension indicates an image, all of them are clustered in a dataset of the same category.

Here, image information is described as an example, but the data used may be all types of data that can be used for machine learning, such as numbers, texts, images, images, and voices, as described above, as well as images.

The dataset management module 120 may cluster information or data (hereinafter referred to as 'data') received from the outside into the same dataset in its format (eg, extension) or classify it by the contents of the data. When the data is classified according to the contents, the dataset management module 120 may use a data classification learning model that classifies data into the same data format. The data classification learning model can be stored in the DB server 10 and called and used when necessary.

The dataset management module 120 may preprocess data so that the dataset 220 is well applied to the learning model 230 . Data preprocessing can transform the data to fit the tensors (vectors) of the learning model. As an example of data preprocessing, there may be an example of converting words into index numbers of a dictionary used for deep learning.

The dataset management module 120 may convert data of the first format into data of the second format. The dataset management module 120 may manage data of the second format as one group (group) of datasets. For example, the dataset management module 120 may extract image data for each frame and convert (decode) them into a group of datasets. The dataset management module 120 may encode a series of images into images. The series of images may be worked images. That is, the dataset management module 120 may convert video data into a group of image datasets, and convert a group of image datasets processed (mosaic) into images.

The dataset management module 120 may provide a video streaming service. For example, the dataset management module 120 may encode from a series of images and provide a video streaming service or a stored video file to provide a streaming service.

When a new dataset is created, the dataset management module 120 creates a new table (dataset table), and searches or modifies data or adds new data in the dataset table.

The dataset management module 120 may access a database table and retrieve data. The dataset management module 120 may show the result of searching for data in the database through a query written by the user to the user. The dataset management module 120 may limit the level at which data can be modified according to the authority granted to the user. The dataset management module 120 may receive numerical data from a user or read one or more files to perform data upload. The dataset management module 120 may provide a tagging function capable of labeling training data.

In this specification, a dataset table and a dataset may be used as the same meaning. In particular, in a relational database, a dataset refers to a data set in relational data format stored as a dataset table. Relational data format refers to a model that defines and describes data using a tabular format. This can be equally applied to a learning model, a learning model table, a learning result, and a learning result table, which will be described later. However, the substance and/or format of both may be different.

The learning model (LM) management module 130 may manage the learning model table 230 used for machine learning (deep learning, etc.).

In this embodiment, the learning model table 130 may include an architecture table and a learning parameter table. The architecture table may include a network table and a hyperparameter table.

The learning model table 230 may correspond to a learning model used by the framework unit 300 .

In this embodiment, the learning model (learning network model) 230 is a judgment model that can be learned based on a data set based on an artificial intelligence algorithm, and may be a model based on a neural network. This judgment model can be designed to simulate human brain structure on a computer.

The judgment model may include a plurality of network nodes having weights that simulate neurons of a human neural network. A plurality of network nodes may each form a connection relationship to simulate synaptic activity of neurons that transmit and receive signals through synapses. This judgment model may include a machine learning model, a neural network model, and/or a deep learning model.

The learning network model is at least one of models such as an Artificial Neural Network (ANN) model, a Deep Neural Network (DNN) model, a Convolution Neural Network (CNN) model, and a Recurrent Neural Network (RNN) model. model can be implemented. The exemplified model is not limited thereto. For example, there may be a Long Short Term Memory Network (LSTM), Gated Recurrent Units (GRU), Generative Adversarial Networks (GAN), Super-resolution GAN (SRGAN) model, etc., but is not limited to these names.

In general, the learning model 230 may include an architecture and parameters.

Architecture (model architecture) refers to the structure of a machine learning model. The architecture may include the number of layers corresponding to the structure of the learning model, the number of units, types of layers, and how units are connected. This can be represented as an architectural structure.

The architecture may correspond to the architecture table of the learning model table. The structure of an architecture may be referred to as a network model or network. The architecture structure may correspond to the network table of the learning model table. Architecture may mean that hyperparameters are assigned to an architecture structure. To build an architecture, you may need network tables and hyperparameter tables.

Parameters may include hyperparameters and learning parameters.

Hyperparameters define the input/output and the inside of the model, and may include a learning rate, an optimization method (learning method; optimizer), a type of layer, an input/output size, parameters required for calculation, and the like. Hyperparameters allow architectures to be implemented. Hyperparameters can act as a component of an architecture. Hyperparameters are heuristic-based, that is, they can be set directly by humans. Also, hyperparameter optimization may be implemented as a separate optimizer module.

Learning parameters may include weights and/or biases. A weight is a value used for interaction with input data, and a model weight corresponding to a model architecture may exist. A value of the learning parameter may be changed by an optimizer. Learning parameters may simply be referred to as 'parameters'.

The optimizer may change the learning parameters so that the learning model has a desired function. Learning (deep learning) or training can mean changing these learning parameters. The optimizer may be implemented by the framework unit 300 or a separate element.

The hyperparameter and learning parameter may correspond to the hyperparameter table and learning parameter table described above.

The learning model management module 130 may create a new network model by adding a supported layer and adjusting layer parameters (type of layer, input/output size, parameter required for calculation).

The learning model management module 130 may query a network model list previously created. The learning model management module 130 may create a new network model by adding a new layer to an existing network model. This can be implemented through tuning of hyperparameters. These series of tasks may be initiated by a user's query.

The learning model management module 130 may provide a function of visualizing and displaying the network model. Through this, the user can easily look at the structure of the hidden layer.

In addition, the learning model 230 may further include a loss function defining a feedback signal to be used for learning and a separate optimizer module for determining a learning progress method. The loss function and optimizer may be included in the framework unit 300 .

The learning model 230 may be stored in a database in a learning model table format, which is a relational data format.

As an example of a function of the learning model, there may be a function of recognizing text input by a user, recognizing voice or text included in an image/audio/video, etc., or analyzing a user's intention with the recognized voice or text. .

The learning model management module 130 may select a specific learning model table suitable for a query from among a plurality of learning model tables. The learning model management module 130 may select the learning model table 230 according to any one of the contents of a query or a model selection policy.

If there is a specific learning model designated by the user in the content of the query, the learning model management module 130 selects the corresponding learning model table. For example, in the case of a query requesting inference with a learned learning model according to a training query, the learning model management module 130 preferably selects a corresponding learning model table.

A model selection policy may be a guideline for selecting a learning model table based on features of the query and/or dataset tables associated with the query. For example, according to a model selection policy, the learning model management module 130 may select a learning model table similar to the query function from among functions of a plurality of learning model tables. Also, according to the model selection policy, the learning model management module 130 may select a learning model table having a data structure similar to the query function and a data set table associated with the query.

The main techniques of learning models may include binary classification, multiclass classification, regression analysis, numerical prediction, time series prediction, sentiment analysis, clustering, anomaly detection, resource reduction, reinforcement learning, and the like. According to the model selection policy, the learning model management module 130 may select a learning model table having a technology suitable for the function of the query.

The learning model management module 130 may select a previously learned learning model table. In this case, the learning model management module 130 verifies and tests whether the existing learning model works properly, and if it works properly, it can calculate the existing learning model table as a result of deep learning training. As a result of verification and testing, if it does not work properly or if the format or number of input data is different, a new learning model table can be calculated by performing deep learning training on the selected learning model table.

Referring to FIG. 4 , the learning model table may include a network table (qml_network_t). The architecture can be converted and stored in the network table (qml_network_t) format, which is a relational data format, in the database. The network table (qml_network_t) may be converted into an architecture of the learning model 230 . This may be converted by the conversion unit 360.

The network table may include a plurality of sub-network tables (qml_s_network_t). For example, in the case of learning a network model with Multi GPU (N), N sub-network tables may be provided. In the case of inferring a network model, one sub-network table may be provided.

The network table or sub-network table may include a plurality of layer tables (qml_layer_t) related to layers constituting the network. Layers constituting the architecture of the learning model 230 may be converted into a layer table (qml_layer_t) and stored. The layer table (qml_layer_t) may be converted into a layer of the learning model 230 .

The layer table (qml_layer_t) may include a plurality of tensor tables (qml_tensor_t). The tensor table may be a 4-dimensional tensor in NCHW format. A tensor table may include dtype, qml_shape_t, data, name, and the like. The tensor table and the tensors of the learning model 230 may be converted to each other.

Parameters of the learning model 230 may be stored as a parameter table. Parameters and parameter tables of the learning model 230 may be converted to each other. This may be converted by the conversion unit 360.

According to the DB schema designed in advance in the present invention, the model architecture and model weight may be stored in the DB table. The pre-designed DB schema can easily classify dataset tables and learning model tables that are similar to each other. When the DB server 10 receives a new data set, it can call a similar learning model among stored relational data format learning models and apply it to the new data set.

For example, 'attribute, domain, degree, tuple, cardinality, relation, key, candidate key, primary The similarity between the input dataset and the pre-stored learning model can be determined according to the similarity of the degree, which is the external form, and the attribute and domain, which are the content, of the elements of the table, such as 'key (primary)'. The similarity determination may be performed by the learning model management module 130 .

This means that after the first relational data format learning model is created, used, and stored in the database, when a similar format dataset is input to create a relational data format learning model, the existing relational data format stored in the database Among the models, a model with high similarity can be searched for, called, and applied. As a result, the generation time of a suitable learning model can be shortened and computing resources can be efficiently used.

The learning model table can serve as a guide so that users or administrators do not omit components when performing tasks, as components are linked in relational data format.

The framework unit 300 may use the elements stored as tables of the database structure as they are or may be used after being manipulated to be suitable for use in the framework unit 300 . This manipulation may be performed by the framework unit 300 or the conversion unit 360 .

The result management module 160 outputs each layer generated during machine learning, intermediate output values, parameter values, evaluation index values (learning loss values of deep learning functions) of models in which calculations are performed, and machine inference result values. Such learning results 260 may be stored in a database or managed so that the user can check them by calling them.

The storage unit 200 stores a project table, a job table, and a common table in addition to the dataset 220 table, the learning model 230 table, and the learning result 260 table. more can be provided.

The task table may include user information, project status, logs, and the like. The common table may include a lookup table such as layer type and error code.

The project table may store actual learning model copied from the learning model table or project information for inference. After the project is created, it has a separate structure from the learning model table, so even if the base network used in the project is modified, the established learning model is not affected.

The storage unit 200 may store a large number of variable data (input/output data and weight information) in a BLOB (Binary Large Object) or text type. The storage unit 200 may divide and store records for a small number of variable data (e.g., parameters for each layer).

The controller 100 may store all input/output data used for machine learning (training) and machine reasoning, and store models used for machine learning and machine reasoning. The control unit 100 may provide a procedure corresponding to a user's query request to perform machine learning according to the user's request.

Procedures include Insert Network, Insert Layer, Make Project, Input Data Loader, Init Network, Train, and Save Model. ) and Test.

The insert network may create a network table including network (architecture) name, network type, dataset name, optimizer type, optimizer parameters, learning rate, batch size, number of trainings, and output layer index.

The insert layer may register a layer table including network ID, layer name, layer type, layer index, layer parameter, and input layer index.

A make project can create a project that includes the project name, dataset name, network name, training or inference flags, and number of GPUs.

The input data loader may input data according to user input selection (layer index, query type (learning table, learning data, verification table, verification data)).

Network initialization may construct a network model.

A train can start training, including project ID, number of training generations, batch size, whether to train later, storage interval, verification interval, and GPU synchronization interval.

Save model can copy the network information of the project table to the network table (project name, network name).

Tests can initiate inferences that include the project ID and a flag whether to save results from all layers.

The framework unit 300 may perform machine learning using various machine learning frameworks or deep learning frameworks.

A framework may be a kind of package in which various libraries or modules for application program development are bundled into one for efficient use. Developers or administrators can quickly and easily use numerous libraries that have already been verified and various deep learning algorithms that have been pre-trained through the framework.

Deep learning frameworks may include TensoFlow, Torch/PyTorch, Deeplearing4j, CNTK (MICROSOFT COGNITIVE TOOLKIT), Keras, ONNX (Open Neural Network Exchange), MXNet, Caffe, QML (Quantum Machine Learning), and the like.

The framework unit 300 may be a deep learning framework installed as a plug-in in the DB server 10. This can be expressed as a database interworking framework (deep learning framework) and a database application framework (deep learning framework).

The framework unit 300 may be executed by calling the control unit 100 of the DB server 10. When called, the framework unit 300 may receive various data as arguments from the control unit 100 and return execution results. The framework unit 300 may construct a network within the framework by interpreting a network model defined in a relational data format. This analysis may be performed by the conversion unit 360 .

The framework unit 300 may receive learning parameters and learning data from the control unit 100 as factors, perform learning of the network configured inside the framework, and return a learning result. The framework unit 300 may receive input data from the control unit 100 as a factor, perform machine inference using a network configured inside the framework, and return a result.

When a query is input, the framework unit 300 may check and modify the learning model stored in the DB server 10 and create a learning model for new learning. The framework unit 300 may perform machine learning by selecting information or data and a learning model according to an input query and setting learning parameters. The framework unit 300 may provide intermediate results and final results of learning. The framework unit 300 may execute machine inference by selecting data and a pre-learned learning network model through an input query, and provide the inference result.

In this embodiment, the framework unit 300 may include the QML module 310 as an internal framework. The internal framework may include or include other frameworks in addition to the QML module 310 . This may provide the user with various options to use.

The QML module 310 may implement QML plug-in functions. The QML module 310 may be equipped with QML, which is a framework capable of performing deep learning. The QML module 310 is connected to the database through a User Defined Function (UDF) and can be executed by a call.

Each function defined in the framework is registered in the database through UDF, and the framework can be executed through the registered UDF call.

The types of argument variables that can be used in UDF are defined as integer, real number, and string. Each of these variables can be used in QML. For example, the integer type can be used as an integer value among essential parameters constituting a network model, an address value of a structure memory defined inside QML, and the like. The real number type can be used for real values among essential parameters constituting the network model, and the string type can be used for parameters with a variable number and blob data that is binary data.

The QML framework may follow the NCHW (N:batch, C:channel, H:height, W:width) format, which is a channel-first data format. The layer type supports layers used in ONNX, and parameters defined in each layer may also follow the ONNX format.

The QML framework can be equipped with a back-propagation algorithm to learn the network model. The QML framework can be loaded with gradient calculation algorithms and optimization algorithms to update model parameters (weights, biases).

The QML module 310, among the methods of learning the network model (architecture), trains the network model from scratch and then determines it through an initialization algorithm according to the weight of each layer, using the train from scratch technique and the weight of the previously learned model (import function). It is possible to support a fine-tuning technique that sets the initial weight of the layer by reading the weight stored in the database or obtained through previous learning attempts through

The QML module 310 may perform learning and inference through information received from a database (DB server 10, the control unit 100 or the storage unit 200 of the server, and the same below). Information received from the database can be obtained through data combinations received through user queries.

The conversion unit 360 may convert a specific learning model into another type of learning model. Specifically, the conversion unit 360 may convert a specific learning model into a relational data format of a database. The conversion unit 360 may convert a learning model in relational data format into a specific learning model or another learning model. For example, the conversion unit 360 converts a learning model table stored in a table type in a database into a QML framework, which is an internal framework, or vice versa. The conversion unit 360 may convert the architecture, layers, and parameters of the learning model 230 into relational data formats such as a network table, a layer table, and a parameter table, or vice versa.

Referring to FIG. 6 , the conversion unit 360 may convert the QML learning model table into a learning model suitable for the QML module 310 . The conversion unit 360 may convert the dataset table to be suitable for use in the QML module 310, if necessary. The QML module 310 (or the framework unit 300) may perform learning and/or inference using the dataset and the converted QML learning model, and output a learning result. The conversion unit 360 may convert the learning result output from the QML module 310 into a relational data format and store it as a learning result (output) table. These functions may be performed by at least one of the QML module 310 and/or the dataset management module 120 instead, or may be performed separately from each other.

The conversion unit 360 may be used for compatibility with an external framework. The conversion unit 360 may convert a pretrained model of an existing framework into another framework format such as an ONNX (Open Neural Network Exchange) model format when exporting information or data from a database to the outside.

Referring to FIG. 7 , the conversion unit 360 may convert (import) a network structure and model data defined in the ONNX model format into a network model format of a database. Conversely, the conversion unit 360 may convert (export) the network model of the database into a structured format or CSV file including the ONNX model.

The conversion unit 360 may convert Open Network Exchange (ONNX), Neural Network Exchange Format (NNEF), and hyperparameter and learning parameter files into structured formats in addition to the ONNX model format.

The user can convert the converted ONNX model and structured format into the target framework desired by the user and use it.

The network model can be applied to other types of deep learning frameworks through a conversion operation through the conversion unit 360 . Through this, the DB server 10 can call a relational data type model stored in the database and apply it to a data set of a similar type.

The conversion unit 360 can minimize the time required for the work through this conversion work.

Referring to FIG. 8 , the query-based machine learning technology according to an embodiment of the present invention converts an ONNX format or a pre-learned model converted to the ONNX format into a QML format through a converter, and learns or infers from the terminal 20. A query is received, information is transmitted from the database to the QML module 310, and training and inference can be performed in the QML module 310. And, if training (learning) or reasoning results are stored in the database, the terminal 20 can check the results stored in the database. Hereinafter, it demonstrates concretely.

The terminal 20 may input (Import) a learning model or receive an output (Export) from a database (①).

When inputting or outputting a learning model, it can be converted to suit the schema structure of the database through the conversion unit 360 (②).

The database can interpret the query and take appropriate action (③).

The control unit 100 may analyze the QML type of the query input from the terminal 20 and transmit a result thereof to the QML module 310 . In more detail, it is possible to perform operations such as analyzing the language type of the input query and determining compatibility or whether similar work details are stored in the storage unit 200 .

The control unit 100 selects a program capable of implementing optimal performance for each operating system or machine learning framework (S/W), and may request learning and inference to the QML module 310. For example, if a dataset requiring training is an image, the controller 100 may select a machine learning S/W capable of exhibiting optimal performance for image training and request training from the selected S/W.

In addition, the control unit 100 may check the resources of the server in use for current training, apply a framework for training according to the scale of the resources, or selectively apply components when the framework is applied. .

The QML module 310 may perform a plug-in in the database and perform training and reasoning through information received from the database (④).

The terminal 20 may request training or inference to the database through a query (⑤).

The terminal 20 may search a table of the database to search learning-related information (⑥).

Learning model data can be stored as a QML schema in a database (⑦).

Referring to FIG. 9 , in the query-based deep learning inference system according to an embodiment of the present invention, the query-based deep learning inference method can be executed in the framework unit 300 that works with the terminal 2 and the DB server 10. there is.

The control unit 100 may receive an input of a learning query (Call Train) or an inference query (Call Inference) from the user terminal (S410).

The control unit 100 may analyze the query and transmit a dataset and a suitable learning model to the framework unit 300 .

The framework unit 300 may execute network initialization (Init Network), network configuration (Construct Network), and network update (Update Network) according to the learning query or inference query (S420).

When all layers are initialized, the framework unit 300 may execute training or inference (Test) (S430).

The framework unit 300 may acquire batch data (Get Batch Data) and store results and models (Store Result & Model) by repeating (Iteration) until the end of learning.

The framework unit 300 may execute tests, obtain test data (Get Test Data), feed forward, and store inference results (Store Result).

The framework unit 300 may provide a learning result or reasoning result to the user terminal 130 when learning or reasoning is finished (S440).

Meanwhile, the query-based deep learning inference system 1 according to an embodiment of the present invention may manage clients, members, datasets, networks, learning models, and learning execution as follows.

[Client Management]

The query-based deep learning inference system 1 according to an embodiment of the present invention may provide the user terminal 130 with a function to manage a dataset and a machine learning process and check the result.

[Member Management]

The query-based deep learning reasoning system 1 may grant authority to create and modify data in the database 110 and network models through member management, and may leave a history of changes.

[Dataset Management]

The query-based deep learning inference system 1 can create a new table to manage datasets and provide functions for searching, modifying, and uploading data. When you create a new dataset, you can automatically create a new table and upload the data. You can view data by accessing a table in the database or display the result of searching the database data through a query written by the user. Data can be modified according to authority. Data upload may be performed by receiving numerical data from the user or by reading one or more files. A function of labeling training data may be provided.

[Network Management]

The query-based deep learning inference system 1 may provide functions for managing network models as follows. New network models can be created by adding supported layers and adjusting layer parameters. A list of previously created network models can be queried. A new network model can be created by adding a new layer to an existing network model. A function to visualize and show the network model can be provided.

[Manage Learning Model]

The query-based deep learning inference system 1 may provide functions for managing learning as follows. You can create or modify a learning model by adjusting the network model, dataset, and learning parameters. The trained network model can be output through the converter function. You can check the resources of the server currently in use.

[Manage Learning Run]

The query-based deep learning inference system 1 may provide functions for performing learning and inference and checking results as follows. You can check server resources. The user may be notified whether learning and inference performance is possible. You can search the list of currently running or waiting learning plans. You can create a learning plan by setting the registered network model, dataset, and learning parameters. You can check the learning parameters of the currently running or waiting learning plan. You can check the middle and results of the currently running learning plan. You can stop the currently running learning plan. You can start a pending study plan. An inference plan can be created by setting the registered network model and dataset. You can check the results of the executed reasoning plan.

As described above, according to the present invention, the deep learning framework is connected to the information database in the form of a plug-in so that even a user without expert knowledge of deep learning can provide the user with necessary information without difficulty, It is possible to realize a query-based deep learning inference system that enables inference of data corresponding to a query by learning data stored in an information database using a deep learning method.

10 is a schematic configuration diagram of a database-linked deep learning distribution system according to another embodiment of the present invention. FIG. 11 is a block diagram of a main server and distributed servers according to FIG. 10 . 12 shows a dataset of a main server and a dataset for training of a distributed server. 13 is a flowchart of a training method of the system of FIG. 10; 14 is a flowchart of an inference method of the system of FIG. 10 . 15 to 17 are signal flow diagrams according to different embodiments of the asynchronous distributed server of FIG. 13 . 18 and 19 are signal flow diagrams according to different embodiments of the synchronous distributed server of FIG. 13 . 20 is a signal flow diagram according to the distributed inference of FIG. 14 . 21 schematically illustrates the learning model. Fig. 22 shows part of the intermediate result table according to Fig. 20; 23 shows part of the network table. See Figures 1 to 9.

Hereinafter, for convenience of description, the learning model will be defined as follows. A learning model (learning network model) can be implemented by an architecture (model architecture) and learning parameters assigned to it. An architecture can be built by an architectural structure and hyperparameters assigned to it. The learning model and learning model table, architecture and architecture table, architecture structure and network table, hyperparameter and hyperparameter table, and learning parameter and learning parameter table may respectively correspond to each other. And, the learning model table may include an architecture table and a learning parameter table. The architecture table may include a network table and a hyperparameter table. The architectural structure may mean the number of layers, the number of units, the type of layers, and how units are connected.

A unit may also be referred to as a node. Values to be entered into the first nodes may be input dataset tables. Values to be entered into the last nodes may be output values. Input values and output values input to nodes of the middle layer (hidden layer) may be managed and stored by the dataset management module 120 or a separate module.

The framework unit or internal framework (hereinafter referred to as 'framework unit') builds a model architecture based on the architecture table (network table and hyperparameter table) of the selected learning model table, and sets the learning parameters based on the learning parameter table. A learning model corresponding to the selected learning model table may be created by assigning to the model architecture. The framework unit may perform deep learning training or inference by using the generated learning model and inputting a dataset table for training or inference. The learning model table and the learning model may be interlocked with each other, or may be described as a correspondence relationship, a conversion relationship, and the like, but are not limited to these terms.

Referring to FIG. 10, a database-linked deep learning distribution system (hereinafter referred to as 'training distribution system') according to an embodiment of the present invention includes a query-based deep learning framework application database server (hereinafter referred to as 'main server') 40 , and a plurality of distributed servers 41 to 43 may be included.

The main server 40 and the plurality of distributed servers 41 to 43 may have at least some of the functions of the DB server 10 of FIGS. 1 to 9 . Among the components of the main server 40 and the plurality of distributed servers 41 to 43, the components of the DB server 10 and corresponding components are described with reference to the above description.

The main server 40 and the plurality of distributed servers 41 to 43 are connected through a network and can communicate with each other.

The main server 40 manages a plurality of distributed servers 41 to 43, and can perform deep learning learning in a distributed manner.

Referring to FIG. 11 (a), the main server 40 may include a control unit 100, a storage unit 200, and an input/output unit 370. The main server 40 may further include a conversion unit 360 . The main server 40 may further include a framework unit 300 .

Referring to FIG. 11(b), the distributed servers 41 to 43 include a control unit 100-N, a storage unit 200-N, a framework unit 300-N, and an input/output unit 370-N. can include The distribution servers 41 to 43 may further include a conversion unit 360-N. N is a natural number, and is used to distinguish a specific distribution server among a plurality of distribution servers 41 to 43 from other distribution servers.

For each component of the main server 40 and distributed servers 41 to 43, the descriptions in FIGS. 1 to 9 are referred to.

The main server 40 implements the functions of the database server 10 of FIGS. 1 to 9 and may additionally implement a distributed function. For example, the main server 40 functions to manage the entire distributed system and may additionally perform a distributed function. However, for convenience of description, the distribution function of the main server 40 is treated as being performed by any one distribution server.

The main framework unit 50 of the main server shown in FIG. 10 and the first to third framework units 51 to 53 of each distribution server correspond to the above-described framework unit 300, respectively, and are distinguished. was used for the purpose of

Any one of the plurality of distributed servers may be implemented as a plurality of computer systems.

The main server 40 may set the plurality of distributed servers 41 to 43 so that each of the plurality of distributed servers 41 to 43 performs deep learning training in the same environment. The main server 40 may make at least a part of a dataset, a learning model, and a framework identical to the plurality of distributed servers 41 to 43 .

The first to third distribution servers 41 to 43 may include first to third framework units 51 to 53, respectively. The first to third framework units 51 to 53 have frameworks (QML modules) to be trained, and can perform machine learning with the same learning model. The meaning that each learning model of the first to third distribution servers 41 to 43 is the same may mean that at least each architecture is the same. The learning parameters p1, p2, and p3 of each distribution server 41 to 42 may be different.

Each of the distribution servers 41 to 42 may have different initial learning parameters by initializing the learning parameters, respectively. Alternatively, by the main server 40, the plurality of distributed servers 41 to 43 may have the same initial learning parameters. That is, the initial value of the learning parameter may be determined by the main server 40 or independently in each of the plurality of distributed servers 41 to 43 . Determination of the initial value may be optional or may be determined according to various factors such as the type and number of datasets, the purpose of deep learning, and the like.

The first to third distribution servers 41 to 43 may have the same data set as the data set provided in the main server 40 . The same data set is transmitted from the main server 40 to the plurality of distributed servers 41 to 43, or specific data of the main server 40 and the plurality of distributed servers 41 to 43 is synchronized using a synchronization method such as mirroring. can be done by This data movement (spreading) method may apply to other data (learning parameters, etc.) as well as the dataset.

A dataset of each of the plurality of distributed servers 41 to 43 may be converted into a dataset DS for learning suitable for learning. The transmission efficiency may be higher than when the main server 40 prepares the dataset as each learning dataset (DS) for each of the plurality of distributed servers 41 to 43 and transmits the dataset. Because it can be transmitted by broadcasting.

After receiving the same data set as the data set in FIG. ~ b10) and converted into a learning dataset (DS) as shown in (b-1) to (b-3) of FIG. 12. The batch size can be received from the main server 40 .

The framework unit 300 may further include an integration unit 320 in addition to the above-described QML module 310 .

The integration unit 320 may integrate the learning parameters derived during the distributed learning process into one learning parameter. A function used by the integration unit 320 for integration may be various. For example, the integration function may multiply each of a plurality of derived learning parameters by a weight and then take an average of these as an output.

The integration unit 320 applies a learning parameter derived from the corresponding distributed server and a learning parameter shared from other distributed servers to the argument of the integration function F, various argument policies (or 'integration policies'). ) can be determined. If it is an option, it can be selected by the user's settings.

Examples of integration policies include:

The integration unit 320 may use the latest learning parameter as an argument of the integration function (F). That is, the learning parameter once used cannot be used as an argument of the integration function (F) unless it is newly shared.

When the integration unit 320 does not receive learning parameters from other distributed servers, it may execute the integration function F without other learning parameters.

The integrator 320 may not execute the integrator function F when only its own learning parameter is an argument of the integrator function F. In this case, the learning parameters derived in the batch learning of the current step may be used as learning parameters in the batch learning of the next step.

The integration unit 320 executes the integration function (F) only when all learning parameters corresponding to the number of arguments are up to date, and even when at least one learning parameter among other learning parameters is up to date, the integration function (F ). Assuming that there are 3 arguments, in the former case, 3 learning parameters are used as arguments when all 3 learning parameters are up-to-date, and otherwise, only their own learning parameters are used as arguments. In the latter, if only two learning parameters are up-to-date, only those two learning parameters are used as arguments. The user can set one of the arguments to be executed only when all arguments of the integrated function (F) or even when some arguments are executed.

Hereinafter, deep learning in a distributed environment will be described in detail.

The first to third framework units 51 to 53 may create a learning model architecture with the same architectural structure and hyperparameters, and prepare for machine learning using each learning dataset DS. The first to third framework units 51 to 53 may respectively set initial values of learning parameters p1, p2, and p3, such as weights and biases, among learning models.

When preparation for deep learning training is completed, each of the plurality of framework units 51 to 53 may perform deep learning training. Each of the plurality of framework units 51 to 53 may repeat deep learning training using each training dataset DS. Each of the plurality of framework units 51 to 53 may update (derive) a parameter, in particular, a learning parameter after training for each mini-batch (b1 to b10). Throughout this specification, learning or training of each mini-batch will be referred to as batch learning or batch training.

For example, the first framework unit 51 trains using the initial learning parameter p1-1 and the first mini-batch b1 to derive the updated (converted) learning parameter p1-1'. can The derived learning parameter p1-1' may be transmitted to the second to third distributed

servers

42 and 43 or may be synchronized and spread.

Learning parameters derived from each framework unit may be spread (or 'shared') in various ways. This may vary by policy or user setting. For example, an immediate shared policy in which the latest learning parameter is spread to other framework parts when every batch learning is completed in a framework part, and a shared policy by time period in which the latest learning parameter is spread to other framework parts after a certain period of time has elapsed. There may be a sharing policy for each learning period in which the latest learning parameter is spread when a certain number of batch learning is completed, a rule set by the main server 40, or other rule policy in which the rule is spread by a random instruction.

The integration unit 320 of the first framework unit 51 includes the second and

third framework units

52 and 53 in addition to the derived first learning parameter p1-1' of the first framework unit 51. The second and third learning parameters derived from may be integrated into one learning parameter (p1-2). The integration unit 320 of the first framework unit 51 applies a weight to the first learning parameter p1-1′ calculated in the first framework unit 51, and the calculated first learning parameter. It is desirable to allow the parameter p1-1' to have more influence on the output of the integrating function.

The first framework unit 51 updates the learning parameters of the learning model to the integrated learning parameters (p1-2), and then uses the second mini-batch (b2) and the integrated learning parameters (p1-2) to machine can learn When learning for one epoch, that is, all mini-batches (training datasets (DS)) is completed, the first framework unit 51 continues learning until a predetermined number of epochs or conditions according to a preset policy are satisfied. can be repeated During one epoch, learning parameter updates (iteration) may be performed as many times as the total data size divided by the batch size. Referring to FIG. 12 (b-1 to 3), since the data size is 80 and the batch size is 8, 10 iterations occur during one epoch.

The first framework unit 51 may shuffle the training data set DS when one epoch ends.

The first framework unit 51 may tune the architecture structure or hyperparameters when the sub-process of deep learning training is finished. A training dataset may be divided into a training dataset, a validation dataset, and a test dataset. As an example of a sub-process of deep learning training, there may be a learning process (training, verification, testing) of the above classified dataset.

Hyperparameters tuned in the first framework unit 51 may be spread to other distributed servers. Other distributed servers can rebuild the learning model architecture with tuned hyperparameters. It is desirable to perform hyperparameter tuning only on one distributed server.

Before the next sub-process of deep learning training is newly started, each learning parameter may be readjusted, such as initialization, or may maintain the previous value.

Referring to FIG. 13 , the main server 40 may receive a deep learning learning query of a specific function from a user (S610). The main server 40 may receive a query directly through the input/output unit 370 or through the terminal 20 .

The main server 40 may select a learning model table suitable for the learning query (S620). The main server 40 may analyze the query and select an appropriate learning model table (hereinafter referred to as 'learning model table (Tt)') from a plurality of learning model tables. The learning model table may be selected by the learning model management module 130 of the main server 40 according to the above-described model selection policy.

The learning model table Tt may be a learning model generated by the conversion unit 360 in an external framework imported.

The main server 40 may have a dataset table for learning. The main server 40 may receive data of the training dataset through a query or from another device.

The main server 40 may allow the plurality of distributed servers 41 to 43 to be initialized and operated (S630).

Initial operation may refer to a series of processes of setting a distributed environment suitable for distribution of deep learning training and preparing for distributed training in a plurality of distributed servers 41 to 43.

The initialization operation may include selecting an appropriate distributed server from among a plurality of available distributed servers 41 to 43 . The initialization operation may connect the first to third distribution servers 41 to 43 and the network, and spread data to the first to third distribution servers 41 to 43 through synchronization, asynchronous, and/or mirroring. there is.

The distributed environment may have a batch size of the learning dataset DS. The main server 40 may determine an appropriate batch size based on the number of distributed servers, the specifications of the distributed servers, the training dataset (DS), and/or the query.

The distributed environment may further have an appropriate number of epochs. A distributed environment may further include a learning query. The learning query provided in the distributed environment may be an analyzed content, for example, a query function.

The main server 40 may spread the distribution environment, the learning model table (Tt), and/or the training dataset (DS) table to the first to third distribution servers 41 to 43. A distributed environment may be a relational data structure. A distributed environment may belong to a model table for training.

After data diffusion, the first to third distribution servers 41 to 43 may have the same distribution environment, a learning model table (Tt), and a learning dataset (DS) table.

Each of the first to third distribution servers 41 to 43 may change their respective training dataset (DS) tables according to learning. For example, the first distribution server 41 may randomly change the order of the data of the learning dataset DS and divide the data according to the batch size. Shuffled and partitioned datasets can be stored as batch dataset tables. A dataset divided into each batch size of the batch data set table may be referred to as 'batch data' or 'mini-batch'.

It is preferable that the first to third distributed servers 41 to 43 set the initial values of the respective learning parameter tables to be different from each other. This is because deep learning training can be performed with various learning parameters. To this end, the first to third distributed servers 41 to 43 may randomly set initial values of learning parameters. For initialization of the learning parameters, various initialization techniques may be used.

Each of the first to third distribution servers 41 to 43 initially operated builds a model architecture suitable for the first to third framework units 51 to 53 installed as a plug-in using the architecture table belonging to the appropriate learning model table. It can be done (S640).

The first to third framework units 51 to 53 may prepare to train a learning model by allocating initial learning parameters to each constructed model architecture (S650).

The first to third framework units 51 to 53 (eg, each QML module 310) may be trained using a mini-batch for learning and a model architecture to which learning parameters are assigned (S660).

For integration of each learning parameter updated in each distributed server, there may be an asynchronous learning method in which batch learning is performed independently in each distributed server and a synchronous learning method in which batch learning is periodically started.

Depending on computing resources or specifications, the time required for each batch learning of distributed servers is inevitably different.

The asynchronous learning method enables continuous batch learning without a break regardless of the timing of batch learning in other distributed servers, so that computing resources can be efficiently used. In addition, if one of the distributed servers finally ends the machine learning, the total learning time can be further reduced compared to the asynchronous method by using a policy that also terminates the other distributed servers.

Since the synchronous learning method shares updated final learning parameters after the same number of batch learning in each distributed server, the degree or efficiency of distributed learning may be better than that of the asynchronous learning method.

Users can select one of synchronous and asynchronous learning methods according to the type or target of machine learning. Hereinafter, synchronous and asynchronous learning methods will be described in detail.

Referring to FIG. 15, an embodiment of an asynchronous learning method will be described. 15 is an embodiment according to the immediate sharing policy in which the latest learning parameter is spread to other framework units when each batch learning is completed in each framework unit among the above-mentioned 'spreading policies'. The unified policy is an embodiment in which at least one other learning parameter is used as an argument when it is up to date.

Each of the first to third framework units 51 to 53 may acquire each batch data (mini-batch b1 to b10) (Get Batch Data) and perform iteration learning until the learning ends. Each repeated learning is referred to as 'batch learning' (batch TR).

The first framework unit 51 may perform 1.1 batch learning in the model architecture to which the 1.1 parameter p1.1 is assigned. When the 1.1 batch learning is completed, the first framework unit 51 may derive the learned 1.1′ parameter p1.1′.

The first framework unit 51 may spread the learned parameter 1.1' (p1.1') to the second and third distributed servers 42 and 43 (S810). Spreading may be transmitted directly from the first distribution server 41 to the remaining

distribution servers

42 and 43, or may be synchronized or mirrored through the main server 40. For the efficiency and consistency of data management, it is desirable to spread through synchronization or mirroring. In this embodiment, after the learning is finished, the learning parameters of the first distribution server 41 are shown to spread to the second and

third distribution servers

42 and 43, but it is not limited thereto. For example, after learning is finished in the third framework unit 53, which takes the most time for learning, each derived learning parameter (p1.1', p2.1', p3.1') is It can spread to

other distribution servers

41, 42, 43.

The integration unit 320 of the first framework unit 51 derives the latest learning parameters (other learning parameters) derived after batch learning in the other distributed

servers

42 and 43 and the first framework unit 51. The obtained learning parameters may be integrated into learning parameters to be used in the next batch learning after appropriate conversion (F).

Among other learning parameters, those used in the integrator 320 prior to the recently completed batch learning may be excluded. That is, only the latest learning parameters can be used.

The first framework unit 51 may update the integrated learning parameters to learning parameters to be applied to the next batch learning and perform the next batch learning.

For example, when the 1.1 batch learning (TR) is completed in the first framework unit 51, integration of the first framework unit 51 is not performed because there is no learning parameter spread in the other distributed

servers

42 and 43. The unit 320 may designate the 1.1' parameter p1.1' as the 1.2 parameter p1.2 used for the next learning 1.2 batch learning.

The first framework unit 51 focuses on the 1.2' parameter (p1.2') derived after the 1.2 batch learning is completed, and the 2.1' and 2' parameters spread in the second and

third distribution servers

42 and 43 By combining with the 3.1′ parameter, the 1.3 parameter p1.3 can be calculated.

It is preferable that the function F used in the integration process focuses on the learning parameter derived from the corresponding framework unit and integrates the other parameters as auxiliary. For example, the integration unit 320 of the first framework unit 51 multiplies the first parameter p1.2' with a high weight, and the other parameters p2.1' and p3.1' with low weights. By multiplying the weight, it can be derived as the 1.2 parameter (p1.2). At this time, the sum of the weights is preferably 1. Depending on the number of factors (learning parameters) of the integrated function (F) or the progress of learning, the size of each of the weights multiplied by each factor may vary.

The first framework unit 51 may derive the 1.3' parameter (p1.3') after the 1.3 batch learning using the 1.3 parameter (p1.3). After 1.3 batch learning, the first framework unit 51 has the 1.3', 2.2', and 3.1' parameters (p1.3, p2.2, p3.1') as the latest parameters. At this time, the 3.1' parameter (p3.1') is a parameter used after completion of the 1.2 batch learning, and excludes it. Accordingly, the first framework unit 51 may calculate the 1.4th parameter p1.4 by integrating the 1.3′ and 2.2′ parameters p1.3′ and p2.2′.

Among the spread learning parameters, which should be the latest, an example can be seen in the integration stage after the 2.4 batch learning of the second distribution server 42 . After the 2.3 batch learning and before the end of the 2.4 batch learning, the second framework unit 52 converts the 1.3' learning parameter (p1.3') and the 1.4' learning parameter (p1.4') to the first distributed server ( 41) can be diffused. Since the 1.4' learning parameter p1.4' is the latest, the integration unit 320 of the second framework unit 52 replaces the 1.3' learning parameter p1.3' with the 1.4' learning parameter p1. 4') can be used to integrate.

According to this embodiment, the first to third distributed servers 41 to 43 may asynchronously perform batch learning and integration of learning parameters. That is, the first distribution server 41 may proceed with the next batch learning regardless of the end of batch learning of the

other distribution servers

42 and 43 . Accordingly, computing resources of the plurality of distributed servers 41 to 43 can be efficiently used. The time required for each batch learning is inevitably different due to different server specifications or operating environments. This is because, in the case of the present synchronous learning method, it is not necessary to wait for batch learning of other distributed servers to end.

The final trained learning parameter (p_last) may be calculated by the integration unit 320 of the first framework unit 51 . The integrating unit 320 of the first framework unit 51 provides the learning parameters (p1.l', p2.m', p3.n', 1.l', 2.m', and 3.n'). ), it is possible to calculate the trained learning parameter (p_last) by integrating (F′) based on at least one of (l, m, n are natural numbers). The function (F′) used in this final integration may be different from the integration function (F) used during training.

The final integration function (F') preferably does not give differential weight to each factor. It is preferable that the final integration function (F') discriminates less than the integration function (F) even if the weight is discriminated for each factor. In this case, it is preferable to assign high weights to low weights in the order in which learning is completed first.

As shown in this embodiment, it is preferable to derive (integrate) the final learning parameter (p_last) from the first distribution server 41 where batch learning ends most quickly. That is, the 'l' value has a larger value than other 'm' and 'n' values. When the batch learning of the first framework unit 51 ends (all epochs are finished), the batch learning of the second and

third framework units

52 and 53 may end regardless of the remaining iterations. In this case, the time can be shortened compared to the synchronous learning method in the same environment.

Another embodiment of the asynchronous learning method will be described with reference to FIG. 16 . 16 is an embodiment according to a sharing policy for each time period in which a finally updated learning parameter is spread to other framework units after a certain time period among the above-mentioned 'diffusion policies'. The integration policy is an embodiment that is used as an argument if all learning parameters are up to date. See Figure 15.

Each of the first to third framework units 51 to 53 may obtain each batch data (mini-batch b1 to b10) and repeat batch learning (batch TR) until learning ends.

The first framework unit 51 may perform 1.1 batch learning in the model architecture to which the 1.1 parameter p1.1 is assigned. When the 1.1 batch learning is completed, the first framework unit 51 may derive the learned 1.1′ parameter p1.1′. The derived parameter 1.1' (p1.1') may be used for learning the 1.2 arrangement as it is. That is, the integration unit 320 of the first framework unit 51 may not execute the integration function (F). The 1.2 parameter p1.2 has the same value as the 1.1′ parameter p1.1′.

In this way, each of the distributed servers DS1 to 3 may perform batch learning independently until the learning parameters are spread.

All of the framework units 51 to 53 may spread the latest learning parameters at a specific period or at a specific time instructed by the main server 40 (S820). In this embodiment, the first framework unit 51 sets the 1.3' learning parameter p1.3', the second framework unit 52 sets the 2.2' learning parameter p2.2', and The third framework unit 52 may spread the 3.1' learning parameter p3.1' to other distributed servers.

After spreading the learning parameters, each framework part can integrate the learning parameters before learning the next batch.

For example, the first framework unit 51 focuses on the 1.3' parameter (p1.3') derived after the 1.3 batch learning is completed, and the second and

third distribution servers

42 and 43 spread the By combining the parameters 2.2' and 3.1', the parameter 1.4 (p1.4) can be calculated. The second framework unit 52 focuses on the 2.3′ parameter (p2.3′) derived after the 2.3 batch learning is completed, and the 3.1′ and 3.1′ parameters spread from the 3rd and 1st distributed

servers

43 and 41 By combining with the 1.3' parameter, the 2.4 parameter (p2.4) can be calculated. The third framework unit 53 focuses on the 3.2′ parameter (p3.2′) derived after the 3.2 batch learning is completed, and the first and

second distribution servers

41 and 42 spread the 1.3′ and Integrating with the 2.2′ parameter, the 3.3 parameter (p3.3) can be calculated.

The final trained learning parameter (p_last) may be calculated by the integration unit 320 of the first framework unit 51 . The integrating unit 320 of the first framework unit 51 provides the learning parameters (p1.l', p2.m', p3.n', 1.l', 2.m', and 3.n'). ), it is possible to calculate the trained learning parameter (p_last) by integrating (F') based on (l, m, n are natural numbers). The function (F′) used in this final integration may be different from the integration function (F) used during training.

third framework units

Another embodiment of the asynchronous learning method will be described with reference to FIG. 17 . 17 is an embodiment according to a sharing policy for each learning period in which the latest learning parameter is spread when a certain number of batch learning is completed among the above-mentioned 'spreading policies'. The integration policy is an embodiment that is used as an argument if all learning parameters are up to date. See Figure 15.

Each of the framework units 51 to 53 may spread the latest learning parameter in a specific cycle of the number of times of batch learning (S830). When batch learning marked in bold in the drawing is completed, the latest learning parameter may be diffused.

In this embodiment, it is assumed that the learning parameters are spread after the end of two cycles of batch learning. The first framework unit 51 sets the 1.2' learning parameter p1.2', the second framework unit 52 sets the 2.2' learning parameter p2.2', and the third framework unit (52) may spread the 3.2' learning parameter (p2.3') to other distributed servers, respectively.

After spreading the learning parameters, each framework unit may integrate the learning parameters when all the latest learning parameters are received from other distributed servers.

third distribution servers

42 and 43 spread the By combining the parameters 2.2' and 3.2', the parameter 1.4 (p1.4) can be calculated. The first framework unit 51 executes the integration function (F) after each batch learning for the reason that no other learning parameters are received after the 1.2 batch learning and for having only one other latest learning parameter after the 1.3 batch learning. don't let

The second framework unit 52 focuses on the 2.3′ parameter (p2.3′) derived after the 2.3 batch learning is completed, and the 3.2′ and 3.2′ parameters spread from the 3rd and 1st distributed

servers

43 and 41 Integrating with the 1.2' parameter, the 2.4 parameter (p2.4) can be calculated. The third framework unit 53 focuses on the 3.2′ parameter (p3.2′) derived after the 3.2 batch learning is completed, and the first and

second distribution servers

41 and 42 spread the 1.2′ and Integrating with the 2.2′ parameter, the 3.3 parameter (p3.3) can be calculated.

third framework units

An embodiment of a synchronous learning method will be described with reference to FIG. 18 . 18 is an embodiment according to the immediate sharing policy in which the latest learning parameter is spread to other framework units when every batch learning is completed in a framework unit among the above-mentioned 'diffusion policies'. The integration policy is an embodiment that is used as an argument if all learning parameters are up to date. See Figure 15.

Each of the first to third framework units 51 to 53 may perform batch learning (batch TR) for each mini-batch b1 to b10 until the training ends.

The first framework unit 51 may spread the learned parameter 1.1' (p1.1') to the second and third distribution servers 42 and 43 (S840).

The first framework unit 51 updates the learning parameters (other learning parameters) derived after batch learning (1.1 batch learning) of the same step in the other distributed

servers

42 and 43 to the first distributed server 41. can determine if it has been

When all other learning parameters are updated in the first distribution server 41, the integration unit 320 of the first framework unit 51 performs all learning parameters (p1.1', p2.1) derived after the 1.1 batch learning. ', p3.1') can be integrated into the learning parameter (p1.2) to be used in the next batch learning using an appropriate integration function (F).

It is preferable that the integration function (F) focuses on the learning parameter derived from the corresponding framework unit and integrates the other parameters as auxiliary. For example, the integration unit 320 of the first framework unit 51 multiplies the first parameter p1.1' with a high weight, and the other parameters p2.1' and p3.1' with low weights. By multiplying the weight, it can be derived as the 1.2 parameter (p1.2). At this time, the sum of the weights is preferably 1. The weight is the degree of learning progress or other factors, and each weight may have a different size.

The first framework unit 51 may update the integrated learning parameter p1.2 as a learning parameter to be applied to the next batch learning, and perform the next batch learning.

The integration unit 320 of the second and

third framework units

52 and 53 of the second and

third distribution servers

42 and 43 respectively all the learning parameters (p1.1' derived after the 1.1 batch learning). , p2.1', p3.1'), the 2.2 and 3.2 parameters (p2.1, p3.1) are integrated and updated as learning parameters to be applied to the next batch learning, and then the next batch can learn

Through this process, the first to third framework units 51 to 53 may be tested until all epochs are completed.

When all epochs are finished, any one of the plurality of distribution servers 41 to 43 or the integration unit 320 of the main server 40 determines each of the last learning parameters (p1.n', p2.n', p3.n ') to derive the final learning parameter (p_last) (n is a natural number). The final integration function (F') may be different from the integration function (F) used during learning. The final integration function (F') preferably does not give differential weight to each factor.

The synchronous learning method may take more time to learn than the asynchronous learning method, but each learning parameter can be used in a balanced manner.

Another embodiment of the synchronous learning method will be described with reference to FIG. 19 . 19 is an embodiment according to any one of a sharing policy for each time period, a sharing policy for each learning period, and other rule policies among the above-mentioned 'diffusion policies'. The integration policy is an embodiment that is used as an argument if all learning parameters are up to date. See Figures 15, 16, and 18.

Each of the first to third framework units 51 to 53 may repeat batch learning (batch TR) for each mini-batch b1 to b10 until the learning ends.

The first framework unit 51 may perform 1.1 batch learning in the model architecture to which the 1.1 parameter p1.1 is assigned. When the 1.1 batch learning is completed, the first framework unit 51 may derive the learned 1.1′ parameter p1.1′. The derived parameter 1.1' (p1.1') may be used for learning the 1.2 arrangement as it is. That is, the integration unit 320 of the first framework unit 51 may not execute the integration function (F). The 1.2 parameter p1.2 has the same value as the 1.1′ parameter p1.1′. In this way, each of the distributed servers DS1 to 3 may perform batch learning independently until the learning parameters are spread. At this time, the start of each batch learning is irrelevant even if it is not synchronized.

All of the framework units 51 to 53 may spread the latest learning parameters at a specific time period, learning period, or specific time (S820). In this embodiment, each of all the framework units 51 to 53 has the 1.3', 2.3', and 3.3' learning parameters (p1.3', p2.3', p3. 3') can be spread to other distributed servers.

After spreading the learning parameters, each framework unit may integrate the learning parameters before the next batch learning and update them to the 1.4th, 2.4th, and 3.4th learning parameters (p1.4, p2.4, p3.4), respectively. . Thereafter, each of the framework units 51 to 53 may proceed with batch learning until the next learning parameter diffusion.

Referring to FIG. 13, any one of the plurality of distributed servers 41 to 43, for example, the first distributed server 41, sets a trained model architecture and a trained learning parameter p_last to an architecture table and a learning parameter table. It can be converted to and stored as a trained learning model table (model table for inference (Ti)) (S670). Stored data may be transmitted to or synchronized with the main server 40 .

Referring to FIG. 14, deep learning inference variance will be described. Hereinafter, reference is made to FIGS. 10 and 11 . However, the environment of the main server 40 and the plurality of distributed servers 41 to 43 of the deep learning inference distribution system may be different from the environment of the main server 40 and the plurality of distributed servers 41 to 43 of the deep learning training distribution system. can The main server 40 and the plurality of distributed servers 41 to 43 may not include the integration units 320 and 320-N, respectively. The main server 40 and the plurality of distributed servers 41 to 43 may be compatible with each other.

Referring to FIG. 14 , a deep learning inference query having the same function as the learning query may be input from the terminal 20 (S710). In this embodiment, assuming that the main server 40 is the same as the first distributed server 41 capable of deep learning, the main server 40 can receive the deep learning inference query. Hereinafter, it is assumed that deep learning inference is performed in the main server 40 .

The main server 40 may have a dataset table for reasoning in the storage unit 200 . The main server 40 may receive data of a dataset for inference through a query or may receive input from another device.

The main server 40 may analyze a deep learning inference query having the same function as the input learning query and select a pre-learned model table Ti for inference (S715). The model table (Ti) for inference may be described in the contents of a deep learning inference query.

The main framework unit 50 of the main server 40 builds the architecture table of the model table for inference (Ti) into a model architecture for inference suitable for the main framework unit 50, and assigns learning parameters to the model architecture for inference. It is possible to generate a learning model for inference by assigning (S720).

The main server 40 may determine whether reasoning distribution is necessary (S725).

Inference distribution may mean performing some of a plurality of tasks for performing deep learning on another device. The plurality of tasks may be a series of processes for one learning model or a set of tasks of each of the plurality of learning models. In the case of the former (one learning model), multiple tasks must be performed in sequence, so that after performing some tasks on one device, the remaining tasks can be performed on another device. In the case of the latter (multiple learning model), tasks of a certain learning model may be performed in different devices for each task group. In this case, a task group of a learning model belonging to a higher group may be performed first, and then a task group of a learning model belonging to a lower group may be performed. The latter may include the former concept of dispersion. In this embodiment, the description is based on the former, but it is natural that the latter concept is included. And, basically, distributed processing tasks must be a series of tasks. A series of tasks are each in a precedence relationship, meaning that they are directly connected to each other. For example, the output value of a task belonging to a series of tasks must be the input value of the next task belonging to a series of tasks.

Inference distribution requirements can vary.

As a first example of an environment that requires reasoning distribution, the execution time of a series of tasks in the first distribution server 41 may be shorter than the execution time of a series of tasks in the main server 40 . At this time, the time to transmit the reasoning distribution environment and the last result value of a series of tasks may be considered. To this end, it is preferable that the main server 40 and the first distribution server 41 be connected through a high-speed mobile communication network such as high-speed Wi-Fi or 5G or 6G. If the main server 40 has a low computing specification, such as a mobile device or an embedded device, and/or the first distribution server 41 has a high computing specification, inference distribution may be required. Such an environment may be particularly suitable for an edge computing (mobile edge computing) environment. In an edge computing environment, when the main server 40 is an edge device and the first distribution server 41 is an edge server, distributed inference may be preferable. In particular, in the case of an edge computing environment, since the communication speed between the edge device and the edge server is very fast, it may be suitable for this inference distribution.

As a second example of an environment requiring distributed reasoning, computing specifications may vary depending on the main performance of each device separately. For example, if the processing speed of specific tasks is faster in the first distribution server 41 than in the main server 40, the corresponding specific tasks are preferably distributed and processed in the first distribution server 41.

As a third example of a distributed inference environment, a series of tasks that can be separated and processed among a plurality of tasks are processed in different distributed servers. For example, in the case of a learning model that classifies the gender and age of a person, it may be separated into a gender classification learning model and an age classification learning model. If the gender classification is processed in the first distributed server 41 and the age classification is processed in the second distributed server 42, the total time for performing deep learning can be reduced.

As a fourth example of an environment requiring distributed reasoning, when there is a lot of input data, the input data may be divided and each divided input data may be distributed and processed. This example can be used in combination with other examples. For example, if there is a process of pre-processing input data with uncomplicated tasks, the main server 40 may perform the pre-processing and transmit data requiring post-processing to other distributed servers after the pre-processing. This example can be especially useful when the communication environment is not fast. This is because only the data that requires post-processing deep learning needs to be transmitted to the distributed server.

If it is determined that inference distribution is necessary, the main server 40 may transmit the inference model table Ti to at least one of the plurality of distribution servers 41 to 43 (S730).

The main server 40 may instruct the plurality of distributed servers 41 to 43 to generate a learning model for inference based on the model table Ti for inference.

The main server 40 may determine a reasoning distribution environment including a reasoning distribution range, which is a task to be distributed inference, and a distribution server for reasoning distribution (S735). Based on the reasoning distribution environment, the main server 40 may instruct distributed processing to distribution servers to distribute inference (S740). A detailed description of the distributed processing instruction will be described later.

If inference distribution is not required, or if some tasks of deep learning inference need to be performed by the main server 40 even if inference distribution is instructed, the main server 40 may perform all or part of the tasks according to deep learning inference. It can (S750). The main server 40 may perform all or part of the deep learning of the query function on the data of the dataset table for inference using the generated learning model for inference.

The main server 40 may acquire inference results according to deep learning inference completed by itself or other distributed servers, and store them (S760) or notify the user.

Referring to FIG. 20 , if it is determined that inference distribution is necessary, the main server 40 may transmit the learning model table Ti for inference to the first and second distribution servers 41 and 42 (S730).

The first and second distributed

servers

41 and 42 may generate a learning model for inference using the learning model table Ti for inference according to instructions from the main server 40 .

Referring to FIG. 21 , the network structure (neural network) of the schematic learning model may include an input layer (L_I), hidden layers (L1 to L_N), and an output layer (L_O).

The input layer may receive input from a dataset for inference.

The hidden layer is the layer where the computation takes place. The hidden layers (L1 to L_N) may be composed of a single layer or a plurality of layers. A circle shape represents each node, and each layer may be composed of a set of nodes. The beginning of an arrow may be an output of a node, and the end of an arrow may be an input of a node.

The output layer is a result value of deep learning, and may have as many output nodes as the number of values to be classified.

Performing deep learning may include multiple tasks. That is, deep learning can be performed in stages by a plurality of tasks. Each of the plurality of tasks may have a unique number distinguished from other tasks. Each of a plurality of layers described later may have a unique number distinguished from other layers.

A plurality of tasks may refer to all tasks for performing a query function without being dependent on any one learning model. The function of the query is classified into a plurality of detailed functions, and deep learning inference can be performed with a plurality of learning models. In this case, the plurality of tasks may include both a first task group of a first learning model and a second task group of a second learning model among the plurality of learning models.

Referring to FIGS. 21 and 22 , operations performed at nodes of a specific layer among a plurality of layers of a learning model may correspond to one task.

For example, first output values R1 of layer 1 (L1) may become second input values of layer 2 (L2). In the layer 2 (L2), the first output values R1 may be calculated and output as second output values R2. A process of calculating second input values R1 to second output values R2 may be referred to as task 2 (T2). The second output values may be referred to as result value list 2 (R2) of task 2 (T2). The result value list corresponds to the task specific number and may have a unique number distinguished from other result value lists.

If there are N layers, there can be N tasks. As shown on the right side of FIG. 22 , the controller 100 may store the plurality of result value lists R1 to R_N of the plurality of tasks T1 to T_N as an intermediate result value table T_R having a relational structure.

Referring to FIG. 23 , each layer L1 to L_N of the hidden layer may correspond to rows H1 to H_N of the network table T_NW. Accordingly, a plurality of layer-specific numbers, a plurality of task-specific numbers, and a plurality of row numbers may correspond to each other. For example, layer 2 (L2), task 2 (T2), and row number 2 (H2) may correspond to each other.

The inference learning model table (Ti) may have a unique number distinguished from other learning model tables.

Referring to FIG. 20 , the main server 40 performs some of the tasks T1 to T5 in the main server 40, and distributes the first to second tasks T6 to T10 in the first distribution server 41. can decide to deal with it. The second task may be task N (T_N).

To this end, the main server 40 performs (S810) some of the tasks (T1 to T5), the unique number (M_ID) of the learning model table (Ti) for inference, and the third task immediately before the first task (T6). The result value list R5 of the task T5 and the second row number H10 of the network table T_NW corresponding to the unique number T10 of the second task may be transmitted to the first distribution server 41. (S812).

The user's request query may be analyzed into an upper first detailed function and a lower second detailed function. For example, when a request query is a gender classification of a person, it can be analyzed by a person detection function of upper input data and a gender classification function of a lower level recognized person. In this case, the deep learning inference for the person detection function may be performed in the main server 40, and the deep learning inference for the gender classification function of the detected person may be performed in the first distributed server 41.

As another example, the main server 40 performs deep learning of the pre-processing detection function on data for inference, and only when deep learning of the main function is required for the data for inference, the first distributed server 41 performs deep learning of the query function. running can be performed. The pre-processing detection function may refer to a function of detecting whether a deep learning function of a query requested by a user, which is a main function, is required. For example, if the request query is a person's gender classification, the pre-processing detection function may be a function of detecting whether a person exists in an input image. Since deep learning of the request query is not performed for images without people, time is shortened and communication bandwidth can be reduced.

When the first distribution server 41 receives the unique number (M_ID) of the learning model table (Ti) for inference and the third result value list (R5) from the main server 40, the third result value list (R5) may be determined as a list of result values (R5) in which the first task (T1) to the fourth task (T5) of a plurality of tasks are performed.

Among the plurality of network tables, the sixth row number H6 is immediately after the third row number H5.

When the first distribution server 41 further receives the second row number H10 of the network table T_NW, which is the task instruction end row, the first framework unit 51 of the first distribution server 41 obtains a third result. Using the value list R5 as an input, operations related to the sixth row numbers H6 to second row numbers H10 of the network table T_NW may be performed (S814). That is, the first distribution server 41 may distribute and process the first to second tasks T6 to T10 of the pre-generated inference learning model (Ti).

When the operation of the sixth row number (H6) to the second row number (H10) of the network table (T_NW) is completed, the first distribution server 41 transfers the second result value list (R2) to the main server (40). It can be transmitted (S816-1). In this embodiment, this procedure may be performed when the second task T10 is the final task or when another task needs to be performed in the main server 40 .

In another embodiment, the main server 40 is the first distribution server 41 to the second distribution server 42, the network table identification number (M_ID), the second result value list (R10) of the second task (T10) ) and the seventh row number (H_N), which is the end row of the task instruction, the first distribution server 41 performs operation of the sixth row number (H6) to the second row number (H10) (S814), and then the main Instructions of the server 40 may be executed (S816-2). The query analyzer 110 of the main server 40 may analyze and/or extract a user-requested query in three stages of detailed functions.

For example, when the query is a gender classification function of a person, the query analyzer 110 extracts the requested query into a higher level person presence detection function, a next level person detection function, and a lower level detected person gender classification function. can do. The main server 40 performs deep learning inference on whether or not a person exists in the image file, which is the input data, and the person detection function in the image file in which the person exists is performed by deep learning inference in the first distributed server 41, and the detected The function of classifying the gender of the image may be inferred by deep learning in the second distributed server 42 . Through this, processing time can be reduced by reducing the amount of processing in each server, and processing time can be further shortened because only images with people are processed in distributed servers.

When the second distribution server 42 receives the unique number (M_ID) of the learning model table (Ti) for inference, the second result value list (R10), and the seventh row number (H_N) of the network table (T_NW), 2 The resulting value list (R10) is used as an input and operations related to row numbers 11 (H11) to row numbers N (H-N) of the network table (T_NW) can be performed (S818). That is, the second distribution server 42 may distribute tasks 11 to N (T11 to T_N) of the pre-generated learning model table for inference (Ti).

The second distribution server 42 may determine a seventh result value list (R_N) as a result of the distribution processing (S818) and transmit it to the main server 40 (S820).

Fig. 26 shows the intermediate data of Figs. 24 and 25;

See Figures 1 to 23. For the contents of the same name as the above-mentioned components, refer to the above-mentioned contents.

The deep learning framework application database server for classifying gender and age according to the present embodiment includes a control unit 100, a storage unit 200, a framework unit 300, a conversion unit 360, and an input/output unit 370. can include

Referring to FIG. 24 , the control unit 100 includes a query analysis unit 110, a dataset management module 120, a learning model management module 130, a result management module 160, and an auxiliary management module 170. can do. The storage unit 200 may include a query analysis value 210, a dataset table 220, a learning model table 230, a learning result 260, mapping information 520, and a face image box 510. there is.

Referring to FIGS. 25 and 26 , deep learning inference for classifying gender and age of the deep learning framework application database server for classifying gender and age will be described.

The input/output unit 370 may receive an inference query for the gender and age classification function of the dataset for inference from the user (S830). A dataset for inference may be stored as a dataset table for inference. A dataset table for inference may be a set of images in which a person exists. Hereinafter, it is assumed that the original image 500 is input as input data.

The query analysis unit 110 may analyze the user requested reasoning query and extract a plurality of detailed functions to achieve the function of the query (S832). The query analyzer 110 may extract an inference query as a plurality of detailed functions, such as a face detection function of an upper group and gender and age classification functions of a lower group. The gender and age classification functions may be extracted as one function and a function for classifying gender and age respectively. Since the gender and age classifications are groups of the same rank, either one or two may be used. In this embodiment, it is assumed that two functions are extracted: a gender classification function and an age classification function. Although the face detection function is indicated in this embodiment, it can be adjusted to detect the entire person rather than the face, if necessary.

The query analysis unit 110 may further extract a preprocessing detection function of the highest group for detecting whether deep learning inference of a user-requested query function, which is a main function, is required for an inference dataset. The main function may mean all of a plurality of detailed functions.

The storage unit 200 includes a pre-learned face detection learning model table associated with the face detection function, a gender classification learning model table associated with the gender classification function, an age classification learning model table associated with the age classification function, and preprocessing associated with the preprocessing detection function. A sensing learning model table may be provided. If there is no learned learning model table, you can add a learning model table learned through deep learning training.

The learning model management module 130 may select a previously learned pre-processing detection learning model table, a face detection learning model table, a gender classification learning model table, and an age classification learning model table from among a plurality of learning model tables.

The framework unit 300 interlocks the preprocessing detection learning model table, the face detection learning model table, the gender classification learning model table, and the age classification learning model table, respectively, with respect to the dataset table for inference, so as to obtain a preprocessing detection function and a plurality of details. Each function can perform deep learning inference. The framework unit 300 preferably performs deep learning inference of the learning model according to the group rank. In this embodiment, each group may have a rank in the order of the highest group, the highest group, and the lowest group. That is, each model may have a preprocessing detection function as a first priority, a face detection function as a second priority, and a gender and age classification function as a third priority.

The framework unit 300 may perform deep learning inference for detecting whether there is a face in the original image 500 by using a preprocessing detection learning model that is a first-order learning model (S834).

If there is no face in the original image 500 (S836), deep learning on the original image 500 may be stopped, and deep learning inference of a preprocessing detection function may be performed on the next input data.

If there is a face in the original image 500 (S836), deep learning inference of the remaining plurality of detailed functions may be performed.

The framework unit 300 may perform deep learning inference for detecting a face in the original image 500 using a face detection learning model, which is a second rank learning model (S836). Face detection may mean defining a region such as a rectangle including a face portion and extracting position coordinates of the rectangle region from the original image 500 .

The auxiliary management module 170 may generate a face image box 510 by cropping the detected face part (S840). The face image box 510 may be an input value for deep learning inference of a detailed function of the next priority.

The auxiliary management module 170 may generate mapping information 520 mapping a relationship between the original image 500 and the face image box 510 (S842).

The mapping information 520 may include a box ID 523 , an image ID 521 , and location information 525 . The mapping information 520 may be in the form of a relational database.

The box ID 523 is a unique number of the face image box 510 and may function as an identifier. The image ID 521 is related to the box ID 523 and is a unique number of the original image 500. The location information 525 is information about the location of the face image box 510 mapped to the original image 500 with respect to the box ID 523 . If the face image box 510 is a rectangular area including a face, the location information 525 may be x and y coordinates of two diagonal vertices of the rectangle.

The framework unit 300 may classify the gender of the face image box 510 using the gender classification learning model that is the third ranking learning model (S844).

The auxiliary management module 170 may update the mapping information 520 by mapping information about the gender of the face image box 510 to the mapping information 520 (S846). Mapping can mean associating the gender value 527 to the box ID 523 of the face image box 510 .

The framework unit 300 may classify the age of the face image box 510 using an age classification learning model that is a third ranking learning model (S848).

The auxiliary management module 170 may update the mapping information 520 by mapping information about the age of the face image box 510 to the mapping information 520 (S850). Mapping can mean associating the age value 529 to the box ID 523 of the face image box 510 .

The dataset management module 120 provides the location information 525 of the face image box 510 of the mapping information 520 having the original image 500 and the image ID 521 of the original image 500, the gender value ( 527), and the age value 529, the character's age and age may be added to the original image 500 to generate the resulting image 540 (S852).

The deep learning framework application database server for classifying gender and age according to the present embodiment performs a series of tasks, which are part of a plurality of tasks according to the above-described deep learning inference, on at least one of the plurality of distributed servers 41 to 43. By doing so, deep learning inference can be distributed. In addition, the deep learning framework application database server that classifies gender and age can conduct deep learning training for an untrained learning model by distributing it with a plurality of distributed servers 41 to 43.

The present invention may be implemented in hardware or software. In implementation, the present invention can also be implemented as computer readable codes on a computer readable recording medium. That is, it may be implemented in the form of a recording medium including instructions executable by a computer. Computer readable media includes all types of media in which data that can be read by a computer system is stored. Computer readable media may include computer storage media and communication storage media. Computer storage media includes all storable media implemented as any method or technology for storing information, such as computer readable instructions, data structures, program modules, and other data, and includes volatile/nonvolatile/hybrid memory. It is not limited to whether or not, separable/non-separable. Communication storage media includes modulated data signals or transmission mechanisms such as carrier waves, any information delivery media, and the like. In addition, functional programs, codes, and code segments for implementing the present invention can be easily inferred by programmers in the technical field to which the present invention belongs.

In addition, although the preferred embodiments of the present invention have been shown and described above, the present invention is not limited to the specific embodiments described above, and the technical field to which the present invention belongs without departing from the gist of the present invention claimed in the claims. Of course, various modifications are possible by those skilled in the art, and these modifications should not be individually understood from the technical spirit or perspective of the present invention.

10: DB server 20: terminal

40: main server 41: first distributed server

42: second distributed server 43: third distributed server

51: first framework unit 52: second framework unit

53: third framework unit 100: control unit

110: query analysis unit 120: dataset management module

130: learning model management module 160: result management module

170: auxiliary management module 200: storage unit

300: framework unit 310: QML module

320: integration unit 360: conversion unit

Claims

an input/output unit that receives an inference query for a gender and age classification function of an inference dataset from a user;

a query analyzer for analyzing the query and extracting a plurality of detailed functions to achieve the function of the query;

A database comprising: a storage unit having a plurality of learning model tables and a data set table; and

A framework unit that interworks with the database and performs deep learning of the plurality of detailed functions using the learning model table and the dataset table;

The query analysis unit extracts the inference query into a face detection function of an upper group and a gender and age classification function of a lower group as the plurality of detailed functions;

A learning model management module for selecting a previously learned face detection learning model table, a gender classification learning model table, and an age classification learning model table from among the plurality of learning model tables;

The framework unit performs deep learning inference of the plurality of detailed functions in conjunction with the face detection learning model table, the gender classification learning model table, and the age classification learning model table, respectively, and the face detection learning model table, which is the upper group. A deep learning framework application database server that classifies gender and age, deep learning inference with a face detection learning model based on priority.
According to claim 1,

a dataset management module that converts the dataset for inference into a dataset table for inference; and

If the framework unit infers that there is a face of the original image included in the dataset table for inference using the face detection learning model, the face part is cropped to create a face image box, and the face image box is unique. A deep learning framework for classifying gender and age, further comprising an auxiliary management module for associating a box ID, which is a number, with an image ID, which is a unique number of the original image, and generating mapping information mapping location information of the face image box. Application database server.
According to claim 2,

The auxiliary management module deepens the gender and age of the person in the face image box by using a gender classification learning model and an age classification learning model according to the gender classification learning model table and the age classification learning model table in the framework unit. A deep learning framework application database server for classifying gender and age, mapping the deep learning inferred gender and age with the box ID when the learning is inferred.
According to claim 3,

The dataset management module creates a result image by adding the age and age of the person to the original image using the original image and the location information of the face image box of the mapping information having the image ID of the original image , deep learning framework application database server that classifies gender and age.
According to claim 1,

The query analysis unit further extracts a preprocessing detection function for detecting whether deep learning inference of the query function, which is a main function, is necessary for the inference dataset,

The storage unit includes a preprocessing sensing learning model table associated with the preprocessing sensing function,

Deep learning inference of the preprocessing detection function is performed on the inference dataset, and when the inference data is classified as requiring deep learning inference of the main function, the plurality of detailed functions of the inference dataset Deep learning framework application database server for classifying gender and age, further comprising a control unit that enables deep learning inference of the.
According to claim 1,

Among the plurality of learning models according to the plurality of detailed functions, a first learning model includes a plurality of tasks,

The plurality of tasks correspond to a plurality of rows of a network table provided in the first learning model table,

Each of the plurality of tasks has a unique number corresponding to a row number of the network table,

A controller configured to perform deep learning inference in a distributed manner by allowing a first to second series of tasks among the plurality of tasks to be performed by a first distributed server among a plurality of distributed servers;

The plurality of distributed servers each have a deep learning framework linked with a database, have the first learning model table,

The control unit may include a unique number of the first learning model table, a third result value list of a third task immediately preceding the first task, and a second row number of the network table corresponding to the unique number of the second task. Deep learning framework application database server for classifying gender and age, which transmits to the first distributed server.
According to claim 6,

When the control unit receives a unique number of the learning model table and a fourth result value list of a fourth task among the plurality of tasks from a second distribution server among the plurality of distribution servers, the fourth result value list is converted into the plurality of It is determined as a list of result values performed from the first task to the fourth task in a series of tasks,

A fifth row number among the plurality of network tables is immediately after the fourth row number,

When the control unit further receives a sixth row number, which is a task instruction end row, from the second distribution server, the framework unit receives the fourth result value list as an input, and the fifth row number to the sixth row number A deep learning framework application database server that classifies gender and age, allowing it to perform computations.
According to claim 1,

The learning model management module selects a second learning model table suitable for the first detailed function when there is no pre-learned first learning model table related to the first detailed function among the plurality of detailed functions;

The second learning model table is converted into the first learning model table by deep learning training of the first detailed function based on the related training dataset table,

The deep learning training is performed by being distributed and processed with a plurality of distributed servers,

The plurality of distributed servers each having a deep learning framework interworking with the database, a deep learning framework application database server for classifying gender and age.
According to claim 8,

Further comprising a controller that spreads the batch size of the training dataset, the second learning model table, and the training dataset table to a plurality of distributed servers,

The deep learning framework application database server functions as a first distributed server among the plurality of distributed servers,

The control unit randomly changes the data order of the training dataset table and converts it into a batch dataset table by dividing it according to the batch size,

The framework unit builds a model architecture using an architecture table belonging to the second learning model table, initializes a learning parameter table belonging to the second learning model table, and then allocates it to the model architecture to generate a second learning model. And a deep learning framework application database server for classifying gender and age, performing deep learning training using a plurality of mini-batches of the batch dataset table for the second learning model.
According to claim 9,

The framework unit derives a new learning parameter when batch learning for one mini-batch of the plurality of mini-batches ends,

The control unit spreads the new learning parameter to the remaining distributed servers of the plurality of distributed servers;

When the new learning parameter is generated, further comprising an integration unit for integrating the new learning parameter and at least one learning parameter spread from the remaining distributed servers and updating the learning parameter to be applied to the next batch learning;

When all assigned epochs are completed, the integration unit derives a final learning parameter by integrating the last learning parameter derived from the framework unit and at least one learning parameter finally spread in the remaining distributed servers;

The control unit converts the trained model architecture and the final learning parameter into the learned first learning model table, deep learning framework application database server for classifying gender and age.
As an inference method of a deep learning framework application database server that classifies gender and age,

Receiving an inference query of a gender and age classification function of an inference dataset from a user;

extracting the inference query into a plurality of detailed functions, which are a face detection function of an upper group and a gender and age classification function of a lower group;

selecting a pre-learned face detection learning model table, gender classification learning model table, and age classification learning model table;

And performing deep learning in the order of deep learning inference of the function corresponding to the upper group and deep learning inference of the function corresponding to the lower group.
According to claim 11,

converting the dataset for inference into a dataset table for inference;

deep learning inference of a face detection function for an original image provided in the dataset table for inference using a face detection learning model according to the face detection learning model table;

generating a face image box by cropping the face portion if it is inferred that the original image has a face;

generating mapping information by associating an image ID, which is a unique number of the original image, with a box ID, which is a unique number of the face image box; and

Adding location information of the face image box to mapping information having the box ID;
According to claim 12,

performing deep learning inference on the gender and age of the person in the face image box using a gender classification learning model and an age classification learning model according to the gender classification learning model table and the age classification learning model table;

adding the sex and age deduced by deep learning to the mapping information including the box ID; and

Generating a resultant image by adding the age and age of the person to the original image using the original image and positional information of the face image box of mapping information having an image ID of the original image, method.