WO2022030054A1

WO2022030054A1 - Value range determination program, value range determination system, and value range determination method for deep learning model

Info

Publication number: WO2022030054A1
Application number: PCT/JP2021/016965
Authority: WO
Inventors: 希武田中; ルクオックヴォン
Original assignee: コニカミノルタ株式会社
Priority date: 2020-08-06
Filing date: 2021-04-28
Publication date: 2022-02-10
Also published as: JPWO2022030054A1

Abstract

[Problem] To provide a value range determination program for a deep learning model with which a lightweight and highly precise quantization deep learning model can be obtained through quantization of parameters. [Solution] A program for determining a value range in the quantization of parameters in a deep learning model, said value range determination program having a procedure (a) for acquiring input data and correct answer data corresponding to the input data, and a procedure (b) for quantizing parameters while sequentially changing the value range in the quantization of the parameters, and determining a value range when the loss is the smallest for an estimation result by a deep learning model based on the input data with respect to the correct answer data.

Description

Range determination program, range determination system, and range determination method for deep learning models

The present invention relates to a range determination program, a range determination system, and a range determination method of a deep learning model.

In recent years, edge computing technology has been developed to introduce AI (Artificial Integrity) technology into various familiar devices and the like.

Edge computing technology requires low cost and low power consumption, so a quantized deep learning model that can be calculated with a high-speed and small-scale circuit is attracting attention.

The quantization deep learning model is lightened by the quantization that converts the parameters expressed by the floating point of 16 to 32 bits in the normal deep learning model to the fixed point of 8 bits or less.

However, the estimation accuracy by the quantized deep learning model may decrease due to the quantization of the parameters, and it changes relatively greatly depending on how the range is set. Therefore, there is a problem that it is difficult to quantize the parameters by setting the range for each layer while suppressing the deterioration of the estimation accuracy.

The present invention has been made to solve such a problem. That is, it is an object of the present invention to provide a range determination program, a range determination system, and a range determination method of a deep learning model, which can obtain a highly accurate and lightweight quantization deep learning model by quantizing parameters.

The above problem of the present invention is solved by the following means.

(1) A program for determining the value range in the quantization of the parameters of the deep learning model, the procedure (a) for acquiring the input data and the correct answer data corresponding to the input data, and the above-mentioned in the quantization of the parameters. The value range is sequentially changed to quantify the parameter, and the loss of the estimation result by the deep learning model based on the input data with respect to the correct answer data is calculated to determine the value range when the loss is minimized. A value range determination program of a deep learning model for causing a computer to execute a process having the procedure (b).

(2) In the procedure (b), the value range in the quantization of the parameter is sequentially changed, the parameter is quantized, the deep learning model is learned using the learning teacher data, and the learning is performed. The value range determination of the deep learning model according to (1) above, wherein the loss for the correct answer data of the estimation result based on the input data by the deep learning model is calculated to determine the value range when the loss is minimized. program.

(3) The procedure (b) is a procedure (b1) for provisionally setting the range in the quantization of the parameter of a predetermined target layer of the deep learning model, and the procedure (b1) for setting the parameter of the target layer. ), And the procedure (b2) for learning the target layer or the designated layer by using the learning teacher data after quantization in the range temporarily set in the above procedure, and the deep learning learned in the procedure (b2). The procedure (b3) for calculating the loss for the correct answer data of the estimation result based on the input data by the model and the procedure (b2) and the procedure (b3) are executed by sequentially changing the temporarily set range. The above-mentioned (2), which comprises a procedure (b4) for determining the range when the loss calculated in the procedure (b3) is minimized in the temporarily set range. Range determination program for deep learning models.

(4) In the procedure (b), the procedure (b1) to the procedure (b4) are executed with one lower layer of the target layer as a new target layer, and the range of the parameter of the new target layer is executed. The deep learning model according to (3) above, further comprising a procedure (b5) for determining the above procedure (b5) and a procedure (b6) for executing the procedure (b5) until the new target layer becomes the lowest layer. Range determination program.

(5) The range determination program of the deep learning model according to (4) above, wherein the initial value of the target layer in (b1) is the uppermost layer.

(6) The range determination program of the deep learning model according to (4) above, wherein the initial value of the target layer in the above (b1) is a layer other than the uppermost layer.

(7) The parameter of each layer of the deep learning model includes a weight and a bias, and the procedure (b2) fixes the bias and the procedure (b21) for learning the weight and the weight. The range determination program for the deep learning model according to any one of (3) to (6) above, which comprises the procedure (b22) for learning the bias.

(8) The parameters of each layer of the deep learning model include weight, bias, and output, and in the procedure (b4), the range of the bias is the range of the weight of the same layer as the layer of the bias. The range determination program for the deep learning model according to any one of (3) to (7) above, which is calculated based on the range of the output of the layer above the layer of the bias.

(9) In the case of the deep learning model in which the lower layer of the convolution layer is the relu layer, the procedure (b4) makes the range of the convolution layer the same as the range of the relu layer. The range determination program for the deep learning model described in any of 8).

(10) A system for determining the value range in the quantization of the parameters of the deep learning model, the acquisition unit for acquiring the input data and the correct answer data corresponding to the input data, and the value range in the quantization of the parameters. A calculation unit that sequentially changes and quantizes the parameters, calculates the loss of the estimation result by the deep learning model based on the input data with respect to the correct answer data, and determines the value range when the loss is minimized. And, the value range determination system of the deep learning model with.

(11) A method of determining the value range in the quantization of the parameters of the deep learning model by the value range determination system, in which the input data and the correct answer data corresponding to the input data are acquired, and the parameter. When the value range in the quantization is sequentially changed to quantify the parameter, and the loss of the estimation result by the deep learning model based on the input data with respect to the correct answer data is calculated to minimize the loss. A method for determining a price range of a deep learning model, comprising the step (b) of determining the price range.

The range in the quantization of the parameters of the deep learning model is sequentially changed to quantize the parameters, and the loss for the correct answer data of the estimation result by the deep learning model based on the input data is calculated, and the range when the loss is minimized. To determine. As a result, a highly accurate and lightweight quantized deep learning model can be obtained by quantizing the parameters.

It is a figure which shows the schematic structure of the range determination system of a deep learning model. It is a block diagram which shows the hardware composition of the range determination apparatus. It is a block diagram which shows the example of the function of the control part of a range determination apparatus. It is explanatory drawing which shows the input image. It is explanatory drawing which shows the joint point. It is explanatory drawing for demonstrating the range in the quantization of a model parameter which is tentatively set sequentially. It is explanatory drawing which shows the transition that the range is determined in the quantization of a model parameter for each layer of a deep learning model. It is a block diagram which shows another example of the function of the control part of a range determination apparatus. It is a flowchart which shows the operation of the range determination apparatus.

Hereinafter, the range determination program, the range determination system, and the range determination method of the deep learning model according to the embodiment of the present invention will be described with reference to the drawings. In the drawings, the same elements are designated by the same reference numerals, and duplicate description will be omitted. In addition, the dimensional ratios in the drawings are exaggerated for convenience of explanation and may differ from the actual ratios.

FIG. 1 is a diagram showing a schematic configuration of a range determination system 10 of a deep learning model according to an embodiment.

The range determination system 10 includes a range determination device 100, a photographing device 200, and a communication network 300. The range determination device 100 is communicably connected to the photographing device 200 by the communication network 300. In the present specification, the object is an object whose position or partial position is estimated by the estimation unit 112 (see FIGS. 3 and 8) of the range determination device 100. Objects can contain multiple categories. A category is a type of object and includes people, dogs, cats, and the like. Hereinafter, for the sake of simplicity, the object will be described as being the subject 500 (that is, a “person”). The range determination device 100 estimates the position or partial position of an object in an image input to the range determination device 100 (hereinafter, also simply referred to as “input image 250”), and is a parameter of a deep learning model of the estimation unit 112 (hereinafter, also referred to as “input image 250”). Determines the range in the quantization of (also called "model parameter"). The input image 250 may be an image taken by the photographing device 200. The input image 250 constitutes the input data. Quantization means, for example, converting a model parameter expressed by a floating point number of 16 to 32 bits into a fixed point number of 8 bits or less. Hereinafter, for the sake of simplicity, the conversion of a model parameter expressed by a 32-bit floating point number into a value that can be expressed by an 8-bit fixed point number will be described as quantization. For example, the value after quantization may be a value expressed by a floating point number higher than 8 bits as long as it is a value that can be expressed by an 8-bit fixed point number. Further, the description will be made on the premise that the model parameters represented by 32-bit floating point numbers are converted into model parameters that can be represented by 8-bit fixed point numbers by quantization in advance. Model parameters include input, weight, bias, and output for each layer of the deep learning model. Further, for simplification of the explanation, the estimation unit 112 of the range determination device 100 estimates the position of the joint point 251 of the subject 500, which is an example of the partial position of the object, as the coordinates on the input image 250. Will be described as an example. Hereinafter, the positions of the joint points 251 and the like are also simply referred to as the joint points 251 and the like.

The range determination device 100 can be included in the photographing device 200 by being integrated with the photographing device 200.

The photographing device 200 is arranged, for example, on the ceiling or the upper part of the wall of the living room of the subject 500, or is arranged on the ceiling or the upper part of the wall of a store or the like. The photographing device 200 photographs a predetermined photographing range from a position where the target person 500 can be overlooked, and outputs an input image 250 (image data). The input image 250 includes an image including the subject 500. The photographing apparatus 200 includes a wide-angle camera. The photographing device 200 may photograph a photographing area as an input image 250 of a moving image having a frame rate of, for example, 15 fps to 30 fps. The input image 250 includes a moving image and a still image. The input image 250 is, for example, a black-and-white image and may be an image of 128 pixels × 128 pixels. The input image 250 may be a color image. The photographing device 200 transmits the input image 250 to the range determining device 100 and the like.

For the communication network 300, a network interface based on a wired communication standard such as Ethernet (registered trademark) can be used. For the communication network 300, a network interface based on a wireless communication standard such as Bluetooth (registered trademark) or 802.11 may be used.

FIG. 2 is a block diagram showing a hardware configuration of the range determination device 100. The range determination device 100 includes a control unit 110, a storage unit 120, a display unit 130, an input unit 140, and a communication unit 150. These components are connected to each other via the bus 160. The range determination device 100 may be configured by a computer.

The control unit 110 is configured by a CPU (Central Processing Unit), and controls and performs arithmetic processing of each unit of the range determination device 100 according to a program. The details of the operation of the control unit 110 will be described later.

The storage unit 120 may be composed of a RAM (Random Access Memory), a ROM (Read Only Memory), and an SSD (Solid State Drive). The RAM temporarily stores programs and data as a work area of the control unit 110. The ROM stores various programs and various data in advance. The SSD stores various programs including an operating system and various data.

The display unit 130 is, for example, a liquid crystal display and displays various information.

The input unit 140 is composed of, for example, a touch panel and various keys. The input unit 140 is used for various operations and inputs.

The communication unit 150 is an interface for communicating with an external device. For communication, a network interface according to standards such as Ethernet (registered trademark), SATA, PCI Express, USB, and IEEE 1394 can be used. In addition, a wireless communication interface such as Bluetooth (registered trademark), 802.11, or 4G may be used for communication.

The details of the operation of the control unit 110 will be described.

FIG. 3 is a block diagram showing an example of the function of the control unit 110 of the range determination device 100. The control unit 110 functions as an acquisition unit 111, an estimation unit 112, a range temporary setting unit 113, a loss calculation unit 114, and a range determination unit 115. The estimation unit 112 can be configured by using a deep learning model, which is a neural network. The estimation unit 112, the range temporary setting unit 113, the loss calculation unit 114, and the range determination unit 115 constitute a calculation unit.

The acquisition unit 111 acquires the input image 250 and the correct answer data of the joint point 251 corresponding to the input image 250.

FIG. 4 is an explanatory diagram showing the input image 250. FIG. 5 is an explanatory diagram showing a joint point 251.

In the example of the input image 250 of FIG. 4, the input image 250 includes an image of the target person 500.

In the example of the joint point 251 in FIG. 5, 14 joint points are shown. Joint points 251 include, for example, right ankle, right knee, right waist, left waist, left knee, left ankle, right wrist, right elbow, right shoulder, left wrist, left elbow, left shoulder, neck, and head. Includes joint point 251.

The estimation unit 112 quantizes the model parameter expressed by a 32-bit floating point number in the range temporarily set by the range temporary setting unit 113 by making it a value that can be expressed by an 8-bit fixed point number. As will be described later, the range temporary setting unit 113 sequentially temporarily sets different range. The estimation unit 112 sequentially quantizes the model parameters in each range temporarily set by the range temporary setting unit 113. The model parameters expressed in 32-bit floating point before being quantized use a large amount of teacher data (combination of an image and correct answer data corresponding to the image) by a relatively high-performance computer. It can be a model parameter generated by learning in advance.

The estimation unit 112 can quantize the model parameters for each layer in the range temporarily set for each layer of the deep learning model by the range temporary setting unit 113. The estimation unit 112 may quantize the model parameters for each model parameter in the range temporarily set for each model parameter of the deep learning model by the range temporary setting unit 113. Hereinafter, for the sake of simplicity, the estimation unit 112 will be described as quantizing the model parameters for each layer in the range temporarily set for each layer of the deep learning model.

The estimation unit 112 quantizes the model parameters for each layer of the deep learning model in the range temporarily set by the range temporary setting unit 113, and estimates the joint point 251 by the deep learning model based on the input image 250.

The range temporary setting unit 113 temporarily changes and temporarily sets the range in the quantization of the model parameter.

FIG. 6 is an explanatory diagram for explaining the range in the quantization of the model parameters, which are temporarily set sequentially. The horizontal axis of each graph in FIG. 6 shows the values of the model parameters, and the vertical axis shows the frequency. The frequency on the vertical axis is the frequency of the values taken by the model parameters when learning the deep learning model.

In FIG. 6, 2 ⁿ , 2 ^n-1 , and 2 ^n-2 shown by broken lines represent a range (more specifically, the upper limit of the range). For example, in the 8-bit fixed-point representation, the exponent n is 8, and the range 2 ⁿ is 256 (decimal) = 111111111 (binary). In the range 2 ⁿ , for example, a value of −128 to 127 (decimal number) can be expressed in increments of 1 (decimal number) = 1 (binary number). In the range 2 ^n-1 , for example, a value of −64 to 63.5 (decimal number) can be expressed in increments of 0.5 (decimal number) = 0.1 (binary number). Therefore, if the range is increased, the range of values that the model parameters can take can be increased, but the quantization error becomes large. Conversely, if the range is made smaller, the range of values that the model parameters can take becomes smaller, but the quantization error becomes smaller.

As shown in the example of FIG. 6, the range temporary setting unit 113 can temporarily set the range in the quantization of the model parameter in the order of 2 ⁿ , 2 ^n-1 , and 2 ^n-2 .

The loss calculation unit 114 calculates the loss of the estimation result of the joint point 251 based on the input image 250 by the estimation unit 112 with respect to the correct answer data. The loss can be, for example, the squared average of the difference (distance) between the joint point 251 and the correct answer data calculated for all the joint points 251. The loss calculation unit 114 calculates the loss in the temporarily set range for each layer of the deep learning model.

The range determination unit 115 determines the range when the loss calculated by the loss calculation unit 114 is minimized for each layer of the deep learning model, and stores it in the storage unit 120 as a parameter of the range.

FIG. 7 is an explanatory diagram showing the transition in which the range in the quantization of the model parameters is determined for each layer of the deep learning model.

The figure on the left side of FIG. 7 shows a state in which the range is determined by quantizing the layer 1 which is the uppermost layer of the deep learning model. The figure on the right shows the state in which the range is determined by quantizing the uppermost layer of the deep learning model, Layer1, and then quantizing the uppermost layer of Layer2, which is one layer below. ing. In this way, the deep learning model can be quantized layer by layer from the top layer to the bottom layer, and the range can be determined. At this time, the quantization of the lower layer is executed in the state where the quantization of the upper layer is executed.

The deep learning model may be quantized and the range may be determined only for a predetermined target layer. The predetermined target layer may be a plurality of target layers. Further, the range may be determined by quantizing only the target layer set by inputting from the input unit 140 by the user or the like.

The deep learning model may be quantized one layer at a time from a predetermined target layer toward the lowest layer, and the range may be determined. In this case, the predetermined target layer may be the uppermost layer of the deep learning model, or may be a layer other than the uppermost layer. That is, the initial value of the predetermined target layer may be the uppermost layer or may be a layer other than the uppermost layer.

The range determination unit 115 can calculate (determine) the range of bias, which is a model parameter, based on the range of weight of the same layer as the layer of bias and the range of output of one layer above the layer of bias.

In the deep learning model, when the lower layer of the convolution layer is the relu layer, the range determination unit 115 can determine the range of the convolution layer to be the same as the range of the relu layer.

FIG. 8 is a block diagram showing another example of the function of the control unit 110 of the range determination device 100. The control unit 110 can function as an acquisition unit 111, an estimation unit 112, a range temporary setting unit 113, a loss calculation unit 114, a range determination unit 115, and a learning loss calculation unit 116.

The acquisition unit 111 acquires the input image 250 and the correct answer data of the joint point 251 corresponding to the input image 250. The acquisition unit 111 further acquires a learning input image and learning correct answer data of the joint point 251 corresponding to the learning input image. In order to prevent over-learning of the deep learning model, it is preferable to use different learning input images and learning correct answer data from the input image 250 and correct answer data, respectively. As the input image for learning and the correct answer data for learning, the same input image 250 and the correct answer data may be used, respectively. The combination of the input image for learning and the correct answer data for learning constitutes the teacher data for learning.

The estimation unit 112 quantizes the model parameter expressed by a 32-bit floating point number in the range temporarily set by the range temporary setting unit 113 by making it a value that can be expressed by an 8-bit fixed point number. The estimation unit 112 sequentially quantizes the model parameters in each range temporarily set for each layer of the deep learning model by the range temporary setting unit 113, and uses the learning teacher data to obtain the deep learning model as follows. To learn.

The learning loss calculation unit 116 calculates the learning loss for the learning correct answer data of the estimation result of the joint point 251 based on the learning input image by the estimation unit 112. The learning loss is the difference (distance) between the joint point 251 and the correct answer data for learning, and is calculated for each joint point 251. The learning loss calculation unit 116 calculates the learning loss in the temporarily set range for each layer of the deep learning model.

The estimation unit 112 learns the deep learning model by backpropagation so that the learning loss calculated for each joint point 251 by the learning loss calculation unit 116 becomes 0.

The estimation unit 112 quantizes the model parameters for each layer of the deep learning model in the value range temporarily set by the value range temporary setting unit 113, and learns the deep learning model using the learning teacher data (that is, the deep). Along with learning all layers of the learning model), the deep learning model after learning estimates the joint points 251 based on the input image 250. As a result, the model parameters are fine-tuned for each layer during the layer-by-layer quantization of the deep learning model. Even if the estimation unit 112 quantizes the model parameters for each layer of the deep learning model in the range temporarily set by the range temporary setting unit 113 and learns for each quantized layer (updates of the model parameters). good. That is, quantization and learning may be performed for each predetermined layer. In this case, the layers other than the predetermined layer are learned without updating the model parameters.

The range determination unit 115 determines the range when the loss calculated by the loss calculation unit 114 is minimized for each layer of the deep learning model.

The deep learning model may be quantized and trained to determine the range only for a predetermined target layer.

The deep learning model may be quantized and trained layer by layer from a predetermined target layer toward the bottom layer to determine a range. In this case, the predetermined target layer may be the uppermost layer of the deep learning model, or may be a layer other than the uppermost layer. That is, the initial value of the predetermined target layer may be the uppermost layer or may be a layer other than the uppermost layer.

Learning for each layer of the deep learning model can be performed in the following order. The model parameters of bias are fixed and weight is learned. After that, the weight is fixed and the bias is learned.

The range of model parameters is determined in this way, and the learning unit 112 learns that the joint point 251 is detected with high accuracy based on the input image 250. At this time, the estimation unit 112 estimates the joint point 251 by reflecting the model parameters after learning stored in the storage unit 120 in the deep learning model. Further, the estimation unit 112 reads out the parameters of the range for each layer stored in the storage unit 120, and quantizes the model parameters for each layer based on the parameters of the range.

The operation of the range determination device 100 will be described.

FIG. 9 is a flowchart showing the operation of the range determination device 100. This flowchart is executed by the control unit 110 according to the program stored in the storage unit 120.

The control unit 110 acquires the input image 250 and the correct answer data of the joint point 251 corresponding to the input image 250 (S101). The control unit 110 can acquire the input image 250 and the correct answer data of the joint point 251 by reading from the storage unit 120. The control unit 110 may acquire the input image 250 and the correct answer data of the joint point 251 by receiving the correct answer data by the communication unit 150.

The control unit 110 temporarily sets the range in the quantization of the target layer of the deep learning model (S102).

The control unit 110 learns the target layer using the teacher data for learning with the model parameters quantized in the temporarily set range (S103).

The control unit 110 estimates the joint point 251 from the input image in the deep learning model after learning (S104).

The control unit 110 calculates the loss of the estimated joint point 251 with respect to the correct answer data (S105).

The control unit 110 determines whether or not the loss has been calculated for all the ranges (S106). If the control unit 110 determines that the loss has been calculated for all the range (S106: YES), the control unit 110 executes step S107. If the control unit 110 determines that the loss has not been calculated for all the range (S106: NO), the control unit 110 executes step S102.

The control unit 110 determines the temporarily set range when the loss is minimized as the range of the target layer (S107).

The control unit 110 determines whether or not the range has been determined for all layers of the deep learning model (S108). When the control unit 110 determines that the range has been determined for all the layers of the deep learning model (S108: YES), the control unit 110 ends the process. When the control unit 110 determines that the range has not been determined for all the layers of the deep learning model (S108: NO), the target layer is set to one layer lower layer, and steps S102 to S107 are executed.

This embodiment has the following effects.

Furthermore, the value range in the quantization of the model parameters is sequentially changed, the parameters are quantized, the deep learning model is learned using the training teacher data, and the correct answer of the estimation result based on the input data by the deep learning model after learning. Calculate the loss for the data to determine the value range when the loss is minimal. As a result, the estimation accuracy by the quantized deep learning model can be further improved.

Furthermore, for a predetermined target layer, the range in the quantization of the parameters of the deep learning model is sequentially changed, the parameters are quantized, the target layer or the specified layer is learned, and the estimation result by the deep learning model based on the input data is obtained. Calculate the loss for the correct answer data of, and determine the range when the loss is minimized. As a result, it is possible to obtain a highly accurate and lightweight quantized deep learning model more easily and flexibly.

Furthermore, the range in the quantization of the parameters of the deep learning model is sequentially changed for each layer from the predetermined target layer toward the bottom layer to quantize the parameters, and the estimation result by the deep learning model based on the input data is obtained. Calculate the loss for the correct data and determine the range when the loss is minimized. As a result, the quantization error of the upper layer is absorbed in the lower layer of the deep learning model, so that a highly accurate and lightweight quantization deep learning model can be obtained more effectively.

Furthermore, the initial value of the target layer is set to the top layer. This makes it possible to obtain a highly accurate and lightweight quantized deep learning model more effectively.

Furthermore, the initial value of the target layer is set to a layer other than the top layer. This makes it possible to obtain a highly accurate and lightweight quantized deep learning model effectively and flexibly.

Furthermore, learning for each layer of the deep learning model is performed in the following order. The model parameters of bias are fixed and weight is learned. After that, the weight is fixed and the bias is learned. As a result, the estimation accuracy by the quantized deep learning model can be further improved.

Further, the range of bias is calculated based on the range of weight of the same layer as the layer of bias and the range of output of one layer above the layer of bias. This makes it possible to obtain a highly accurate and lightweight quantized deep learning model more easily without changing the existing hardware configuration.

Furthermore, in the case of a deep learning model in which the lower layer of the convolution layer is the relu layer, the range of the convolution layer is made the same as the range of the relu layer. This makes it possible to obtain a highly accurate and lightweight quantized deep learning model more easily in a specific case.

The configurations of the range determination program, the range determination system 10, and the range determination method of the deep learning model described above have been described as the main configurations in explaining the features of the above-described embodiment, and are limited to the above configurations. However, various modifications can be made within the scope of the claims. Further, it does not exclude the configurations provided in the general position estimation system and the range determination device.

For example, the imaging device 200 having a built-in computer may have the function of the range determination device 100.

Further, the range determination device 100 and the photographing device 200 may each be configured by a plurality of devices, or any plurality of the devices may be configured as a single device.

Further, in the above-mentioned flowchart, some steps may be omitted or other steps may be added. Further, a part of each step may be executed at the same time, or one step may be divided into a plurality of steps and executed.

Further, in the embodiment, conversion of a model parameter expressed by a 32-bit floating point number to an 8-bit fixed point number has been described as an example of quantization, but for quantization, a 32-bit floating point number is used. It also includes converting the represented model parameters to floating point numbers less than 32 bits.

Further, the means and methods for performing various processes in the range determination system 10 and the range determination device 100 described above can be realized by either a dedicated hardware circuit or a programmed computer. The program may be provided by a computer-readable recording medium such as a USB memory or a DVD (Digital Versaille Disc) -ROM, or may be provided online via a network such as the Internet. In this case, the program recorded on the computer-readable recording medium is usually transferred to and stored in a storage unit such as a hard disk. Further, the above program may be provided as a single application software, or may be incorporated into the software of a device such as a photographing device as a function.

This application is based on a Japanese patent application (Japanese Patent Application No. 2020-133972) filed on August 6, 2020, and the disclosure contents are referenced and incorporated as a whole.

10 range determination system,
100 range determination device,
110 Control unit,
111 acquisition part,
112 Estimator,
113 Range temporary setting section,
114 Loss calculation unit,
115 Range Determination,
116 Learning loss calculation unit,
120 storage,
130 Communication Department,
140 input section,
150 communication department,
200 shooting equipment,
250 input image,
251 Joint Point,
500 Target people.

Claims

A program that determines the range in the quantization of parameters of a deep learning model.
The procedure (a) for acquiring the input data and the correct answer data corresponding to the input data, and
The range in the quantization of the parameter is sequentially changed to quantize the parameter, and the loss of the estimation result by the deep learning model based on the input data with respect to the correct answer data is calculated to minimize the loss. The procedure (b) for determining the above range at the time and
A range determination program of a deep learning model for letting a computer execute a process having.
In the procedure (b), the value range in the quantization of the parameter is sequentially changed, the parameter is quantized, the deep learning model is learned using the learning teacher data, and the deep learning model after learning is used. The value range determination program of the deep learning model according to claim 1, wherein the loss for the correct answer data of the estimation result based on the input data is calculated, and the value range when the loss is minimized is determined.
The procedure (b) is
The procedure (b1) for provisionally setting the range in the quantization of the parameter of the predetermined target layer of the deep learning model, and
The procedure (b2) of learning the target layer or the designated layer by quantizing the parameters of the target layer in the range temporarily set in the procedure (b1) and using the teacher data for learning.
The procedure (b3) for calculating the loss of the estimation result based on the input data by the deep learning model learned in the procedure (b2) with respect to the correct answer data, and the procedure (b3).
The temporarily set range is sequentially changed, the procedure (b2) and the procedure (b3) are executed, and the loss calculated in the procedure (b3) is the smallest in the temporarily set range. The procedure (b4) for determining the range when becomes
2. The range determination program for the deep learning model according to claim 2.
The procedure (b) is
A procedure (b5) in which the procedure (b1) to the procedure (b4) are executed with one lower layer of the target layer as a new target layer to determine the range of the parameter of the new target layer.
The procedure (b6) in which the procedure (b5) is executed until the new target layer becomes the lowest layer, and the procedure (b6).
The range determination program of the deep learning model according to claim 3, further comprising.
The range determination program of the deep learning model according to claim 4, wherein the initial value of the target layer in the above (b1) is the uppermost layer.
The range determination program of the deep learning model according to claim 4, wherein the initial value of the target layer in the above (b1) is a layer other than the uppermost layer.
The parameters of each layer of the deep learning model include weight, and bias.
The procedure (b2) is
The procedure (b21) for fixing the bias and learning the weight, and
The procedure (b22) for learning the bias by fixing the weight and
The range determination program of the deep learning model according to any one of claims 3 to 6.
The parameters of each layer of the deep learning model include weight, bias, and output.
The procedure (b4) calculates the range of the bias based on the range of the weight of the same layer as the layer of the bias and the range of the output of the layer one layer above the layer of the bias. The range determination program for the deep learning model according to any one of claims 3 to 7.
In the case of the deep learning model in which the lower layer of the convolution layer is the relu layer, the procedure (b4) makes the range of the convolution layer the same as the range of the relu layer, any one of claims 3 to 8. The range determination program for the deep learning model described in Section.
It is a system that determines the range in the quantization of the parameters of the deep learning model.
An acquisition unit that acquires input data and correct answer data corresponding to the input data,
The range in the quantization of the parameter is sequentially changed to quantize the parameter, and the loss of the estimation result by the deep learning model based on the input data with respect to the correct answer data is calculated to minimize the loss. The arithmetic unit that determines the above range at the time,
Range determination system for deep learning models with.
It is a method to determine the range in the quantization of the parameters of the deep learning model by the range determination system.
The step (a) of acquiring the input data and the correct answer data corresponding to the input data, and
The range in the quantization of the parameter is sequentially changed to quantize the parameter, and the loss of the estimation result by the deep learning model based on the input data with respect to the correct answer data is calculated to minimize the loss. At the stage (b) of determining the above range,
How to determine the range of a deep learning model with.