WO2021166347A1

WO2021166347A1 - Information processing device, information processing method, and program

Info

Publication number: WO2021166347A1
Application number: PCT/JP2020/042909
Authority: WO
Inventors: 亮志池谷
Original assignee: ソニーグループ株式会社
Priority date: 2020-02-17
Filing date: 2020-11-18
Publication date: 2021-08-26
Also published as: JP7447529B2; JP2021128674A

Abstract

The present invention improves design efficiency in an information processing device that aids neural network-related hardware design.　In the present invention, an analysis unit obtains, on the basis of connection relations of a plurality of nodes that constitute a neural network, a plurality of calculation orders for calculating functions which are used respectively at the plurality of nodes. A cost acquisition unit obtains, for each of the plurality of calculation orders, the cost necessary for calculating the functions. A display control unit displays on a display unit the plurality of calculation orders and the costs in a manner associated with each other.

Description

Information processing equipment, information processing methods, and programs

This technology is related to information processing equipment. More specifically, the present invention relates to an information processing apparatus that performs processing related to a neural network, an information processing method, and a program that causes a computer to execute the information processing method.

In recent years, in fields such as image recognition and voice recognition, development and research on neural networks that reproduce the mechanism of the human cranial nerve system on a computer are underway. For example, an information processing device has been proposed in which the amount of memory and the amount of calculation required for processing a neural network are obtained and displayed on a screen (see, for example, Patent Document 1).

International Publication No. 2017/138220

In the above-mentioned conventional technology, design support is provided when implementing a neural network in hardware by displaying the amount of memory and the amount of calculation. However, in the above-mentioned conventional technique, the design efficiency may decrease. For example, if the setting related to the calculation order of the functions in the neural network is changed, the amount of memory and the amount of calculation may fluctuate. In this case, in order to optimize the amount of memory and the like, it is necessary to repeat the operation of changing the setting, which may reduce the design efficiency due to the complicated operation.

This technology was created in view of this situation, and aims to improve design efficiency in information processing devices that support hardware design related to neural networks.

The present technology has been made to solve the above-mentioned problems, and the first aspect thereof is used in each of the plurality of nodes based on the mutual connection relationship of the plurality of nodes forming the neural network. An analysis unit that obtains a plurality of operation sequences for calculating a function to be performed, a cost acquisition unit that obtains a cost required for the operation of the function for each of the plurality of operation orders, and each of the plurality of operation orders and the above cost. This is an information processing device including a display control unit that displays the information in association with the display unit, an information processing method, and a program that causes a computer to execute the method. This has the effect of improving the design efficiency of hardware related to neural networks.

Further, in the first aspect, the display control unit selects one of the plurality of operation sequences according to the operation of the user, and the selected operation order, the cost corresponding to the operation order, and the connection relationship. May be displayed. This has the effect of displaying the selected arithmetic order, cost, and coupling relationship.

Further, in the first aspect, the display control unit may display a table in which each of the plurality of calculation sequences and the cost are associated with each other on the display unit. This has the effect of displaying a table in which each of the plurality of operation sequences is associated with the cost.

Also, in this first aspect, the cost may include at least one of memory capacity, number of memory accesses and memory bandwidth. This has the effect of displaying at least one of the memory capacity, the number of memory accesses, and the memory bandwidth for each calculation order.

Further, in this first aspect, the above cost may include the processing time. This has the effect of displaying the processing time for each calculation order.

Further, in this first aspect, the cost may include at least one of the usage efficiency of the arithmetic unit and the degree of parallelism of the arithmetic. This brings about the effect that at least one of the usage efficiency of the arithmetic unit and the degree of parallelism of the above arithmetic is displayed for each arithmetic order.

It is a block diagram which shows one configuration example of the design support system in the 1st Embodiment of this technique. It is a block diagram which shows one configuration example of the information processing apparatus in the 1st Embodiment of this technique. It is a figure which shows an example of the structure of the neural network in the 1st Embodiment of this technique. It is a figure which shows an example of the connection relation of layers in a neural network in 1st Embodiment of this technique. It is a figure for demonstrating the number of data held at the time of calculating the function of layer 1 in the 1st Embodiment of this technique. It is a figure for demonstrating the number of data held when the function of layer 2 is calculated second in the 1st Embodiment of this technique. It is a figure for demonstrating the number of data held when the function of layer 3 is calculated third in the 1st Embodiment of this technique. It is a figure for demonstrating the number of data held when the function of layer 4 is calculated fourth in the 1st Embodiment of this technique. It is a figure for demonstrating the number of data held when the function of layer 5 is calculated fifth in the 1st Embodiment of this technique. It is a figure for demonstrating the number of data held when the function of layer 4 is calculated secondly in the 1st Embodiment of this technique. It is a figure for demonstrating the number of data held when the function of layer 2 is calculated third in the 1st Embodiment of this technique. It is a figure for demonstrating the number of data held when the function of layer 3 is calculated fourth in the 1st Embodiment of this technique. It is a figure which shows an example of the screen before the display of the calculation result in the 1st Embodiment of this technique. It is a figure which shows an example of the display screen of the calculation result in the 1st Embodiment of this technique. It is a figure which shows an example of the display screen of another calculation result in the 1st Embodiment of this technique. It is a flowchart which shows an example of the operation of the information processing apparatus in the 1st Embodiment of this technique. It is a figure which shows an example of the display screen of the calculation result in the 2nd Embodiment of this technique.

Hereinafter, embodiments for carrying out the present technology (hereinafter referred to as embodiments) will be described. The explanation will be given in the following order.
1. 1. First Embodiment (Example of displaying the cost for each calculation order)
2. Second embodiment (example of displaying the cost for each calculation order in a table)

<1. First Embodiment>
[Configuration example of design support system]
FIG. 1 is a block diagram showing a configuration example of a design support system according to the first embodiment of the present technology. This design support system is a system for supporting the design when implementing the neural network in hardware. The design support system includes a learning framework 110, an information processing device 200, a conversion unit 120, and a hardware design tool 130.

The learning framework 110 generates an inference model based on the input data set. Data sets such as image data and audio data are input to the learning framework 110. The learning framework 110 performs machine learning based on the dataset and generates an inference model for predicting which class the input data belongs to. This inference model includes, for example, a program for reproducing a neural network on a computer and a coefficient file. The coefficient file sets the weighting coefficient for the data input / output to the nodes in the neural network. Further, as the neural network, for example, a convolutional neural network (CNN) is used.

Then, the learning framework 110 supplies the inference model (program and coefficient file) to the information processing device 200.

The information processing device 200 obtains the calculation order of the functions in the neural network based on the inference model. The information processing device 200 analyzes the inference model and obtains all the plurality of calculation sequences for calculating the function. Then, the information processing apparatus 200 obtains and displays the cost required for the calculation for each calculation order. The user examines the display contents, determines the optimum calculation order, operates the information processing apparatus 200, and causes the conversion unit 120 to output the calculation order sequence that defines the calculation order together with the program.

The conversion unit 120 converts the code into machine language. As the conversion unit 120, for example, a compiler is used. The conversion unit 120 generates a code for performing an operation in the operation order defined in the operation order sequence based on the program and the operation order sequence, and converts the code into a machine language. Then, the conversion unit 120 supplies the machine language to the hardware design tool 130.

The hardware design tool 130 generates a circuit in which a neural network is hardware-mounted based on a machine language.

Although the learning framework 110 and the conversion unit 120 are also provided outside the information processing device 200, these can also be arranged inside the information processing device 200.

[Configuration example of information processing device]
FIG. 2 is a block diagram showing a configuration example of the information processing apparatus 200 according to the first embodiment of the present technology. The information processing device 200 includes an analysis unit 210, a cost acquisition unit 220, a display control unit 230, a storage unit 240, a calculation order determination unit 250, and a display unit 260.

The storage unit 240 stores data used in the information processing device 200. A memory or the like is used as the storage unit 240.

The learning framework 110 outputs the neural network program 241 and the coefficient file 242, which are stored in the storage unit 240.

The analysis unit 210 analyzes the neural network program 241 and obtains all the calculation orders of the functions. The analysis unit 210 obtains a plurality of calculation sequences for calculating the function used in each of the nodes based on the mutual connection relationship of the plurality of nodes forming the neural network. Then, the analysis unit 210 supplies the data indicating the calculation order to the cost acquisition unit 220.

The cost acquisition unit 220 obtains the cost required for the operation of the function for each operation order. Here, the cost includes, for example, at least one of a memory capacity, a number of memory accesses, a memory bandwidth, a processing time, an arithmetic unit usage efficiency, and a degree of parallelism of arithmetic operations. The cost acquisition unit 220 reads out the neural network program 241 and the coefficient file 242 from the storage unit 240, and obtains the cost when each of the calculation orders is calculated by the predetermined hardware in that order. Then, the cost acquisition unit 220 supplies each of the calculation sequences to the display control unit 230 together with the corresponding cost.

The display control unit 230 causes the display unit 260 to display each of the calculation sequences and the corresponding costs.

The display unit 260 displays the screen according to the control of the display control unit 230. A liquid crystal monitor or the like is used as the display unit 260.

The calculation order determination unit 250 determines the calculation order according to the operation of the user. The user refers to and examines the cost for each calculation order displayed on the display unit 260, operates the information processing apparatus 200, and specifies the optimum calculation order. The operation order determination unit 250 generates an operation order sequence definition file 243 indicating a specified operation order according to a user operation, and stores the operation order sequence definition file 243 in the storage unit 240. The neural network program 241 and the arithmetic sequence sequence definition file 243 are output to the conversion unit 120.

Although the storage unit 240 and the display unit 260 are arranged in the information processing device 200, at least one of these can be provided outside the information processing device 200. When the storage unit 240 is provided externally, for example, the storage unit 240 may be arranged in the server via a network.

[Neural network configuration example]
FIG. 3 is a diagram showing an example of the structure of the neural network according to the first embodiment of the present technology. The figure shows an example of a neural network reproduced on a computer by the neural network program 241. This neural network is formed by a plurality of layers such as layers 1 to 6. In addition, a plurality of nodes such as node 301 are provided in each layer.

Also, each of the layers is combined with at least one other layer. For example, layer 1 is combined with

layers

2 and 4. Layer 2 is combined with

layers

1 and 3. Layer 3 is combined with

layers

2 and 5. Layer 4 is combined with

layers

1 and 5. Layer 5 is combined with

layers

3 and 6. The connection relationship between the layers is not limited to that illustrated in the figure.

Also, for example, it is assumed that the data set is input to layer 1 and the result is output from layer 6.

FIG. 4 is a diagram showing an example of the connection relationship of layers in the neural network in the first embodiment of the present technology. In this figure, the nodes in the layer of the neural network illustrated in FIG. 3 are omitted, and the functions in the layer are shown.

"Input Dataset" indicates that the data set is input to layer 1. In layer 1, "Convolution Kernels hepe: 5,5" indicates a convolution operation using a kernel. The kernel is also called a filter. "5,5" indicates the size of the kernel (filter). "Stride = 2" indicates the interval at which the kernel is applied. A neural network that performs a convolutional operation in this way is called a CNN.

In addition, "Batch Normalization" indicates a batch normalization process that forcibly optimizes the output of the layer so that the distribution of the parameters of the intermediate layer becomes appropriate. "ReLU (Rectified Linear Unit)" indicates a unit that uses a rectified linear function. Further, "64,14,14" indicates that the number of maps is 64 and the individual sizes of the feature maps are 14 × 14.

In layer 4, "Average Pooling" indicates an average pooling process in which the input feature map is divided into a plurality of pooling areas and the average value of each area is calculated.

In layer 5, "Add" indicates addition processing. In layer 6, "Affine" indicates an affine transformation process for obtaining the inner product of a matrix performed by forward propagation.

Note that the processing executed in each layer is not limited to the one illustrated in the figure. For example, in layer 1, an activation function other than the normalized linear function, such as a step function or a linear combination, can be used. Further, in layer 3, a pooling process other than the average pooling process, such as a max pooling process, can be performed.

As illustrated in the figure, a function is used in each layer. For example, a function representing the kernel or a rectified linear function is used. The order of operations of these functions can be changed based on the connection relationships between the layers. As described above, in the figure, the layer 2 of the subsequent stage is combined with the layer 3 of the subsequent stage, and the layer 5 of the subsequent stage is combined with the

layers

3 and 4. Therefore, if the operation of the function of the layer 2 is executed before the layer 3 and the operation of the functions of the

layers

3 and 4 is performed before the layer 5, the operation order of the layers 2 to 4 is arbitrary. When the operation order is changed, the cost required for the operation of the operation order may change.

First, consider the case where the calculation is performed in the order of

layers

2, 3 and 4 after layer 1.

FIG. 5 is a diagram for explaining the number of data held when the layer 1 function in the first embodiment of the present technology is calculated. In the figure, "Conv" indicates a convolution operation. "Act" indicates batch normalization processing and normalization linear function operations. It is assumed that 200 pieces of data are output by the calculation of layer 1. Since these data are used in

layers

2 and 4, they are stored in the storage unit 240.

FIG. 6 is a diagram for explaining the number of data held when the second layer 2 function is calculated in the first embodiment of the present technology. It is assumed that, for example, 150 pieces of data are output by the calculation of the function of the layer 2. Since these data are used in layer 3, they are stored in the storage unit 240. By adding 150 pieces of data, the number of data in the storage unit 240 becomes 350 pieces.

FIG. 7 is a diagram for explaining the number of data held when the third layer 3 function is calculated in the first embodiment of the present technology. It is assumed that 75 pieces of data are output by the calculation of the function of the layer 3. Since these data are used in layer 5, they are stored in the storage unit 240. By adding 75 data, the number of data in the storage unit 240 becomes 425. Then, the 150 data output in the layer 2 are deleted from the storage unit 240 because they are no longer needed. By deleting 150 data, the number of data in the storage unit 240 becomes 275.

FIG. 8 is a diagram for explaining the number of data held when the fourth layer 4 function is calculated in the first embodiment of the present technology. It is assumed that 50 pieces of data are output by the calculation of the function of the layer 4. Since these data are used in layer 5, they are stored in the storage unit 240. By adding 50 data, the number of data in the storage unit 240 becomes 325. Then, since the 200 data output in layer 1 are no longer needed, they are deleted from the storage unit 240. By deleting 200 pieces of data, the number of data in the storage unit 240 becomes 125 pieces.

FIG. 9 is a diagram for explaining the number of data held when the fifth layer 5 function is calculated in the first embodiment of the present technology. It is assumed that 75 pieces of data are output by the calculation of the function of the layer 5. These data are stored in the storage unit 240 for later calculation. By adding 75 data, the number of data in the storage unit 240 becomes 200. Then, the 125 data output in the

layers

3 and 5 are deleted from the storage unit 240 because they are no longer needed. By deleting 125 data, the number of data in the storage unit 240 becomes 75.

As illustrated in FIGS. 5 to 9, when the operations are performed in the order of

layers

2, 3 and 4, the maximum number of data in the storage unit 240 is 425 when the data of layer 3 is added in FIG. Is.

Next, consider the case where the calculation is performed in the order of

layers

4, 2, and 3 after layer 1.

FIG. 10 is a diagram for explaining the number of data held when the second layer 4 function is calculated in the first embodiment of the present technology. It is assumed that, for example, 50 pieces of data are output by the calculation of the function of the layer 4. Since these data are used in layer 5, they are stored in the storage unit 240. By adding 50 data, the number of data in the storage unit 240 becomes 250.

FIG. 11 is a diagram for explaining the number of data held when the third layer 2 function is calculated in the first embodiment of the present technology. It is assumed that 150 pieces of data are output by the calculation of the function of the layer 2. Since these data are used in layer 3, they are stored in the storage unit 240. By adding 150 pieces of data, the number of data in the storage unit 240 becomes 400 pieces. Then, since the 200 data output in layer 1 are no longer needed, they are deleted from the storage unit 240. By deleting 200 pieces of data, the number of data in the storage unit 240 becomes 200 pieces.

FIG. 12 is a diagram for explaining the number of data held when the fourth layer 3 function is calculated in the first embodiment of the present technology. It is assumed that 75 pieces of data are output by the calculation of the function of the layer 3. Since these data are used in layer 5, they are stored in the storage unit 240. By adding 75 data, the number of data in the storage unit 240 becomes 275. Then, the 150 data output in the layer 2 are deleted from the storage unit 240 because they are no longer needed. By deleting 150 pieces of data, the number of data pieces in the storage unit 240 becomes 125 pieces.

As illustrated in FIGS. 10 to 12, when the operations are performed in the order of

layers

4, 2, and 3, the maximum number of data in the storage unit 240 is 400 when the data of layer 2 is added in FIG. Is.

As illustrated in FIGS. 5 to 12, in the case where the calculation is performed in the order of

layers

2, 3 and 4, and the case where the calculation is performed in the order of

layers

4, 2 and 3, the storage unit 240 (memory or the like) is used. The maximum number of data to be retained is different. If the maximum value of the number of data is different, the memory capacity required for the calculation will also be different. For example, the memory capacity can be obtained by multiplying the maximum value of the number of data by the individual data sizes. In this way, the required memory capacity may change due to the change in the calculation order. The same applies to costs other than memory capacity (processing time, operating efficiency of arithmetic units, etc.).

Therefore, the information processing apparatus 200 obtains all the plurality of calculation orders for calculating the function used in the node based on the connection relationship between the layers (in other words, the connection relationship between the nodes). Then, as illustrated in FIGS. 5 to 12, the information processing apparatus 200 obtains the cost for each calculation order and displays it in association with the calculation order.

[Display example of information processing device]
FIG. 13 is a diagram showing an example of a screen before displaying the calculation result in the first embodiment of the present technology. In the display screen, a box 400 including the layer name and the function in the layer is displayed for each layer. When the layers are connected to each other, a line 401 indicating a connection relationship is wired between the corresponding boxes 400. Further, the operation button 421 for switching the calculation order and the operation button 422 for displaying the calculation result are displayed at predetermined positions.

The information processing device 200 can also edit the structure of the neural network according to the operation of the user. In editing, layers are added or deleted, the connection relationship between layers is changed, and functions in the layers are set. In addition, the information processing device 200 can also perform machine learning in the displayed neural network according to the user's operation. The GUI (Graphical User Interface) for these operations is omitted in the figure.

FIG. 14 is a diagram showing an example of a display screen of a calculation result according to the first embodiment of the present technology. FIG. 13 is a screen when the operation button 422 is operated in FIG. After the operation, a numerical mark 410 indicating the operation order of the corresponding function is displayed in the vicinity of each of the boxes 400. Further, a table 430 showing a cost name and a cost value is displayed at a predetermined position for each cost.

In the figure, numerical marks 410 of 1, 2, 3, 4, 5 and 6, respectively, are displayed in the vicinity of the boxes 400 of

layers

1, 2, 3, 4, 5 and 6. This means that the operations were performed in the order of

layers

1, 2, 3, 4, 5 and 6. The cost value (X1, etc.) in the table 430 indicates the value when the calculation is performed in this calculation order.

FIG. 15 is a diagram showing an example of another calculation result display screen according to the first embodiment of the present technology. FIG. 6 is a screen that can be switched when the user operates the operation button 421 on the display screen of FIG. The numerical marks in the thick frame in FIG. 15 indicate the places where the numerical values are switched. Numerical marks 410 of 1, 3, 4, 2, 5 and 6, respectively, are displayed in the vicinity of the boxes 400 of

layers

1, 4, 2, 3, 5 and 6. The cost value in the table 430 is switched to the value (X2, etc.) when the calculation is performed in this calculation order.

As illustrated in FIGS. 14 and 15, the information processing apparatus 200 selects one of a plurality of calculation sequences according to the user's operation, and the calculation sequence and the corresponding cost, and the connection relationship between the layers (nodes). Is displayed.

Then, the user compares the display screens of FIGS. 14 and 15 and determines the optimum calculation order. For example, when the memory capacity of the hardware to be mounted is relatively small, the operation order with the smaller required memory capacity is preferentially selected. Then, the user operates the information processing apparatus 200 to output an operation order sequence that defines the operation order.

Since the information processing device 200 covers all of the calculation order and displays the cost for each calculation order, the user can determine the optimum calculation order by referring to the display screen. Thereby, the design efficiency can be improved.

[Operation example of information processing device]
FIG. 16 is a flowchart showing an example of the operation of the information processing apparatus 200 according to the first embodiment of the present technology. This operation is started, for example, when a predetermined application for displaying the cost is executed.

The information processing device 200 analyzes the program and obtains the entire calculation order of the functions (step S901). Then, the information processing apparatus 200 obtains the cost for each calculation order (step S902). Subsequently, the information processing apparatus 200 displays the cost together with the calculation order and the connection relationship (step S903). The information processing device 200 outputs a sequence of calculation sequences according to a user operation (step S904). After step S904, the information processing apparatus 200 ends the operation for display.

As described above, according to the first embodiment of the present technology, since the information processing apparatus 200 seeks and displays the cost for each of the plurality of calculation sequences, the user refers to the display screen and displays the optimum calculation sequence. Can be determined. As a result, the design efficiency at the time of hardware mounting can be improved.

<2. Second Embodiment>
In the first embodiment described above, the information processing apparatus 200 displays the calculation order selected according to the operation of the user and the corresponding cost, but in this configuration, when comparing a plurality of calculation orders, the calculation order is displayed. It is necessary to perform an operation to switch the screen. The information processing apparatus 200 of the second embodiment is different from the first embodiment in that the information processing device 200 of the second embodiment displays a table in which each of the plurality of calculation sequences is associated with the cost to improve convenience.

FIG. 17 is a diagram showing an example of a display screen of a calculation result in the second embodiment of the present technology. As illustrated in the figure, a table in which each of the plurality of calculation sequences and the cost are associated with each other is displayed on the display screen. If run on the command line, this table will be displayed with standard output.

In the calculation order column, layers 1 to 6 are arranged in the calculation order. In addition, the cost values of the memory capacity, the processing time, and the usage efficiency of the arithmetic unit are displayed for each arithmetic order. For example, when performing calculations in the order of

layers

1, 2, 3, 4, 5, and 6, the memory capacity, processing time, and usage efficiency of the calculator are X1, Y1, and Z1, respectively. Further, when the calculation is performed in the order of

layers

1, 4, 2, 3, 5, and 6, the memory capacity, the processing time, and the usage efficiency of the calculator are X2, Y2, and Z2, respectively. When performing calculations in the order of

layers

1, 2, 4, 3, 5, and 6, the memory capacity, processing time, and usage efficiency of the calculator are X3, Y3, and Z3, respectively.

The information processing device 200 can also sort a plurality of calculation orders in ascending or descending order of costs specified by the user according to the user's operation. For example, when the user specifies the memory capacity among the memory capacity, the processing time, and the usage efficiency of the arithmetic unit, the information processing apparatus 200 sorts the arithmetic order in ascending order of the memory capacity.

Also, the contents of the table can be displayed as text. The information processing device 200 can also save the execution result in a file.

As illustrated in the figure, the information processing apparatus 200 displays a table in which each of the plurality of calculation sequences is associated with the cost, so that the user switches the screen when comparing the plurality of calculation sequences. It is not necessary and convenience can be improved.

The information processing device 200 adds an operation button for switching to the table display of the second embodiment on the display screens of FIGS. 13 to 16 of the first embodiment, and displays the information according to the user's operation. You can also switch.

As described above, according to the second embodiment of the present technology, the information processing apparatus 200 displays a table in which each of the plurality of calculation sequences is associated with the cost, so that the convenience of the user can be improved. can.

Note that the above-described embodiment shows an example for embodying the present technology, and the matters in the embodiment and the matters specifying the invention in the claims have a corresponding relationship with each other. Similarly, the matters specifying the invention within the scope of claims and the matters in the embodiment of the present technology having the same name have a corresponding relationship with each other. However, the present technology is not limited to the embodiment, and can be embodied by applying various modifications to the embodiment without departing from the gist thereof.

Further, the processing procedure described in the above-described embodiment may be regarded as a method having these series of procedures, or as a program for causing a computer to execute these series of procedures or as a recording medium for storing the program. You may catch it. As this recording medium, for example, a CD (Compact Disc), MD (MiniDisc), DVD (Digital Versatile Disc), memory card, Blu-ray Disc (Blu-ray (registered trademark) Disc) and the like can be used.

It should be noted that the effects described in the present specification are merely examples and are not limited, and other effects may be obtained.

The present technology can have the following configurations.
(1) An analysis unit that obtains a plurality of calculation sequences for calculating a function used in each of the plurality of nodes based on the mutual connection relationship of a plurality of nodes forming a neural network.
A cost acquisition unit for obtaining the cost required for the operation of the function for each of the plurality of operation sequences,
An information processing device including a display control unit that displays each of the plurality of calculation sequences and the cost in association with each other on the display unit.
(2) The display control unit selects one of the plurality of operation sequences according to the operation of the user and displays the selected operation order, the cost corresponding to the operation order, and the connection relationship (1). ) The information processing device described.
(3) The information processing device according to (1) or (2), wherein the display control unit displays a table in which each of the plurality of calculation sequences and the cost are associated with each other on the display unit.
(4) The information processing apparatus according to any one of (1) to (3) above, wherein the cost includes at least one of a memory capacity, a number of memory accesses, and a memory bandwidth.
(5) The information processing apparatus according to any one of (1) to (4), wherein the cost includes a processing time.
(6) The information processing apparatus according to any one of (1) to (5), wherein the cost includes at least one of the usage efficiency of the arithmetic unit and the degree of parallelism of the arithmetic operation.
(7) An analysis procedure for obtaining a plurality of operation sequences for calculating a function used in each of the plurality of nodes based on the mutual connection relationship of a plurality of nodes forming a neural network, and an analysis procedure.
A cost acquisition procedure for obtaining the cost required for the operation of the function for each of the plurality of operation sequences, and
An information processing method including a display control procedure for displaying each of the plurality of calculation sequences and the cost in association with each other on the display unit.
(8) An analysis procedure for obtaining a plurality of operation sequences for calculating a function used in each of the plurality of nodes based on the mutual connection relationship of a plurality of nodes forming a neural network, and an analysis procedure.
A cost acquisition procedure for obtaining the cost required for the operation of the function for each of the plurality of operation sequences, and
A program for causing a computer to execute a display control procedure for displaying each of the plurality of calculation sequences and the cost in association with each other on the display unit.

110 Learning framework 120 Conversion unit 130 Hardware design tool 200 Information processing device 210 Analysis unit 220 Cost acquisition unit 230 Display control unit 240 Storage unit 241 Neural network program 242 Coefficient file 243 Operation order sequence definition file 250 Operation order determination unit 260 Display Department

Claims

An analysis unit that obtains a plurality of operation orders for calculating a function used in each of the plurality of nodes based on the mutual connection relationship of a plurality of nodes forming a neural network.
A cost acquisition unit for obtaining the cost required for the operation of the function for each of the plurality of operation sequences,
An information processing device including a display control unit that displays each of the plurality of calculation sequences and the cost in association with each other on the display unit.
The information according to claim 1, wherein the display control unit selects one of the plurality of operation sequences according to a user operation and displays the selected operation order, the cost corresponding to the operation order, and the connection relationship. Processing equipment.
The information processing device according to claim 1, wherein the display control unit displays a table in which each of the plurality of calculation sequences and the cost are associated with each other on the display unit.
The information processing device according to claim 1, wherein the cost includes at least one of a memory capacity, a number of memory accesses, and a memory bandwidth.
The information processing device according to claim 1, wherein the cost includes processing time.
The information processing device according to claim 1, wherein the cost includes at least one of the usage efficiency of the arithmetic unit and the degree of parallelism of the arithmetic.
An analysis procedure for obtaining a plurality of operation sequences for operating a function used in each of the plurality of nodes based on the mutual connection relationship of a plurality of nodes forming a neural network, and an analysis procedure.
A cost acquisition procedure for obtaining the cost required for the operation of the function for each of the plurality of operation sequences, and
An information processing method including a display control procedure for displaying each of the plurality of calculation sequences and the cost in association with each other on the display unit.
An analysis procedure for obtaining a plurality of operation sequences for operating a function used in each of the plurality of nodes based on the mutual connection relationship of a plurality of nodes forming a neural network, and an analysis procedure.
A cost acquisition procedure for obtaining the cost required for the operation of the function for each of the plurality of operation sequences, and
A program for causing a computer to execute a display control procedure for displaying each of the plurality of calculation sequences and the cost in association with each other on the display unit.