JPWO2020202313A1

JPWO2020202313A1 - Neural network data compression device and data compression method

Info

Publication number: JPWO2020202313A1
Application number: JP2021511693A
Authority: JP
Inventors: 誠也柴田; 博昭五十嵐; 芙美代鷹野; 崇竹中
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2019-03-29
Filing date: 2019-03-29
Publication date: 2021-12-16
Anticipated expiration: 2039-03-29
Also published as: JP7218796B2; WO2020202313A1

Abstract

データ圧縮装置７０１は、１つのニューラルネットワークが分割されて作成される部分ニューラルネットワークの出力データのデータ量を削減するニューラルネットワークのデータ圧縮装置であって、部分ニューラルネットワークの出力データが格納された記憶部からデータを読み出す読出し回路７１１と、読み出されたデータのデータ圧縮を行うデータ圧縮回路７１２とを備え、読出し回路７１１は、一のチャネルの全てのデータの読み出しが完了してから他のチャネルのデータを読み出す。The data compression device 701 is a data compression device for a neural network that reduces the amount of data of the output data of the partial neural network created by dividing one neural network, and is a storage in which the output data of the partial neural network is stored. The read circuit 711 is provided with a read circuit 711 that reads data from the unit and a data compression circuit 712 that compresses the read data. Read the data of.

Description

本発明は、１つのニューラルネットワークが分割されて作成される部分ニューラルネットワークの出力データのデータ量を削減するニューラルネットワークのデータ圧縮装置およびデータ圧縮方法に関する。 The present invention relates to a data compression device and a data compression method for a neural network that reduces the amount of data of the output data of a partial neural network created by dividing one neural network.

ニューラルネットワークは、多くの分野で使用されている。ニューラルネットワークにおける演算量、特に、多数の隠れ層を有する多層ニューラルネットワークにおける演算量は膨大であるため、計算能力が高いデータセンタに設置されたサーバや、クラウドサーバで演算が実行される。 Neural networks are used in many fields. Since the amount of calculation in a neural network, particularly in a multi-layer neural network having a large number of hidden layers, is enormous, the calculation is executed by a server installed in a data center having high computing power or a cloud server.

しかし、ユーザとサーバとの間のトラフィックに輻輳が生じたり、サーバの消費電力が高くなるといった問題が生ずる。そこで、エッジコンピューティングを利用することが考案されている。具体的には、ニューラルネットワークを分割し、分割で生成された１つの部分ニューラルネットワーク（前半ニューラルネットワークという。）の演算をエッジデバイスが実行し、例えばクラウドサーバが他の部分ニューラルネットワーク（後半ニューラルネットワークという。）の演算を実行する（例えば、非特許文献１参照）。その場合、前半ニューラルネットワークにおける最終層の出力である特徴量データが、通信ネットワークを介して、エッジデバイスからクラウドサーバに転送される。なお、非特許文献１では、ニューラルネットワークは畳み込みニューラルネットワーク（ＣＮＮ：Convolutional Neural Network）であり、前半ニューラルネットワークにおける最終層は、あるプーリング層である。 However, there are problems such as congestion in the traffic between the user and the server and high power consumption of the server. Therefore, it is devised to use edge computing. Specifically, the neural network is divided, and the edge device executes the calculation of one partial neural network (referred to as the first half neural network) generated by the division. For example, the cloud server performs the operation of the other partial neural network (second half neural network). (See, for example, Non-Patent Document 1). In that case, the feature amount data which is the output of the final layer in the first half neural network is transferred from the edge device to the cloud server via the communication network. In Non-Patent Document 1, the neural network is a convolutional neural network (CNN), and the final layer in the first half neural network is a certain pooling layer.

非特許文献１には、エッジデバイスからクラウドサーバに転送されるデータ量を削減するために、特徴量データに対してデータ圧縮処理を施すことが記載されている。データ圧縮処理では、まず、ＨＥＶＣ（High Efficiency Video Coding）規格に基づく符号化が実行可能になるように、小さいサイズのチャネル画像データが複数チャネル存在する特徴量データをタイル状に並べて、大きいサイズのチャネル画像データを１チャネルまたは３チャネルもつ画像データに変換する処理が行われる。その後、ＨＥＶＣ規格に基づく符号化が行われる。 Non-Patent Document 1 describes that the feature amount data is subjected to data compression processing in order to reduce the amount of data transferred from the edge device to the cloud server. In the data compression process, first, feature data in which multiple channels of small-sized channel image data exist are arranged in a tile so that coding based on the HEVC (High Efficiency Video Coding) standard can be performed, and the large-sized channel image data is arranged in a tile. A process of converting channel image data into image data having one channel or three channels is performed. After that, coding based on the HEVC standard is performed.

図１５は、非特許文献１に記載されている特徴量データのデータ圧縮処理を説明するための説明図である。図１５（ａ）には、複数チャネルからなる特徴量データを、フレームfi, fi+1, fi+2の３フレームそれぞれについて時系列順に並べた様子が示されている。例えば、出力ウィンドウのサイズが縦横ｎ×ｎ（ｎ：２以上の自然数）であり、チャネル数がＫ（Ｋ：２以上の自然数）であるとする。ここで、出力ウィンドウのサイズｎがデータ圧縮処理方式にとって不適である可能性がある。例えば、非特許文献１では、ＨＥＶＣエンコーダはＣＴＵサイズを符号化の最小単位とするため、特徴量データのサイズｎがＣＴＵサイズ以下である場合にはそのままではＨＥＶＣエンコーダを適用できない問題が指摘されている。そのような場合への対応方法として、非特許文献１には、複数チャネル画像をタイル状にまとめて１枚の画像にする方法が示されている。 FIG. 15 is an explanatory diagram for explaining the data compression process of the feature amount data described in Non-Patent Document 1. FIG. 15A shows how the feature data composed of a plurality of channels are arranged in chronological order for each of the three frames fi, fi + 1, and fi + 2. For example, it is assumed that the size of the output window is n × n (n: a natural number of 2 or more) and the number of channels is K (K: a natural number of 2 or more). Here, the size n of the output window may be unsuitable for the data compression processing method. For example, in Non-Patent Document 1, since the HEVC encoder uses the CTU size as the minimum unit for coding, it has been pointed out that the HEVC encoder cannot be applied as it is when the feature amount data size n is smaller than the CTU size. There is. As a method for dealing with such a case, Non-Patent Document 1 discloses a method of combining a plurality of channel images into a single image in a tile shape.

図１５（ｂ）には、各フレームについて、９チャネル分の出力ウィンドウがタイル状に並べられた状態が示されている。タイル状に並べられた複数の出力ウィンドウが１つのフレームと見なされる。そして、そのフレームを対象として符号化が実行される。以下、符号化される前のデータ、すなわち、前半ニューラルネットワークの出力データを中間データという。 FIG. 15B shows a state in which output windows for 9 channels are arranged in tiles for each frame. Multiple output windows arranged in tiles are considered as one frame. Then, the coding is executed for the frame. Hereinafter, the data before being encoded, that is, the output data of the first half neural network is referred to as intermediate data.

T. Mitani et al.,"Compression and Aggregation for Optimizing Information Transmission in Distributed CNN", IEEE, 2017 fifth International Symposium on Computing and Networking, 19-22 Nov. 2017T. Mitani et al., "Compression and Aggregation for Optimizing Information Transmission in Distributed CNN", IEEE, 2017 fifth International Symposium on Computing and Networking, 19-22 Nov. 2017

図１６は、９チャネル分の出力ウィンドウがタイル状に並べられた状態の１フレームを示す説明図である。図１６において、矢印は、符号化順を示す。ただし、図１６には、第１〜第３出力チャネル（図１６（ｂ）において、１〜３の数字が付された出力チャネル）のみについての符号化順が示されている。また、出力ウィンドウのサイズが３×３である場合が例示されている。 FIG. 16 is an explanatory diagram showing one frame in which output windows for 9 channels are arranged in a tile. In FIG. 16, the arrows indicate the coding order. However, FIG. 16 shows the coding order only for the first to third output channels (output channels with numbers 1 to 3 in FIG. 16B). Further, the case where the size of the output window is 3 × 3 is illustrated.

例えば、符号化順は、第１出力チャネルの１行目→第２出力チャネルの１行目→第３出力チャネルの１行目→第１出力チャネルの２行目→・・・である。ＣＮＮを例にすると、畳み込み層の出力およびプーリング層の出力は、次層に対する特徴量データとして、メモリに格納される。メモリとしてＲＡＭ（Random Access Memory）が用いられる。一般に、格納アドレスの順序は、第１出力チャネルのデータ→第２出力チャネルのデータ→・・・→第Ｋ出力チャネルのデータである。 For example, the coding order is the first line of the first output channel → the first line of the second output channel → the first line of the third output channel → the second line of the first output channel → ... Taking CNN as an example, the output of the convolution layer and the output of the pooling layer are stored in the memory as feature data for the next layer. RAM (Random Access Memory) is used as the memory. Generally, the order of the storage addresses is the data of the first output channel → the data of the second output channel → ... → the data of the Kth output channel.

すなわち、メモリにおけるデータの格納順と符号化順とは、同じではない。すると、メモリへのデータの書き込み順またはメモリからのデータの読み出し順を、符号化順に合わせる処理の実行が求められる。そのような処理が実行されるので、前半ニューラルネットワークにおける処理時間が増加する。また、メモリ管理が複雑になる。換言すれば、前半ニューラルネットワークの処理（符号化処理を含む。）の効率が低下する。 That is, the data storage order and the coding order in the memory are not the same. Then, it is required to execute the process of matching the order of writing data to the memory or the order of reading data from the memory to the coding order. Since such processing is executed, the processing time in the first half neural network increases. In addition, memory management becomes complicated. In other words, the efficiency of the processing (including the coding processing) of the first half neural network is reduced.

本発明は、１つのニューラルネットワークが分割されて作成される部分ニューラルネットワークの出力データのデータ量を効率よく削減することを目的とする。 An object of the present invention is to efficiently reduce the amount of output data of a partial neural network created by dividing one neural network.

本発明によるニューラルネットワークのデータ圧縮装置は、１つのニューラルネットワークが分割されて作成される部分ニューラルネットワークの出力データのデータ量を削減するニューラルネットワークのデータ圧縮装置であって、部分ニューラルネットワークの出力データが格納された記憶部からデータを読み出す読出し回路と、読み出されたデータのデータ圧縮を行うデータ圧縮回路とを含み、読出し回路は、一のチャネルの全てのデータの読み出しが完了してから他のチャネルのデータを読み出す。 The data compression device of the neural network according to the present invention is a data compression device of the neural network that reduces the amount of data of the output data of the partial neural network created by dividing one neural network, and is the output data of the partial neural network. The read circuit includes a read circuit for reading data from the storage unit in which the data is stored and a data compression circuit for compressing the read data. Read the data of the channel of.

本発明による他の態様のニューラルネットワークのデータ圧縮装置は、１つのニューラルネットワークが分割されて作成される部分ニューラルネットワークの出力データのデータ量を削減するニューラルネットワークのデータ圧縮装置であって、部分ニューラルネットワークの出力データが格納された記憶部からデータを読み出す読出し回路と、読み出されたデータのデータ圧縮を行うデータ圧縮回路とを含み、データ圧縮回路は、類似チャネル情報で示される、出力データが類似している２つ以上の類似チャネルのうちの一の類似チャネルのデータのデータ圧縮を行う符号化部と、他の類似チャネルのデータの、一の類似チャネルのデータとの差分を符号化する差分符号化部とを有する。 The data compression device of the neural network of another aspect according to the present invention is a data compression device of the neural network that reduces the amount of data of the output data of the partial neural network created by dividing one neural network, and is a partial neural. The data compression circuit includes a read circuit for reading data from a storage unit in which output data of the network is stored and a data compression circuit for compressing the read data. Encodes the difference between the data of one similar channel and the data of another similar channel and the coding unit that compresses the data of one of the two or more similar channels that are similar. It has a difference coding unit.

本発明によるニューラルネットワークのデータ圧縮方法は、１つのニューラルネットワークが分割されて作成される部分ニューラルネットワークの出力データのデータ量を削減するニューラルネットワークのデータ圧縮方法であって、部分ニューラルネットワークの出力データが格納された記憶部からデータを読み出し、読み出されたデータのデータ圧縮を行い、記憶部からデータを読み出すときに、一のチャネルの全てのデータの読み出しが完了してから他のチャネルのデータを読み出す。 The data compression method of the neural network according to the present invention is a data compression method of the neural network that reduces the amount of data of the output data of the partial neural network created by dividing one neural network, and is the output data of the partial neural network. Data is read from the storage unit in which the data is stored, data is compressed from the read data, and when the data is read from the storage unit, after all the data in one channel has been read out, the data in the other channel is completed. Is read.

本発明による他の態様のニューラルネットワークのデータ圧縮方法は、１つのニューラルネットワークが分割されて作成される部分ニューラルネットワークの出力データのデータ量を削減するニューラルネットワークのデータ圧縮方法であって、部分ニューラルネットワークの出力データが格納された記憶部からデータを読み出し、読み出されたデータのデータ圧縮を行い、データ圧縮を行うときに、類似チャネル情報で示される、出力データが類似している２つ以上の類似チャネルのうちの一の類似チャネルのデータのデータ圧縮を行い、他の類似チャネルのデータの、一の類似チャネルのデータとの差分を符号化する。 Another aspect of the data compression method for a neural network according to the present invention is a data compression method for a neural network that reduces the amount of output data of a partial neural network created by dividing one neural network, and is a partial neural network. When data is read from the storage unit where the output data of the network is stored, the data of the read data is compressed, and the data is compressed, two or more of the output data are similar, which is indicated by the similar channel information. Data compression of the data of one of the similar channels in is performed, and the difference between the data of the other similar channels and the data of one similar channel is encoded.

本発明によるニューラルネットワークのデータ圧縮プログラムは、１つのニューラルネットワークが分割されて作成される部分ニューラルネットワークの出力データのデータ量を削減するニューラルネットワークのデータ圧縮プログラムであって、コンピュータに、部分ニューラルネットワークの出力データが格納された記憶部からデータを読み出す処理と、読み出されたデータのデータ圧縮を行う処理とを実行させ、記憶部からデータを読み出すときに、一のチャネルの全てのデータの読み出しが完了してから他のチャネルのデータを読み出させる。 The data compression program of the neural network according to the present invention is a data compression program of the neural network that reduces the amount of data of the output data of the partial neural network created by dividing one neural network, and is a data compression program of the partial neural network to a computer. When the process of reading data from the storage unit in which the output data of the above is stored and the process of compressing the read data are executed and the data is read from the storage unit, all the data in one channel is read out. Is completed, and then the data of other channels is read out.

本発明による他の態様のニューラルネットワークのデータ圧縮プログラムは、１つのニューラルネットワークが分割されて作成される部分ニューラルネットワークの出力データのデータ量を削減するニューラルネットワークのデータ圧縮プログラムであって、コンピュータに、部分ニューラルネットワークの出力データが格納された記憶部からデータを読み出す処理と、読み出されたデータのデータ圧縮を行う処理とを実行させ、データ圧縮を行うときに、類似チャネル情報で示される、出力データが類似している２つ以上の類似チャネルのうちの一の類似チャネルのデータのデータ圧縮を行う処理と、他の類似チャネルのデータの、一の類似チャネルのデータとの差分を符号化する処理とを実行させる。 Another aspect of the neural network data compression program according to the present invention is a neural network data compression program that reduces the amount of output data of a partial neural network created by dividing one neural network into a computer. , The process of reading data from the storage unit in which the output data of the partial neural network is stored and the process of compressing the read data are executed, and when the data is compressed, it is indicated by similar channel information. Encoding the difference between the process of compressing the data of one of the two or more similar channels whose output data is similar and the data of the other similar channel with the data of one similar channel. To execute the processing to be performed.

本発明によれば、１つのニューラルネットワークが分割されて作成される部分ニューラルネットワークの出力データのデータ量が効率よく削減される。 According to the present invention, the amount of data of the output data of the partial neural network created by dividing one neural network is efficiently reduced.

ニューラルネットワークシステムの一例を示すブロック図である。It is a block diagram which shows an example of a neural network system. ＣＮＮの一例を示す説明図である。It is explanatory drawing which shows an example of CNN. 第１の実施形態におけるデータ圧縮部の構成例を示すブロック図である。It is a block diagram which shows the structural example of the data compression part in 1st Embodiment. 第１の実施形態における符号化順序を説明するための説明図である。It is explanatory drawing for demonstrating the coding order in 1st Embodiment. 第１の実施形態におけるデータ圧縮部の動作の一例を示すフローチャートである。It is a flowchart which shows an example of the operation of the data compression part in 1st Embodiment. 第２の実施形態におけるデータ圧縮部の構成例を示すブロック図である。It is a block diagram which shows the structural example of the data compression part in 2nd Embodiment. 第２の実施形態における符号化順序を説明するための説明図である。It is explanatory drawing for demonstrating the coding order in 2nd Embodiment. 第２の実施形態におけるデータ圧縮部の動作の一例を示すフローチャートである。It is a flowchart which shows an example of the operation of the data compression part in 2nd Embodiment. 第３の実施形態におけるデータ圧縮部の構成例を示すブロック図である。It is a block diagram which shows the structural example of the data compression part in 3rd Embodiment. 第３の実施形態におけるデータ圧縮部の動作の一例を示すフローチャートである。It is a flowchart which shows an example of the operation of the data compression part in 3rd Embodiment. ニューラルネットワークのデータ圧縮装置の主要部を示すブロック図である。It is a block diagram which shows the main part of the data compression apparatus of a neural network. 他の態様のニューラルネットワークのデータ圧縮装置の主要部を示すブロック図である。It is a block diagram which shows the main part of the data compression apparatus of the neural network of another aspect. の態様のニューラルネットワークのデータ圧縮装置の主要部を示すブロック図である。It is a block diagram which shows the main part of the data compression apparatus of the neural network of the aspect of. ＣＰＵを有するコンピュータの一例を示すブロック図である。It is a block diagram which shows an example of the computer which has a CPU. 従来の特徴量データのデータ圧縮処理を説明するための説明図である。It is explanatory drawing for demonstrating the data compression processing of the conventional feature quantity data. ９チャネル分の出力ウィンドウがタイル状に並べられた状態の１フレームを示す説明図である。It is explanatory drawing which shows one frame in the state that the output windows for 9 channels are arranged in a tile shape.

図１は、ニューラルネットワークシステムの一例を示すブロック図である。ニューラルネットワークシステムは、ニューラルネットワーク装置、通信ネットワーク５００およびクラウドサーバ６００を含む。ニューラルネットワーク装置は、前半ニューラルネットワーク２００、データ圧縮部１００、および送信部４００を含む。 FIG. 1 is a block diagram showing an example of a neural network system. The neural network system includes a neural network device, a communication network 500 and a cloud server 600. The neural network device includes a first half neural network 200, a data compression unit 100, and a transmission unit 400.

前半ニューラルネットワーク２００は、１つ以上の畳み込み層を含む。なお、ここでは、ニューラルネットワークとして、ＣＮＮを例にする。データ圧縮部１００は、前半ニューラルネットワーク２００が作成した中間データのデータ量を削減するための処理を実行する。送信部４００は、データ圧縮部１００が出力するデータを、通信ネットワーク５００を介してクラウドサーバ６００に送信する。 The first half neural network 200 includes one or more convolution layers. Here, CNN is taken as an example as a neural network. The data compression unit 100 executes a process for reducing the amount of intermediate data created by the first half neural network 200. The transmission unit 400 transmits the data output by the data compression unit 100 to the cloud server 600 via the communication network 500.

なお、後半ニューラルネットワークの演算を実行する演算手段としてクラウドサーバ６００を例にするが、演算手段は、クラウドサーバ６００に限られない。演算手段は、例えば、オンプレミスのサーバであってもよい。 The cloud server 600 is taken as an example of the calculation means for executing the calculation of the latter half neural network, but the calculation means is not limited to the cloud server 600. The arithmetic means may be, for example, an on-premises server.

前半ニューラルネットワーク２００は、積和演算部２１０、記憶部２２０および選択部２３０を含む。積和演算部２１０は、記憶部２２０に記憶されている特徴量データと重み係数とを用いて畳み込み層の演算を実行し、演算結果を記憶部２２０に格納する。なお、記憶部２２０として、ＲＡＭが用いられる。 The first half neural network 200 includes a product-sum calculation unit 210, a storage unit 220, and a selection unit 230. The product-sum calculation unit 210 executes the operation of the convolution layer using the feature amount data stored in the storage unit 220 and the weighting coefficient, and stores the calculation result in the storage unit 220. A RAM is used as the storage unit 220.

選択部２３０は、プーリング層を実現する。すなわち、選択部２３０は、記憶部２２０に格納された積和演算部２１０の所定数の演算結果毎に、１つの演算結果を選択する。選択部２３０は、例えば、所定数の演算結果のうちの最大値を選択する。なお、選択部２３０は、例えば、所定数の演算結果の平均値を選択してもよい。 The selection unit 230 realizes a pooling layer. That is, the selection unit 230 selects one calculation result for each predetermined number of calculation results of the product-sum calculation unit 210 stored in the storage unit 220. The selection unit 230 selects, for example, the maximum value among a predetermined number of calculation results. The selection unit 230 may select, for example, an average value of a predetermined number of calculation results.

図２は、ＣＮＮの一例を示す説明図である。図２に示す例では、ＣＮＮは、入力データを対象とする第１畳み込み層、第１プーリング層、第２畳み込み層、第２プーリング層、第３畳み込み層、第４畳み込み層、第５畳み込み層、および第５プーリング層を含む。 FIG. 2 is an explanatory diagram showing an example of CNN. In the example shown in FIG. 2, the CNN is a first convolution layer, a first pooling layer, a second convolution layer, a second pooling layer, a third convolution layer, a fourth convolution layer, and a fifth convolution layer for input data. , And a fifth pooling layer.

そして、前半ニューラルネットワーク２００は、一例として、第１畳み込み層、第１プーリング層、第２畳み込み層、第２プーリング層、第３畳み込み層、第４畳み込み層、第５畳み込み層、および第５プーリング層で構成される。また、後半ニューラルネットワーク３００は、第５プーリング層よりも後の複数の層を含む。 The first half neural network 200, for example, has a first convolution layer, a first pooling layer, a second convolution layer, a second pooling layer, a third convolution layer, a fourth convolution layer, a fifth convolution layer, and a fifth pooling. Consists of layers. Further, the latter half neural network 300 includes a plurality of layers after the fifth pooling layer.

実施形態１．
図３は、第１の実施形態におけるデータ圧縮部の構成例を示すブロック図である。図３に示すデータ圧縮部１００Ａは、読出し回路１１１と符号化部１１２とを有する。なお、データ圧縮部１００Ａは、図１に示されたニューラルネットワーク装置におけるデータ圧縮部１００に相当する。Embodiment 1.
FIG. 3 is a block diagram showing a configuration example of the data compression unit according to the first embodiment. The data compression unit 100A shown in FIG. 3 has a read circuit 111 and a coding unit 112. The data compression unit 100A corresponds to the data compression unit 100 in the neural network device shown in FIG.

読出し回路１１１は、記憶部２２０から特徴量データを読み出す。符号化部１１２は、読み出された特徴量データを符号化する。符号化部１１２は、例えば、ＨＥＶＣ規格に基づく符号化を行う。なお、本実施形態では、符号化部１１２は、ＨＥＶＣ規格に基づいて符号化処理を実行するが、ＨＥＶＣ規格に基づく符号化は一例であり、符号化部１１２は、他の符号化方式を用いてもよい。例えば、ＪＶＥＴ（Joint Video Experts Team）で検討されている次世代符号化方式が用いられてもよい。 The read circuit 111 reads the feature amount data from the storage unit 220. The coding unit 112 encodes the read feature amount data. The coding unit 112 performs coding based on, for example, the HEVC standard. In the present embodiment, the coding unit 112 executes the coding process based on the HEVC standard, but the coding based on the HEVC standard is an example, and the coding unit 112 uses another coding method. You may. For example, the next-generation coding method studied by JVET (Joint Video Experts Team) may be used.

図４は、本実施形態における符号化順序を説明するための説明図である。本実施形態では、前半ニューラルネットワーク２００の出力データを、チャネル別に独立した動画と見なす。図４（ａ）には、複数チャネルからなる特徴量データが、フレームfi, fi+1, fi+2の３フレームそれぞれについて時系列順に並べられた様子が示されている。例えば、出力ウィンドウのサイズが縦横ｎ×ｎ（ｎ：２以上の自然数）であり、チャネル数がＫ（Ｋ：２以上の自然数）であるとする。ここで、出力ウィンドウのサイズｎがデータ圧縮処理方式にとって不適である可能性がある。上述したように、ＨＥＶＣエンコーダはＣＴＵサイズを符号化の最小単位とするため、特徴量データのサイズｎがＣＴＵサイズ以下である場合にはそのままではＨＥＶＣエンコーダを適用できない。 FIG. 4 is an explanatory diagram for explaining the coding order in the present embodiment. In the present embodiment, the output data of the first half neural network 200 is regarded as an independent moving image for each channel. FIG. 4A shows how the feature data composed of a plurality of channels are arranged in chronological order for each of the three frames fi, fi + 1, and fi + 2. For example, it is assumed that the size of the output window is n × n (n: a natural number of 2 or more) and the number of channels is K (K: a natural number of 2 or more). Here, the size n of the output window may be unsuitable for the data compression processing method. As described above, since the HEVC encoder uses the CTU size as the minimum unit for coding, the HEVC encoder cannot be applied as it is when the size n of the feature amount data is smaller than or equal to the CTU size.

そこで、図４（ｂ）に示すように、記憶部２２０から読み出されたデータが短冊状に横方向に並べられる。なお、短冊は複数個ある（図４（ｂ）に示す例では４つ）。図４（ｂ）において、数字は、短冊の番号を示す。１つの短冊には、あるフレーム中の同一チャネルのデータが設定される。また、ある短冊へのデータの配列が終了すると、次のチャネルデータが配列される。 Therefore, as shown in FIG. 4B, the data read from the storage unit 220 are arranged in a strip shape in the horizontal direction. There are a plurality of strips (4 in the example shown in FIG. 4B). In FIG. 4B, the numbers indicate the strip numbers. Data of the same channel in a certain frame is set in one strip. Also, when the arrangement of data in a strip is completed, the next channel data is arranged.

次に、データ圧縮部１００Ａの動作を説明する。図５は、データ圧縮部１００Ａの動作の一例を示すフローチャートである。なお、フレーム数をＦ（Ｆ：２以上の自然数）とする。 Next, the operation of the data compression unit 100A will be described. FIG. 5 is a flowchart showing an example of the operation of the data compression unit 100A. The number of frames is F (F: a natural number of 2 or more).

読出し回路１１１は、まず、フレーム数に関する変数ｆに１を設定し（ステップＳ１０１）、チャネル数に関する変数ｋに１を設定する（ステップＳ１０２）。 First, the read circuit 111 sets the variable f regarding the number of frames to 1 (step S101) and sets the variable k regarding the number of channels to 1 (step S102).

読出し回路１１１は、第ｆフレームの第ｋチャネルのデータを記憶部２２０から読み出す（ステップＳ１０３）。なお、出力ウィンドウのサイズが３×３である場合を例にすると、データの読出し順は、第ｆフレームの第１出力チャネルの１行目→第１出力チャネルの２行目→第１出力チャネルの３行目→・・・である。 The read circuit 111 reads the data of the kth channel of the fth frame from the storage unit 220 (step S103). Taking the case where the size of the output window is 3 × 3 as an example, the data reading order is as follows: 1st line of the 1st output channel of the fth frame → 2nd line of the 1st output channel → 1st output channel. The third line of → ...

そして、読出し回路１１１は、記憶回路としての一時記憶部（図示せず）においてデータが短冊状に横方向に並べられるように、読み出されたデータを格納する。そして、読出し回路１１１は、ｋの値を１増やす（ステップＳ１０４）。なお、一時記憶部は、読出し回路１１１に内蔵されるか、または、読出し回路１１１と符号化部１１２との間に設けられる。 Then, the read circuit 111 stores the read data so that the data can be arranged in a strip shape in the temporary storage unit (not shown) as a storage circuit. Then, the read circuit 111 increments the value of k by 1 (step S104). The temporary storage unit is built in the reading circuit 111, or is provided between the reading circuit 111 and the coding unit 112.

ｋの値がＫを越えている場合には（ステップＳ１０５）、全チャネルのデータについて読出しが完了したことになるので、符号化部１１２は、一時記憶部に格納されているデータについて符号化処理を実行する（ステップＳ１０６）。ｋの値がＫを越えていないときには、ステップＳ１０３に戻る。 If the value of k exceeds K (step S105), the reading of the data of all channels is completed, so that the coding unit 112 encodes the data stored in the temporary storage unit. Is executed (step S106). If the value of k does not exceed K, the process returns to step S103.

ステップＳ１０６の処理が実行された後、読出し回路１１１は、ｆの値を１増やす（ステップＳ１０７）。ｆの値がＦを越えていないときには（ステップＳ１０８）、ステップＳ１０２に戻る。 After the process of step S106 is executed, the read circuit 111 increments the value of f by 1 (step S107). When the value of f does not exceed F (step S108), the process returns to step S102.

ｆの値がＦを越えている場合には、処理を終了する。 If the value of f exceeds F, the process ends.

本実施形態では、フレーム毎に複数チャネルのデータがまとめて１つの画像と見なされる。なお、図５に示された例では、フレーム毎に全チャネルのデータがまとめて１つの画像と見なされるが、データ圧縮部１００Ａは、全チャネル数よりも少ない数の所定数のチャネルのデータをまとめて１つの画像と見なすようにしてもよい。 In the present embodiment, the data of a plurality of channels are collectively regarded as one image for each frame. In the example shown in FIG. 5, the data of all channels are collectively regarded as one image for each frame, but the data compression unit 100A collects data of a predetermined number of channels smaller than the total number of channels. You may consider them together as one image.

そして、本実施形態では、データを記憶部２２０から読み出された同じチャネルのデータが横方向に短冊状に配置された後に、符号化処理が実行される。そして、記憶部２２０からのデータの読出し順は、記憶部２２０におけるデータの格納順（アドレス順）と整合している。したがって、記憶部２２０へのデータの書き込み順または記憶部２２０からのデータの読み出し順を、符号化順に合わせる処理の実行が不要になる。その結果、前半ニューラルネットワークの処理（符号化処理を含む。）の効率の低下が抑制される。 Then, in the present embodiment, the coding process is executed after the data of the same channel read from the storage unit 220 is arranged in a strip shape in the horizontal direction. The order of reading data from the storage unit 220 is consistent with the order of storing data (address order) in the storage unit 220. Therefore, it is not necessary to execute the process of matching the order of writing data to the storage unit 220 or the order of reading data from the storage unit 220 to the coding order. As a result, the decrease in efficiency of the processing (including the coding processing) of the first half neural network is suppressed.

実施形態２．
図６は、第２の実施形態におけるデータ圧縮部の構成例を示すブロック図である。図６に示すデータ圧縮部１００Ｂは、読出し回路１１３と符号化部１１２とを有する。なお、データ圧縮部１００Ｂは、図１に示されたニューラルネットワーク装置におけるデータ圧縮部１００に相当する。Embodiment 2.
FIG. 6 is a block diagram showing a configuration example of the data compression unit according to the second embodiment. The data compression unit 100B shown in FIG. 6 has a read circuit 113 and a coding unit 112. The data compression unit 100B corresponds to the data compression unit 100 in the neural network device shown in FIG.

読出し回路１１３は、記憶部２２０から特徴量データを読み出す。符号化部１１２は、読み出された特徴量データを符号化する。符号化部１１２は、例えば、ＨＥＶＣ規格に基づく符号化を行う。なお、本実施形態では、符号化部１１２は、ＨＥＶＣ規格に基づいて符号化処理を実行するが、ＨＥＶＣ規格に基づく符号化は一例であり、符号化部１１２は、他の符号化方式を用いてもよい。 The read circuit 113 reads the feature amount data from the storage unit 220. The coding unit 112 encodes the read feature amount data. The coding unit 112 performs coding based on, for example, the HEVC standard. In the present embodiment, the coding unit 112 executes the coding process based on the HEVC standard, but the coding based on the HEVC standard is an example, and the coding unit 112 uses another coding method. You may.

図７は、本実施形態における符号化順序を説明するための説明図である。本実施形態では、前半ニューラルネットワーク２００の出力データを、チャネル別に独立した動画と見なす。図７（ａ）には、１フレームの出力がチャネル別に独立した動画と見なされる様子が示されている。例えば、出力ウィンドウのサイズが縦横ｎ×ｎ（ｎ：２以上の自然数）であり、チャネル数がＫ（Ｋ：２以上の自然数）であるとすると、ｎ×ｎの動画がＫ種類あると見なされる。上述したように、ＨＥＶＣエンコーダを適用するためにはｎがＣＴＵサイズ以上である必要がある。本実施形態では、ｎがＣＴＵサイズ以上であることが前提である。 FIG. 7 is an explanatory diagram for explaining the coding order in the present embodiment. In the present embodiment, the output data of the first half neural network 200 is regarded as an independent moving image for each channel. FIG. 7A shows how the output of one frame is regarded as an independent moving image for each channel. For example, if the size of the output window is n × n (n: a natural number of 2 or more) and the number of channels is K (K: a natural number of 2 or more), it is considered that there are K types of n × n videos. Is done. As mentioned above, n needs to be CTU size or larger in order to apply the HEVC encoder. In this embodiment, it is premised that n is CTU size or more.

本実施形態では、図７（ｂ）に示すように、全てのフレームの同一チャネルの出力データが、１つの映像と見なされる。そして、各映像の各々が独立して符号化される。なお、図７（ｂ）には、各々の画像の符号化データが、第ｋビットストリーム（ｋ：１〜Ｋ）として表現されている。 In this embodiment, as shown in FIG. 7B, the output data of the same channel of all frames is regarded as one video. Then, each of the images is independently encoded. In FIG. 7B, the coded data of each image is represented as a k-th stream (k: 1 to K).

次に、データ圧縮部１００Ｂの動作を説明する。図８は、データ圧縮部１００Ｂの動作の一例を示すフローチャートである。なお、フレーム数をＦ（Ｆ：２以上の自然数）とする。また、前半ニューラルネットワーク２００が、Ｋチャネルのデータを出力する場合を例にする。 Next, the operation of the data compression unit 100B will be described. FIG. 8 is a flowchart showing an example of the operation of the data compression unit 100B. The number of frames is F (F: a natural number of 2 or more). Further, the case where the first half neural network 200 outputs K channel data is taken as an example.

読出し回路１１３は、まず、チャネル数に関する変数ｋに１を設定し（ステップＳ２０１）、フレーム数に関する変数ｆに１を設定する（ステップＳ２０２）。 First, the read circuit 113 sets the variable k related to the number of channels to 1 (step S201), and sets the variable f related to the number of frames to 1 (step S202).

読出し回路１１３は、第ｋチャネルの第ｆフレームのデータを記憶部２２０から読み出す（ステップＳ２０３）。なお、出力ウィンドウのサイズが３×３である場合を例にすると、データの読出し順は、第１の実施形態の場合と同様、第１フレームの第１出力チャネルの１行目→第１出力チャネルの２行目→第１出力チャネルの３行目→第２フレームの第１出力チャネルの１行目→・・・である。 The read circuit 113 reads the data of the fth frame of the kth channel from the storage unit 220 (step S203). Taking the case where the size of the output window is 3 × 3 as an example, the data reading order is the same as in the case of the first embodiment, from the first line of the first output channel of the first frame to the first output. The second line of the channel → the third line of the first output channel → the first line of the first output channel of the second frame → ...

そして、読出し回路１１３は、ｆの値を１増やす（ステップＳ２０４）。ｆの値がＦを越えていないときには（ステップＳ２０５）、ステップＳ２０３に戻る。 Then, the read circuit 113 increases the value of f by 1 (step S204). When the value of f does not exceed F (step S205), the process returns to step S203.

ｆの値がＦを越えている場合には、１チャネル分の全てのデータが一時記憶部に格納されたことになる。符号化部１１２は、一時記憶部に格納されているデータについて符号化処理を実行する（ステップＳ２０７）。すなわち、符号化部１１２は、１つのチャネルの全データ（１つの映像と見なされる。）を対象として符号化処理を実行する。 When the value of f exceeds F, it means that all the data for one channel is stored in the temporary storage unit. The coding unit 112 executes a coding process for the data stored in the temporary storage unit (step S207). That is, the coding unit 112 executes the coding process for all the data of one channel (which is regarded as one video).

次いで、読出し回路１１３は、ｋの値を１増やす（ステップＳ２０８）。ｋの値がＫを越えている場合には、全チャネルのデータについて処理が完了したことになるので、データ圧縮部１００Ｂは、処理を終了する（ステップＳ２０９）。ｋの値がＫを越えていないときには、ステップＳ２０２に戻る。 Next, the read circuit 113 increments the value of k by 1 (step S208). If the value of k exceeds K, the processing is completed for the data of all channels, and the data compression unit 100B ends the processing (step S209). If the value of k does not exceed K, the process returns to step S202.

本実施形態では、特徴量データがチャネル毎に別々の動画として扱われ、動画エンコーダ（例えば、ＨＥＶＣエンコーダ）により圧縮される。１つのチャネルは、入力画像のもつ特徴のうちの一つを抽出した画像であると捉えることができる。よって、各フレームから同一チャネルのみを抽出して得た動画（第ｋビットストリーム）は、入力画像列と同様に、時間的局所性（類似性）を持っていることが期待される。例えば、フレームfiのチャネル１の特徴量画像と、フレームfi+1のチャネル１の特徴量画像とは、大部分が同一であることが期待される。よって、ＨＥＶＣエンコーダが持つ、入力画像列の時間的局所性を活用して高効率に圧縮を行うアルゴリズムの効果を、有効に活用することができる。 In the present embodiment, the feature amount data is treated as a separate moving image for each channel and compressed by a moving image encoder (for example, HEVC encoder). One channel can be regarded as an image obtained by extracting one of the features of the input image. Therefore, it is expected that the moving image (k-th stream) obtained by extracting only the same channel from each frame has a temporal locality (similarity) as in the input image sequence. For example, it is expected that most of the feature image of channel 1 of frame fi and the feature image of channel 1 of frame fi + 1 are the same. Therefore, it is possible to effectively utilize the effect of the algorithm that performs compression with high efficiency by utilizing the temporal locality of the input image sequence possessed by the HEVC encoder.

実施形態３．
図９は、第３の実施形態におけるデータ圧縮部の構成例を示すブロック図である。図９に示すデータ圧縮部１００Ｃは、読出し回路１１４と符号化部１１５と差分符号化部１１６とを有する。なお、データ圧縮部１００Ｃは、図１に示されたニューラルネットワーク装置におけるデータ圧縮部１００に相当する。Embodiment 3.
FIG. 9 is a block diagram showing a configuration example of the data compression unit according to the third embodiment. The data compression unit 100C shown in FIG. 9 includes a read circuit 114, a coding unit 115, and a difference coding unit 116. The data compression unit 100C corresponds to the data compression unit 100 in the neural network device shown in FIG.

本実施形態では、データ圧縮部１００Ｃは、類似しているチャネルを指定する類似チャネル情報を入力する。ＣＮＮで取り扱うデータの種類等に応じて、あるチャネルの出力データに類似する出力データを含む他のチャネルは、学習済みのＣＮＮについて相当程度で把握可能である。以下、出力データが類似する複数のチャネルを、類似チャネルという。例えば、画像データが扱われる場合には、類似する画像特徴に相当する複数の類似チャネルは、事前に（ニューラルネットワークの処理が実行する前に）、決定可能である。類似チャネル情報は、事前に決定された２つ以上の類似チャネルを示す情報である。 In the present embodiment, the data compression unit 100C inputs similar channel information that specifies similar channels. Depending on the type of data handled by the CNN, other channels including output data similar to the output data of one channel can be grasped to a considerable extent with respect to the trained CNN. Hereinafter, a plurality of channels having similar output data will be referred to as similar channels. For example, when image data is handled, a plurality of similar channels corresponding to similar image features can be determined in advance (before the processing of the neural network is performed). Similar channel information is information indicating two or more predetermined similar channels.

符号化部１１５は、２つ以上の類似チャネルのうちの一のチャネルのデータを符号化する。差分符号化部１１６は、他の類似チャネルのデータの、一のチャネルのデータとの差分を符号化する。 The coding unit 115 encodes the data of one of two or more similar channels. The difference coding unit 116 encodes the difference between the data of another similar channel and the data of one channel.

次に、データ圧縮部１００Ｃの動作を説明する。図１０は、データ圧縮部１００Ｂの動作の一例を示すフローチャートである。 Next, the operation of the data compression unit 100C will be described. FIG. 10 is a flowchart showing an example of the operation of the data compression unit 100B.

読出し回路１１４は、外部から類似チャネル情報を入力する（ステップＳ３０１）。そして、読出し回路１１４は、類似チャネル情報が示す２つ以上の類似チャネルのうちの一のチャネルのデータを記憶部２２０から読み出す（ステップＳ３０２）。 The read circuit 114 inputs similar channel information from the outside (step S301). Then, the read circuit 114 reads the data of one of the two or more similar channels indicated by the similar channel information from the storage unit 220 (step S302).

符号化部１１５は、読み出された一のチャネルのデータを符号化する（ステップＳ３０３）。なお、符号化部１１５は、データ量を減らせるのであれば、どのような符号化方式を用いてもよい。例えば、データを２^ｎ（ｎ：自然数）になるように量子化した後周波数変換する符号化方式を使用できる。また、ＨＥＶＣなどの予測符号化方式を使用できる。The coding unit 115 encodes the data of one read channel (step S303). The coding unit 115 may use any coding method as long as the amount of data can be reduced. For example, ^{a coding method in which data is quantized to 2 n} (n: natural number) and then frequency-converted can be used. Further, a predictive coding method such as HEVC can be used.

読出し回路１１４は、２つ以上の類似チャネルのうちの他のチャネルのデータを記憶部２２０から読み出す（ステップＳ３０４）。差分符号化部１１６は、読み出された他のチャネルのデータと一のチャネルのデータとの差分を算出する。そして、差分符号化部１１６は、差分を符号化する。差分符号化部１１６は、データ量を減らせるのであれば、どのような符号化方式を用いてもよい。例えば、差分符号化部１１６は、ランレングス符号化またはハフマン符号化を使用できる。 The read circuit 114 reads the data of the other channel of the two or more similar channels from the storage unit 220 (step S304). The difference coding unit 116 calculates the difference between the read data of another channel and the data of one channel. Then, the difference coding unit 116 encodes the difference. The difference coding unit 116 may use any coding method as long as the amount of data can be reduced. For example, the difference coding unit 116 can use run-length coding or Huffman coding.

全てのチャネルの出力データについて符号化部１１５または差分符号化部１１６による符号化処理が実行されたら処理を終了する。符号化処理がなされていないチャネルが未だ残っている場合には、ステップＳ３０２に移行する。 When the coding processing by the coding unit 115 or the difference coding unit 116 is executed for the output data of all channels, the processing is terminated. If there is still a channel that has not been coded, the process proceeds to step S302.

本実施形態では、一のチャネルのデータに類似するデータのチャネルのデータに関して、一のチャネルのデータ（類似するチャネルのデータ）との差分のデータがクラウドサーバ６００に送信されるので、第１の実施形態および第２の実施形態の場合と同様、送信データ量を効果的に削減することができる。 In the present embodiment, with respect to the data of the channel of the data similar to the data of one channel, the data of the difference from the data of one channel (data of similar channels) is transmitted to the cloud server 600, so that the first method is made. As in the case of the embodiment and the second embodiment, the amount of transmitted data can be effectively reduced.

なお、上記の各実施形態では、ＣＮＮを例にしたが、各実施形態を、ＣＮＮ以外の多層のニューラルネットワークに適用することも可能である。 In each of the above embodiments, CNN is taken as an example, but each embodiment can be applied to a multi-layer neural network other than CNN.

図１１は、ニューラルネットワークのデータ圧縮装置の主要部を示すブロック図である。図１１に示すデータ圧縮装置７０１は、１つのニューラルネットワークが分割されて作成される部分ニューラルネットワーク（例えば、前半ニューラルネットワーク２００）の出力データのデータ量を削減するニューラルネットワークのデータ圧縮装置であって、部分ニューラルネットワークの出力データが格納された記憶部（例えば、記憶部２２０）からデータを読み出す読出し回路７１１（例えば、読出し回路１１１，１１３）と、読み出されたデータのデータ圧縮を行うデータ圧縮回路７１２（例えば、符号化部１１２）とを備え、読出し回路７１１は、一のチャネルの全てのデータの読み出しが完了してから他のチャネルのデータを読み出す。 FIG. 11 is a block diagram showing a main part of a data compression device of a neural network. The data compression device 701 shown in FIG. 11 is a data compression device for a neural network that reduces the amount of output data of a partial neural network (for example, the first half neural network 200) created by dividing one neural network. , Read circuit 711 (for example, read circuits 111, 113) that reads data from the storage unit (for example, storage unit 220) in which the output data of the partial neural network is stored, and data compression that performs data compression of the read data. A circuit 712 (for example, a coding unit 112) is provided, and the read circuit 711 reads the data of another channel after the read of all the data of one channel is completed.

図１２は、他の態様のニューラルネットワークのデータ圧縮装置の主要部を示すブロック図である。図１２に示すデータ圧縮装置７０１において、読出し回路７１１は、一のチャネルのデータを記憶回路７１３における短冊状の領域に格納し、データ圧縮回路７１２は、短冊状の領域に格納されたデータのデータ圧縮を行う。 FIG. 12 is a block diagram showing a main part of a data compression device of another aspect of a neural network. In the data compression device 701 shown in FIG. 12, the read circuit 711 stores the data of one channel in the strip-shaped area of the storage circuit 713, and the data compression circuit 712 stores the data of the data stored in the strip-shaped area. Perform compression.

図１３は、さらに他の態様のニューラルネットワークのデータ圧縮装置の主要部を示すブロック図である。図１３に示すデータ圧縮装置７０２は、１つのニューラルネットワークが分割されて作成される部分ニューラルネットワークの出力データのデータ量を削減するニューラルネットワークのデータ圧縮装置であって、部分ニューラルネットワークの出力データが格納された記憶部からデータを読み出す読出し回路７１１（例えば、読出し回路１１４）と、読み出されたデータのデータ圧縮を行うデータ圧縮回路７１４とを備え、データ圧縮回路７１４は、類似チャネル情報で示される、出力データが類似している２つ以上の類似チャネルのうちの一の類似チャネルのデータのデータ圧縮を行う符号化部７１５（例えば、符号化部１１５）と、他の類似チャネルのデータの、一の類似チャネルのデータとの差分を符号化する差分符号化部７１６（例えば、差分符号化部１１６）とを有する。 FIG. 13 is a block diagram showing a main part of the data compression device of the neural network of still another aspect. The data compression device 702 shown in FIG. 13 is a data compression device for a neural network that reduces the amount of data of the output data of the partial neural network created by dividing one neural network, and the output data of the partial neural network is A read circuit 711 (for example, a read circuit 114) that reads data from the stored storage unit and a data compression circuit 714 that compresses the read data are provided, and the data compression circuit 714 is indicated by similar channel information. The coding unit 715 (for example, the coding unit 115) that compresses the data of one of the two or more similar channels having similar output data, and the data of the other similar channels. , A difference coding unit 716 (for example, a difference coding unit 116) that encodes a difference from the data of one similar channel.

図１４は、ＣＰＵ（Central Processing Unit ）を有するコンピュータの一例を示すブロック図である。ＣＰＵ１０００は、記憶装置１００１に格納されデータ圧縮プログラムに従って処理を実行することによって、上記の実施形態における各機能を実現する。 FIG. 14 is a block diagram showing an example of a computer having a CPU (Central Processing Unit). The CPU 1000 realizes each function in the above embodiment by being stored in the storage device 1001 and executing the process according to the data compression program.

すなわち、ＣＰＵ１０００は、図３に示された読出し回路１１１および符号化部１１２の機能を実現する。また、ＣＰＵ１０００は、図６に示された読出し回路１１３および符号化部１１２の機能を実現可能である。さらに、ＣＰＵ１０００は、図９に示された読出し回路１１４、符号化部１１５および差分符号化部１１６の機能を実現可能である。また、図１１〜図１３に示されたデータ圧縮装置７０１，７０２の機能（記憶回路７１３を除く。）を実現可能である。 That is, the CPU 1000 realizes the functions of the read circuit 111 and the coding unit 112 shown in FIG. Further, the CPU 1000 can realize the functions of the read circuit 113 and the coding unit 112 shown in FIG. Further, the CPU 1000 can realize the functions of the read circuit 114, the coding unit 115, and the difference coding unit 116 shown in FIG. Further, the functions of the data compression devices 701 and 702 (excluding the storage circuit 713) shown in FIGS. 11 to 13 can be realized.

記憶装置１００１は、例えば、非一時的なコンピュータ可読媒体（non-transitory computer readable medium ）である。非一時的なコンピュータ可読媒体は、様々なタイプの実体のある記録媒体（tangible storage medium ）を含む。非一時的なコンピュータ可読媒体の具体例として、半導体メモリ（例えば、マスクＲＯＭ、ＰＲＯＭ（Programmable ROM）、ＥＰＲＯＭ（Erasable PROM ）、フラッシュＲＯＭ）がある。 The storage device 1001 is, for example, a non-transitory computer readable medium. Non-temporary computer-readable media include various types of tangible storage media. Specific examples of non-temporary computer-readable media include semiconductor memories (for example, mask ROM, PROM (Programmable ROM), EPROM (Erasable PROM), flash ROM).

メモリ１００２は、例えばＲＡＭ（Random Access Memory）で実現され、ＣＰＵ１０００が処理を実行するときに一時的にデータを格納する記憶手段である。なお、図１２に示された記憶回路７１３は、メモリ１００２で実現可能である。 The memory 1002 is realized by, for example, a RAM (Random Access Memory), and is a storage means for temporarily storing data when the CPU 1000 executes processing. The storage circuit 713 shown in FIG. 12 can be realized by the memory 1002.

上記の実施形態の一部または全部は以下の付記のようにも記載されうるが、本発明の構成は以下の構成に限定されない。 Although some or all of the above embodiments may be described as in the appendix below, the configuration of the present invention is not limited to the following configurations.

（付記１）１つのニューラルネットワークが分割されて作成される部分ニューラルネットワークの出力データのデータ量を削減するニューラルネットワークのデータ圧縮装置であって、
前記部分ニューラルネットワークの出力データが格納された記憶部からデータを読み出す読出し回路と、
読み出されたデータのデータ圧縮を行うデータ圧縮回路とを備え、
前記読出し回路は、一のチャネルの全てのデータの読み出しが完了してから他のチャネルのデータを読み出す
ことを特徴とするニューラルネットワークのデータ圧縮装置。(Appendix 1) A neural network data compression device that reduces the amount of output data of a partial neural network created by dividing one neural network.
A read circuit that reads data from the storage unit in which the output data of the partial neural network is stored, and
It is equipped with a data compression circuit that compresses the read data.
The read circuit is a data compression device for a neural network, characterized in that data of another channel is read after all data of one channel has been read.

（付記２）前記読出し回路は、一のチャネルのデータを記憶回路における短冊状の領域に格納し、
前記データ圧縮回路は、前記短冊状の領域に格納されたデータのデータ圧縮を行う
付記１のニューラルネットワークのデータ圧縮装置。(Appendix 2) The read-out circuit stores data of one channel in a strip-shaped area in the storage circuit.
The data compression circuit is a data compression device for a neural network according to Appendix 1, which compresses data stored in the strip-shaped region.

（付記３）前記データ圧縮回路は、複数チャネルの各々のチャネルの複数のデータを１つの映像と見なしてデータ圧縮を行う
付記１のニューラルネットワークのデータ圧縮装置。(Appendix 3) The data compression circuit is a data compression device for a neural network according to Appendix 1, wherein a plurality of data of each channel of a plurality of channels are regarded as one video and data compression is performed.

（付記４）前記データ圧縮回路は、ＨＥＶＣ規格に基づく符号化処理を実行する
付記２または付記３のニューラルネットワークのデータ圧縮装置。(Appendix 4) The data compression circuit is a neural network data compression device according to Appendix 2 or Appendix 3 that executes a coding process based on the HEVC standard.

（付記５）１つのニューラルネットワークが分割されて作成される部分ニューラルネットワークの出力データのデータ量を削減するニューラルネットワークのデータ圧縮装置であって、
前記部分ニューラルネットワークの出力データが格納された記憶部からデータを読み出す読出し回路と、
読み出されたデータのデータ圧縮を行うデータ圧縮回路とを備え、
前記データ圧縮回路は、
類似チャネル情報で示される、出力データが類似している２つ以上の類似チャネルのうちの一の類似チャネルのデータのデータ圧縮を行う符号化部と、
他の類似チャネルのデータの、前記一の類似チャネルのデータとの差分を符号化する差分符号化部とを含む
ことを特徴とするニューラルネットワークのデータ圧縮装置。(Appendix 5) A neural network data compression device that reduces the amount of output data of a partial neural network created by dividing one neural network.
A read circuit that reads data from the storage unit in which the output data of the partial neural network is stored, and
It is equipped with a data compression circuit that compresses the read data.
The data compression circuit is
A coding unit that compresses the data of one of two or more similar channels whose output data is similar, which is indicated by the similar channel information, and the data of the similar channel.
A data compression device for a neural network, comprising a difference coding unit that encodes a difference between data of another similar channel and data of the one similar channel.

（付記６）１つのニューラルネットワークが分割されて作成される部分ニューラルネットワークの出力データのデータ量を削減するニューラルネットワークのデータ圧縮方法であって、
前記部分ニューラルネットワークの出力データが格納された記憶部からデータを読み出し、
読み出されたデータのデータ圧縮を行い、
前記記憶部からデータを読み出すときに、一のチャネルの全てのデータの読み出しが完了してから他のチャネルのデータを読み出す
ことを特徴とするニューラルネットワークのデータ圧縮方法。(Appendix 6) A neural network data compression method that reduces the amount of output data of a partial neural network created by dividing one neural network.
Data is read from the storage unit in which the output data of the partial neural network is stored, and the data is read.
Compress the read data and perform data compression.
A method for compressing data in a neural network, which comprises reading data from another channel after reading all data in one channel when reading data from the storage unit.

（付記７）一のチャネルのデータを記憶回路における短冊状の領域に格納し、
前記短冊状の領域に格納されたデータのデータ圧縮を行う
付記６のニューラルネットワークのデータ圧縮方法。(Appendix 7) The data of one channel is stored in a strip-shaped area in the storage circuit.
The data compression method for a neural network according to Appendix 6, which compresses the data stored in the strip-shaped area.

（付記８）複数チャネルの各々のチャネルの複数のデータを１つの映像と見なしてデータ圧縮を行う
付記６のニューラルネットワークのデータ圧縮方法。(Appendix 8) The data compression method of the neural network of Appendix 6 in which a plurality of data of each channel of a plurality of channels are regarded as one video and data compression is performed.

（付記９）１つのニューラルネットワークが分割されて作成される部分ニューラルネットワークの出力データのデータ量を削減するニューラルネットワークのデータ圧縮方法であって、
前記部分ニューラルネットワークの出力データが格納された記憶部からデータを読み出し、
読み出されたデータのデータ圧縮を行い、
前記データ圧縮を行うときに、
類似チャネル情報で示される、出力データが類似している２つ以上の類似チャネルのうちの一の類似チャネルのデータのデータ圧縮を行い、
他の類似チャネルのデータの、前記一の類似チャネルのデータとの差分を符号化する
ことを特徴とするニューラルネットワークのデータ圧縮方法。(Appendix 9) A neural network data compression method that reduces the amount of output data of a partial neural network created by dividing one neural network.
Data is read from the storage unit in which the output data of the partial neural network is stored, and the data is read.
Compress the read data and perform data compression.
When performing the data compression,
Data compression of the data of one of two or more similar channels with similar output data, which is indicated by the similar channel information, is performed.
A data compression method for a neural network, characterized in that the difference between the data of another similar channel and the data of the one similar channel is encoded.

（付記１０）１つのニューラルネットワークが分割されて作成される部分ニューラルネットワークの出力データのデータ量を削減するニューラルネットワークのデータ圧縮プログラムであって、
コンピュータに、
前記部分ニューラルネットワークの出力データが格納された記憶部からデータを読み出す処理と、
読み出されたデータのデータ圧縮を行う処理とを実行させ、
前記記憶部からデータを読み出すときに、一のチャネルの全てのデータの読み出しが完了してから他のチャネルのデータを読み出させる
ためのニューラルネットワークのデータ圧縮プログラム。(Appendix 10) A neural network data compression program that reduces the amount of output data of a partial neural network created by dividing one neural network.
On the computer
The process of reading data from the storage unit in which the output data of the partial neural network is stored, and
The process of compressing the read data is executed, and the data is compressed.
A neural network data compression program for reading data from another channel after the reading of all data in one channel is completed when reading data from the storage unit.

（付記１１）コンピュータに、
一のチャネルのデータを記憶回路における短冊状の領域に格納する処理と、
前記短冊状の領域に格納されたデータのデータ圧縮を行う処理とを実行させる
付記１０のニューラルネットワークのデータ圧縮プログラム。(Appendix 11) To the computer
The process of storing the data of one channel in a strip-shaped area in the storage circuit,
The data compression program of the neural network of Appendix 10 for executing the process of compressing the data stored in the strip-shaped area.

（付記１２）コンピュータに、
複数チャネルの各々のチャネルの複数のデータを１つの映像と見なしてデータ圧縮を行う処理を実行させる
付記１０のニューラルネットワークのデータ圧縮プログラム。(Appendix 12) To the computer
The data compression program of the neural network of Appendix 10 for executing a process of performing data compression by regarding a plurality of data of each channel of a plurality of channels as one video.

（付記１３）１つのニューラルネットワークが分割されて作成される部分ニューラルネットワークの出力データのデータ量を削減するニューラルネットワークのデータ圧縮プログラムであって、
コンピュータに、
前記部分ニューラルネットワークの出力データが格納された記憶部からデータを読み出す処理と、
読み出されたデータのデータ圧縮を行う処理とを実行させ、
前記データ圧縮を行うときに、
類似チャネル情報で示される、出力データが類似している２つ以上の類似チャネルのうちの一の類似チャネルのデータのデータ圧縮を行う処理と、
他の類似チャネルのデータの、前記一の類似チャネルのデータとの差分を符号化する処理とを実行させる
ためのニューラルネットワークのデータ圧縮プログラム。(Appendix 13) A neural network data compression program that reduces the amount of output data of a partial neural network created by dividing one neural network.
On the computer
The process of reading data from the storage unit in which the output data of the partial neural network is stored, and
The process of compressing the read data is executed, and the data is compressed.
When performing the data compression,
The process of compressing the data of one of two or more similar channels whose output data is similar, which is indicated by the similar channel information, and the data of the similar channel.
A neural network data compression program for executing a process of encoding the difference between the data of another similar channel and the data of the one similar channel.

１００，１００Ａ，１００Ｂ，１００Ｃデータ圧縮部
１１１，１１３，１１４読出し回路
１１２，１１５符号化部
１１６差分符号化部
２００前半ニューラルネットワーク
２１０積和演算部
２２０記憶部
２３０選択部
４００送信部
５００通信ネットワーク
６００クラウドサーバ
７０１，７０２データ圧縮装置
７１１読出し回路
７１２，７１４データ圧縮回路
７１３記憶回路
７１５符号化部
７１６差分符号化部
１０００ＣＰＵ
１００１記憶装置
１００２メモリ100, 100A, 100B, 100C Data compression unit 111, 113, 114 Read circuit 112, 115 Coding unit 116 Difference coding unit 200 First half Neural network 210 Product sum calculation unit 220 Storage unit 230 Selection unit 400 Transmission unit 500 Communication network 600 Cloud server 701,702 Data compression device 711 Read circuit 712,714 Data compression circuit 713 Storage circuit 715 Coding unit 716 Difference coding unit 1000 CPU
1001 storage device 1002 memory

Claims

A neural network data compression device that reduces the amount of output data of a partial neural network created by dividing one neural network.
A read circuit that reads data from the storage unit in which the output data of the partial neural network is stored, and
It is equipped with a data compression circuit that compresses the read data.
The read circuit is a data compression device for a neural network, characterized in that data of another channel is read after all data of one channel has been read.

The read circuit stores the data of one channel in a strip-shaped area in the storage circuit.
The data compression device for a neural network according to claim 1, wherein the data compression circuit compresses data stored in the strip-shaped region.

The data compression device for a neural network according to claim 1, wherein the data compression circuit regards a plurality of data of each channel of the plurality of channels as one video and performs data compression.

The data compression device for a neural network according to claim 2 or 3, wherein the data compression circuit executes a coding process based on the HEVC standard.

A neural network data compression device that reduces the amount of output data of a partial neural network created by dividing one neural network.
A read circuit that reads data from the storage unit in which the output data of the partial neural network is stored, and
It is equipped with a data compression circuit that compresses the read data.
The data compression circuit is
A coding unit that compresses the data of one of two or more similar channels whose output data is similar, which is indicated by the similar channel information, and the data of the similar channel.
A data compression device for a neural network, comprising a difference coding unit that encodes a difference between data of another similar channel and data of the one similar channel.

A neural network data compression method that reduces the amount of output data of a partial neural network created by dividing one neural network.
Data is read from the storage unit in which the output data of the partial neural network is stored, and the data is read.
Compress the read data and perform data compression.
A method for compressing data in a neural network, which comprises reading data from another channel after reading all data in one channel when reading data from the storage unit.

The data of one channel is stored in a strip-shaped area in the storage circuit,
The data compression method for a neural network according to claim 6, wherein the data stored in the strip-shaped area is compressed.

The data compression method for a neural network according to claim 6, wherein a plurality of data of each channel of a plurality of channels are regarded as one video and data compression is performed.

A neural network data compression method that reduces the amount of output data of a partial neural network created by dividing one neural network.
Data is read from the storage unit in which the output data of the partial neural network is stored, and the data is read.
Compress the read data and perform data compression.
When performing the data compression,
Data compression of the data of one of two or more similar channels with similar output data, which is indicated by the similar channel information, is performed.
A data compression method for a neural network, characterized in that the difference between the data of another similar channel and the data of the one similar channel is encoded.

A neural network data compression program that reduces the amount of output data of a partial neural network created by dividing one neural network.
On the computer
The process of reading data from the storage unit in which the output data of the partial neural network is stored, and
The process of compressing the read data is executed, and the data is compressed.
A neural network data compression program for reading data from another channel after the reading of all data in one channel is completed when reading data from the storage unit.

On the computer
The process of storing the data of one channel in a strip-shaped area in the storage circuit,
The data compression program for a neural network according to claim 10, wherein the process of compressing the data stored in the strip-shaped area is executed.

On the computer
The data compression program for a neural network according to claim 10, wherein a plurality of data of each channel of a plurality of channels are regarded as one video and a process of performing data compression is executed.

A neural network data compression program that reduces the amount of output data of a partial neural network created by dividing one neural network.
On the computer
The process of reading data from the storage unit in which the output data of the partial neural network is stored, and
The process of compressing the read data is executed, and the data is compressed.
When performing the data compression,
The process of compressing the data of one of two or more similar channels whose output data is similar, which is indicated by the similar channel information, and the data of the similar channel.
A neural network data compression program for executing a process of encoding the difference between the data of another similar channel and the data of the one similar channel.