JP2022526333A

JP2022526333A - Data processing methods and equipment, computer devices, recording media, and computer programs

Info

Publication number: JP2022526333A
Application number: JP2021557139A
Authority: JP
Inventors: 衡 ▲張▼
Original assignee: Shanghai Sensetime Intelligent Technology Co Ltd
Current assignee: Shanghai Sensetime Intelligent Technology Co Ltd
Priority date: 2019-12-30
Filing date: 2020-12-03
Publication date: 2022-05-24
Also published as: CN113128531B; KR20210130796A; CN113128531A; WO2021135810A1; TWI763168B; TW202125271A; SG11202110625XA

Abstract

本発明は、深層学習モデルのトレーニングに用いられる、データ処理方法と装置、コンピュータデバイス、記録媒体、及びコンピュータプログラムを提供し、当該方法は、サンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得ることと、プリフェッチサンプルデータキューに現在含まれているサンプルデータの数量が前記目標プリフェッチ数量に達してないことに応答して、新たなサンプルデータを読み取って、読み取った前記新たなサンプルデータを前記プリフェッチサンプルデータキューン格納することと、を含む。【選択図】図１The present invention provides a data processing method and apparatus, a computer device, a recording medium, and a computer program used for training a deep learning model, the method performing a first update process on a prefetch quantity of sample data. In response to obtaining the target prefetch quantity and responding that the quantity of sample data currently contained in the prefetch sample data queue has not reached the target prefetch quantity, the new sample data is read and read. Includes storing the prefetch sample data cune with new sample data. [Selection diagram] Fig. 1

Description

＜関連出願の相互引用＞
本発明は、出願番号が２０１９１１４０３６６９.４であり、出願日が２０１９年１２月３０日である中国特許出願の優先権を主張し、当該中国特許出願の全ての内容が援用により本願に組み入れられる。
本発明は、機械学習技術分野に関し、具体的には、データ処理方法と装置、コンピュータデバイス、記録媒体、及びコンピュータプログラムに関する。 <Mutual citation of related applications>
The present invention claims priority for a Chinese patent application with application number 200911403669.4 and filing date December 30, 2019, the entire contents of which are incorporated herein by reference.
The present invention relates to the field of machine learning technology, specifically to data processing methods and devices, computer devices, recording media, and computer programs.

深層学習モデルは、大量のサンプルデータに基づく複数回の反復トレーニングを必要とする。機械学習モデルのトレーニング過程での収束速度を向上させるために、通常、マルチプロセス並列トレーニングの方法を採用して実現している。マルチプロセス並列トレーニングの方法を採用して深層学習モデルに対してトレーニングを実行するときに、今回のトレーニングの計算タスクを実行する同時に、各々の並列プロセスで次回のトレーニングに必要なトレーニングデータを事前に読み取ることになる。各々の並列プロセスは、当該回のトレーニングを実行した後に、すべてのプロセス間で通信およびデータ同期化を実行する必要があり、あるプロセスで次回のトレーニングに利用するトレーニングデータを読み取る速度が遅すぎると、トレーニングプロセス全体が遅延され、トレーニング効率の低下につながる。 Deep learning models require multiple iterative trainings based on large amounts of sample data. In order to improve the convergence speed in the training process of the machine learning model, it is usually realized by adopting the method of multi-process parallel training. When training is performed on a deep learning model by adopting the method of multi-process parallel training, the calculation task of this training is executed, and at the same time, the training data required for the next training is prepared in advance in each parallel process. It will be read. Each parallel process must perform communication and data synchronization between all processes after performing that round of training, and if one process reads the training data used for the next training too slowly. , The entire training process is delayed, leading to reduced training efficiency.

本発明の実施例は、データ処理方法および装置を少なくとも提供する。 The embodiments of the present invention provide at least a data processing method and apparatus.

第１の態様によると、本発明の実施例は、１つまたは複数のプロセスを含む、深層学習モデルのトレーニングに、適用されるデータ処理方法を提供し、当該方法は、前記１つまたは複数のプロセスの中の１つの目標プロセスに対して、サンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得ることと、前記目標プロセスに対応するプリフェッチサンプルデータキューに現在含まれているサンプルデータの数量が前記目標プリフェッチ数量に達してないことに応答して、新たなサンプルデータを読み取って、読み取った前記新たなサンプルデータを前記プリフェッチサンプルデータキューン格納することと、を含む。 According to a first aspect, embodiments of the invention provide a data processing method that is applied to training a deep learning model that includes one or more processes, the method of which is said one or more. For one target process in the process, the first update process is executed for the prefetch quantity of sample data to obtain the target prefetch quantity, and it is currently included in the prefetch sample data queue corresponding to the target process. In response to the quantity of the sample data being read not reaching the target prefetch quantity, the new sample data is read and the read new sample data is stored in the prefetch sample data queue.

このようにして、メインプロセスは、プリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得、データキューに現在含まれているサンプルデータ量が目標プリフェッチ数量に達してないときに、サンプルデータプールから新たなサンプルデータを読み取るため、メインプロセスは、１回の反復トレーニングを実行した後に、次の１回の反復トレーニングに必要なサンプルデータの読み取りがすでに完了されることになる。実際のほとんどの場合、メインプロセスがデータを読み取るのにかかる時間は、１つの反復トレーニングを実行するのにかかる時間よりも短いことが多いため、データキュー中に十分な数量のサンプルデータが常に格納されて後続のいくつかの反復トレーニングの使用を満たすように確保することができ、メインプロセスが特定のサンプルデータの読み取るのにかかる時間が長すぎても、サンプル数量が時間内に読み取られなくて反復トレーニングに遅延が発生されることを回避することができ、トレーニング効率を向上させた。 In this way, the main process performs the first update process on the prefetch quantity to obtain the target prefetch quantity, and when the sample data amount currently contained in the data queue has not reached the target prefetch quantity. To read new sample data from the sample data pool, the main process will perform one iterative training and then complete the reading of the sample data required for the next one iterative training. In most cases in practice, the time it takes for the main process to read the data is often shorter than the time it takes to perform one iterative training, so a sufficient amount of sample data is always stored in the data queue. It can be ensured to meet the use of some subsequent iterative training, and if the main process takes too long to read certain sample data, the sample quantity will not be read in time. It was possible to avoid delays in repetitive training and improve training efficiency.

可能な１実施形態において、前記サンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得ることは、前記１つまたは複数のプロセスに対応するプリフェッチサンプルデータキューが現在占有している合計メモリスペースおよびメモリ使用上限閾値に基づいて、サンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得ることを含む。 In one possible embodiment, performing the first update process on the prefetch quantity of the sample data to obtain the target prefetch quantity is currently occupied by the prefetch sample data queue corresponding to the one or more processes. It includes executing the first update process for the prefetch quantity of the sample data to obtain the target prefetch quantity based on the total memory space and the memory usage upper limit threshold.

このようにして、プリフェッチサンプルデータキューが現在占有している合計メモリスペースおよびメモリ使用上限閾値に基づいて、サンプルデータのプリフェッチ数量を動的に更新することができ、プリフェッチサンプルデータの量を柔軟に割り当てることができて、トレーニングの要件を満たすことができる。 In this way, the prefetch quantity of the sample data can be dynamically updated based on the total memory space currently occupied by the prefetch sample data queue and the upper limit of memory usage, and the amount of prefetch sample data can be flexibly updated. Can be assigned to meet training requirements.

可能な１実施形態において、前記１つまたは複数のプロセスに対応するプリフェッチサンプルデータキューが現在占有している合計メモリスペースおよびメモリ使用上限閾値に基づいて、サンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得ることは、前記１つまたは複数のプロセスに対応するプリフェッチサンプルデータキューが現在占有している合計メモリスペース、メモリ使用上限閾値、および、前記目標プロセスの前記深層学習モデルに対してトレーニングを実行するためのデータスループットに基づいて、サンプルデータのプリフェッチ数量に対して第１更新処理を実行して前記目標プリフェッチ数量を得ることを含む。 In one possible embodiment, the first update to the prefetch quantity of the sample data is based on the total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes and the upper memory usage threshold. Performing processing to obtain the target prefetch quantity is the total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes, the memory usage limit, and the deep layer of the target process. It includes performing a first update process on the prefetch quantity of the sample data to obtain the target prefetch quantity based on the data throughput for performing training on the training model.

このようにして、プリフェッチサンプルデータキューが現在占有している合計メモリスペース、メモリ使用上限閾値、および、前記深層学習モデルをトレーニングするときのデータスループットに基づいて、プリフェッチ数量を動的に更新し、データスループットが増加されるときに、プリフェッチサンプルデータキュー中のデータ量がサンプルデータの消費に追いつくことができるようにし、データスループットが減少されるときに、プリフェッチサンプルデータキューによって占有されるメモリの量を可能な限り減少し、余剰のメモリを他の作業に使用することができ、調整がより柔軟になる。 In this way, the prefetch quantity is dynamically updated based on the total memory space currently occupied by the prefetch sample data queue, the upper memory usage threshold, and the data throughput when training the deep learning model. Allows the amount of data in the prefetch sample data queue to keep up with the consumption of sample data when the data throughput increases, and the amount of memory occupied by the prefetch sample data queue when the data throughput decreases. Can be reduced as much as possible and excess memory can be used for other tasks, making adjustments more flexible.

可能な１実施形態において、前記１つまたは複数のプロセスに対応するプリフェッチサンプルデータキューが現在占有している合計メモリスペースおよびメモリ使用上限閾値に基づいて、サンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得ることは、前記１つまたは複数のプロセスに対応するプリフェッチサンプルデータキューが現在占有している合計メモリスペースが前記メモリ使用上限閾値に達してない場合、前記プリフェッチ数量を第１調節ステップサイズだけ増加して前記目標プリフェッチ数量を得ること、および／または、前記１つまたは複数のプロセスに対応するプリフェッチサンプルデータキューが現在占有している合計メモリスペースが前記メモリ使用上限閾値に達した場合、前記プリフェッチ数量を第２調節ステップサイズだけ減少して前記目標プリフェッチ数量を得ることを含む。 In one possible embodiment, the first update to the prefetch quantity of the sample data is based on the total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes and the upper limit memory usage threshold. Performing processing to obtain the target prefetch quantity is the prefetch if the total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes does not reach the memory usage upper limit threshold. The quantity is increased by the first adjustment step size to obtain the target prefetch quantity, and / or the total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes is the memory usage. When the upper limit threshold is reached, the prefetch quantity is reduced by the second adjustment step size to obtain the target prefetch quantity.

このようにして、プリフェッチサンプルデータキューが現在占有している合計メモリスペースが前記メモリ使用上限閾値に達してない場合には、可能な限りプリフェッチサンプルデータを増加し、プリフェッチサンプルデータキューが現在占有している合計メモリスペースが前記メモリ使用上限閾値に達した場合には、サンプルデータのプリフェッチ数量を減少することによって、プリフェッチサンプルデータキューの長さを柔軟に調整することができる。 In this way, if the total memory space currently occupied by the prefetch sample data queue does not reach the memory usage upper limit threshold, the prefetch sample data is increased as much as possible and the prefetch sample data queue is currently occupied. When the total memory space is reached the upper limit of memory usage, the length of the prefetch sample data queue can be flexibly adjusted by reducing the prefetch quantity of the sample data.

可能な１実施形態において、前記１つまたは複数のプロセスに対応するプリフェッチサンプルデータキューが現在占有している合計メモリスペースが前記メモリ使用上限閾値に達してない場合、前記プリフェッチ数量を第１調節ステップサイズだけ増加して前記目標プリフェッチ数量を得ることは、前記１つまたは複数のプロセスに対応するプリフェッチサンプルデータキューが現在占有している合計メモリスペースが前記メモリ使用上限閾値に達してないし、かつ、前記目標プロセスの前記深層学習モデルに対してトレーニングを実行するためのデータスループットが所定のデータスループット条件を満たす場合、前記プリフェッチ数量を第１調節ステップサイズだけ増加して前記目標プリフェッチ数量を得ることを含む。 In one possible embodiment, if the total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes has not reached the memory usage upper bound threshold, the prefetch quantity is adjusted in the first adjustment step. Increasing by size to obtain the target prefetch quantity means that the total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes has not reached the memory usage upper limit threshold and When the data throughput for executing training for the deep learning model of the target process satisfies a predetermined data throughput condition, the prefetch quantity may be increased by the first adjustment step size to obtain the target prefetch quantity. include.

可能な１実施形態において、前記方法は、前記１つまたは複数のプロセスに対応するプリフェッチサンプルデータキューが現在占有している合計メモリスペースが前記メモリ使用上限閾値に達してないし、かつ、前記データスループットが前記所定のデータスループット条件を満たさない場合、前記プリフェッチ数量を第３調節ステップサイズだけ減少して前記目標プリフェッチ数量を得ることをさらに含む。 In one possible embodiment, the method is such that the total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes does not reach the memory usage upper limit and the data throughput. Further includes reducing the prefetch quantity by a third adjustment step size to obtain the target prefetch quantity if does not meet the predetermined data throughput condition.

可能な１実施形態において、前記所定のデータスループット条件は、前記データスループットの現在数値が履歴数値よりも大きいことと、前記データスループットの現在数値がデータスループット閾値よりも大きいことと、の中の少なくとも１つを含み、ここで、前記履歴数値は、現在の反復トレーニングの前の複数の履歴反復トレーニングのときの前記データスループットの平均値、または、現在の反復トレーニングの前の１回の反復トレーニングのときの前記データスループットの数値である。 In one possible embodiment, the predetermined data throughput condition is such that the current value of the data throughput is larger than the historical value and the current value of the data throughput is larger than the data throughput threshold. Including one, where the historical figures are the average value of the data throughput at the time of multiple historical iterative trainings prior to the current iterative training, or the one-time iterative training prior to the current iterative training. It is a numerical value of the said data throughput at the time.

可能な１実施形態において、前記方法は、前記プリフェッチ数量の調節ステップサイズに対して第２更新処理を実行して目標調節ステップサイズを得ることをさらに含み、ここで、前記目標調節ステップサイズは、前記プリフェッチ数量の次の１回の更新処理に使用される。 In one possible embodiment, the method further comprises performing a second update process on the prefetch quantity adjustment step size to obtain a target adjustment step size, wherein the target adjustment step size is: It is used for the next one-time update process of the prefetch quantity.

可能な１実施形態において、前記プリフェッチ数量の調節ステップサイズに対して第２更新処理を実行して目標調節ステップサイズを得ることは、前記第１更新処理中に前記プリフェッチ数量を増加する場合、前記プリフェッチ数量の調節ステップサイズを増加すること、および／または、前記第１更新処理中に前記プリフェッチ数量を減少する場合、前記プリフェッチ数量の調節ステップサイズを減少することを含む。 In one possible embodiment, performing the second update process for the adjustment step size of the prefetch quantity to obtain the target adjustment step size is the case where the prefetch quantity is increased during the first update process. Increasing the prefetch quantity adjustment step size and / or decreasing the prefetch quantity adjustment step size during the first update process includes decreasing the prefetch quantity adjustment step size.

このようにして、プリフェッチ数量を増加する必要があるときに、プリフェッチ数量をより速く増加することによって、プリフェッチサンプルデータキューに格納したサンプルデータがより多い数量により速く達するようにより速く保証し、後続のトレーニング反復サイクルの使用の要件を満たすことができ、プリフェッチ数量が少なすぎてモデルトレーニング過程が遅延されることを回避する同時に、プリフェッチ数量を減少する必要があるときに、プリフェッチ数量をより緩やかに減少して、プリフェッチサンプルデータキューの長さの変化がよりスムーズになるように保証することができ、プリフェッチサンプルデータの数量の急激な減少によるトレーニング過程のショックを回避することができる。 In this way, when the prefetch quantity needs to be increased, increasing the prefetch quantity faster ensures that the sample data stored in the prefetch sample data queue reaches the larger quantity faster and subsequent. Decrease the prefetch quantity more slowly when the requirements for using the training iteration cycle can be met and the prefetch quantity is not too small to delay the model training process, while at the same time the prefetch quantity needs to be reduced. Therefore, it is possible to guarantee that the change in the length of the prefetch sample data queue becomes smoother, and it is possible to avoid the shock of the training process due to the sudden decrease in the quantity of the prefetch sample data.

第２の態様によると、本発明の実施例は、１つまたは複数のプロセスを含む、深層学習モデルのトレーニングに、適用されるデータ処理装置をさらに提供し、当該装置は、前記１つまたは複数のプロセスの中の１つの目標プロセスに対して、サンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得るための第１更新モジュールと、前記目標プロセスに対応するプリフェッチサンプルデータキューに現在含まれているサンプルデータの数量が前記目標プリフェッチ数量に達してないことに応答して、新たなサンプルデータを読み取って、読み取った前記新たなサンプルデータを前記プリフェッチサンプルデータキューン格納するための読取りモジュールと、を備える。 According to a second aspect, embodiments of the invention further provide a data processing apparatus applicable to training a deep learning model, comprising one or more processes, the apparatus being said one or more. For one target process in the process, the first update module for executing the first update process for the prefetch quantity of sample data to obtain the target prefetch quantity, and the prefetch sample corresponding to the target process. In response to the quantity of sample data currently contained in the data queue not reaching the target prefetch quantity, new sample data is read and the read new sample data is stored in the prefetch sample data queue. It is equipped with a read module for.

第３の態様によると、本発明の実施例は、コンピュータデバイスをさらに提供し、当該コンピュータデバイスは、プロセッサと、記録媒体と、バスと、を備え、前記記録媒体には、前記プロセッサによって実行可能な機械可読命令が記録されており、コンピュータデバイスが運行されるときに、前記プロセッサと前記記録媒体との間はバスを介して通信し、前記機械可読命令が前記プロセッサによって実行されると、上述した第１の態様または第１の態様の任意の可能な１実施形態のステップが実行される。 According to a third aspect, an embodiment of the present invention further provides a computer device, wherein the computer device comprises a processor, a recording medium, and a bus, the recording medium being feasible by the processor. A machine-readable instruction is recorded, and when the computer device is operated, the processor and the recording medium communicate with each other via a bus, and the machine-readable instruction is executed by the processor. The steps of the first aspect or any possible embodiment of the first aspect are carried out.

第４の態様によると、本発明の実施例は、コンピュータ可読記録媒体をさらに提供し、当該コンピュータ可読記録媒体には、コンピュータプログラムが記録されており、当該コンピュータプログラムがプロセッサによって運行されるときに、上述した第１の態様または第１の態様の任意の可能な１実施形態のステップが実行される。 According to a fourth aspect, an embodiment of the present invention further provides a computer-readable recording medium, in which a computer program is recorded and the computer program is operated by a processor. , The steps of the first aspect described above or any possible embodiment of the first aspect are performed.

以下、本発明の上記の目的、特徴、および、利点をより明白かつ理解可能にするために、好ましい実施例を挙げて、図面を参照して詳細に説明する。 Hereinafter, in order to make the above-mentioned object, features, and advantages of the present invention more clear and understandable, preferred embodiments will be given and described in detail with reference to the drawings.

以下、本発明の実施例の技術的解決策をより明確に説明するために、実施例に必要な図面を簡単に紹介する。ここでの図面は、明細書に組み込まれて本明細書の一部を構成し、これら図面は本発明に合致する実施例を示し、明細書と一緒に本発明の技術的解決策を説明する。以下の図面は、本発明のいくつかの実施例を示すだけであり、範囲を限定するものと見なされるべきではないと理解すべきである。当業者は、創造的な作業なしで、これら図面に基づいて他の関連する図面を得ることができる。
本発明の実施例によって提供されるデータ処理方法を示すフローチャートである。本発明の実施例によって提供されるデータ処理装置を示す模式図である。本発明の実施例によって提供されるコンピュータデバイスを示す模式図である。 Hereinafter, in order to more clearly explain the technical solution of the embodiment of the present invention, the drawings required for the embodiment will be briefly introduced. The drawings herein are incorporated herein to form a portion of the specification, which drawings show examples in line with the invention and together with the specification describe technical solutions of the invention. .. It should be understood that the drawings below show only some embodiments of the present invention and should not be considered as limiting the scope. One of ordinary skill in the art can obtain other related drawings based on these drawings without any creative work.
It is a flowchart which shows the data processing method provided by the Example of this invention. It is a schematic diagram which shows the data processing apparatus provided by the Example of this invention. It is a schematic diagram which shows the computer device provided by the Example of this invention.

以下、本発明の実施例の目的、技術的解決策、および、利点をより明確にするために、本発明の実施例の図面を参照して、本発明の実施例の技術的解決策を明確かつ完全に説明する。勿論ながら、説明する実施例は、全部の実施例ではなく、本発明の一部の実施例に過ぎない。通常、ここでの図面で記載および表示する本発明の実施例のコンポーネントは、さまざまな異なる構成で配置および設計することができる。したがって、以下の図面に提供される本発明の実施例の詳細な説明は、請求された本開示の範囲を限定することを意図するものではなく、単に本開示の選択された実施形態を表す。本発明の実施例に基づいて、創造的な作業なしに当業者によって得られた他のすべての実施形態は、本開示の保護範囲に含まれるものとする。 Hereinafter, in order to further clarify the purpose, technical solution, and advantage of the embodiment of the present invention, the technical solution of the embodiment of the present invention is clarified with reference to the drawings of the embodiment of the present invention. And I will explain it completely. Of course, the examples described are not all examples, but only a part of the present invention. Typically, the components of the embodiments of the invention described and displayed in the drawings herein can be arranged and designed in a variety of different configurations. Accordingly, the detailed description of the embodiments of the invention provided in the following drawings is not intended to limit the scope of the claimed disclosure, but merely represents a selected embodiment of the present disclosure. All other embodiments obtained by one of ordinary skill in the art based on the embodiments of the present invention without creative work shall be included in the scope of protection of the present disclosure.

調査の結果、マルチプロセス並列トレーニングの方法を採用して深層学習モデルをトレーニングするときに、今回のトレーニングの計算を実行する同時に、各々のプロセスはいずれも次回のトレーニングに必要なトレーニングデータを事前に読み取ることになる。各々の並列プロセスは、今回のトレーニングを実行した後に、他のプロセスと通信およびデータ同期化を実行し、すべてのプロセスの通信およびデータ同期化が完了された後に、次回のトレーニングタスクを開始する必要がある。ここで任意のプロセスのトレーニングタスクに時間の遅延が発生されると、たとえば次回のトレーニングに使用するトレーニングデータを事前に読み取るときに、読み取る時間が今回のトレーニングタスクの実行時間を超えると、すべてのプロセスのトレーニングタスクにいずれも時間の遅延が発生されることになり、トレーニング効率の低下につながる。 As a result of the research, when training the deep learning model by adopting the method of multi-process parallel training, the calculation of this training is performed, and at the same time, each process has the training data necessary for the next training in advance. It will be read. Each parallel process should perform communication and data synchronization with other processes after performing this training, and start the next training task after all processes have completed communication and data synchronization. There is. If there is a time delay in the training task of any process here, for example, when pre-reading the training data to be used for the next training, if the reading time exceeds the execution time of this training task, all All training tasks in the process will be delayed in time, leading to reduced training efficiency.

上述した調査に鑑みて、本発明は、深層学習モデルトレーニングに適用されるデータ処理方法および装置を提供する。当該データ処理方法は、サンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得、プリフェッチサンプルデータキューに現在含まれているサンプルデータの数量が前記目標プリフェッチ数量に達してないことに応答して、新たなサンプルデータを読み取って、読み取った前記新たなサンプルデータを前記プリフェッチサンプルデータキューン格納することによって、メインプロセスが１回の反復トレーニングを実行した後に、次の１回の反復トレーニングに必要なサンプルデータの読み取りがすでに完了されるようにすることができる。メインプロセスは、プリフェッチ数量を動的に更新して、目標プリフェッチ数量を得、データキューに現在含まれているサンプルデータ量が目標プリフェッチ数量に達してないときに、サンプルデータプールから新たなサンプルデータを読み取る。ほとんどの場合、メインプロセスが新たなサンプルデータを読み取るのみかかる時間が１つの反復トレーニングを実行するのにかかる時間よりも短いことが多いため、データキュー中に十分な数量のサンプルデータが常に格納されて後続のいくつかの反復トレーニングの使用を満たすように確保することができ、メインプロセスが特定のサンプルデータの読み取るのにかかる時間がながすぎても、サンプル数量が足りないで反復トレーニングに遅延が発生することがなく、トレーニング効率を向上させた。 In view of the investigations described above, the present invention provides data processing methods and devices applicable to deep learning model training. In the data processing method, the first update process is executed for the prefetch quantity of the sample data to obtain the target prefetch quantity, and the quantity of the sample data currently contained in the prefetch sample data queue reaches the target prefetch quantity. In response to the absence, the new sample data is read and the read new sample data is stored in the prefetch sample data queue so that the main process performs one iterative training and then the next one. It is possible to ensure that the reading of the sample data required for the iterative training of is already completed. The main process dynamically updates the prefetch quantity to get the target prefetch quantity, and when the amount of sample data currently contained in the data queue does not reach the target prefetch quantity, new sample data from the sample data pool. To read. In most cases, the data queue always stores a sufficient amount of sample data because the main process only takes less time to read new sample data than it takes to perform one iterative training. Can be ensured to meet the use of several subsequent iterative trainings, and if the main process takes too long to read certain sample data, the sample quantity is insufficient and the iterative training is delayed. The training efficiency was improved without the occurrence of.

従来の解決策に存在する欠陥は、発明者の実践と注意深い研究後の結果であり、したがって、上記の問題の発見過程および本発明で提案された解決策は、すべて本開示のプロセス中の本開示への発明者の貢献であるべきである。 The deficiencies present in conventional solutions are the result of the inventor's practice and careful study, and therefore the process of finding the above problems and the solutions proposed in the present invention are all books in the process of this disclosure. It should be the inventor's contribution to the disclosure.

以下、本発明の図面を参照して、本発明中の技術的解決策を明確かつ完全に説明する。勿論ながら、説明する実施例は、全部の実施例ではなく、本発明の一部の実施例に過ぎない。通常、ここでの図面で記載および表示する本発明の実施例のコンポーネントは、さまざまな異なる構成で配置および設計することができる。したがって、以下の図面に提供される本発明の実施例の詳細な説明は、請求された本開示の範囲を限定することを意図するものではなく、単に本開示の選択された実施形態を表す。本発明の実施例に基づいて、創造的な作業なしに当業者によって得られた他のすべての実施形態は、本開示の保護範囲に含まれるものとする。 Hereinafter, the technical solutions in the present invention will be described clearly and completely with reference to the drawings of the present invention. Of course, the examples described are not all examples, but only a part of the present invention. Typically, the components of the embodiments of the invention described and displayed in the drawings herein can be arranged and designed in a variety of different configurations. Accordingly, the detailed description of the embodiments of the invention provided in the following drawings is not intended to limit the scope of the claimed disclosure, but merely represents a selected embodiment of the present disclosure. All other embodiments obtained by one of ordinary skill in the art based on the embodiments of the present invention without creative work shall be included in the scope of protection of the present disclosure.

以下の図面では、類似の符号と文字が類似の項目を示していることに注意すべきである。したがって、1つの図面で項目を定義すると、後続の図面でさらに定義および解析する必要はない。 It should be noted that in the drawings below, similar signs and letters indicate similar items. Therefore, defining an item in one drawing does not require further definition and analysis in subsequent drawings.

本実施例に対する理解を容易にするために、まず、本発明の実施例によって開示されるデータ処理方法を詳細に説明する。本発明の実施例によって提供されるデータ処理方法は、深層学習モデルのトレーニングに適用される。その実行主体は、一般的に、深層学習モデルをトレーニングするメインプロセスまたはサブプロセスである。可能な一部の実現形態において、当該データ処理方法は、プロセッサによりメモリに格納されているコンピュータ可読命令を呼び出す方式によって実現され得る。 In order to facilitate understanding of the present embodiment, first, the data processing method disclosed by the embodiment of the present invention will be described in detail. The data processing methods provided by the embodiments of the present invention apply to training deep learning models. Its execution subject is generally the main process or subprocess that trains the deep learning model. In some possible implementations, the data processing method may be implemented by a method of calling a computer-readable instruction stored in memory by a processor.

以下、実行主体が少なくとも１つのメインプロセスの中の任意のメインプロセスである例をとって、本発明の実施例によって提供されるデータ処理方法を説明する。 Hereinafter, the data processing method provided by the embodiment of the present invention will be described by taking an example in which the execution subject is an arbitrary main process in at least one main process.

図１は、本発明の実施例によって提供されるデータ処理方法のフローチャートであり、当該方法は、ステップＳ１０１～Ｓ１０２を含み、ここで、
Ｓ１０１において、サンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得、
Ｓ１０２において、プリフェッチサンプルデータキューに現在含まれているサンプルデータの数量が目標プリフェッチ数量に達してないことに応答して、新たなサンプルデータを読み取って、読み取った新たなサンプルデータをプリフェッチサンプルデータキュー中に格納する。 FIG. 1 is a flowchart of a data processing method provided by an embodiment of the present invention, which method comprises steps S101 to S102, wherein the method comprises steps S101 to S102.
In S101, the first update process is executed for the prefetch quantity of the sample data to obtain the target prefetch quantity.
In S102, in response to the fact that the quantity of sample data currently contained in the prefetch sample data queue has not reached the target prefetch quantity, new sample data is read and the read new sample data is used in the prefetch sample data queue. Store inside.

以下、上述したＳ１０１～Ｓ１０２をそれぞれ詳細に説明する。 Hereinafter, the above-mentioned S101 to S102 will be described in detail.

Ｉ：上述したＳ１０１において、１つのメインプロセスがある場合、１つのメインプロセスは、深層学習モデルをトレーニングし、かつ、当該メインプロセスは、サンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得ることができる。 I: In S101 described above, when there is one main process, one main process trains the deep learning model, and the main process executes the first update process for the prefetch quantity of the sample data. The target prefetch quantity can be obtained.

複数のメインプロセスがある場合、複数のメインプロセスは、深層学習モデルに対して並列トレーニングを実行し、各々のメインプロセスは、それぞれサンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得ることができる。ここで、異なるメインプロセスに対応するプリフェッチ数量は、異なることができ、異なるメインプロセスに対応する目標プリフェッチ数量も、異なることができる。 If there are multiple main processes, the multiple main processes perform parallel training on the deep learning model, and each main process performs the first update process on the prefetch quantity of the sample data. You can get the prefetch quantity. Here, the prefetch quantities corresponding to different main processes can be different, and the target prefetch quantities corresponding to different main processes can also be different.

各メインプロセスは、１つのプリフェッチサンプルデータキューに対応し、任意のメインプロセスに対応するプリフェッチサンプルデータキューには、複数のサンプルデータが格納されており、かつ、各メインプロセスは、対応するプリフェッチサンプルデータキューに格納されているサンプルデータに基づいて深層学習モデルをトレーニングする。 Each main process corresponds to one prefetch sample data queue, and the prefetch sample data queue corresponding to any main process contains a plurality of sample data, and each main process has a corresponding prefetch sample. Train a deep learning model based on the sample data stored in the data queue.

当該プリフェッチサンプルデータキューは、たとえば、先入れ先出しキューである。メインプロセスは、１つの新たな反復トレーニングを開始するときに、まず当該メインプロセスに対応するプリフェッチサンプルデータキューから１組のサンプルデータを読み取る。当該組のサンプルデータは、読み取られた後に、プリフェッチサンプルデータキューから削除されて、新たなサンプルデータのために格納位置を確保する。 The prefetch sample data queue is, for example, a first-in first-out queue. When starting a new iterative training, the main process first reads a set of sample data from the prefetch sample data queue corresponding to the main process. After being read, the set of sample data is deleted from the prefetch sample data queue to reserve a storage position for new sample data.

ここで、１つの反復トレーニングで、メインプロセスは１組のサンプルデータに基づいて深層学習モデルをトレーニングすることになり、１組のサンプルデータには少なくとも１つのサンプルデータが含まれることに注意すべきである。本発明の実施例において、プリフェッチ数量とは、サンプルデータ組の数量であり得る。 Note that in one iterative training, the main process will train the deep learning model based on a set of sample data, and one set of sample data will contain at least one sample data. Is. In the embodiment of the present invention, the prefetch quantity can be the quantity of the sample data set.

事前に決定されたプリフェッチ数量更新条件に達してない場合、メインプロセスは、プリフェッチサンプルデータキューに現在含まれているサンプルデータの数量が現在のプリフェッチ数量に達してないことに応答して、新たなサンプルデータを読み取る。 If the predetermined prefetch quantity update condition is not reached, the main process responds that the quantity of sample data currently contained in the prefetch sample data queue has not reached the current prefetch quantity. Read the sample data.

事前に決定されたプリフェッチ数量更新条件に達した場合、メインプロセスは、サンプルデータの現在のプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得、メインプロセスは、プリフェッチサンプルデータキューに現在含まれているサンプルデータの数量が目標プリフェッチ数量に達してないことに応答して、新たなサンプルデータを読み取る。ここで、サンプルデータのプリフェッチ数量と目標プリフェッチ数量は、同じでも異なっていてもよい。 When the predetermined prefetch quantity update condition is reached, the main process performs the first update process on the current prefetch quantity of the sample data to obtain the target prefetch quantity, and the main process receives the prefetch sample data queue. Read new sample data in response to the fact that the quantity of sample data currently contained in is not reaching the target prefetch quantity. Here, the prefetch quantity and the target prefetch quantity of the sample data may be the same or different.

具体的に、プリフェッチ数量更新条件は、たとえば以下のａ１～ａ３の中の１つまたは複数を含む。 Specifically, the prefetch quantity update condition includes, for example, one or more of the following a1 to a3.

ａ１は、所定の更新サイクルに達したことである a1 means that a predetermined update cycle has been reached.

ここで、更新サイクルは、プリフェッチ数量を更新するサイクルである。 Here, the update cycle is a cycle for updating the prefetch quantity.

当該更新サイクルは、たとえば時間サイクルであり、たとえば、更新サイクルが１時間であると、１時間ごとに、１回のプリフェッチ数量に対する第１更新処理をトリガーすることができる。 The update cycle is, for example, a time cycle. For example, if the update cycle is one hour, the first update process for one prefetch quantity can be triggered every hour.

当該更新サイクルは、たとえば所定の数量の反復トレーニングであり、たとえば、メインプロセスは、深層学習モデルに対して５回の反復トレーニングを実行するたびに、１回のプリフェッチ数量に対する第１更新処理をトリガーすることができる。ここで、異なる回の反復トレーニングにかかる時間が互いに異なる可能性があるため、異なる更新サイクルの持続時間も異なることになる。 The update cycle is, for example, a predetermined quantity of iterative training, for example, the main process triggers the first update process for one prefetch quantity for every five iterative trainings performed on the deep learning model. can do. Here, the duration of different update cycles will also be different, as the time required for different repetitive trainings may differ from each other.

ａ２は、サンプルデータに基づいて深層学習モデルに対してトレーニングするときのデータスループットが第１閾値よりも大きいことである。 A2 is that the data throughput when training the deep learning model based on the sample data is larger than the first threshold value.

ここで、データスループットは、メインプロセスが深層学習モデルに対してトレーニングを実行するときのサンプルデータ処理速度を示すために使用される。メインプロセスが深層学習モデルに対してトレーニングを実行するときのデータスループットが第１閾値よりも大きいと、プリフェッチサンプルデータキューに格納されているサンプルデータに対する消費速度がより速いと見なされる。ここで、より小さいプリフェッチ数量を維持すると、プリフェッチサンプルデータキューに格納されているサンプルデータの数量が、時間内のトレーニングの消費に追いつかない可能性がある。したがって、プリフェッチサンプルデータキューにプリフェッチされたサンプルデータの数量を増加し、サンプルデータのプリフェッチ数量に対する第１更新処理をトリガーすることが考えられる。 Here, the data throughput is used to indicate the speed of sample data processing as the main process trains on the deep learning model. If the data throughput when the main process trains on the deep learning model is greater than the first threshold, it is considered to consume faster for the sample data stored in the prefetch sample data queue. Here, if a smaller prefetch quantity is maintained, the quantity of sample data stored in the prefetch sample data queue may not keep up with the consumption of training in time. Therefore, it is conceivable to increase the quantity of sample data prefetched in the prefetch sample data queue and trigger the first update process for the prefetch quantity of the sample data.

ａ２.１：ここで、たとえば、以下の方法を採用してデータスループットを得る。
プリフェッチ数量更新条件に達したことに応答して、プリフェッチ数量更新条件に達したときの深層学習モデルに対するトレーニング進行状況に基づいて、複数の履歴反復トレーニングから、少なくとも１つの目標反復トレーニングを確定し、また、各々の目標反復トレーニングに使用するサンプルデータ組に含まれているサンプルデータの数量、および、各々の目標反復トレーニングにかかる時間に基づいて、サンプルデータに基づいて深層学習モデルに対してトレーニングを実行するときのデータスループットを確定する。 a2.1: Here, for example, the following method is adopted to obtain the data throughput.
In response to reaching the prefetch quantity update condition, at least one target iterative training is determined from multiple historical iterative trainings based on the training progress for the deep training model when the prefetch quantity update condition is reached. Also, based on the quantity of sample data contained in the sample data set used for each target iterative training and the time taken for each target iterative training, the deep learning model is trained based on the sample data. Determine the data throughput when running.

ここで、目標反復トレーニングは、たとえば、プリフェッチ数量更新条件に達したタイミングに一番近い少なくとも１つの反復トレーニングである。 Here, the target iterative training is, for example, at least one iterative training closest to the timing when the prefetch quantity update condition is reached.

たとえば、あるメインプロセスは、すでに深層学習モデルに対して５回の反復トレーニングを実行し、かつ、プリフェッチ数量更新条件に達したときに、深層学習モデルに対して６回目の反復トレーニングを実行している。ここで、１つの目標反復トレーニングがある場合、５番目の反復トレーニングを目標反復トレーニングとして確定することができる。当該５番目の反復トレーニングにかかる時間が１５分であると、使用するサンプルデータの数量は６４個であり、データスループットはたとえば６４÷１５である。 For example, one main process has already performed 5 iterative trainings on a deep learning model, and when the prefetch quantity update condition is reached, it has performed a 6th iterative training on the deep learning model. There is. Here, if there is one target repetitive training, the fifth repetitive training can be determined as the target repetitive training. If the time required for the fifth iterative training is 15 minutes, the quantity of sample data used is 64, and the data throughput is, for example, 64/15.

３つの目標反復トレーニングがある場合、３番目、４番目、及び５番目の反復トレーニングを目標反復トレーニングとして確定することができる。３番目、４番目、及び５番目の反復トレーニングにそれぞれかかる時間が１２分、１４分、及び１５分であり、各反復トレーニングに使用するサンプルデータの数量がいずれも６４個であると、データスループットはたとえば６４×３÷（１２＋１４＋１５）であり、単位は個／分である。 If there are three target repetitive trainings, the third, fourth, and fifth repetitive trainings can be defined as target repetitive trainings. Data throughput when the third, fourth, and fifth iterative trainings take 12 minutes, 14 minutes, and 15 minutes, respectively, and the quantity of sample data used for each iterative training is 64, respectively. Is, for example, 64 × 3 ÷ (12 + 14 + 15), and the unit is pieces / minute.

ａ２.２：もう１実施例において、さらに、現在実行している反復トレーニングを目標反復トレーニングとして確定し、現在実行している反復トレーニング中のすでにトレーニングが完了したサンプルの数量および通じた持続時間に基づいて、データスループットを確定することができる。 a2.2: In another embodiment, further, the repetitive training currently being performed is determined as the target repetitive training, and the quantity and duration of the samples already completed during the repetitive training currently being performed are used. Based on this, the data throughput can be determined.

たとえば、あるメインプロセスがすでに深層学習モデルに対して５回の反復トレーニングを実行し、かつ、プリフェッチ数量更新条件に達したときに、深層学習モデルに対して６番目の反復トレーニングを実行している。６番目の反復トレーニングを目標反復トレーニングとして確定することができる。６番目の反復トレーニングで、１つのサンプルデータ組中の６４個のサンプルを使用して深層学習モデルに対してトレーニングを実行する必要がある。現在すでにトレーニングが完了したサンプルデータの数量は３０個であり、現在トレーニング反復が通じた持続時間は４分であると、データスループットはたとえば３０÷４である。 For example, a main process has already performed 5 iterative trainings on a deep learning model, and when the prefetch quantity update condition is reached, it is performing a 6th iterative training on the deep learning model. .. The sixth repetitive training can be established as the target repetitive training. In the sixth iterative training, it is necessary to train the deep learning model using 64 samples in one sample data set. If the quantity of sample data that has already been trained is 30 and the duration of the training iteration is currently 4 minutes, the data throughput is, for example, 30/4.

ａ３は、サンプルデータに基づいて深層学習モデルに対してトレーニングを実行するときのデータスループットが第２閾値未満であることである。 A3 is that the data throughput when training the deep learning model based on the sample data is less than the second threshold value.

ここで、第２閾値は、第１閾値未満である。 Here, the second threshold value is less than the first threshold value.

メインプロセスが深層学習モデルに対して実行トレーニングときのデータスループットが第２閾値未満であると、プリフェッチサンプルデータキューに格納されているサンプルデータに対する消費速度が遅すぎると見なされる。ここで、より大きいプリフェッチ数量を維持すると、プリフェッチサンプルデータキューに格納されているサンプルデータが常に蓄積され、より大きなメモリを占有する可能性がある。したがって、プリフェッチサンプルデータキューにプリフェッチされたサンプルデータの数量を減少し、サンプルデータのプリフェッチ数量に対する第１更新処理をトリガーすることが考えられる。 If the data throughput during execution training of the main process for the deep learning model is less than the second threshold, it is considered too slow to consume the sample data stored in the prefetch sample data queue. Here, if a larger prefetch quantity is maintained, the sample data stored in the prefetch sample data queue is always accumulated and may occupy a larger memory. Therefore, it is conceivable to reduce the quantity of sample data prefetched in the prefetch sample data queue and trigger the first update process for the prefetch quantity of the sample data.

ここで、データスループットの確定方法は、上記のａ２と類似であり、繰り返して説明しない。 Here, the method of determining the data throughput is similar to the above-mentioned a2, and will not be described repeatedly.

プリフェッチ数量更新条件を満たした後に、たとえば、以下の方法を採用してサンプルデータのプリフェッチ数量に対して第１更新処理を実行することができ、すなわち、
プリフェッチサンプルデータキューが現在占有している合計メモリスペースおよびメモリ使用上限閾値に基づいて、サンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得る。 After satisfying the prefetch quantity update condition, for example, the first update process can be executed for the prefetch quantity of the sample data by adopting the following method, that is,
Based on the total memory space currently occupied by the prefetch sample data queue and the upper limit threshold for memory usage, the first update process is executed for the prefetch quantity of the sample data to obtain the target prefetch quantity.

例示的に、たとえば、プリフェッチサンプルデータキューが占有している合計メモリスペースがメモリ使用上限閾値に達したか否かを検出し、検出結果に基づいてサンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得ることができる。 Illustratively, for example, it is detected whether or not the total memory space occupied by the prefetch sample data queue has reached the memory usage upper limit threshold value, and the first update process is performed for the prefetch quantity of the sample data based on the detection result. Can be executed to obtain the target prefetch quantity.

ここで、プリフェッチサンプルデータキューが占有している合計メモリスペースとは、すべてのメインプロセスに対応するサンプルデータキューが占有している合計メモリスペースを表す。 Here, the total memory space occupied by the prefetch sample data queue represents the total memory space occupied by the sample data queue corresponding to all the main processes.

具体的に、プリフェッチサンプルデータキューが現在占有している合計メモリスペースがメモリ使用上限閾値に達してない場合、プリフェッチ数量を第１調節ステップサイズだけ増加して、目標プリフェッチ数量を得るし、および／または、
プリフェッチサンプルデータキューが現在占有している合計メモリスペースがメモリ使用上限閾値を達した場合、プリフェッチ数量を第２調節ステップサイズだけ減少して、目標プリフェッチ数量を得る。 Specifically, if the total memory space currently occupied by the prefetch sample data queue has not reached the upper memory usage threshold, the prefetch quantity is increased by the first adjustment step size to obtain the target prefetch quantity, and / or,
When the total memory space currently occupied by the prefetch sample data queue reaches the upper memory usage threshold, the prefetch quantity is reduced by the second adjustment step size to obtain the target prefetch quantity.

ここで、第１調節ステップサイズとは、サンプルデータのプリフェッチ数量を増加するときの調節ステップサイズを表し、第２調節ステップサイズとは、サンプルデータのプリフェッチ数量を減少するときの調節ステップサイズを表す。 Here, the first adjustment step size represents the adjustment step size when the prefetch quantity of the sample data is increased, and the second adjustment step size represents the adjustment step size when the prefetch quantity of the sample data is decreased. ..

ここで、第１調節ステップサイズと第２調節ステップサイズは、同じサイズまたは異なるサイズを有し得る。 Here, the first adjustment step size and the second adjustment step size may have the same size or different sizes.

例示的に、第１調節ステップサイズは、たとえば第２調節ステップサイズよりも大きいし、このような場合、プリフェッチ数量を増加する必要があるときに、プリフェッチ数量をより速く増加することによって、プリフェッチサンプルデータキューに格納したサンプルデータがより多い数量により速く達するようにより速く保証し、後続のトレーニング反復サイクルの使用の要件を満たすことができ、プリフェッチ数量が少なすぎてモデルトレーニング過程が遅延されることを回避することができる。同時に、プリフェッチ数量を減少する必要があるときに、プリフェッチ数量をより緩やかに減少して、プリフェッチサンプルデータキューの長さの変化がよりスムーズになるように保証することができ、プリフェッチサンプルデータの数量の急激な減少によるトレーニング過程のショックを回避することができる。 Illustratively, the first adjustment step size is larger than, for example, the second adjustment step size, and in such cases, when the prefetch quantity needs to be increased, the prefetch sample is increased by increasing the prefetch quantity faster. It guarantees that the sample data stored in the data queue reaches a larger quantity faster, can meet the requirements for use in subsequent training iteration cycles, and the prefetch quantity is too small to delay the model training process. It can be avoided. At the same time, when the prefetch quantity needs to be reduced, the prefetch quantity can be reduced more slowly to ensure smoother changes in the length of the prefetch sample data queue, and the quantity of prefetch sample data. It is possible to avoid the shock of the training process due to the sudden decrease of.

さらに、もう１実施例において、さらに、プリフェッチサンプルデータキューが現在占有している合計メモリスペース、メモリ使用上限閾値、および、深層学習モデルをトレーニングするためのデータスループットに基づいて、サンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得ることができる。 Furthermore, in another embodiment, the prefetch quantity of the sample data is further based on the total memory space currently occupied by the prefetch sample data queue, the upper limit of memory usage, and the data throughput for training the deep learning model. The target prefetch quantity can be obtained by executing the first update process for the data.

ここで、上記の実施例に基づいて、たとえば、プリフェッチサンプルデータキューが現在占有している合計メモリスペースがメモリ使用上限閾値に達してない場合、
深層学習モデルをトレーニングするためのデータスループットが所定のデータスループット条件を満たす場合、プリフェッチ数量を第１調節ステップサイズだけ増加して、目標プリフェッチ数量を得る。 Here, based on the above embodiment, for example, if the total memory space currently occupied by the prefetch sample data queue has not reached the memory usage upper threshold.
If the data throughput for training the deep learning model satisfies a predetermined data throughput condition, the prefetch quantity is increased by the first adjustment step size to obtain the target prefetch quantity.

もう１実施例において、プリフェッチサンプルデータキューが現在占有している合計メモリスペースがメモリ使用上限閾値に達してないし、かつ、データスループット未が所定のデータスループット条件を満たす場合、プリフェッチ数量を第３調節ステップサイズだけ減少して目標プリフェッチ数量を得ることをさらに含む。 In another embodiment, if the total memory space currently occupied by the prefetch sample data queue does not reach the memory usage upper limit threshold and the data throughput not yet meets the predetermined data throughput condition, the prefetch quantity is adjusted to the third adjustment. It further includes reducing the step size by the step size to get the target prefetch quantity.

具体的に、第１調節ステップサイズと第３調節ステップサイズは、同じサイズまたは異なるサイズを有し得る。類似に、第２調節ステップサイズと第３調節ステップサイズは、同じサイズまたは異なるサイズを有し得る。 Specifically, the first adjustment step size and the third adjustment step size may have the same size or different sizes. Similarly, the second adjustment step size and the third adjustment step size may have the same size or different sizes.

もう１実施例において、プリフェッチサンプルデータキューが現在占有している合計メモリスペースがメモリ使用上限閾値に達していると、深層学習モデルをトレーニングするためのデータスループットが所定のデータスループット条件を満たすか否かに関わらず、プリフェッチ数量を第２調節ステップサイズだけ減少して、目標プリフェッチ数量を得る。 In another embodiment, if the total memory space currently occupied by the prefetch sample data queue has reached the upper memory usage threshold, whether the data throughput for training the deep learning model meets a predetermined data throughput condition. Regardless, the prefetch quantity is reduced by the second adjustment step size to get the target prefetch quantity.

上記の所定のデータスループット条件は、以下のｂ１～ｂ２の中の少なくとも１つを含む。
ｂ１は、データスループットの現在数値が履歴数値よりも大きいことであり、ここで、履歴数値は、現在の反復トレーニングの前の複数の履歴反復トレーニングに対応するデータスループットの平均値、または、現在の反復トレーニングの前の１回の反復トレーニングのデータスループットの数値である。 The above predetermined data throughput condition includes at least one of the following b1 to b2.
b1 is that the current value of the data throughput is larger than the historical value, where the historical value is the average value of the data throughput corresponding to the plurality of historical iterative trainings before the current iterative training, or the current value. It is a numerical value of the data throughput of one iterative training before the iterative training.

具体的な確定方法は、たとえば、上記のａ２.１を参照すればよく、ここでは繰り返して説明しない。 For a specific determination method, for example, a2.1 above may be referred to, and the description will not be repeated here.

ｂ２は、データスループットの現在数値がデータスループット閾値よりも大きいことである。 b2 is that the current value of the data throughput is larger than the data throughput threshold.

ここで、データスループットの現在数値は、たとえば、上記のａ２.２を参照すればよく、ここでは繰り返して説明しない。 Here, the current numerical value of the data throughput may be referred to, for example, a2.2 above, and will not be repeatedly described here.

また、本発明の実施例によって提供されるデータ処理方法は、上記の実施例に基づいて、
プリフェッチ数量に対する調節ステップサイズ第２更新処理を実行して目標調節ステップサイズを得ることをさらに含み、ここで、目標調節ステップサイズは、プリフェッチ数量の次の１回の更新処理に使用される。 Further, the data processing method provided by the embodiment of the present invention is based on the above embodiment.
It further includes performing an adjustment step size second update process for the prefetch quantity to obtain the target adjustment step size, where the target adjustment step size is used for the next one update process of the prefetch quantity.

ここで、たとえば、第１更新処理でプリフェッチ数量を増加する場合、プリフェッチ数量の調節ステップサイズを増加して目標調節ステップサイズを得るし、および／または、
在第１更新処理中減少プリフェッチ数量の場合、減少プリフェッチ数量の調節ステップサイズ以目標調節ステップサイズを得る。 Here, for example, when the prefetch quantity is increased in the first update process, the adjustment step size of the prefetch quantity is increased to obtain the target adjustment step size, and / or.
In the case of the reduced prefetch quantity during the first update process, the target adjustment step size after the adjustment step size of the reduced prefetch quantity is obtained.

具体的な例は、以下のとおりである。 Specific examples are as follows.

Ｍ１、Ｍ２、Ｍ３、Ｍ４、及びＭ５のような合計５個のプロセスが同一の深層学習モデルのトレーニングタスクを並列に実行している。 A total of five processes, such as M1, M2, M3, M4, and M5, are executing training tasks of the same deep learning model in parallel.

ここで、Ｍ１、Ｍ２、Ｍ３、Ｍ４、及びＭ５は、それぞれ本発明の実施例によって提供されるデータ処理方法を実行する。 Here, M1, M2, M3, M4, and M5 each execute the data processing method provided by the embodiment of the present invention.

Ｍ１が当該データ処理方法を実行する例をとる。 Take an example in which M1 executes the data processing method.

例１：メモリ使用上限閾値に基づいてサンプルデータのプリフェッチ数量に対して第１更新処理を実行する。 Example 1: The first update process is executed for the prefetch quantity of the sample data based on the memory usage upper limit threshold.

１.１：Ｍ１により、Ｍ１に対応するプリフェッチサンプルデータキューＬ１、Ｍ２に対応するプリフェッチサンプルデータキューＬ２、Ｍ３に対応するプリフェッチサンプルデータキューＬ３、Ｍ４に対応するプリフェッチサンプルデータキューＬ４、および、Ｍ５に対応するプリフェッチサンプルデータキューＬ５が占有している合計メモリスペースがメモリ使用上限閾値に達したか否かを検出し、達してない場合には、１.２（ａ）と１.２（ｂ）に遷移し、達した場合またはＭ１が操作システムのメインプロセスにメモリを申し込むときに申し込み失敗になった場合には、１.３に遷移する。 1.1 By M1, the prefetch sample data queue L1 corresponding to M1, the prefetch sample data queue L2 corresponding to M2, the prefetch sample data queue L3 corresponding to M3, the prefetch sample data queue L4 corresponding to M4, and M5 Detects whether or not the total memory space occupied by the prefetch sample data queue L5 corresponding to the above has reached the upper limit of memory usage, and if not, 1.2 (a) and 1.2 (b). ), And if the application fails when M1 applies for memory to the main process of the operation system, the transition to 1.3 occurs.

１.２（ａ）：Ｍ１により、サンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得、
「目標プリフェッチ数量＝プリフェッチ数量＋第１調節ステップサイズ」であり、ここで、第１調節ステップサイズは、前の１回の調節ステップサイズに対して第２更新処理を実行して得た目標調節ステップサイズである。 1.2 (a): By M1, the first update process is executed for the prefetch quantity of the sample data to obtain the target prefetch quantity.
"Target prefetch quantity = prefetch quantity + first adjustment step size", where the first adjustment step size is the target adjustment obtained by executing the second update process for the previous one adjustment step size. The step size.

１.２（ｂ）：Ｍ１により、第１調節ステップサイズに対して第２更新処理を実行する。
「今回の第２更新処理の後に得られた目標調節ステップサイズ＝第１調節ステップサイズ*２」であり、すなわち、次の１回の第１更新処理に使用する第１調節ステップサイズは、今回の第１更新処理に使用する調節ステップサイズの２倍である。 1.2 (b): The second update process is executed for the first adjustment step size by M1.
"Target adjustment step size obtained after this second update process = first adjustment step size * 2", that is, the first adjustment step size used for the next one first update process is this time. It is twice the adjustment step size used for the first update process of.

１.３：Ｍ１により、第２調節ステップサイズが１よりも大きいか否かを検出し、第２調節ステップサイズが１よりも大きいと、１.４（ａ）と１.４（ｂ）に遷移し、大きくないと、１.５に遷移する。 1.3: M1 detects whether the second adjustment step size is larger than 1, and if the second adjustment step size is larger than 1, it becomes 1.4 (a) and 1.4 (b). It transitions, and if it is not large, it transitions to 1.5.

１.４（ａ）：Ｍ１により、第２調節ステップサイズに対して第２更新処理を実行し、
「調整後の第２調節ステップサイズ＝調整前の第２調節ステップサイズ／２」である。 1.4 (a): The second update process is executed for the second adjustment step size by M1.
"Second adjustment step size after adjustment = second adjustment step size before adjustment / 2".

１.４（ｂ）：Ｍ１により、サンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得、
「目標プリフェッチ数量＝プリフェッチ数量－第２調節ステップサイズ」である。ここで、第２調節ステップサイズは、１.４（ａ）中の調整後の調節ステップサイズである。 1.4 (b): By M1, the first update process is executed for the prefetch quantity of the sample data to obtain the target prefetch quantity.
"Target prefetch quantity = prefetch quantity-second adjustment step size". Here, the second adjustment step size is the adjusted adjustment step size in 1.4 (a).

１.５：Ｍ１により、第２調節ステップサイズをそのまま維持し、そのまま維持された第２調節ステップサイズに基づいて、サンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得、
「目標プリフェッチ数量＝プリフェッチ数量－そのまま維持された第２調節ステップサイズ」である。 1.5: By M1, the second adjustment step size is maintained as it is, and based on the maintained second adjustment step size, the first update process is executed for the prefetch quantity of the sample data to set the target prefetch quantity. Get,
"Target prefetch quantity = prefetch quantity-second adjustment step size maintained as it is".

例２：Ｍ１により、メモリ使用上限閾値、および、深層学習モデルをトレーニングするためのデータスループットに基づいて、サンプルデータのプリフェッチ数量に対して第１更新処理を実行する。 Example 2: According to M1, the first update process is executed for the prefetch quantity of the sample data based on the memory usage upper limit threshold value and the data throughput for training the deep learning model.

２.１：Ｍ１により、Ｍ１に対応するプリフェッチサンプルデータキューＬ１、Ｍ２に対応するプリフェッチサンプルデータキューＬ２、Ｍ３に対応するプリフェッチサンプルデータキューＬ３、Ｍ４に対応するプリフェッチサンプルデータキューＬ４、および、Ｍ５に対応するプリフェッチサンプルデータキューＬ５が占有している合計メモリスペースがメモリ使用上限閾値に達したか否かを検出し、達してない場合には、２.２に遷移し、達した場合またはＭ１が操作システムのメインプロセスにメモリを申し込むときに申し込み失敗になった場合には、２.７に遷移する。 2.1 According to M1, the prefetch sample data queue L1 corresponding to M1, the prefetch sample data queue L2 corresponding to M2, the prefetch sample data queue L3 corresponding to M3, the prefetch sample data queue L4 corresponding to M4, and M5 Detects whether or not the total memory space occupied by the prefetch sample data queue L5 corresponding to the above has reached the memory usage upper limit threshold, and if not, transitions to 2.2, and if it is reached or M1. If the application fails when applying for memory to the main process of the operation system, the transition to 2.7 occurs.

２.２：Ｍ１により、深層学習モデルをトレーニングするときのデータスループットが所定のデータスループット条件を満たすか否かを検出し、満たすと、２.３（ａ）と２.３（ｂ）に遷移し、満たさないと、２.４（ａ）と２.４（ｂ）に遷移する。 2.2: M1 detects whether or not the data throughput when training a deep learning model satisfies a predetermined data throughput condition, and if it is satisfied, it transitions to 2.3 (a) and 2.3 (b). However, if it is not satisfied, it transitions to 2.4 (a) and 2.4 (b).

２.３（ａ）：Ｍ１により、サンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得、
「目標プリフェッチ数量＝プリフェッチ数量＋第１調節ステップサイズ」であり、ここで、第１調節ステップサイズは、前の１回の調節ステップサイズに対して第２更新処理を実行して得た目標調節ステップサイズである。 2.3 (a): By M1, the first update process is executed for the prefetch quantity of the sample data to obtain the target prefetch quantity.
"Target prefetch quantity = prefetch quantity + first adjustment step size", where the first adjustment step size is the target adjustment obtained by executing the second update process for the previous one adjustment step size. The step size.

２.３（ｂ）：Ｍ１により、第１調節ステップサイズに対して第２更新処理を実行し、
「今回の第２更新処理の後に得られた目標調節ステップサイズ＝第１調節ステップサイズ*２」である。 2.3 (b): The second update process is executed for the first adjustment step size by M1.
"Target adjustment step size obtained after this second update process = first adjustment step size * 2".

２.４（ａ）：Ｍ１により、サンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得、
「目標プリフェッチ数量＝プリフェッチ数量－第３調節ステップサイズ」である。 2.4 (a): By M1, the first update process is executed for the prefetch quantity of the sample data to obtain the target prefetch quantity.
"Target prefetch quantity = prefetch quantity-third adjustment step size".

２.４（ｂ）：Ｍ１により、第３調節ステップサイズが１よりも大きいか否かを検出し、第３調節ステップサイズが１よりも大きいと、２.５に遷移し、大きくないと、２.６に遷移する。 2.4 (b): M1 detects whether or not the third adjustment step size is larger than 1, and if the third adjustment step size is larger than 1, it transitions to 2.5, and if it is not large, it transitions to 2.5. Transition to 2.6.

２.５：Ｍ１により、第３調節ステップサイズに対して第２更新処理を実行し、
「調整後の第３調節ステップサイズ＝調整前の第３調節ステップサイズ／２」である。 2.5: By M1, the second update process is executed for the third adjustment step size.
"Third adjustment step size after adjustment = third adjustment step size before adjustment / 2".

２.６：Ｍ１により、第３調節ステップサイズをそのまま維持する。当該第３調節ステップサイズは、次の１回のプリフェッチ数量に対して第１更新処理を実行するときに使用される。 2.6: By M1, the third adjustment step size is maintained as it is. The third adjustment step size is used when the first update process is executed for the next one prefetch quantity.

２.７：Ｍ１により、第２調節ステップサイズが１よりも大きいか否かを検出し、第２調節ステップサイズが１よりも大きいと、２.８（ａ）と２.８（ｂ）に遷移し、大きくないと、２.９に遷移する。 2.7: M1 detects whether the second adjustment step size is larger than 1, and if the second adjustment step size is larger than 1, it becomes 2.8 (a) and 2.8 (b). It transitions, and if it is not large, it transitions to 2.9.

２.８（ａ）：Ｍ１により、第２調節ステップサイズに対して第２更新処理を実行し、
「調整後の第２調節ステップサイズ＝調整前の第２調節ステップサイズ／２」である。 2.8 (a): The second update process is executed for the second adjustment step size by M1.
"Second adjustment step size after adjustment = second adjustment step size before adjustment / 2".

２.８（ｂ）：Ｍ１により、サンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得、
「目標プリフェッチ数量＝プリフェッチ数量－第２調節ステップサイズ」である。ここで、第２調節ステップサイズは、２.８（ａ）中の調整後の調節ステップサイズである。 2.8 (b): By M1, the first update process is executed for the prefetch quantity of the sample data to obtain the target prefetch quantity.
"Target prefetch quantity = prefetch quantity-second adjustment step size". Here, the second adjustment step size is the adjusted adjustment step size in 2.8 (a).

２.９：Ｍ１により、第２調節ステップサイズをそのまま維持し、そのまま維持された第２調節ステップサイズに基づいて、サンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得、
「目標プリフェッチ数量＝プリフェッチ数量－そのまま維持された第２調節ステップサイズ」である。 2.9: By M1, the second adjustment step size is maintained as it is, and based on the maintained second adjustment step size, the first update process is executed for the prefetch quantity of the sample data to set the target prefetch quantity. Get,
"Target prefetch quantity = prefetch quantity-second adjustment step size maintained as it is".

上記の例中の各々のステップを通じて、サンプルデータのプリフェッチ数量に対する第１更新処理の実行を実行する。 Through each step in the above example, the first update process is executed for the prefetch quantity of the sample data.

ＩＩ：上記のＳ１０２において、メインプロセスは、プリフェッチサンプルデータキューに現在含まれているサンプルデータの数量が目標プリフェッチ数量に達してないときに、直接サンプルデータベースから新たなサンプルデータを読み取ってもよいし、サブプロセスと通信してサブプロセスを制御することによってサンプルデータベースから新たなサンプルデータを読み取ってもよい。 II: In S102 above, the main process may read new sample data directly from the sample database when the quantity of sample data currently contained in the prefetch sample data queue has not reached the target prefetch quantity. , You may read new sample data from the sample database by communicating with the subprocess and controlling the subprocess.

メインプロセスが直接サンプルデータベースから新たなサンプルデータを読み取る場合、メインプロセスは、プリフェッチサンプルデータキューから抽出したサンプルデータの数量、および、読み取ったプリフェッチサンプルデータキュー中のサンプルデータの数量に基づいて、プリフェッチサンプルデータキューに現在格納されているサンプルデータの数量を確定してから、当該数量と目標プリフェッチ数量とを比較し、当該数量が目標プリフェッチ数量未満である場合、直接サンプルデータベースから新たなサンプルデータを読み取って、プリフェッチサンプルデータキューに格納することができる。 If the main process reads new sample data directly from the sample database, the main process prefetches based on the quantity of sample data extracted from the prefetch sample data queue and the quantity of sample data in the read prefetch sample data queue. Determine the quantity of sample data currently stored in the sample data queue, then compare the quantity with the target prefetch quantity, and if the quantity is less than the target prefetch quantity, pull the new sample data directly from the sample database. It can be read and stored in the prefetch sample data queue.

メインプロセスがサブプロセスを制御してサンプルデータベースから新たなサンプルデータを読み取る場合、メインプロセスは、サブプロセスとの通信によって、プリフェッチサンプルデータキューに現在格納されているサンプルデータの数量を確定してから、当該数量と目標プリフェッチ数量とを比較し、当該数量が目標プリフェッチ数量未満である場合、サブプロセスにサンプルデータ読み取り命令を送信し、ここで、当該サンプルデータ読み取る命令には、読み取る必要があるサンプルデータの数量情報が含まれている。サブプロセスは、メインプロセスが送信したサンプルデータ読み取り命令を受信した後、サンプルデータ読み取り命令に含まれている数量情報に基づいて、新たなサンプルデータを読み取ってプリフェッチサンプルデータキューに格納することができる。 If the main process controls the subprocess to read new sample data from the sample database, the main process communicates with the subprocess to determine the quantity of sample data currently stored in the prefetch sample data queue. , Compare the quantity with the target prefetch quantity, and if the quantity is less than the target prefetch quantity, send a sample data read instruction to the subprocess, where the instruction to read the sample data is a sample that needs to be read. Contains quantity information for the data. After receiving the sample data read instruction sent by the main process, the subprocess can read new sample data and store it in the prefetch sample data queue based on the quantity information contained in the sample data read instruction. ..

本発明の実施例において、メインプロセスは、プリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得、データキューに現在含まれているサンプルデータ量が目標プリフェッチ数量に達してないときに、サンプルデータプールから新たなサンプルデータを読み取るため、メインプロセスは、１回の反復トレーニングを実行した後に、次の１回の反復トレーニングに必要なサンプルデータの読み取りがすでに完了されることになる。実際のほとんどの場合、メインプロセスがデータを読み取るのにかかる時間は、１つの反復トレーニングを実行するのにかかる時間よりも短いことが多いため、データキュー中に十分な数量のサンプルデータが常に格納されて後続のいくつかの反復トレーニングの使用を満たすように確保することができ、メインプロセスが特定のサンプルデータの読み取るのにかかる時間がながすぎても、サンプル数量が時間内に読み取られなくて反復トレーニングに遅延が発生されることを回避することができ、トレーニング効率を向上させた。 In the embodiment of the present invention, the main process executes the first update process on the prefetch quantity to obtain the target prefetch quantity, and when the sample data amount currently contained in the data queue has not reached the target prefetch quantity. In addition, to read new sample data from the sample data pool, the main process will have already completed the reading of the sample data required for the next one iterative training after performing one iterative training. .. In most cases in practice, the time it takes for the main process to read the data is often shorter than the time it takes to perform one iterative training, so a sufficient amount of sample data is always stored in the data queue. Can be ensured to meet the use of several subsequent iterative trainings, and the sample quantity will not be read in time even if the main process takes too long to read certain sample data. It was possible to avoid delays in repetitive training and improve training efficiency.

当業者は、具体的な実施形態に説明した方法で、各ステップの書き込み順序は、厳密な実行順序を意味して実施過程に対する制限を構成するのではなく、各ステップの具体的な実行順序はその機能および可能の内部ロジックによって決定されることを理解すべきである。 Those skilled in the art will appreciate that, in the manner described in the specific embodiments, the writing order of each step does not constitute a restriction on the implementation process, meaning a strict execution order, but the specific execution order of each step. It should be understood that it is determined by its function and possible internal logic.

同じ発明概念に基づいて、本発明の実施例は、データ処理方法に対応するデータ処理装置をさらに提供し、本発明の実施例における装置が解決しようとする問題の原理は、本発明の実施例の上記のデータ処理方法と同様であるため、装置の実施が方法の実施を参照することができ、繰り返された部分は繰り返して説明しない。 Based on the same concept of the invention, the embodiments of the present invention further provide a data processing apparatus corresponding to the data processing method, and the principle of the problem to be solved by the apparatus in the embodiments of the present invention is the embodiment of the present invention. Since it is the same as the above-mentioned data processing method in the above, the implementation of the device can refer to the implementation of the method, and the repeated part is not described repeatedly.

図２は、本発明の実施例によって提供されるデータ処理装置の模式図であり、前記装置は、１つまたは複数のプロセスを含む、深層学習モデルのトレーニングに適用され、前記装置は、第１更新モジュール２１と、読取りモジュール２２と、を備え、ここで、
第１更新モジュール２１は、前記１つまたは複数のプロセスの中の１つの目標プロセスに対して、サンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得、
読取りモジュール２２は、前記目標プロセスに対応するプリフェッチサンプルデータキューに現在含まれているサンプルデータの数量が前記目標プリフェッチ数量に達してないことに応答して、新たなサンプルデータを読み取って、読み取った前記新たなサンプルデータを前記プリフェッチサンプルデータキューン格納する。 FIG. 2 is a schematic diagram of a data processing apparatus provided by an embodiment of the present invention, wherein the apparatus is applied to training a deep learning model including one or more processes, and the apparatus is the first. The update module 21 and the read module 22 are provided here.
The first update module 21 executes the first update process on the prefetch quantity of the sample data for one target process in the one or more processes to obtain the target prefetch quantity.
The read module 22 reads and reads new sample data in response to the fact that the quantity of sample data currently contained in the prefetch sample data queue corresponding to the target process has not reached the target prefetch quantity. The new sample data is stored in the prefetch sample data queue.

本発明の実施例において、メインプロセスは、プリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得、プリフェッチサンプルデータキューに現在含まれているサンプルデータ量が目標プリフェッチ数量に達してないときに、サンプルデータプールから新たなサンプルデータを読み取るため、メインプロセスは、１回の反復トレーニングを実行した後に、次の１回の反復トレーニングに必要なサンプルデータの読み取りがすでに完了されることになる。実際のほとんどの場合、メインプロセスがデータを読み取るのにかかる時間は、１つの反復トレーニングを実行するのにかかる時間よりも短いことが多いため、データキュー中に十分な数量のサンプルデータが常に格納されて後続のいくつかの反復トレーニングの使用を満たすように確保することができ、メインプロセスが特定のサンプルデータの読み取るのにかかる時間がながすぎても、サンプル数量が時間内に読み取られなくて反復トレーニングに遅延が発生されることを回避することができ、トレーニング効率を向上させた。 In the embodiment of the present invention, the main process performs the first update process on the prefetch quantity to obtain the target prefetch quantity, and the sample data amount currently contained in the prefetch sample data queue reaches the target prefetch quantity. In order to read new sample data from the sample data pool when it is not available, the main process should have performed one iterative training and then already read the sample data required for the next one iterative training. become. In most cases in practice, the time it takes for the main process to read the data is often shorter than the time it takes to perform one iterative training, so a sufficient amount of sample data is always stored in the data queue. Can be ensured to meet the use of several subsequent iterative trainings, and the sample quantity will not be read in time even if the main process takes too long to read certain sample data. It was possible to avoid delays in repetitive training and improve training efficiency.

可能な１実施形態において、前記第１更新モジュール２１は、サンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得るときに、
前記１つまたは複数のプロセスに対応するプリフェッチサンプルデータキューが現在占有している合計メモリスペースおよびメモリ使用上限閾値に基づいて、サンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得る。 In one possible embodiment, when the first update module 21 executes the first update process on the prefetch quantity of the sample data to obtain the target prefetch quantity,
Based on the total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes and the upper memory usage threshold, the target prefetch is performed by performing the first update process on the prefetch quantity of the sample data. Get the quantity.

可能な１実施形態において、前記第１更新モジュール２１は、前記１つまたは複数のプロセスに対応するプリフェッチサンプルデータキューが現在占有している合計メモリスペースおよびメモリ使用上限閾値に基づいて、サンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得るときに、
前記１つまたは複数のプロセスに対応するプリフェッチサンプルデータキューが現在占有している合計メモリスペース、メモリ使用上限閾値、および、前記目標プロセスの前記深層学習モデルに対してトレーニングを実行するためのデータスループットに基づいて、サンプルデータのプリフェッチ数量に対して第１更新処理を実行して前記目標プリフェッチ数量を得る。 In one possible embodiment, the first update module 21 is based on the total memory space and memory usage limit threshold currently occupied by the prefetch sample data queue corresponding to the one or more processes. When the first update process is executed for the prefetch quantity to obtain the target prefetch quantity,
The total memory space currently occupied by the prefetch sample data queue for the one or more processes, the upper memory usage threshold, and the data throughput to perform training on the deep learning model of the target process. The first update process is executed for the prefetch quantity of the sample data based on the above, and the target prefetch quantity is obtained.

可能な１実施形態において、前記第１更新モジュール２１は、前記１つまたは複数のプロセスに対応するプリフェッチサンプルデータキューが現在占有している合計メモリスペースおよびメモリ使用上限閾値に基づいて、サンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得るときに、
前記１つまたは複数のプロセスに対応するプリフェッチサンプルデータキューが現在占有している合計メモリスペースが前記メモリ使用上限閾値に達してない場合、前記プリフェッチ数量を第１調節ステップサイズだけ増加して前記目標プリフェッチ数量を得るし、および／または、
前記１つまたは複数のプロセスに対応するプリフェッチサンプルデータキューが現在占有している合計メモリスペースが前記メモリ使用上限閾値に達した場合、前記プリフェッチ数量を第２調節ステップサイズだけ減少して前記目標プリフェッチ数量を得る。 In one possible embodiment, the first update module 21 is based on the total memory space and memory usage limit threshold currently occupied by the prefetch sample data queue corresponding to the one or more processes. When the first update process is executed for the prefetch quantity to obtain the target prefetch quantity,
If the total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes does not reach the memory usage upper threshold, the prefetch quantity is increased by the first adjustment step size to the target. Get the prefetch quantity and / or
When the total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes reaches the upper memory usage threshold, the prefetch quantity is reduced by the second adjustment step size to achieve the target prefetch. Get the quantity.

可能な１実施形態において、前記第１更新モジュール２１は、前記１つまたは複数のプロセスに対応するプリフェッチサンプルデータキューが現在占有している合計メモリスペースが前記メモリ使用上限閾値に達してない場合、前記プリフェッチ数量を第１調節ステップサイズだけ増加して前記目標プリフェッチ数量を得るときに、
前記１つまたは複数のプロセスに対応するプリフェッチサンプルデータキューが現在占有している合計メモリスペースが前記メモリ使用上限閾値に達してないし、かつ、前記目標プロセスの前記深層学習モデルに対してトレーニングを実行するためのデータスループットが所定のデータスループット条件を満たす場合、前記プリフェッチ数量を第１調節ステップサイズだけ増加して前記目標プリフェッチ数量を得る。 In one possible embodiment, if the total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes does not reach the memory usage upper limit threshold. When the prefetch quantity is increased by the first adjustment step size to obtain the target prefetch quantity,
The total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes has not reached the upper memory usage threshold and training is performed on the deep learning model of the target process. When the data throughput for this is satisfied with a predetermined data throughput condition, the prefetch quantity is increased by the first adjustment step size to obtain the target prefetch quantity.

可能な１実施形態において、前記第１更新モジュール２１は、さらに、
前記１つまたは複数のプロセスに対応するプリフェッチサンプルデータキューが現在占有している合計メモリスペースが前記メモリ使用上限閾値に達してないし、かつ、前記データスループットが前記所定のデータスループット条件を満たさない場合、前記プリフェッチ数量を第３調節ステップサイズだけ減少して前記目標プリフェッチ数量を得る。 In one possible embodiment, the first update module 21 further
When the total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes does not reach the memory usage upper limit threshold, and the data throughput does not meet the predetermined data throughput condition. , The prefetch quantity is reduced by the third adjustment step size to obtain the target prefetch quantity.

可能な１実施形態において、前記所定のデータスループット条件は、
前記データスループットの現在数値が履歴数値よりも大きいことと、
前記データスループットの現在数値がデータスループット閾値よりも大きいことと、の中の少なくとも１つを含み、ここで、前記履歴数値は、現在の反復トレーニングの前の複数の履歴反復トレーニングのときの前記データスループットの平均値、または、現在の反復トレーニングの前の１回の反復トレーニングのときの前記データスループットの数値である。 In one possible embodiment, the predetermined data throughput condition is
The current value of the data throughput is larger than the historical value, and
The current value of the data throughput is greater than the data throughput threshold and includes at least one of, where the historical value is the data from a plurality of historical iterative training prior to the current iterative training. It is the average value of the throughput, or the numerical value of the data throughput at the time of one repetitive training before the current repetitive training.

可能な１実施形態において、前記装置は、前記プリフェッチ数量の調節ステップサイズに対して第２更新処理を実行して目標調節ステップサイズを得るための第２更新モジュール２３をさらに備え、ここで、前記目標調節ステップサイズは、前記プリフェッチ数量の次の１回の更新処理に使用される。 In one possible embodiment, the apparatus further comprises a second update module 23 for performing a second update process on the prefetch quantity adjustment step size to obtain a target adjustment step size, wherein said. The target adjustment step size is used for the next one-time update process of the prefetch quantity.

可能な１実施形態において、前記第２更新モジュール２３は、前記プリフェッチ数量の調節ステップサイズに対して第２更新処理を実行して目標調節ステップサイズを得るときに、
前記第１更新処理中に前記プリフェッチ数量を増加する場合、前記プリフェッチ数量の調節ステップサイズを増加し、および／または、
前記第１更新処理中に前記プリフェッチ数量を減少する場合、前記プリフェッチ数量の調節ステップサイズを減少する。 In one possible embodiment, the second update module 23 performs a second update process on the adjustment step size of the prefetch quantity to obtain a target adjustment step size.
When the prefetch quantity is increased during the first update process, the adjustment step size of the prefetch quantity is increased and / or.
When the prefetch quantity is reduced during the first update process, the adjustment step size of the prefetch quantity is reduced.

装置の各モジュールの処理フローおよび各モジュール間の相互作用フローの記載は、上述した方法の実施例の関連する説明を参照することができ、ここでは繰り返して説明しない。 The description of the processing flow of each module of the apparatus and the interaction flow between each module can refer to the relevant description of the embodiments of the above method and will not be repeated herein.

本発明の実施例は、コンピュータデバイス３０をさらに提供する。図３は、本発明の実施例によって提供されるコンピュータデバイス３０の構成を示す模式図であり、当該デバイスは、
プロセッサ３１と、メモリ３２と、バス３３と、を備える。メモリ３２は、命令を格納し、当該メモリ３２は、内部メモリ３２１と外部メモリ３２２とを備え、ここでの内部メモリ３２１はメモリとも呼ばれ、プロセッサ３１の演算データおよびハードディスクなどの外部メモリ３２２と交換するデータを一時的に格納する。プロセッサ３１は、内部メモリ３２１を介して外部メモリ３２２とデータ交換を実行し、前記コンピュータデバイス３００が運行されると、前記プロセッサ３１と前記メモリ３２との間はバス３３を介して通信することによって、前記プロセッサ３１が以下の命令を実行するようにし、当該命令は、
前記１つまたは複数のプロセスの中の１つの目標プロセスに対して、サンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得ることと、
前記目標プロセスに対応するプリフェッチサンプルデータキューに現在含まれているサンプルデータの数量が前記目標プリフェッチ数量に達してないことに応答して、新たなサンプルデータを読み取って、読み取った前記新たなサンプルデータを前記プリフェッチサンプルデータキューン格納することと、を含む。 The embodiments of the present invention further provide the computer device 30. FIG. 3 is a schematic view showing the configuration of the computer device 30 provided by the embodiment of the present invention.
It includes a processor 31, a memory 32, and a bus 33. The memory 32 stores instructions, and the memory 32 includes an internal memory 321 and an external memory 322. The internal memory 321 here is also referred to as a memory, and includes arithmetic data of the processor 31 and an external memory 322 such as a hard disk. Temporarily store the data to be exchanged. The processor 31 executes data exchange with the external memory 322 via the internal memory 321, and when the computer device 300 is operated, the processor 31 and the memory 32 communicate with each other via the bus 33. , The processor 31 executes the following instruction, and the instruction is
For one target process in the one or more processes, the first update process is executed for the prefetch quantity of the sample data to obtain the target prefetch quantity.
The new sample data is read and read in response to the fact that the quantity of sample data currently contained in the prefetch sample data queue corresponding to the target process has not reached the target prefetch quantity. To store the prefetch sample data queue.

可能な１実施形態において、プロセッサ３１によって実行される命令の中で、前記サンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得ることは、
前記１つまたは複数のプロセスに対応するプリフェッチサンプルデータキューが現在占有している合計メモリスペースおよびメモリ使用上限閾値に基づいて、サンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得ることを含む。 In one possible embodiment, in the instruction executed by the processor 31, the first update process is executed for the prefetch quantity of the sample data to obtain the target prefetch quantity.
Based on the total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes and the upper memory usage threshold, the target prefetch is performed by performing the first update process on the prefetch quantity of the sample data. Includes getting quantity.

可能な１実施形態において、プロセッサ３１によって実行される命令の中で、前記１つまたは複数のプロセスに対応するプリフェッチサンプルデータキューが現在占有している合計メモリスペースおよびメモリ使用上限閾値に基づいて、サンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得ることは、
前記１つまたは複数のプロセスに対応するプリフェッチサンプルデータキューが現在占有している合計メモリスペース、メモリ使用上限閾値、および、前記目標プロセスの前記深層学習モデルに対してトレーニングを実行するためのデータスループットに基づいて、サンプルデータのプリフェッチ数量に対して第１更新処理を実行して前記目標プリフェッチ数量を得ることを含む。 In one possible embodiment, in an instruction executed by processor 31, based on the total memory space and memory usage upper bound threshold currently occupied by the prefetch sample data queue corresponding to the one or more processes. To obtain the target prefetch quantity by executing the first update process for the prefetch quantity of the sample data,
The total memory space currently occupied by the prefetch sample data queue for the one or more processes, the upper memory usage threshold, and the data throughput to perform training on the deep learning model of the target process. The first update process is executed for the prefetch quantity of the sample data based on the above, and the target prefetch quantity is obtained.

可能な１実施形態において、プロセッサ３１によって実行される命令の中で、前記１つまたは複数のプロセスに対応するプリフェッチサンプルデータキューが現在占有している合計メモリスペースおよびメモリ使用上限閾値に基づいて、サンプルデータのプリフェッチ数量に対して第１更新処理を実行して目標プリフェッチ数量を得ることは、
前記１つまたは複数のプロセスに対応するプリフェッチサンプルデータキューが現在占有している合計メモリスペースが前記メモリ使用上限閾値に達してない場合、前記プリフェッチ数量を第１調節ステップサイズだけ増加して前記目標プリフェッチ数量を得ること、および／または、
前記１つまたは複数のプロセスに対応するプリフェッチサンプルデータキューが現在占有している合計メモリスペースが前記メモリ使用上限閾値に達した場合、前記プリフェッチ数量を第２調節ステップサイズだけ減少して前記目標プリフェッチ数量を得ることを含む。 In one possible embodiment, in an instruction executed by processor 31, based on the total memory space and memory usage upper bound threshold currently occupied by the prefetch sample data queue corresponding to the one or more processes. To obtain the target prefetch quantity by executing the first update process for the prefetch quantity of the sample data,
If the total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes does not reach the memory usage upper threshold, the prefetch quantity is increased by the first adjustment step size to the target. Get the prefetch quantity and / or
When the total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes reaches the upper memory usage threshold, the prefetch quantity is reduced by the second adjustment step size to achieve the target prefetch. Includes getting quantity.

可能な１実施形態において、プロセッサ３１によって実行される命令の中で、前記１つまたは複数のプロセスに対応するプリフェッチサンプルデータキューが現在占有している合計メモリスペースが前記メモリ使用上限閾値に達してない場合、前記プリフェッチ数量を第１調節ステップサイズだけ増加して前記目標プリフェッチ数量を得ることは、
前記１つまたは複数のプロセスに対応するプリフェッチサンプルデータキューが現在占有している合計メモリスペースが前記メモリ使用上限閾値に達してないし、かつ、前記目標プロセスの前記深層学習モデルに対してトレーニングを実行するためのデータスループットが所定のデータスループット条件を満たす場合、前記プリフェッチ数量を第１調節ステップサイズだけ増加して前記目標プリフェッチ数量を得ることを含む。 In one possible embodiment, among the instructions executed by the processor 31, the total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes reaches the memory usage upper limit threshold. If not, increasing the prefetch quantity by the first adjustment step size to obtain the target prefetch quantity
The total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes has not reached the upper memory usage threshold and training is performed on the deep learning model of the target process. When the data throughput for the purpose satisfies a predetermined data throughput condition, the prefetch quantity is increased by the first adjustment step size to obtain the target prefetch quantity.

可能な１実施形態において、プロセッサ３１によって実行される命令の中で、前記方法は、
前記１つまたは複数のプロセスに対応するプリフェッチサンプルデータキューが現在占有している合計メモリスペースが前記メモリ使用上限閾値に達してないし、かつ、前記データスループットが前記所定のデータスループット条件を満たさない場合、前記プリフェッチ数量を第３調節ステップサイズだけ減少して前記目標プリフェッチ数量を得ることをさらに含む。 In one possible embodiment, among the instructions executed by the processor 31, the method is:
When the total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes does not reach the memory usage upper limit threshold, and the data throughput does not meet the predetermined data throughput condition. Further includes reducing the prefetch quantity by a third adjustment step size to obtain the target prefetch quantity.

可能な１実施形態において、プロセッサ３１によって実行される命令の中で、前記所定のデータスループット条件は、
前記データスループットの現在数値が履歴数値よりも大きいことと、
前記データスループットの現在数値がデータスループット閾値よりも大きいことと、の中の少なくとも１つを含み、ここで、前記履歴数値は、現在の反復トレーニングの前の複数の履歴反復トレーニングのときの前記データスループットの平均値、または、現在の反復トレーニングの前の１回の反復トレーニングのときの前記データスループットの数値である。 In one possible embodiment, in the instruction executed by the processor 31, the predetermined data throughput condition is
The current value of the data throughput is larger than the historical value, and
The current value of the data throughput is greater than the data throughput threshold and includes at least one of, where the historical value is the data from a plurality of historical iterative training prior to the current iterative training. It is the average value of the throughput, or the numerical value of the data throughput at the time of one repetitive training before the current repetitive training.

可能な１実施形態において、プロセッサ３１によって実行される命令の中で、前記方法は、
前記プリフェッチ数量の調節ステップサイズに対して第２更新処理を実行して目標調節ステップサイズを得ることをさらに含み、ここで、前記目標調節ステップサイズは、前記プリフェッチ数量の次の１回の更新処理に使用される。 In one possible embodiment, among the instructions executed by the processor 31, the method is:
Further including executing a second update process for the adjustment step size of the prefetch quantity to obtain a target adjustment step size, where the target adjustment step size is the next one update process of the prefetch quantity. Used for.

可能な１実施形態において、プロセッサ３１によって実行される命令の中で、前記プリフェッチ数量の調節ステップサイズに対して第２更新処理を実行して目標調節ステップサイズを得ることは、
前記第１更新処理中に前記プリフェッチ数量を増加する場合、前記プリフェッチ数量の調節ステップサイズを増加すること、および／または、
前記第１更新処理中に前記プリフェッチ数量を減少する場合、前記プリフェッチ数量の調節ステップサイズを減少することを含む。 In one possible embodiment, in the instruction executed by the processor 31, the second update process is executed for the adjustment step size of the prefetch quantity to obtain the target adjustment step size.
When the prefetch quantity is increased during the first update process, the adjustment step size of the prefetch quantity is increased and / or.
When the prefetch quantity is reduced during the first update process, it includes reducing the adjustment step size of the prefetch quantity.

本発明の実施例は、コンピュータ可読記録媒体をさらに提供し、当該コンピュータ可読記録媒体には、コンピュータプログラムが記録されており、当該コンピュータプログラムがプロセッサによって運行されるときに、上述した方法の実施例に記載のデータ処理方法のステップが実行される。ここで、当該記録媒体は、揮発性または不揮発性のコンピュータ可読記録媒体であり得る。 Embodiments of the present invention further provide a computer-readable recording medium, wherein a computer program is recorded in the computer-readable recording medium, and when the computer program is operated by a processor, an embodiment of the above-mentioned method is performed. The steps of the data processing method described in are performed. Here, the recording medium can be a volatile or non-volatile computer-readable recording medium.

本発明の実施例によって提供されるデータ処理方法のコンピュータプログラム製品は、プログラムコードを格納したコンピュータ可読記録媒体を含み、前記プログラムコードに含まれている命令は、上述した方法の実施例に記載のデータ処理方法のステップを実行し、具体的には、上述した方法の実施例を参照することができ、ここでは繰り返して説明しない。 The computer program product of the data processing method provided by the embodiment of the present invention includes a computer-readable recording medium containing the program code, and the instructions contained in the program code are described in the above-described embodiment of the method. The steps of the data processing method can be performed, specifically, examples of the above-mentioned method can be referred to, which are not repeated here.

本発明の実施例は、コンピュータプログラムをさらに提供し、当該コンピュータプログラムがプロセッサによって実行されるときに、前述した実施例の任意の方法が実現される。当該コンピュータプログラム製品は、具体的に、ハードウェア、ソフトウェア、または、その組み合わせの方式で実現され得る。オプションの１つの実施例において、前記コンピュータプログラム製品は、具体的に、コンピュータ記録媒体によって具体化される。オプションのもう１つの実施例において、コンピュータプログラム製品は、具体的に、ソフトウェア開発キット（ＳｏｆｔｗａｒｅＤｅｖｅｌｏｐｍｅｎｔＫｉｔ、ＳＤＫ）などのソフトウェア製品によって具体化される。 The embodiments of the present invention further provide a computer program, and when the computer program is executed by a processor, any method of the above-described embodiment is realized. The computer program product can be specifically realized by a method of hardware, software, or a combination thereof. In one example of the option, the computer program product is specifically embodied by a computer recording medium. In another embodiment of the option, the computer program product is specifically embodied by a software product such as a Software Development Kit (SDK).

当業者は、説明の便宜および簡素化のために、上述に記載のシステムおよび装置の具体的な作業過程は、前述した方法の実施例の対応する過程を参照することができ、ここでは繰り返して説明しないことを明確に了解すべきである。本発明によって提供されるいくつかの実施例において、開示したシステム、装置、および、方法は、他の方式によって実現され得ることを理解すべきである。上記に記載の装置の実施例は、単に例示的なものである。たとえば、前記ユニットの分割は、論理機能分割のみであり、実際の実装において他の分割方法があり得る。また、たとえば、複数のユニットまたはコンポーネントを、組み合わせることができるか、または、もう１つのシステムに統合することができるか、または、一部の特徴を無視するかまたは実行しないでもよい。さらに、表示または議論された相互間の結合または直接結合または通信接続は、いくつかの通信インターフェース、デバイスまたはユニットの間接結合または通信接続を介することができ、電気的、機械的または他の形態であり得る。 For convenience and simplification of description, those skilled in the art can refer to the corresponding processes of the embodiments of the above-mentioned methods for the specific working processes of the systems and devices described above, which are repeated herein. It should be clearly understood that it will not be explained. It should be understood that in some of the embodiments provided by the present invention, the disclosed systems, devices, and methods may be implemented by other methods. The embodiments of the device described above are merely exemplary. For example, the division of the unit is only a logical function division, and there may be other division methods in the actual implementation. Also, for example, multiple units or components may be combined, integrated into another system, or some features may be ignored or not performed. In addition, the coupling or direct coupling or communication connection between the displayed or discussed interactions can be via an indirect coupling or communication connection of several communication interfaces, devices or units, in electrical, mechanical or other forms. possible.

分離された部品として説明されたユニットは、物理的に分離されている場合とされていない場合があり、ユニットとして表示される部品物理ユニットである場合とそうでない場合がある。つまり、1つの場所に配置することも、複数のネットワークユニットに分散させることもできる。実際の必要に応じてここでの一部または全部のユニットを選択して本実施例の解決策の目的を達成することができる。 A unit described as a separated part may or may not be physically separated, and may or may not be a physical part physical unit displayed as a unit. That is, it can be located in one location or distributed across multiple network units. The objectives of the solution of this embodiment may be achieved by selecting some or all of the units here as needed in practice.

また、本発明の各々の実施例における各機能ユニットは、１つの処理ユニットに統合され得るか、または各ユニットが物理的に単独で存在し得るか、２つまたは２つ以上のユニットが１つのユニットに統合され得る。 Also, each functional unit in each embodiment of the invention may be integrated into one processing unit, or each unit may physically exist independently, or two or more units may be one. Can be integrated into a unit.

前記機能がソフトウェア機能ユニットの形で実装され、独立した製品として販売または使用される場合、プロセッサによって実行可能な不揮発性のコンピュータ可読記憶媒体に格納することができる。このような理解に基づいて、本発明の技術的解決策の本質、または先行技術に寄与する部分または技術的解決策の一部は、ソフトウェア製品の形で具体化することができる。当該コンピュータソフトウェア製品は、１つの記録媒体に格納され、コンピュータデバイス（パーソナルコンピュータ、サーバ、または、ネットワークデバイスなど）が本発明の各々の実施例に記載の方法の全部または一部のステップを実行できるようにするためのいくつかの命令を含む。前述した記録媒体は、Ｕディスク、モバイルハードディスク、読み取り専用メモリ（Ｒｅａｄ－ＯｎｌｙＭｅｍｏｒｙ、ＲＯＭ）、ランダムアクセスメモリ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ、ＲＡＭ）、磁気ディスク、または、光ディスクなどの、プログラムコードを格納できるさまざまな媒体を含み得る。 When the function is implemented in the form of a software functional unit and sold or used as a stand-alone product, it can be stored in a non-volatile computer-readable storage medium run by the processor. Based on this understanding, the essence of the technical solution of the invention, or any part of the prior art or part of the technical solution, can be embodied in the form of a software product. The computer software product is housed in one recording medium and the computer device (such as a personal computer, server, or network device) can perform all or part of the steps of the methods described in each embodiment of the invention. Includes some instructions to make it so. The recording media described above include U disks, mobile hard disks, read-only memory (Read-Only Memory, ROM), random access memory (Random Access Memory, RAM), magnetic disks, optical disks, and various other recording media that can store program codes. Can include various media.

最後に、上記の実施例は、本発明の具体的な実施形態に過ぎず、それらを限定するのではなく、本発明の技術的解決策を説明するために使用されるものであり、本発明の保護範囲はこれに限定されない。前述した実施例を参照して本発明を詳細に説明したが、当業者は、本発明に開示した技術範囲内で当業者は前述した実施例に記載の技術的解決策を修正するか、または容易に変更を思い付くことができ、またはここでの一部の技術特徴を同等に置き換えられることができることを理解すべきである。これら修正、変更、または、置き換えは、該当する技術的解決策の本質が本発明の実施例の技術的解決策の精神および範囲から逸脱することを引き起こさず、いずれも本発明の保護範囲内にカバーされるべきである。したがって、本発明の保護範囲は、特許請求の範囲の保護範囲に従うべきである。 Finally, the above embodiments are merely specific embodiments of the invention and are used to illustrate the technical solutions of the invention rather than limiting them. The scope of protection is not limited to this. Although the invention has been described in detail with reference to the embodiments described above, those skilled in the art will modify or modify the technical solutions described in the embodiments described above within the technical scope disclosed in the present invention. It should be understood that changes can be easily conceived or some technical features here can be replaced equally. These modifications, changes, or replacements do not cause the essence of the applicable technical solution to deviate from the spirit and scope of the technical solution of the embodiments of the invention, and are all within the scope of the invention. Should be covered. Therefore, the scope of protection of the present invention should be in accordance with the scope of claims.

Claims

A data processing method that applies to training deep learning models, including one or more processes.
For one target process in the one or more processes, the first update process is executed for the prefetch quantity of the sample data to obtain the target prefetch quantity.
The new sample data is read and read in response to the fact that the quantity of sample data currently contained in the prefetch sample data queue corresponding to the target process has not reached the target prefetch quantity. A data processing method comprising storing the prefetch sample data queue.

To obtain the target prefetch quantity by executing the first update process for the prefetch quantity of the sample data.
Based on the total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes and the upper limit memory usage threshold, the target is to execute the first update process for the prefetch quantity of the sample data. The data processing method according to claim 1, wherein the prefetch quantity is obtained.

Based on the total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes and the memory usage upper limit threshold, the target is to execute the first update process for the prefetch quantity of the sample data. Getting the prefetch quantity is
The total memory space currently occupied by the prefetch sample data queue for the one or more processes, the upper memory usage threshold, and the data throughput to perform training on the deep learning model of the target process. The data processing method according to claim 2, further comprising performing a first update process on the prefetch quantity of the sample data to obtain the target prefetch quantity.

Based on the total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes and the memory usage upper limit threshold, the target is to execute the first update process for the prefetch quantity of the sample data. Getting the prefetch quantity is
If the total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes does not reach the memory usage upper threshold, the prefetch quantity is increased by the first adjustment step size to the target. Get the prefetch quantity and / or
When the total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes reaches the memory usage upper limit threshold, the prefetch quantity is reduced by the second adjustment step size to achieve the target prefetch. The data processing method according to claim 2 or 3, wherein the data processing method comprises obtaining a quantity.

If the total memory space currently occupied by the prefetch sample data queue for the one or more processes does not reach the memory usage upper threshold, the prefetch quantity is increased by the first adjustment step size to the target. Getting the prefetch quantity is
The total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes has not reached the upper memory usage threshold and training is performed on the deep learning model of the target process. The data processing according to claim 4, wherein when the data throughput for performing the data throughput satisfies a predetermined data throughput condition, the prefetch quantity is increased by the first adjustment step size to obtain the target prefetch quantity. Method.

When the total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes does not reach the memory usage upper limit threshold, and the data throughput does not meet the predetermined data throughput condition. The data processing method according to claim 5, further comprising reducing the prefetch quantity by a third adjustment step size to obtain the target prefetch quantity.

The predetermined data throughput condition is
The current value of the data throughput is larger than the historical value, and
The current value of the data throughput is greater than the data throughput threshold and includes at least one of them.
Here, the historical numerical value is the average value of the data throughput at the time of a plurality of historical iterative trainings before the current iterative training, or the data throughput at the time of one iterative training before the current iterative training. The data processing method according to claim 5 or 6, wherein the data is a numerical value of.

Further including executing a second update process for the adjustment step size of the prefetch quantity to obtain the target adjustment step size.
Here, the data processing method according to any one of claims 1 to 7, wherein the target adjustment step size is used for the next one-time update processing of the prefetch quantity.

To obtain the target adjustment step size by executing the second update process for the adjustment step size of the prefetch quantity.
When the prefetch quantity is increased during the first update process, the adjustment step size of the prefetch quantity is increased and / or.
The data processing method according to claim 8, wherein when the prefetch quantity is reduced during the first update process, the adjustment step size of the prefetch quantity is reduced.

A data processing device that is applied to the training of deep learning models, including one or more processes.
For one target process in the one or more processes, a first update module for executing a first update process for a prefetch quantity of sample data to obtain a target prefetch quantity, and
The new sample data is read and read in response to the fact that the quantity of sample data currently contained in the prefetch sample data queue corresponding to the target process has not reached the target prefetch quantity. A data processing device comprising a read module for storing the prefetch sample data queue.

When the first update module executes the first update process on the prefetch quantity of the sample data to obtain the target prefetch quantity,
Based on the total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes and the upper limit memory usage threshold, the target is to execute the first update process for the prefetch quantity of the sample data. The data processing apparatus according to claim 10, wherein a prefetch quantity is obtained.

The first update module is first with respect to the prefetch quantity of the sample data based on the total memory space and memory usage upper bound threshold currently occupied by the prefetch sample data queue corresponding to the one or more processes. When performing the update process to get the target prefetch quantity
The total memory space currently occupied by the prefetch sample data queue for the one or more processes, the upper memory usage threshold, and the data throughput to perform training on the deep learning model of the target process. The data processing apparatus according to claim 11, wherein the first update process is performed on the prefetch quantity of the sample data to obtain the target prefetch quantity.

The first update module is first with respect to the prefetch quantity of the sample data based on the total memory space and memory usage upper bound threshold currently occupied by the prefetch sample data queue corresponding to the one or more processes. When performing the update process to get the target prefetch quantity
If the total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes does not reach the memory usage upper threshold, the prefetch quantity is increased by the first adjustment step size to the target. Get the prefetch quantity and / or
When the total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes reaches the memory usage upper limit threshold, the prefetch quantity is reduced by the second adjustment step size to achieve the target prefetch. The data processing apparatus according to claim 11 or 12, wherein the quantity is obtained.

The first update module adjusts the prefetch quantity to the first adjustment step when the total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes does not reach the memory usage upper limit threshold. When increasing by size to get the target prefetch quantity
The total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes has not reached the upper memory usage threshold and training is performed on the deep learning model of the target process. The data processing apparatus according to claim 13, wherein the prefetch quantity is increased by the first adjustment step size to obtain the target prefetch quantity when the data throughput for performing the data throughput satisfies a predetermined data throughput condition.

The first update module further
When the total memory space currently occupied by the prefetch sample data queue corresponding to the one or more processes does not reach the memory usage upper limit threshold, and the data throughput does not meet the predetermined data throughput condition. The data processing apparatus according to claim 14, wherein the prefetch quantity is reduced by a third adjustment step size to obtain the target prefetch quantity.

The predetermined data throughput condition is
The current value of the data throughput is larger than the historical value, and
The current value of the data throughput is greater than the data throughput threshold and includes at least one of them.
Here, the historical numerical value is the average value of the data throughput at the time of a plurality of historical iterative trainings before the current iterative training, or the data throughput at the time of one iterative training before the current iterative training. The data processing apparatus according to claim 14 or 15, characterized in that the numerical value of the above.

Further, a second update module for executing a second update process for the adjustment step size of the prefetch quantity to obtain a target adjustment step size is provided.
The data processing apparatus according to any one of claims 10 to 16, wherein the target adjustment step size is used for the next one-time update process of the prefetch quantity.

When the second update module executes the second update process for the adjustment step size of the prefetch quantity to obtain the target adjustment step size,
When the prefetch quantity is increased during the first update process, the adjustment step size of the prefetch quantity is increased and / or.
The data processing apparatus according to claim 17, wherein when the prefetch quantity is reduced during the first update process, the adjustment step size of the prefetch quantity is reduced.

It ’s a computer device,
Equipped with a processor, a recording medium, a bus,
Machine-readable instructions that can be executed by the processor are recorded on the recording medium.
When the computer device is operated, the processor and the recording medium communicate with each other via a bus, and the machine-readable instruction is executed by the processor according to any one of claims 1 to 9. A computer device characterized by performing the described data processing method.

A computer-readable recording medium
A computer program is recorded in the computer-readable recording medium, and when the computer program is operated by a processor, the data processing method according to any one of claims 1 to 9 is executed. A computer-readable recording medium characterized by that.

It ’s a computer program,
A computer program according to any one of claims 1 to 9, wherein when the computer program is executed by a processor, the data processing method according to any one of claims 1 to 9 is executed.