WO2021248791A1 - 数据增强策略的更新方法、装置、设备及存储介质 - Google Patents
数据增强策略的更新方法、装置、设备及存储介质 Download PDFInfo
- Publication number
- WO2021248791A1 WO2021248791A1 PCT/CN2020/125967 CN2020125967W WO2021248791A1 WO 2021248791 A1 WO2021248791 A1 WO 2021248791A1 CN 2020125967 W CN2020125967 W CN 2020125967W WO 2021248791 A1 WO2021248791 A1 WO 2021248791A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- data
- strategy
- training
- data enhancement
- stage
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/774—Generating sets of training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
Definitions
- the embodiments of the present disclosure relate to the field of machine learning, and relate to an update method, device, device, and storage medium of a data enhancement strategy.
- the application effect of deep learning technology relies on a large amount of training data, and the data processing model trained on a limited amount of training data usually has an overfitting phenomenon.
- automatic data enhancement technology is gradually used to increase the amount and diversity of training data.
- Automatic data enhancement technology refers to the automatic data enhancement process through automatic machine learning technology. Therefore, it is very important to find a suitable data enhancement strategy.
- the data enhancement strategy can be optimized through the reinforcement learning algorithm based on the training effect of the data processing model.
- the embodiments of the present disclosure provide a method, device, device, and storage medium for updating a data enhancement strategy.
- the embodiments of the present disclosure provide a method for updating a data enhancement strategy, including:
- the data enhancement strategy and the preset training data perform the second-stage training on the preset data processing model that has undergone the first-stage training
- the data enhancement strategy is updated to obtain the updated data enhancement strategy.
- the method further includes:
- the data enhancement strategy is updated for the M+1th time.
- the number of the initial data enhancement strategies is multiple, and the updating of each of the data enhancement strategies is performed in parallel; the method further includes:
- each of the data enhancement strategies except for the optimal strategy is replaced with the optimal data enhancement strategy.
- the data enhancement strategy includes a plurality of preset data enhancement operations; according to the data enhancement strategy and the preset training data, the data that has undergone the first stage of training is preset Process the model for the second stage of training, including:
- the second-stage training is performed on the data processing model after the first-stage training through the training data after data enhancement.
- the updating the data enhancement strategy according to the data processing model trained in the second stage includes:
- the updated data enhancement strategy is selected from each of the preset strategies.
- the updating the preset strategy model according to the data processing model trained in the second stage includes:
- the data processing model trained in the second stage is tested, and the test result is obtained;
- the strategy model is updated according to the historical inspection result and the inspection result.
- the updating the strategy model according to the historical inspection result and the inspection result includes:
- the strategy parameter in the strategy model is updated.
- the method before the obtaining the initial data enhancement strategy, the method further includes:
- the data enhancement strategy in the first-stage training is uniformly and randomly selected;
- embodiments of the present disclosure provide a data processing method, including:
- the data to be processed is processed through a pre-trained data processing model.
- the data processing model sequentially undergoes the first stage of training and the second stage of training.
- a preset data enhancement strategy is adopted
- the data processing model is trained with preset training data, and the data enhancement strategy is generated using the method described in the first aspect or each possible implementation manner of the first aspect.
- the method further includes:
- the second stage training is performed on the data processing model trained in the first stage.
- the performing the first stage training on the data processing model according to the training data includes:
- the data enhancement strategy in the first-stage training is uniformly and randomly selected;
- the to-be-processed data and the training data are image data or text data.
- an update device for a data enhancement strategy including:
- the acquisition part is configured to acquire the initial data enhancement strategy
- the training part is configured to perform second-stage training on the preset data processing model that has undergone the first-stage training according to the data enhancement strategy and preset training data;
- the update part is configured to update the data enhancement strategy according to the data processing model trained in the second stage to obtain the updated data enhancement strategy.
- an embodiment of the present disclosure provides a data processing device, including:
- the obtaining part is configured to obtain the data to be processed
- the processing part is configured to process the to-be-processed data through a pre-trained data processing model.
- the data processing model sequentially undergoes the first stage of training and the second stage of training, and passes through the second training stage.
- the preset data enhancement strategy and preset training data train the data processing model, and the data enhancement strategy is generated using the method described in the first aspect or each possible implementation manner of the first aspect.
- an electronic device including:
- the memory is used to store program instructions
- the processor is configured to call program instructions in the memory to execute the method described in the first aspect, each possible implementation manner of the first aspect, the second aspect, or each possible implementation manner of the second aspect.
- the embodiments of the present disclosure provide a computer-readable storage medium on which a computer program is stored.
- the computer program When the computer program is executed, it implements the first aspect and the possible implementation manners of the first aspect, The method described in the second aspect, or each possible implementation manner of the second aspect.
- the embodiments of the present disclosure provide a computer program, including computer-readable code.
- the processor in the electronic device executes the Aspect, each possible implementation manner of the first aspect, the second aspect, or the method described in each possible implementation manner of the second aspect.
- the training phase of the data processing model is divided into the first stage and the second stage before and after the two phases.
- the data enhancement strategy and the training data are used to update the data enhancement strategy.
- the data processing model trained in the first stage is trained in the second stage, and then the data enhancement strategy is updated based on the data processing model trained in the second stage, so that the data processing model does not need to be trained from scratch during the update process of the data enhancement strategy. While ensuring the quality of the data enhancement strategy, the efficiency of generating the data enhancement strategy is improved.
- the generated data enhancement strategy can be applied to the same type of training data and is transferable.
- Figure 1 is an example diagram of the relationship between data enhancement and the training effect of an image classification model
- FIG. 2 is a schematic diagram of a network architecture provided by an embodiment of the disclosure.
- FIG. 3 is a schematic flowchart of a method for updating a data enhancement strategy provided by an embodiment of the present disclosure
- FIG. 4 is a schematic flowchart of a method for updating a data enhancement strategy provided by another embodiment of the present disclosure
- FIG. 5 is a schematic flowchart of a method for updating a data enhancement strategy provided by another embodiment of the present disclosure
- FIG. 6 is a schematic flowchart of a method for updating a data enhancement strategy provided by another embodiment of the present disclosure.
- FIG. 7 is an example diagram of parallel updating of multiple data enhancement strategies provided by another embodiment of the present disclosure.
- FIG. 8 is a schematic flowchart of a data processing method provided by an embodiment of the present disclosure.
- FIG. 9 is a schematic structural diagram of an update device for a data enhancement strategy provided by an embodiment of the present disclosure.
- FIG. 10 is a schematic structural diagram of a data processing device provided by an embodiment of the disclosure.
- FIG. 11 is a schematic structural diagram of an electronic device provided by an embodiment of the disclosure.
- Fig. 12 is a block diagram of a device for updating a data enhancement strategy provided according to this embodiment.
- the first-stage training and the second-stage training refer to the total number of training of the data processing model, and the training of the data processing model is divided into the first-stage training and the second-stage training in the order of front and back.
- the total number of training times of the data processing model is preset to 300 times
- the first 100 times of training may be referred to as the first stage of training
- the last 200 times of training may be referred to as the second stage of training.
- the number of training times in the first phase of training and the number of training times in the second phase of training are not limited.
- Data enhancement operation refers to the operation of fine-tuning the training data to increase the data volume and diversity of the training data. For example, taking image data as an example, the size and color of the image data are adjusted.
- Data enhancement strategy refers to a data enhancement program for training data.
- the data enhancement strategy includes data enhancement operations.
- the data enhancement operation in the data enhancement strategy is horizontal cropping of the image, and the cropping amplitude corresponding to the horizontal cropping of the image is 0.1 width, that is, the width of each horizontal cropping of the image is 10% of the original width of the image.
- Deep learning technology is widely used in many fields and has achieved remarkable results.
- deep learning technology can handle tasks such as image classification, target detection, image segmentation, and human pose estimation.
- the data processing model using deep learning technology usually needs to be trained on a large amount of training data, otherwise the trained model will appear over-fitting. Therefore, data enhancement has become a common way to increase the amount and diversity of training data, and designing an appropriate data enhancement strategy has become a key factor in improving the training effect of data processing models.
- data enhancement strategies can be manually designed by professionals, but this method not only has high time and personnel costs, but also has low reusability of data enhancement strategies, and is usually only suitable for training specific data processing models.
- the method of automatically generating data enhancement strategies can not only improve the efficiency of data enhancement strategy generation, but also generate better data enhancement strategies.
- the data enhancement strategy can be optimized through a reinforcement learning algorithm based on the training effect of the data processing model.
- the inventor conjectures that the improvement of the training effect of the data processing model by data enhancement mainly occurs in the later training stage of the data processing model.
- the inventor has conducted an in-depth study of the model training process based on the data enhancement strategy to verify the above conjecture.
- FIG. 1 shows the relationship between data enhancement and the training effect of the image classification model.
- the abscissa is the number of data enhancement rounds in the 300 training of the image classification model, and the ordinate is the classification of the image classification model after 300 training.
- the dotted line is the relationship between the number of data enhancement rounds in the later stage of training and the classification accuracy of the image classification model, and the solid line is the relationship between the number of data enhancement rounds in the early stage of training and the classification accuracy of the image classification model.
- the number of data enhancement rounds in the post-training period is continuously calculated from the last training of the image classification model.
- the number of data enhancement rounds in the post-training period is 50, which means that data enhancement is performed in the last 50 trainings of the image classification model.
- the number of data enhancement rounds in the pre-training period is continuously calculated from the first training of the image classification model.
- the number of data enhancement rounds in the pre-training period is 50, which means that data enhancement is performed in the first 50 trainings of the image classification model.
- the method for updating the data enhancement strategy obtained by the embodiments of the present disclosure obtains the initial data enhancement strategy, and performs the first-stage training data processing model according to the data enhancement strategy and training data.
- the data enhancement strategy is updated according to the data processing model trained in the second stage, so that in the process of updating the data strategy model, only the second stage of training is required for the data processing model, which not only ensures the data enhancement strategy Quality has improved the efficiency of data enhancement strategy generation.
- the method for updating the data enhancement strategy provided by the embodiment of the present disclosure may be applicable to the network architecture shown in FIG. 2.
- the network architecture includes at least a terminal device 201 or a server 202.
- the terminal device 201 can store the data processing model trained in the first stage, and perform the second stage training of the data processing model and the data enhancement strategy.
- the data processing model trained in the first stage can also be stored on the server 202, and the second-stage training of the data processing model and the update of the data enhancement strategy can be performed;
- the terminal device 201 can also be stored in the first-stage training Data processing model, the second stage training of the data processing model and the update of the data enhancement strategy are performed on the server 202, or the data processing model trained in the first stage is stored on the server 202, and the data processing model is performed on the terminal device 201 The second phase of training and data enhancement strategy update.
- the foregoing terminal device may be a computer, a tablet computer, a smart phone, etc.
- the foregoing server may be a single server or a server group composed of multiple servers.
- FIG. 3 is a schematic flowchart of a method for updating a data enhancement strategy provided by an embodiment of the present disclosure. As shown in Figure 3, the method includes:
- an initial data enhancement strategy can be obtained from each preset data enhancement strategy.
- each preset data enhancement strategy is called each preset strategy, and the currently adopted data enhancement strategy is called data Enhanced strategy.
- the user may also set the initial data enhancement strategy in advance, and directly obtain the set data enhancement strategy.
- the initial data enhancement operation can also be obtained from each preset data enhancement operation, and then the initial data enhancement strategy can be obtained.
- S302 Perform a second-stage training on the preset data processing model that has undergone the first-stage training according to the data enhancement strategy and the preset training data.
- the data processing model may be trained in the first stage in advance to obtain the data processing model after the first stage of training.
- Training data can be collected in advance, and the training data can be stored in the form of a database.
- the training data can be enhanced through the data enhancement strategy.
- the training data after the data enhancement is used to perform the first step on the data processing model trained in the first stage.
- the second-stage training obtains the data processing model after the second-stage training, so that the training data is enhanced in the post-training of the data processing model, making full use of the feature that data enhancement has a greater impact on the post-training of the data processing model.
- the training effect of the data processing model trained in the second stage can be tested, and the test result can be obtained.
- the inspection result of the data processing model is the accuracy of image classification of the data processing model.
- the test result of the data processing model is obtained, and the training of the data processing model obtained by training based on the training data after the data enhancement can be understood in the case of data enhancement of the training data through the data enhancement strategy
- the test results of the data processing model reflect the quality of the data enhancement strategy.
- the higher the accuracy of the image classification of the data processing model the better the quality of the data enhancement strategy. Therefore, the data enhancement strategy can be updated based on the test results of the data processing model.
- the preset strategy in the strategy update space can be obtained as the updated data enhancement strategy.
- the data processing model trained in the first stage is trained in the second stage, and the data enhancement strategy is updated according to the data processing model trained in the second stage.
- FIG. 4 is a schematic flowchart of a method for updating a data enhancement strategy provided by another embodiment of the present disclosure. As shown in Figure 4, the method includes:
- the initial data enhancement strategy can be obtained from each preset strategy.
- the user can also set the initial data enhancement strategy in advance, and directly obtain the set data enhancement strategy.
- the initial data enhancement operation can also be obtained from each preset data enhancement operation, and then the initial data enhancement strategy can be obtained.
- the data enhancement strategy includes multiple preset data enhancement operations to improve the quality of the data enhancement strategy.
- the training data can be enhanced in turn according to the data enhancement operations in the data enhancement strategy.
- the data processing model trained in the first stage is trained in the second stage.
- each data enhancement operation and each operation amplitude corresponding to each data enhancement operation as shown in Table 1 can be preset.
- the same data enhancement operations with different operation ranges can be used.
- Treated as different data enhancement operations so there are 36 data enhancement operations in Table 1.
- the data enhancement operations in Table 1 can be combined to obtain 36 ⁇ 36 data enhancement strategies. Therefore, according to Table 1, 36 ⁇ 36 preset strategies can be set.
- S402 Perform a second-stage training on the preset data processing model that has undergone the first-stage training according to the data enhancement strategy and the preset training data.
- the training data can be enhanced through the data enhancement strategy.
- the training data after the data enhancement is used to perform the first step on the data processing model trained in the first stage.
- the second-stage training obtains the data processing model after the second-stage training, so that the training data is enhanced in the post-training of the data processing model, making full use of the feature that data enhancement has a greater impact on the post-training of the data processing model.
- the training effect of the data processing model trained in the second stage can be tested, and the test result can be obtained.
- the test result of the data processing model is obtained, and the training of the data processing model obtained by training based on the training data after the data enhancement can be understood in the case of data enhancement of the training data through the data enhancement strategy.
- the test results of the data processing model reflect the quality of the currently adopted data enhancement strategy. Therefore, the data enhancement strategy can be updated based on the test results of the data processing model.
- the preset strategy in the strategy update space can be obtained as the updated data enhancement strategy.
- the initial data enhancement strategy is updated to the updated data enhancement strategy, that is, the currently adopted data enhancement strategy is updated to the updated data enhancement strategy, and step S402 is executed to correct
- the data enhancement strategy has been updated many times.
- the update of the data enhancement strategy is stopped, and the test results of the data processing model trained in the second stage are selected during all the update processes In the highest case, the adopted data enhancement strategy is used as the final data enhancement strategy, thereby effectively improving the quality of the data enhancement strategy.
- the data enhancement strategy of the Mth update is obtained, and M is greater than or equal to 1.
- the data processing model trained in the first stage is carried out. Two-stage training; according to the data enhancement model trained in the second stage, the data enhancement strategy is updated for the M+1th time.
- the number of updates reaches the threshold of the number of times, it is determined that the updated data enhancement strategy meets the preset condition; when the number of updates does not meet the threshold of the number of times, it is determined that the updated data enhancement strategy does not meet the preset condition, so as to pass the number of updates Control whether the update of the data enhancement strategy continues, and avoid always updating the data enhancement strategy.
- determining whether to stop the continuous update of the data enhancement strategy by determining whether the update times of the data enhancement strategy reaches the preset threshold value, it can also be determined by determining whether the data enhancement strategy has undergone the second phase of training. Whether the test result of the data processing model meets the preset conditions is used to determine whether to stop the continuous update of the data enhancement strategy.
- the test result of the data processing model can be compared with the preset test threshold. In the case that the test result of the data processing model is greater than the test threshold, it is determined that the data processing model that has passed the second training stage meets the preset conditions.
- the data enhancement strategy is set as the final data enhancement strategy; when the test result of the data processing model is less than or equal to the test threshold, it is determined that the data processing model after the second training phase does not meet the preset conditions, and the data enhancement strategy is continued Update.
- the number of data enhancement strategies in each update process is multiple, and the updates of each data enhancement strategy are performed in parallel, thereby effectively improving the generation efficiency of the data enhancement strategy.
- each data enhancement strategy except the optimal strategy is replaced with the optimal data enhancement strategy, thereby improving the convergence of the update process and the generation efficiency of the data enhancement strategy.
- the selection is made based on the test result obtained by testing the training effect of the data processing model trained in the second stage.
- the training data is image data or text data.
- the data processing model is an image processing model; when the training data is text data, the data processing model is Natural language processing model. Therefore, the method for updating the data enhancement strategy improved by the embodiment of the present disclosure is applicable to the generation of the data enhancement strategy in the image processing field and the generation of the data enhancement strategy in the natural language field.
- the data processing model trained in the first stage is trained in the second stage, and the data enhancement strategy is performed multiple times according to the data processing model trained in the second stage.
- Update make full use of the characteristics of data enhancement strategies that have greater influence on the later training of data processing models, and improve the efficiency of data enhancement strategies while ensuring the quality of data enhancement strategies.
- FIG. 5 is a schematic flowchart of a method for updating a data enhancement strategy provided by another embodiment of the present disclosure. As shown in Figure 5, the method includes:
- the initial data enhancement strategy can be obtained from each preset strategy.
- the initial data enhancement strategy can also be pre-set by the user directly.
- the initial data enhancement operation can also be obtained from each preset data enhancement operation, and then the initial data enhancement strategy can be obtained.
- one or more preset strategies are selected uniformly and randomly from each preset strategy as the initial data enhancement Strategy to improve the fairness of the initial data enhancement strategy selection.
- one or more preset strategies are uniformly randomly selected from each preset strategy, which means that the probability of each preset strategy being selected is equal.
- each data enhancement strategy is updated synchronously to improve the efficiency of data enhancement strategy generation.
- S502 Perform a second-stage training on the preset data processing model that has undergone the first-stage training according to the data enhancement strategy and the preset training data.
- the training data is enhanced through the data enhancement operation in the data enhancement strategy.
- the data enhancement strategy includes multiple data enhancement operations
- the data enhancement strategy is used to enhance the training data. Operation, perform data enhancement on the training data one by one, and obtain the training data after the data enhancement.
- the data processing model trained in the first stage is trained in the second stage to obtain the data processing model trained in the second stage.
- the strategy model is a parameterized model, and its parameters are preset strategy parameters.
- the output of the strategy model can be adjusted.
- the output of the strategy model is the selection probability of each preset strategy, that is, when the data enhancement strategy is updated, the probability of each preset strategy being selected as the updated data enhancement strategy. Therefore, the strategy model can be understood as a polynomial distribution.
- preset verification data can be obtained, and the verification data includes input data and tag data corresponding to the input data.
- the verification data includes the input image and the classification label corresponding to the input image, where the classification label is the input data Category.
- the input data in the verification data is input to the data processing model trained in the second stage to obtain the output result of the data processing model, and the output result of the data processing model is processed with the label data corresponding to the input data.
- the test results of the data processing model can be obtained.
- testing the data processing model refers to testing the training effect of the data processing model. For example, taking image data as an example, when the verification data is image data and the task of the data processing model is an image classification task, the input image is input to the data processing model, and the output of the data processing model is assigned to the classification label corresponding to the input image By comparison, the classification accuracy of the data processing model can be obtained.
- the strategy parameters of the strategy model can be updated according to the inspection result to obtain the updated strategy model.
- S504 Determine the selection probability of each preset strategy through the updated strategy model.
- S505 According to the selection probability of each preset strategy, select an updated data enhancement strategy from each preset strategy.
- the selection probability of each preset strategy can be re-determined. According to the selection probability of each preset strategy, one of the preset strategies is selected as the updated strategy. Data enhancement strategy.
- the strategy parameter includes the weight corresponding to each preset strategy, and the strategy parameter is updated, that is, the weight corresponding to each preset strategy is updated.
- the same weight can be set for each preset strategy to achieve uniform and random selection of the initial data enhancement strategy from each preset strategy.
- the weight of each preset strategy changes differently, and the selection probability of each preset strategy gradually differs.
- the strategy parameters are adjusted, and then the selection probability of each preset strategy is re-determined according to the strategy model, and the better quality data is continuously selected from each preset strategy for enhancement
- the strategy not only improves the generation efficiency of the data enhancement strategy, but also guarantees the quality of the data enhancement strategy.
- the strategy model can be expressed as formula (1):
- e is the base of the natural logarithm
- ⁇ k is the k-th weight in the strategy parameter ⁇ , that is, the weight corresponding to the k-th preset strategy
- K represents the total number of preset strategies
- O (k) represents the k-th weight A preset strategy
- p ⁇ (O (k) ) represents the selection probability of the k-th preset strategy. Therefore, through the strategy model and the strategy parameters including the corresponding weights of each preset strategy, the selection probability of each preset strategy can be determined. By adjusting the strategy parameters, the selection probability of each preset strategy can be effectively adjusted, which improves the data enhancement strategy. The efficiency of the generation of data, and to ensure the quality of the data enhancement strategy.
- the policy parameters in the process of updating the policy parameters according to the test results of the data processing model trained in the second stage, can be updated through the preset heuristic search algorithm to improve the strategy. The effect of parameter updates.
- the policy parameter update can be expressed as formula (3):
- T n represents the nth search trajectory in the reinforcement learning algorithm
- p(T n ) is the probability that the search trajectory T n is searched in the reinforcement learning algorithm
- N represents the search trajectory in the reinforcement learning algorithm quantity
- the gradient value of the policy parameters can be Multiply it by the preset learning rate in the reinforcement learning algorithm to get the product, and then add the product to the strategy parameter to get the updated strategy parameter.
- the model parameters of the data-processing model obtained through the second-stage training It can be expressed as formula (4):
- x represents the input data in the training data
- y represents the label data corresponding to x in the training data
- O(x) represents the data enhancement of x
- L( ⁇ ) represents the preset loss function
- the data enhancement strategy in the first stage training is uniformly and randomly selected, according to the data in the first stage
- the enhancement strategy performs data enhancement on the training data.
- the data processing model is trained in the first stage, so that the training data is also enhanced in the first stage of training to improve the data processing after the first stage of training The training effect of the model.
- model parameters of the data processing model obtained through the first stage of training can be expressed as formula (5):
- ⁇ share represents the model parameters of the data processing model obtained through the first stage of training, It means that the data enhancement strategy is selected from each preset strategy according to the uniform probability distribution.
- the data enhancement strategy has a greater impact on the later training of the data processing model.
- the data processing model trained in the first stage is trained in the second stage.
- the data processing model trained in the second stage update the strategy model, determine the selection probability of each preset strategy through the updated strategy model, and optimize the quality of the updated data to enhance the strategy by adjusting the probability of each preset strategy.
- the quality of the data enhancement strategy is improved, and the generation efficiency of the data enhancement strategy is improved.
- FIG. 6 is a schematic flowchart of a method for updating a data enhancement strategy provided by another embodiment of the present disclosure. As shown in Figure 6, the method includes:
- the training effect of the data processing model trained in the second stage is tested by verifying the data, and the test is obtained.
- the test results of the second-stage data processing model in the first N-1 updates of the data enhancement strategy are obtained.
- the test results of the second-stage data processing model in the first N-1 updates of the data enhancement strategy are called historical test results.
- the test results and historical test results can be combined to update the strategy model to Ensure the stability of the strategy model update during each update process, thereby improving the update effect of the data enhancement strategy.
- N is the total number of current updates of the data enhancement strategy
- the Nth update refers to the current update process.
- the mean value of the historical test results can be determined, the difference between the test result and the mean value can be determined, and the difference is calculated based on the difference.
- the strategy parameters in the strategy model are updated to ensure the stability of the strategy model update during each update process, thereby improving the data enhancement strategy update effect.
- the update process of the strategy parameters can adopt a heuristic search algorithm, which will not be repeated.
- S604 Determine the selection probability of each preset strategy through the updated strategy model.
- S605 According to the selection probability of each preset strategy, select an updated data enhancement strategy from each preset strategy.
- steps S601 to S605 can refer to the detailed description of steps S501 to S505, which will not be repeated here.
- S606 Determine whether the updated data enhancement strategy meets a preset condition.
- S608 if the updated data enhancement strategy meets the preset condition, S608 is executed; if the updated data enhancement strategy does not meet the preset condition, S607 is executed.
- the updated initial data enhancement strategy is the updated data enhancement strategy
- step S602 is skipped to perform multiple updates to the data enhancement strategy to improve the quality of the data enhancement strategy.
- the updated data enhancement strategy is set as the final data enhancement strategy.
- determining whether to stop the continuous update of the data enhancement strategy by determining whether the update times of the data enhancement strategy reaches the preset threshold value, it can also be determined by determining whether the data enhancement strategy has undergone the second phase of training. Whether the test result of the data processing model meets the preset conditions is used to determine whether to stop the continuous update of the data enhancement strategy.
- the test result of the data processing model can be compared with a preset test threshold.
- the test result of the data processing model is greater than the test threshold, it means the data processing model that has passed the second training stage
- the data enhancement strategy is set as the final data enhancement strategy; if the test result of the data processing model is less than or equal to the test threshold, the data enhancement strategy is continued to be updated.
- the optimal data enhancement strategy is selected, and the updated data In the enhancement strategy, each data enhancement strategy except the optimal strategy is replaced with the optimal data enhancement strategy, thereby improving the convergence of the update process and the generation efficiency of the data enhancement strategy.
- the selection can be made based on the test result obtained by testing the training effect of the data processing model trained in the second stage.
- Figure 7 provides a parallel update process for multiple data enhancement strategies.
- each cube represents a data enhancement strategy
- each cube represents a data processing model
- accuracy (ACC) represents the test result of the data processing model obtained through the second stage of training
- each row represents one Data enhancement strategy update process
- each column represents an update of each data enhancement strategy.
- an initial data enhancement strategy can be selected uniformly and randomly from each preset strategy, and multiple copies of the initial data enhancement strategy can be obtained to obtain multiple identical initial data enhancement strategies.
- the strategy is updated in parallel. Every preset update times, the optimal data enhancement strategy is selected from each updated data enhancement strategy, and the optimal data enhancement strategy is copied, as shown by the dashed arrow.
- Strategy replication is: in each updated data enhancement strategy, the remaining data enhancement strategy except for the optimal data enhancement strategy is replaced with the optimal data enhancement strategy. Therefore, the convergence of multiple updates of the data enhancement strategy can be effectively improved, and a better quality data enhancement strategy can be obtained.
- the model parameter ⁇ share of the data processing model trained in the first stage is loaded into the data processing model to obtain the data processing model trained in the first stage.
- the second-stage training is performed on the data processing model trained in the first stage, and then the verification data is tested to obtain ACC, that is, the test result of the data processing model trained in the second stage.
- ACC that is, the test result of the data processing model trained in the second stage.
- the data enhancement strategy is updated, and the updated data enhancement strategy is obtained.
- multiple data enhancement strategies can be updated in parallel, and each update process of the data enhancement strategy only requires the second-stage training of the data processing model, and each preset number of updates will be
- Each updated data enhancement strategy is replaced with the current optimal data enhancement strategy, and the calculation amount of strategy parameter update is small, which effectively improves the efficiency of data enhancement strategy update, improves the efficiency of data enhancement strategy generation, and ensures The quality of the data enhancement strategy.
- the ratio of the training times of the first stage training to the total training times or the ratio of the training times of the second stage training to the total training times may be adjusted to improve the generation efficiency of the data enhancement strategy.
- FIG. 8 is a schematic flowchart of a data processing method provided by an embodiment of the disclosure. As shown in Figure 8, the method includes:
- the data to be processed input by the user can be acquired, and the data to be processed can also be collected in advance.
- the data to be processed is processed through the pre-trained data processing model.
- the data processing model goes through the first stage of training and the second stage of training in turn.
- the preset data enhancement strategy and preset training are used
- the data trains the data processing model.
- the data processing model is pre-trained.
- the data processing model is trained in the first stage, and then the data processing model is trained in the second stage according to the data enhancement strategy and training data, so as to make full use of
- the data to be processed is input into a data processing model, and the data to be processed is processed by the data processing model to obtain corresponding processing results.
- the data enhancement strategy used in the second-stage training of the data processing model can be obtained through the data enhancement strategy update method improved in any of the above embodiments, so as to improve the quality and generation of the data enhancement strategy. Efficiency, thereby improving the data processing effect of the data processing model and the efficiency of model training.
- the data processing model in the process of training the data processing model, may be trained in the first stage through the training data to obtain the data processing model trained in the first stage. Then use the data enhancement strategy to enhance the training data. Based on the training data after the data enhancement, the data processing model trained in the first stage is trained in the second stage to obtain the trained data processing model, so as to make full use of the data enhancement.
- the post-training of the data processing model has a more influential feature, which improves the data processing effect of the data processing model and the efficiency of model training.
- the data enhancement strategy can be selected uniformly and randomly among the preset strategies as the data enhancement strategy for the first stage of training.
- the data enhancement strategy is used to enhance the training data, and the data processing model is trained in the first stage through the data-enhanced training data, so that the data enhancement strategy is selected uniformly and randomly, and the time spent on model training is increased as much as possible.
- the training effect of the first-stage training of the data processing model thereby improving the overall training effect of the data processing model.
- the data to be processed and the training data can be image data or text data.
- the data processing model is an image processing model
- the data to be processed and the training data are image data
- the data processing model is In the case of the natural language processing model, the data to be processed and the training data are text data, thereby improving the image processing effect or the natural language processing effect.
- the data to be processed is processed through the pre-trained data processing model.
- the training process of the data processing model is divided into the first stage training and the second stage training.
- the preset The data enhancement strategy of the data processing model improves the data processing effect and model training efficiency of the data processing model, thereby improving the data processing effect.
- Automatic machine learning is a hot area in the current machine learning field, and its related technologies can play a role in improving model performance and reducing manpower required for tuning in many fields.
- Image data enhancement technology has also been widely used in the field of image processing. Automating the image data enhancement process through automatic machine learning technology can improve the pertinence of data enhancement and reduce unnecessary manual adjustments.
- it is more complicated to find a suitable enhancement strategy on the data set of a specific task. This is because the magnitude of the data set is generally large, and the overhead of direct search is unacceptable. And if it is only to find a common strategy and apply it to all tasks, the improvement function of the model will be lower.
- some of the existing automatic data-enhanced search technologies are still expensive, and some of the enhancement effects are not ideal. Among them, part or all of the process of automating machine learning.
- the most common task is to automatically adjust the parameters of machine learning, such as automatically finding a suitable model structure, a suitable data enhancement strategy, a suitable loss function, and a suitable optimizer.
- the update method of the data enhancement strategy provided by the embodiments of the present disclosure can achieve a good balance between time consumption and evaluation accuracy, that is, search can be carried out directly on a data set of a conventional scale, and a stable improvement can be obtained; and, applicable It is used in multiple image classification data sets, and has a certain transferability; it can also be easily embedded in various image classification tasks.
- the method for updating the data enhancement strategy includes searching for an image data enhancement strategy.
- the search process can be divided into the following three steps. First of all, the model is trained under a uniform random strategy. After that, the One-Shot (search strategy) search phase will be carried out, that is, the end state of the pre-training is repeatedly loaded and the post-training is performed, and the search is performed at the same time. The search goal is to optimize the performance of post-training. Finally, apply the searched strategy to the original task to re-train as a whole to get the final model performance.
- One-Shot is a search strategy.
- the original intention is to take a "path" in the entire search space each time, and it can also be broadly understood as a single sampling update that is repeated multiple times.
- the time efficiency of the search can be greatly improved.
- the stability of the evaluation index was not observed to be damaged in the experiment. Using this method can improve the performance of each image classification model under a given data set, and help the model achieve better performance in multiple task scenarios.
- Step A use uniform and random data enhancement for pre-training.
- this step A includes: obtaining an untrained initial model; training under uniform random data enhancement; and obtaining a model that has been previously trained.
- the input of step A is a designated image classification data set, a completely untrained model; the output is a model that has been trained in the previous period.
- this step A includes:
- the data enhancement operations we choose can be various automatic data enhancement operations to ensure fairness.
- the operation list is shown in Table 1, where the second column represents the different amplitude values of each operation. Considering the difference in amplitude values, there are a total of 36 possible data enhancement operations. During training, two operations will be used uniformly and randomly for each picture. The picture after the data enhancement operation is used as the actual input of the model.
- Step B Perform One-Shot search, that is, repeat post-training and continuously update the data enhancement strategy.
- each cube represents a data enhancement strategy
- each cube represents a data processing model.
- Accuracy (ACC) Represents the test results of the data processing model obtained through the second stage of training.
- Each row represents the update process of a data enhancement strategy
- each column represents an update of each data enhancement strategy.
- the single update process may include: loading the model obtained after pre-training. That is, each post-training will reset the model parameters to the parameters obtained after the pre-training.
- the historical index sliding average can be subtracted from each evaluation. Use this time model to evaluate the index update strategy. Reinforcement learning is used here to update, and the goal of the update is to improve the evaluation index of the model.
- the final strategy After several repeated training and updates, the final strategy will be obtained.
- the final strategy can be exported as a short script for convenient addition to the desired training process.
- Step C Retrain using the final strategy to obtain the final model and final performance.
- each picture will undergo data enhancement under the control of the final strategy (under the corresponding probability value).
- the final model and performance are obtained.
- the method for updating the data enhancement strategy utilizes the One-Shot idea, which achieves a good balance between search efficiency and evaluation accuracy, and achieves a better experimental effect under the same conditions.
- the results of the algorithm search can be easily derived and can be used flexibly for other tasks.
- the method for updating the data enhancement strategy can directly perform data enhancement in the training process of image classification tasks or other image processing tasks, in order to achieve better performance and stronger generalization; Search for data enhancement strategies under specified data sets and specified models to obtain highly customized data enhancement strategies; it can be combined with a customized search space to perform data enhancement strategy searches for a wider range of tasks. Such as natural language processing and other fields.
- FIG. 9 is a schematic structural diagram of an update apparatus for a data enhancement strategy provided by an embodiment of the present disclosure. As shown in Figure 9, the device includes:
- the obtaining part 901 is configured to obtain an initial data enhancement strategy
- the training part 902 is configured to perform second-stage training on the preset data processing model that has undergone the first-stage training according to the data enhancement strategy and preset training data;
- the update part 903 is configured to update the data enhancement strategy according to the data processing model trained in the second stage to obtain the updated data enhancement strategy.
- the update part 903 is also configured as:
- the updated initial data enhancement strategy is the updated data enhancement strategy to update the data enhancement strategy multiple times.
- the number of data enhancement strategies is multiple, and the update of each data enhancement strategy is performed in parallel; the update part 903 is also configured as:
- the optimal data enhancement strategy is selected;
- each data enhancement strategy except the optimal strategy is replaced with the optimal data enhancement strategy.
- the data enhancement strategy includes a plurality of preset data enhancement operations; the training part 902 is also configured to:
- the data processing model trained in the first stage is trained in the second stage.
- the training data is image data or text data.
- the update part 903 is also configured as:
- the updated data enhancement strategy is selected from each preset strategy.
- the updating part 903 is further configured to:
- the data processing model trained in the second stage is tested, and the test result is obtained;
- the strategy model is updated.
- the update part 903 is also configured as:
- the strategy parameters in the strategy model are updated.
- the training part 902 is further configured to:
- the data enhancement strategy in the first stage of training is uniformly and randomly selected;
- the data processing model is trained in the first stage.
- the device for updating the data enhancement strategy provided in FIG. 9 can execute the above-mentioned corresponding method embodiments, and its implementation principles and technical effects are similar, and will not be repeated here.
- FIG. 10 is a schematic structural diagram of a data processing device provided by an embodiment of the present disclosure. As shown in Figure 10, the device includes:
- the obtaining part 1001 is configured to obtain data to be processed
- the processing part 1002 is configured to process the data to be processed through a pre-trained data processing model.
- the data processing model sequentially undergoes the first stage of training and the second stage of training, and in the second training stage, the preset data enhancement strategy is adopted Train the data processing model with preset training data.
- the data enhancement strategy is generated using the method for updating the data enhancement strategy shown in any of the foregoing embodiments.
- the device further includes a training part, and the training part is further configured to:
- the data processing model trained in the first stage is trained in the second stage.
- the training part is also configured as:
- the data enhancement strategy in the first stage of training is uniformly and randomly selected
- the data processing model is trained in the first stage.
- the data to be processed and the training data are image data or text data.
- the data processing device provided in FIG. 10 can execute the foregoing corresponding method embodiments, and its implementation principles and technical effects are similar, and will not be repeated here.
- FIG. 11 is a schematic structural diagram of an electronic device provided by an embodiment of the disclosure.
- the terminal device may include: a processor 1101 and a memory 1102.
- the memory 1102 is used to store computer execution instructions, and the processor 1101 implements a method as in any of the foregoing embodiments when the processor 1101 executes a computer program.
- the above-mentioned processor 1101 may be a general-purpose processor, including a central processing unit (CPU), a network processor (NP), etc.; it may also be a digital signal processor (Digital Signal Processing, DSP), a dedicated Integrated circuit (application-specific integrated circuit, ASIC), field-programmable gate array (FPGA) or other programmable logic devices, discrete gates or transistor logic devices, discrete hardware components.
- the aforementioned memory 1102 may include random access memory (RAM), or may also include non-volatile memory (non-volatile memory), such as at least one disk memory.
- the embodiments of the present disclosure also provide a computer-readable storage medium, in which instructions are stored, which when run on a computer, cause the computer to execute the method in any of the above-mentioned embodiments.
- the embodiments of the present disclosure further provide a program product, the program product includes a computer program, the computer program is stored in a storage medium, at least one processor can read the computer program from the storage medium, the at least When a processor executes the computer program, the method of any of the foregoing embodiments can be implemented.
- FIG. 12 is a block diagram of a device 1200 for updating a data enhancement strategy provided according to this embodiment.
- the apparatus 1200 may be provided as a server or a computer. 12
- the apparatus 1200 includes a processing component 1201, which further includes one or more processors, and a memory resource represented by a memory 1202, for storing instructions executable by the processing component 1201, such as application programs.
- the application program stored in the memory 1202 may include one or more parts each corresponding to a set of instructions.
- the processing component 1201 is configured to execute instructions to execute the method of any one of the embodiments in FIG. 3 to FIG. 6 described above.
- the device 1200 may also include a power component 1203 configured to perform power management of the device 1200, a wired or wireless network interface 1204 configured to connect the device 1200 to a network, and an input output (I/O) interface 1205.
- the device 1200 can operate based on an operating system stored in the memory 1202, such as Windows ServerTM, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM or the like.
- "at least one” refers to one or more, and “multiple” refers to two or more.
- “And/or” describes the association relationship of the associated objects, indicating that there can be three relationships, for example, A and/or B, which can mean: A alone exists, A and B exist at the same time, and B exists alone, where A , B can be singular or plural.
- the character “/” generally indicates that the associated objects before and after are in an “or” relationship; in the formula, the character "/" indicates that the associated objects before and after are in a "division” relationship.
- At least one item (a) refers to any combination of these items, including any combination of a single item (a) or a plurality of items (a).
- at least one of a, b, or c can mean: a, b, c, ab, ac, bc, or abc, where a, b, and c can be single or multiple Piece.
- the size of the sequence numbers of the foregoing processes does not mean the order of execution.
- the execution order of the processes should be determined by their functions and internal logic, and should not be used in the embodiments of the present disclosure.
- the implementation process constitutes any limitation.
- the data processing model trained in the first stage is trained in the second stage, and the data enhancement strategy is updated according to the data processing model trained in the second stage.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Software Systems (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- General Physics & Mathematics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Medical Informatics (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Mathematical Physics (AREA)
- Health & Medical Sciences (AREA)
- Multimedia (AREA)
- General Health & Medical Sciences (AREA)
- Databases & Information Systems (AREA)
- Life Sciences & Earth Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Image Analysis (AREA)
- Machine Translation (AREA)
- Devices For Checking Fares Or Tickets At Control Points (AREA)
Abstract
Description
数据增强操作 | 操作幅度 | 幅度单位 |
水平裁剪 | {0.1,0.2,0.3} | 宽度比例 |
垂直裁剪 | {0.1,0.2,0.3} | 高度比例 |
水平平移 | {0.15,0.3,0.45} | 宽度比例 |
垂直平移 | {0.15,0.2,0.45} | 高度比例 |
图像旋转 | {10,20,30} | 角度值 |
色彩调整 | {0.3,0.6,0.9} | 色彩平衡度 |
色调分离 | {4.4,5.6,6.8} | 像素位数值 |
日光化 | {26,102,179} | 像素阈值 |
对比度调整 | {1.3,1.6,1.9} | 对比度比例 |
锐度调整 | {1.3,1.6,1.9} | 锐化比例 |
亮度调整 | {1.3,1.6,1.9} | 亮度比例 |
自动对比度 | 无 | 无 |
均衡化 | 无 | 无 |
颜色反转 | 无 | 无 |
Claims (17)
- 一种数据增强策略的更新方法,所述方法包括:获取初始的数据增强策略;根据所述数据增强策略和预设的训练数据,对预设的经过第一阶段训练的数据处理模型进行第二阶段训练;根据经过第二阶段训练的数据处理模型,对所述数据增强策略进行更新,以得到更新后的数据增强策略。
- 根据权利要求1所述的方法,其中,所述方法还包括:获取第M次更新的所述数据增强策略,所述M大于或等于1;根据第M次更新的所述数据增强策略和所述训练数据,对所述经过第一阶段训练的数据处理模型进行第二阶段训练;根据经过第二阶段训练的数据增强模型,对所述数据增强策略进行第M+1次更新。
- 根据权利要求2所述的方法,其中,所述初始的数据增强策略的数量为多个,各所述数据增强策略的更新并行进行;所述方法还包括:每预设的更新次数,根据所述经过第二阶段训练的数据处理模型,在更新后的各所述数据增强策略中,选取最优的数据增强策略;在更新后的所述数据增强策略中,将除所述最优策略之外的各所述数据增强策略分别替换为所述最优的数据增强策略。
- 根据权利要求1-3任一项所述的方法,其中,所述数据增强策略包括多个预设的数据增强操作;所述根据所述数据增强策略和预设的训练数据,对预设的经过第一阶段训练的数据处理模型进行第二阶段训练,包括:按照各所述数据增强操作,依次对所述训练数据进行数据增强;通过数据增强后的所述训练数据,对所述经过第一阶段训练的数据处理模型进行第二阶段训练。
- 根据权利要求1-3任一项所述的方法,其中,所述根据经过第二阶段训练的数据处理模型,对所述数据增强策略进行更新,包括:根据所述经过第二阶段训练的数据处理模型,更新预设的策略模型;通过更新后的所述策略模型,确定各个预设策略的选中概率;按照各所述预设策略的选中概率,在各所述预设策略中选取更新后的所述数据增强策略。
- 根据权利要求5所述的方法,其中,在所述数据增强策略的更新次数为多次的情况下,所述根据经过第二阶段训练的数据处理模型,更新预设的策略模型,包括:根据预设的验证数据,对所述经过第二阶段训练的数据处理模型进行检验,得到检验结果;获取所述数据增强策略的前N-1次更新中所述经过第二阶段的数据处理模型的历史检验结果,所述N为所述数据增强策略当前更新的总次数;根据所述历史检验结果和所述检验结果,对所述策略模型进行更新。
- 根据权利要求6所述的方法,其中,所述根据所述历史检验结果和所述检验结果,对所述策略模型进行更新,包括:确定所述历史检验结果的均值;确定所述检验结果和所述均值的差值;根据所述差值,对所述策略模型中的策略参数进行更新。
- 根据权利要求1-3任一项所述的方法,其中,所述获取初始的数据增强策略之前,所述方法还包括:在各个预设策略中,均匀随机选取所述第一阶段训练中的数据增强策略;根据所述第一阶段训练中的数据增强策略和所述训练数据,对所述数据处理模型进行所述第一阶段训练。
- 一种数据处理方法,所述方法包括:获取待处理数据;通过预先训练好的数据处理模型,对所述待处理数据进行处理,所述数据处理模型依次经过第一阶段训练和第二阶段训练,在所述第二训练阶段中通过预设的数据增强策略和预设的训练数据对所述数据处理模型进行训练,所述数据增强策略采用如权利要求1-8任一项所述的数据增强策略的更新方法进行生成。
- 根据权利要求9所述的方法,其中,所述方法还包括:根据所述训练数据,对所述数据处理模型进行所述第一阶段训练;通过所述数据增强策略对所述训练数据进行数据增强;根据数据增强后的所述训练数据,对经过所述第一阶段训练的数据处理模型进行所述第二阶段训练。
- 根据权利要求10所述的方法,其中,所述根据所述训练数据,对所述数据处理模型进行所述第一阶段训练,包括:在各预设策略中,均匀随机选取所述第一阶段训练中的数据增强策略;根据所述第一阶段训练中的数据增强策略和所述训练数据,对所述数据处理模型进行所述第一阶段训练。
- 根据权利要求9-11任一项所述的方法,其中,所述待处理数据和所述训练数据为图像数据或者文本数据。
- 一种数据增强策略的更新装置,所述装置包括:获取部分,被配置为获取初始的数据增强策略;训练部分,被配置为根据所述数据增强策略和预设的训练数据,对预设的经过第一阶段训练的数据处理模型进行第二阶段训练;更新部分,被配置为根据经过第二阶段训练的数据处理模型,对所述数据增强策略进行更新,以得到更新后的所述数据增强策略。
- 一种数据处理装置,所述装置包括:获取部分,被配置为获取待处理数据;处理部分,被配置为通过预先训练好的数据处理模型,对所述待处理数据进行处理,所述数据处理模型依次经过第一阶段训练和第二阶段训练,在所述第二训练阶段中通过预设的数据增强策略和预设的训练数据对所述数据处理模型进行训练,所述数据增强策略采用如权利要求1-8任一项所述的数据增强策略的更新方法进行生成。
- 一种电子设备,其中,所述电子设备包括:存储器和处理器;所述存储器用于存储程序指令;所述处理器用于调用所述存储器中的程序指令执行如权利要求1-8中任一项或者权利要求9-12中任一项所述的方法。
- 一种计算机可读存储介质,其中,所述计算机可读存储介质上存储有计算机程序;所述计算机程序被执行时,实现如权利要求1-8中任一项或者权利要求9-12中任一项所述的方法。
- 一种计算机程序,包括计算机可读代码,当所述计算机可读代码在电子设备中运行时,所述电子设备中的处理器执行用于实现权利要求1-8中任一项或者权利要求9-12中任一项所述的方法。
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2021569392A JP2022541370A (ja) | 2020-06-09 | 2020-11-02 | データ強化ポリシーの更新方法、装置、デバイス及び記憶媒体 |
KR1020217038153A KR20220004692A (ko) | 2020-06-09 | 2020-11-02 | 데이터 증강 정책의 업데이트 방법, 장치, 기기 및 저장 매체 |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010519507.3 | 2020-06-09 | ||
CN202010519507.3A CN111695624B (zh) | 2020-06-09 | 2020-06-09 | 数据增强策略的更新方法、装置、设备及存储介质 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2021248791A1 true WO2021248791A1 (zh) | 2021-12-16 |
Family
ID=72479980
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2020/125967 WO2021248791A1 (zh) | 2020-06-09 | 2020-11-02 | 数据增强策略的更新方法、装置、设备及存储介质 |
Country Status (5)
Country | Link |
---|---|
JP (1) | JP2022541370A (zh) |
KR (1) | KR20220004692A (zh) |
CN (1) | CN111695624B (zh) |
TW (1) | TWI781576B (zh) |
WO (1) | WO2021248791A1 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114665986A (zh) * | 2022-03-15 | 2022-06-24 | 深圳市百泰实业股份有限公司 | 一种蓝牙钥匙的测试系统及方法 |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111695624B (zh) * | 2020-06-09 | 2024-04-16 | 北京市商汤科技开发有限公司 | 数据增强策略的更新方法、装置、设备及存储介质 |
CN114462628A (zh) * | 2020-11-09 | 2022-05-10 | 华为技术有限公司 | 数据增强方法、装置、计算设备以及计算机可读存储介质 |
CN113537406B (zh) * | 2021-08-30 | 2023-04-07 | 重庆紫光华山智安科技有限公司 | 一种图像自动数据增强方法、系统、介质及终端 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110110861A (zh) * | 2019-05-09 | 2019-08-09 | 北京市商汤科技开发有限公司 | 确定模型超参数及模型训练的方法和装置、存储介质 |
CN110807109A (zh) * | 2019-11-08 | 2020-02-18 | 北京金山云网络技术有限公司 | 数据增强策略的生成方法、数据增强方法和装置 |
US20200114926A1 (en) * | 2018-10-16 | 2020-04-16 | Toyota Motor Engineering & Manufacturing North America, Inc. | Vehicle velocity predictor using neural networks based on v2x data augmentation to enable predictive optimal control of connected and automated vehicles |
CN111695624A (zh) * | 2020-06-09 | 2020-09-22 | 北京市商汤科技开发有限公司 | 数据增强策略的更新方法、装置、设备及存储介质 |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9449283B1 (en) * | 2012-08-20 | 2016-09-20 | Context Relevant, Inc. | Selecting a training strategy for training a machine learning model |
CN111758105A (zh) * | 2018-05-18 | 2020-10-09 | 谷歌有限责任公司 | 学习数据增强策略 |
CN109961098B (zh) * | 2019-03-22 | 2022-03-01 | 中国科学技术大学 | 一种机器学习的训练数据选择方法 |
CN111127364B (zh) * | 2019-12-26 | 2022-08-02 | 吉林大学 | 图像数据增强策略选择方法及人脸识别图像数据增强方法 |
-
2020
- 2020-06-09 CN CN202010519507.3A patent/CN111695624B/zh active Active
- 2020-11-02 JP JP2021569392A patent/JP2022541370A/ja active Pending
- 2020-11-02 WO PCT/CN2020/125967 patent/WO2021248791A1/zh active Application Filing
- 2020-11-02 KR KR1020217038153A patent/KR20220004692A/ko active Search and Examination
-
2021
- 2021-04-07 TW TW110112619A patent/TWI781576B/zh active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20200114926A1 (en) * | 2018-10-16 | 2020-04-16 | Toyota Motor Engineering & Manufacturing North America, Inc. | Vehicle velocity predictor using neural networks based on v2x data augmentation to enable predictive optimal control of connected and automated vehicles |
CN110110861A (zh) * | 2019-05-09 | 2019-08-09 | 北京市商汤科技开发有限公司 | 确定模型超参数及模型训练的方法和装置、存储介质 |
CN110807109A (zh) * | 2019-11-08 | 2020-02-18 | 北京金山云网络技术有限公司 | 数据增强策略的生成方法、数据增强方法和装置 |
CN111695624A (zh) * | 2020-06-09 | 2020-09-22 | 北京市商汤科技开发有限公司 | 数据增强策略的更新方法、装置、设备及存储介质 |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114665986A (zh) * | 2022-03-15 | 2022-06-24 | 深圳市百泰实业股份有限公司 | 一种蓝牙钥匙的测试系统及方法 |
CN114665986B (zh) * | 2022-03-15 | 2023-05-16 | 深圳市百泰实业股份有限公司 | 一种蓝牙钥匙的测试系统及方法 |
Also Published As
Publication number | Publication date |
---|---|
JP2022541370A (ja) | 2022-09-26 |
TWI781576B (zh) | 2022-10-21 |
CN111695624A (zh) | 2020-09-22 |
KR20220004692A (ko) | 2022-01-11 |
TW202147180A (zh) | 2021-12-16 |
CN111695624B (zh) | 2024-04-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2021248791A1 (zh) | 数据增强策略的更新方法、装置、设备及存储介质 | |
US20230043174A1 (en) | Method for pushing anchor information, computer device, and storage medium | |
WO2018227800A1 (zh) | 一种神经网络训练方法及装置 | |
KR20160037022A (ko) | 부스트 풀링 뉴럴 네트워크 기반의 데이터 분류 장치 및 그 데이터 분류 장치를 위한 뉴럴 네트워크 학습 방법 | |
US9361666B2 (en) | Learning user preferences for photo adjustments | |
CN110597878B (zh) | 一种多模态数据的跨模态检索方法、装置、设备及介质 | |
CN112257815A (zh) | 模型生成方法、目标检测方法、装置、电子设备及介质 | |
WO2020238039A1 (zh) | 神经网络搜索方法及装置 | |
US20220405579A1 (en) | Single-stage model training for neural architecture search | |
JP2023523029A (ja) | 画像認識モデル生成方法、装置、コンピュータ機器及び記憶媒体 | |
EP4094197A1 (en) | Neural architecture search with weight sharing | |
JP2022063250A (ja) | SuperLoss:堅牢なカリキュラム学習のための一般的な損失 | |
US20200334557A1 (en) | Chained influence scores for improving synthetic data generation | |
US20160070705A1 (en) | Interactive feedback for changes in search relevancy parameters | |
WO2023207139A1 (zh) | 利用交流型樽海鞘算法求解拉伸/压缩弹簧参数的方法 | |
CN112232397A (zh) | 图像分类模型的知识蒸馏方法、装置和计算机设备 | |
CN113537630A (zh) | 业务预测模型的训练方法及装置 | |
CN110991621A (zh) | 一种基于通道数搜索卷积神经网络的方法 | |
CN109033428B (zh) | 一种智能客服方法及系统 | |
WO2021253938A1 (zh) | 一种神经网络的训练方法、视频识别方法及装置 | |
CN112668718B (zh) | 神经网络训练方法、装置、电子设备以及存储介质 | |
US11507782B2 (en) | Method, device, and program product for determining model compression rate | |
US20230087774A1 (en) | Parameter optimization method, electronic device, and storage medium | |
CN116910210A (zh) | 基于文档的智能问答模型训练方法、装置及其应用 | |
CN115170902B (zh) | 图像处理模型的训练方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
ENP | Entry into the national phase |
Ref document number: 2021569392 Country of ref document: JP Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 20217038153 Country of ref document: KR Kind code of ref document: A |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20940452 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20940452 Country of ref document: EP Kind code of ref document: A1 |