CN113157453A

CN113157453A - Task complexity-based high-energy-efficiency target detection task dynamic scheduling method

Info

Publication number: CN113157453A
Application number: CN202110435960.0A
Authority: CN
Inventors: 高岭; 王祎昊; 杨冠华; 周冰; 任杰
Original assignee: Northwestern University
Current assignee: Northwestern University
Priority date: 2021-04-22
Filing date: 2021-04-22
Publication date: 2021-07-23
Anticipated expiration: 2041-04-22
Also published as: CN113157453B

Abstract

A high-energy-efficiency target detection task dynamic scheduling method based on task complexity is characterized in that a neural network is used for evaluating the complexity of a task, a target detection model and a cloud platform meeting user requirements are judged by weighting the current state of a mobile phone and the requirements of a user, when the user submits the target detection task, the task is unloaded according to the complexity of the task, the current state of the mobile phone and the requirements of the user, and the calculation pressure and energy consumption of a mobile phone end can be greatly reduced by unloading the calculation-intensive target detection task to the cloud for execution under the condition of good network; under the condition that the network is not good, the energy consumption of the mobile phone end can be reduced by calling the local small model of the mobile phone end, and the energy consumption pressure of the mobile phone end can be greatly reduced by utilizing the method and the system under the condition that certain user experience is provided and the precision of the target detection model is kept.

Description

Task complexity-based high-energy-efficiency target detection task dynamic scheduling method

Technical Field

The invention belongs to the technical field of computers, and particularly relates to a task complexity-based dynamic scheduling method for an energy-efficient target detection task.

Background

Most CNN-based target detection models are only applicable to recommendation systems. For example, the process of searching for free parking spaces by a city camera is performed by a slow accurate model, whereas car collision warnings are related to a fast inaccurate model. The accuracy of the real-time target detection models is improved, and the real-time target detection models can be used for prompting to generate a recommendation system and can also be used for independent process management and manual input reduction. Real-time object detector operations on conventional Graphics Processing Units (GPUs) allow them to be used in large quantities at affordable prices. The most accurate modern neural networks cannot operate in real time and require a large number of GPUs for large batches of training. Further accuracy requirements have prompted larger and deeper target detection models. However, the deployment of a large-scale target detection model at the mobile end is not only very expensive in time, but also very expensive in energy consumption. In recent years, many improvements have been proposed for depth target detection models, however these models are still bulky compared to mobile devices with little computational power.

In recent years, lightweight neural networks designed for mobile terminals have become increasingly popular, and designing deep neural network architectures to achieve the best compromise between accuracy and efficiency has been an active area of research. Novel manual structural and algorithmic neural architecture searches play an important role in advancing the field. Through these design techniques, some lightweight neural networks designed for mobile platforms play a tremendous role. However, these lightweight neural networks are difficult to satisfy in terms of accuracy compared to deep neural networks. Furthermore, these neural networks do not guarantee optimal results for any mobile-side architecture.

Therefore, a method is needed which can be applied to any mobile platform while maintaining the accuracy, and can reduce the inference cost of the mobile terminal.

Disclosure of Invention

In order to overcome the defects of the prior art, the invention aims to provide a task complexity-based high-energy-efficiency target detection task dynamic scheduling method, which has the advantage of reducing the energy consumption of a mobile platform while maintaining the precision.

In order to achieve the purpose, the invention provides the following technical scheme:

a high-energy-efficiency target detection task dynamic scheduling method based on task complexity selects a proper task unloading strategy according to the state of a current user and the task complexity when the user submits a target detection task, and is characterized by comprising the following steps:

step S1, acquiring complexity C of the task submitted by the user by using the opencv library_xAnd through the complexity of the task, the prediction capability C of the supported ith target detection model to the task submitted by the user is given_iAcquiring the predicted fraction S of the model to the task by utilizing the prediction capability and the complexity of the task_i；

Step S2, using the network status of the current mobile terminal, modeling the network status, respectively calculating to unload the tasks to different server terminals Cloud_iGenerated communication duration

And communication energy consumption

Step S3, calculating the calculation energy consumption E required by running different models on different platforms_iAnd calculating the time T_iAnd generating different weights according to different requirements of users to generate final weighted scores of different models on different platforms for tasks

Wherein alpha, beta and gamma are weight values adjusted according to user settings, F is relu function, F is sigmoid function, G is logarithmic function with natural constant as base, and the expected fraction S_iThrough step S1 deriving, communication duration

Energy consumption of communication

The energy consumption E is calculated in step S2_iAnd duration T_iAnd finally, selecting the strategy with the maximum weighting score as the final strategy through calculation in the step.

The step S1 further includes the following steps:

step S11, respectively obtaining the statistical characteristics F of the tasks submitted by the users by using the opencv library in the local platform_iAnd using the obtained different statistical characteristics to model the statistical characteristics by using a neural network, and calculating the complexity C of the submitted task_x；

Step S12, the prediction capabilities of all models supported by the cloud platform are obtained locally, and the prediction capability C of a certain model on the task submitted by the user is calculated by utilizing the built neural network model_i；

Step S13, calculating the weighted value of basic attribute such as complexity of model and prediction ability to obtain the predicted score S of model to task_i。

In step S11, a multilayer fully-connected neural network is used as a default for modeling.

The step S2 further includes the following steps:

step S21, reading the device information file of the mobile phone end by using C/C + + language, and acquiring some hardware information related to the current transmission of the mobile phone;

step S22, using the transmission time and transmission energy consumption of some fixed points acquired in advance, making linear prediction on the information acquired in step S21, and calculating the generated communication time length

And communication energy consumption

In step S21, some hardware information includes, by default, the current voltage status, the current network speed, and the maximum bandwidth of the mobile phone.

The step S3 includes the steps of:

step S31, selecting a group of fixed weight combinations alpha, beta and gamma according to different setting strategies of users, and increasing the weight corresponding to a certain demand when the user is biased to the demand;

step S32, calculating energy consumption E required by running different models on different platforms_iAnd time T_iAnd generating a weighted score based on the data of step S13, step S22 and the present step;

in step S33, all the weighted scores generated in step S32 are sorted, and the selection policy that has the largest weighted score is selected as the final selection policy.

In step S31, the weight combination needs to satisfy α + β + γ of 1 by default.

In step S32, a weighted score is generated by an exponential function with a natural constant as a base as a default.

The invention has the beneficial effects that:

the invention provides a task complexity-based high-energy-efficiency target detection task dynamic scheduling method, and aims at designing a scheduling mode for unloading a computation task of a deep neural network by using a cloud under the background that a target detection requirement possibly proposed by a user can cause a large amount of computation and energy consumption pressure on a self mobile platform under the condition that the computation capability of a mobile terminal is insufficient.

According to the scheduling strategy provided by the invention, by comprehensively considering various factors such as user requirements, task complexity, the current mobile terminal state and the like, the calculation pressure brought to the mobile platform can be reduced as much as possible when the user has the target detection task requirements, so that the user experience is greatly improved. Meanwhile, the energy consumption of the mobile terminal can be reduced to the maximum extent while the calculation pressure of the mobile platform is reduced.

Drawings

Fig. 1 is a schematic view of a usage scenario of the present invention.

Fig. 2 is a schematic view of the working process of the present invention.

Detailed Description

The present invention will be described in further detail with reference to the accompanying drawings, but the present invention is not limited to the following embodiments.

Example 1

As shown in a usage scenario diagram of the invention shown in fig. 1, the method for dynamically scheduling an energy-efficient target detection task based on task complexity provided by the invention designs a scheduling method for offloading a computation task of a deep neural network by using a cloud under the background that a target detection requirement possibly proposed by a user causes a large amount of computation and energy consumption pressure on a mobile platform of the user under the situation that the computing power of a mobile terminal is insufficient. By comprehensively considering various factors such as user requirements, task complexity and the state of the current mobile terminal, the computing pressure on the mobile platform can be reduced as much as possible when the user has the requirement of the target detection task, and therefore the user experience is greatly improved and the energy consumption caused by computing is reduced.

As shown in the workflow diagram of fig. 2, after a user submits a task, some hardware information of the mobile platform of the user, such as network speed, power, etc., is first obtained; next, acquiring some configuration information set by the user, such as the minimum waiting time and the like; and finally, selecting the deployed hardware platform and the deployed model according to the information, and giving an optimal selection strategy.

Referring to fig. 1 and 2, a task complexity-based energy-efficient dynamic scheduling method for target detection tasks selects a proper task offloading strategy according to the state of a current user and the complexity of the task when the user submits a target detection task, and specifically includes the following steps:

step S1, acquiring complexity C of the task submitted by the user by using the opencv library_xAnd through the complexity of the task, the supported prediction energy of the ith target detection model on the task submitted by the user is givenForce C_iAcquiring the predicted fraction S of the model to the task by utilizing the prediction capability and the complexity of the task_i。

And communication energy consumption

Wherein alpha, beta and gamma are weight values adjusted according to user settings, F is relu function, F is sigmoid function, G is logarithmic function with natural constant as base, and the expected fraction S_iThe communication duration is determined by step S1

Energy consumption of communication

The step S1 includes the steps of:

Under the default condition, a multilayer fully-connected neural network is used for modeling;

The step S2 includes the steps of:

step S21, acquiring some current hardware information of the mobile phone by reading the device file of the mobile phone end;

under the default condition, the hardware information comprises the current voltage state, the current network speed and the maximum bandwidth of the mobile phone;

And communication energy consumption

The step S3 includes the steps of:

step S31, selecting a group of fixed weight combinations alpha, beta and gamma according to the setting strategy of the user;

by default, the weight combination needs to satisfy α + β + γ ═ 1.

Step S32, calculating energy consumption E required by running different models on different platforms_iAnd time T_iAnd generating a weighted score based on the data of steps S13, S22 and the step;

generating a weighted score by an exponential function with a natural constant as a base under a default condition;

Claims

1. A high-energy-efficiency target detection task dynamic scheduling method based on task complexity selects a proper task unloading strategy according to the state of a current user and the task complexity when the user submits a target detection task, and is characterized by comprising the following steps:

And communication energy consumption

Energy consumption of communication

2. The method for dynamically scheduling task-complexity-based energy-efficient target detection tasks according to claim 1, wherein the step S1 further comprises the following steps:

3. The method for dynamically scheduling task-complexity-based energy-efficient target detection tasks according to claim 2, wherein in step S11, a multi-layer fully-connected neural network is used for modeling by default.

4. The method for dynamically scheduling task-complexity-based energy-efficient target detection tasks according to claim 1, wherein the step S2 further comprises the following steps:

step S22, using the previously acquiredThe transmission time and transmission energy consumption of some fixed points are linearly predicted from the information obtained in step S21, and the resulting communication duration is calculated

And communication energy consumption

5. The method as claimed in claim 4, wherein in step S21, some hardware information includes current voltage status, current network speed, and maximum bandwidth of the mobile phone by default.

6. The method for dynamically scheduling energy-efficient object detection tasks based on task complexity as claimed in claim 1, wherein the step S3 comprises the following steps:

7. The method for dynamically scheduling task-complexity-based energy-efficient target detection tasks according to claim 6, wherein in step S31, the weight combination needs to satisfy α + β + γ ═ 1 by default.

8. The method for dynamically scheduling energy-efficient object detection tasks according to claim 6, wherein in step S32, the weighting score is generated by an exponential function with a natural constant as a base by default.