WO2022111466A1

WO2022111466A1 - Task scheduling method, control method, electronic device and computer-readable medium

Info

Publication number: WO2022111466A1
Application number: PCT/CN2021/132400
Authority: WO
Inventors: 吴臻志; 祝夭龙
Original assignee: 北京灵汐科技有限公司
Priority date: 2020-11-24
Filing date: 2021-11-23
Publication date: 2022-06-02
Also published as: CN114546631A

Abstract

Provided are a task scheduling method, a control method, an electronic device and a computer-readable medium. The control method for task scheduling is applied to a core of a many-core system, and the control method comprises: performing load detection on at least one core cluster of a many-core system, and determining whether there is a target core cluster in the core cluster which has been subjected to detection, wherein the target core cluster is a core cluster where task allocation needs to be adjusted in the at least one core cluster; and when the target core cluster is present, controlling, according to a load detection result, the target core cluster to adjust the task allocation, wherein the many-core system comprises a plurality of cores, at least one of the cores constitutes the core cluster, and the many-core system comprises the at least one core cluster.

Description

Task scheduling method, control method, electronic device, computer readable medium

technical field

The present disclosure relates to the field of computer technology, and in particular, to a control method for task scheduling, a task scheduling method, a task scheduling method, an electronic device, a computer-readable medium, and a computer program product.

Background technique

A many-core system can be composed of at least one chip, each chip has multiple computing units, and the smallest computing unit in each chip that can be independently scheduled and has complete computing power is called a core. In a many-core system, multiple cores can work together, and each core can run program instructions independently, using parallel computing capabilities to speed up program execution and provide multitasking capabilities.

SUMMARY OF THE INVENTION

The present disclosure provides a control method for task scheduling based on a many-core system, a task scheduling method, a task scheduling method, an electronic device, a computer-readable medium, and a computer program product.

In a first aspect, an embodiment of the present disclosure provides a task scheduling control method, which is applied to the core of a many-core system, including: performing load detection on at least one core cluster of the many-core system, and determining whether the detected core cluster is in the core cluster. There is a target core cluster, and the target core cluster is at least one of the core clusters whose task allocation needs to be adjusted; if the target core cluster exists, the target core cluster is controlled to adjust the task allocation according to the load detection result ; wherein the many-core system includes a plurality of cores, at least one of the cores forms the core cluster, and the many-core system includes at least one of the core clusters.

In a second aspect, an embodiment of the present disclosure provides a task scheduling method, which is applied to a first control core of a first core cluster of a many-core system. The task scheduling method includes: performing load detection on the first core cluster, determining the Whether the first core cluster is the target core cluster, and the target core cluster is the core cluster whose task allocation needs to be adjusted; if the first core cluster is the target core cluster, according to the load of the first core cluster As a result of the detection, the task allocation of the first core cluster is adjusted; wherein, the many-core system includes multiple cores, at least one of the cores forms the core cluster, and the many-core system includes at least one of the core clusters , the first core cluster is one of at least one of the core clusters.

In a third aspect, an embodiment of the present disclosure provides a task scheduling method, which is applied to the core of a many-core system, including: sending a request signaling for acquiring task information of a task processed by a target core cluster, so as to acquire the task information; the task information, form a second core cluster for replacing the target core cluster, and the core is the second control core of the second core cluster; run the target core on the second core cluster The task of cluster processing; wherein, the many-core system includes a plurality of cores, at least one of the cores forms the core cluster, the many-core system includes at least one of the core clusters, and the target core cluster is the task allocation that needs to be adjusted of the core cluster.

In a fourth aspect, an embodiment of the present disclosure provides a computer-readable medium on which a computer program is stored, and when the program is executed by a processor, implements at least one of the following methods: the method described in the first aspect of the embodiment of the present disclosure A control method for task scheduling; the task scheduling method according to the second aspect of the embodiment of the present disclosure; and the task scheduling method according to the third aspect of the embodiment of the present disclosure.

In a fifth aspect, embodiments of the present disclosure provide an electronic device, including: a plurality of cores; and a network-on-chip configured to exchange data and external data among the plurality of cores; One or more instructions that are executed by one or more of the cores to enable the one or more of the cores to perform at least one of the following methods: The control method for task scheduling described above; the task scheduling method described in the second aspect of the embodiment of the present disclosure; and the task scheduling method described in the third aspect of the embodiment of the present disclosure.

In a sixth aspect, an embodiment of the present disclosure provides a computer program product, which, when running on a computer, causes the computer to execute at least one of the following methods: the task scheduling described in the first aspect of the embodiment of the present disclosure A control method; the task scheduling method described in the second aspect of the embodiment of the present disclosure; and the task scheduling method described in the third aspect of the embodiment of the present disclosure.

In the embodiment of the present disclosure, load detection can be performed on each core cluster in the many-core system, and when there is a core cluster that needs to adjust the task allocation, the core cluster is controlled to adjust the task allocation according to the load detection result, thereby improving the flexibility of task processing in the many-core system. improve the utilization efficiency of computing resources in many-core systems.

It should be understood that what is described in this section is not intended to identify key or critical features of embodiments of the disclosure, nor is it intended to limit the scope of the disclosure. Other features of the present disclosure will become readily understood from the following description.

Description of drawings

The accompanying drawings are used to provide a further understanding of the present disclosure and constitute a part of the specification, and together with the embodiments of the present disclosure, they are used to explain the present disclosure, and are not intended to limit the present disclosure. The above and other features and advantages will become more apparent to those skilled in the art by describing detailed example embodiments with reference to the accompanying drawings, in which:

1 is a flowchart of a control method for task scheduling in an embodiment of the present disclosure;

2 is a schematic diagram of a many-core system in an embodiment of the present disclosure;

3 is a flowchart of a task scheduling method in an embodiment of the present disclosure;

4 is a flowchart of a task scheduling method in an embodiment of the present disclosure;

5 is a flowchart of a task scheduling method in an embodiment of the present disclosure;

6 is a flowchart of a task scheduling method in an embodiment of the present disclosure;

Fig. 7 is the composition block diagram of a kind of core in the embodiment of the present disclosure;

FIG. 8 is a block diagram of an electronic device in an embodiment of the present disclosure.

Detailed ways

In order for those skilled in the art to better understand the technical solutions of the present disclosure, the exemplary embodiments of the present disclosure are described below with reference to the accompanying drawings, including various details of the embodiments of the present disclosure to facilitate understanding, and they should be considered to be exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the present disclosure. Also, descriptions of well-known functions and constructions are omitted from the following description for clarity and conciseness.

Various embodiments of the present disclosure and various features of the embodiments may be combined with each other without conflict.

As used herein, the term "and/or" includes any and all combinations of one or more of the associated listed items.

The terminology used herein is used to describe particular embodiments only and is not intended to limit the present disclosure. As used herein, the singular forms "a" and "the" are intended to include the plural forms as well, unless the context clearly dictates otherwise. It will also be understood that when the terms "comprising" and/or "made of" are used in this specification, the stated features, integers, steps, operations, elements and/or components are specified to be present, but not precluded or Add one or more other features, integers, steps, operations, elements, components, and/or groups thereof. Words like "connected" or "connected" are not limited to physical or mechanical connections, but may include electrical connections, whether direct or indirect.

Unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art. It will also be understood that terms such as those defined in common dictionaries should be construed as having meanings consistent with their meanings in the context of the related art and the present disclosure, and will not be construed as having idealized or over-formal meanings, unless expressly so limited herein.

In some related technologies, through static scheduling, tasks are allocated to the many-core system to run according to a scheduling policy formulated in advance. Static scheduling maps a certain part of the algorithm to the core of the chip, and each core only runs the mapped part of the algorithm, resulting in poor flexibility of task processing in many-core systems.

In order to solve the problem that static scheduling makes the many-core system less flexible in task processing, in the embodiment of the present disclosure, a core cluster is dynamically formed according to computing tasks. The core cluster includes multiple cores, and the many-core system may have multiple cores. Core clusters, each core cluster performs corresponding computing tasks. However, each cluster of cores typically performs computational tasks in a pipelined fashion. When the processing tasks change, such as a sudden increase in business flow, some core clusters may be overloaded. In the case where each core cluster performs computing tasks in a pipelined manner, the reduction of the processing efficiency of a certain core cluster will lead to a reduction of the processing efficiency of the entire many-core system.

In view of this, an embodiment of the present disclosure provides a control method for task scheduling, which is applied to the core of a many-core system. FIG. 1 is a flowchart of a control method for task scheduling in an embodiment of the present disclosure. 1, the control method includes:

In step S110, load detection is performed on at least one core cluster of the many-core system, and it is determined whether there is a target core cluster in the detected core cluster, and the target core cluster is at least one of the core clusters that needs to adjust the task allocation the core cluster;

In step S120, when the target core cluster exists, control the target core cluster to adjust task allocation according to the load detection result;

The many-core system includes a plurality of cores, at least one of the cores forms the core cluster, and the many-core system includes at least one of the core clusters.

FIG. 2 is a schematic diagram of a many-core system in an embodiment of the present disclosure. As shown in FIG. 2 , the many-core system includes a first core and a second core, and the first core and the second core have different control capabilities and different functions in the many-core system. The first core may be the control core of the many-core system, which is used to receive instructions and tasks of the external system; control each core in the many-core system to perform processing tasks and the like. A plurality of second cores are formed into a core cluster (as shown by the dotted box in FIG. 2 ). There may be multiple core clusters in a many-core system, and each core cluster performs a corresponding computing task.

In the plurality of second cores of each core cluster, including one second core as the control core of the core cluster and at least one second core as the slave core, the control core is used to receive the first core or other components (eg Many-core system synchronizers, external devices, etc.) instructions and tasks; split tasks; control each slave core in the core cluster to execute subtasks, etc.; slave cores are used to execute corresponding subtasks. The present disclosure does not limit the specific functional classification of the cores and the types of tasks performed by each core.

In the embodiment of the present disclosure, the high-level control core in the many-core system performs load detection on each core cluster in the many-core system by performing steps S110 to S120, and when there are core clusters that need to adjust task allocation, the control needs The core cluster that adjusts the assignment of tasks adjusts the assignment of tasks.

In this embodiment of the present disclosure, the high-level control core that executes steps S110 to S120 may be the first core in the many-core system as shown in FIG. 2 , or may be any one of the many-core systems independent of each core cluster core. This embodiment of the present disclosure does not limit this.

In the embodiment of the present disclosure, the load detection refers to determining whether the tasks processed by the core cluster match the computing resources of the core cluster. The result of the load detection may be that the target core cluster is overloaded, and adjusting the task allocation in step S120 may be to increase the number of cores in the target core cluster to increase the computing resources of the target core cluster; the result of the load detection may also be that the target core cluster computing resources Excessive, adjusting the task allocation in step S120 may be to reduce the number of cores in the target core cluster to save the computing resources of the many-core system and improve the utilization rate of the computing resources of the many-core system; the load detection result can also be the core cluster processing task and The computing resources of the core cluster are matched. This embodiment of the present disclosure does not limit this.

In the control method for task scheduling provided by the embodiment of the present disclosure, load detection is performed on each core cluster in the many-core system through a high-level control check, and when there are core clusters that need to adjust the task allocation, the core cluster that needs to adjust the task allocation is controlled to adjust the task. Allocation can dynamically increase computing resources for overloaded core clusters, and can timely recycle computing resources of core clusters with excess computing resources, thereby improving the flexibility of task processing in many-core systems and reducing the processing efficiency of some core clusters. The overall processing efficiency of the many-core system is reduced, and the utilization efficiency of the computing resources of the many-core system is improved at the same time.

As an optional implementation manner, when the high-level control core finds that the core cluster is overloaded or has excessive computing resources through load detection, it determines that the core cluster needs to adjust task allocation.

Correspondingly, in some embodiments, step S110 includes:

Judging whether there is an overloaded core cluster or a core cluster with excess computing resources in the detected core clusters;

If there is a core cluster with overload or excess computing resources in the detected core clusters, it is determined that the core cluster with overload or excess computing resources is the target core cluster.

In the embodiment of the present disclosure, when each core cluster processes tasks in a pipeline manner, multiple core clusters have a uniform synchronization period. Typically, the synchronization period is determined by the runtime of the longest-running core cluster. The longer the run time, the higher the load on the core cluster. As an optional implementation manner, it may be determined whether there is an overloaded core cluster according to the relationship between the running durations of multiple core clusters within a certain period of time.

Correspondingly, in some embodiments, the step of judging whether there is an overloaded core cluster in at least one of the core clusters of the many-core system includes:

Judging whether there is a candidate target core cluster within a predetermined time period, the candidate target core cluster is the core cluster with the longest running time in N synchronization cycles, where N is a natural number greater than or equal to a predetermined number;

When the candidate target core cluster exists, it is determined that there is an overloaded core cluster; wherein the candidate target core cluster is an overloaded core cluster.

For example, a predetermined period of time may include multiple synchronization cycles (the number is greater than or equal to N). If there is a candidate target core cluster that has the longest running time in the N synchronization cycles within the predetermined time period, it can be It is determined that there are overloaded core clusters, and N is a natural number greater than or equal to a predetermined number. On the contrary, the judgment can be continued after the end of the next predetermined time period.

It should be understood that those skilled in the art can set the duration of the predetermined time period and the predetermined number of synchronization cycles with the longest running time according to actual conditions, which are not limited in the present disclosure.

In this way, the accuracy of judging the overloaded core clusters can be improved, and the pertinence of task adjustment can be improved.

As another optional implementation manner, a predetermined synchronization period may be preset, and when the running duration of any one core cluster exceeds the predetermined synchronization period, it means that the core cluster is overloaded.

Determine whether there is a core cluster whose running time exceeds the predetermined synchronization period;

In the case that there is a core cluster whose running duration exceeds the predetermined synchronization period, it is determined that there is an overloaded core cluster, wherein the core cluster whose running duration exceeds the predetermined synchronization period is an overloaded core cluster.

For example, the predetermined synchronization period can be set as: the time required for each core cluster to process a computing task in one phase during normal computing, and when the predetermined synchronization period is reached, each core in the core cluster can be phased Switch to the next phase. For any core cluster, if the running duration of a phase of all or part of the cores in the core cluster exceeds a predetermined synchronization period, the core cluster may be considered to be overloaded. The present disclosure does not limit the specific value of the predetermined synchronization period.

In some embodiments, multiple overloaded core clusters may also be determined from the core clusters of the many-core system, which is not limited in the present disclosure.

In the embodiment of the present disclosure, when the target core cluster is controlled to adjust the task allocation in step S120, a solution for adjusting the task allocation of the target core cluster without stopping is provided.

Accordingly, in some embodiments, step S120 includes:

apply for an idle core as the second control core of the second core cluster that replaces the target core cluster;

In response to the request signaling for the second control core to acquire task information of the task processed by the target core cluster, transmit the task information of the task processed by the target core cluster to the second control core

That is to say, when the task allocation of the target core cluster needs to be adjusted, an idle core can be dynamically applied for, a new core cluster (called the second core cluster) can be rebuilt, and the target core cluster can be replaced by the second core cluster; When the second core cluster is used, there is no need to suspend the operation of the target core cluster. After the second core cluster replaces the target core cluster, the target core cluster may be disbanded, and the cores in the target core cluster become idle cores, thereby releasing computing resources.

As an optional implementation manner, the high-level control core determines the control core of the second core cluster (referred to as the second control core), and the second core cluster interacts with the high-level control core and the control core of the target core cluster , to complete the formation of the second core cluster.

In some embodiments, task information of tasks processed by the target core cluster may be stored in the control core of the target core cluster; task information of tasks processed by each core cluster may also be stored in the high-level control core. This embodiment of the present disclosure does not limit this.

In some embodiments, the second control core may obtain task information of the task processed by the target core cluster from the control core of the target core cluster, and may also obtain task information of the task processed by the target core cluster from the high-level control core. This embodiment of the present disclosure also does not limit this.

In some embodiments, the task information of the task processed by the target core cluster is transmitted to the second control core in response to the request signaling by the second control core to obtain the task information of the task processed by the target core cluster nuclear.

In the embodiment of the present disclosure, the task information of the task processed by the target core cluster is not limited. For example, the task information may include configuration information of each core in the second core cluster, and may also include information representing storage content of each core in the second core cluster.

In this way, the target core cluster can be replaced by the second core cluster, so as to realize the process of adjusting the task allocation of the target core cluster, thereby improving the efficiency of the adjustment.

FIG. 3 is a flowchart of a task scheduling method in an embodiment of the present disclosure. Referring to FIG. 3 , an embodiment of the present disclosure provides a task scheduling method, which is applied to a first control core of a first core cluster of a many-core system. Task scheduling methods include:

In step S210, load detection is performed on the first core cluster to determine whether the first core cluster is a target core cluster, and the target core cluster is a core cluster that needs to adjust task allocation;

In step S220, when the first core cluster is the target core cluster, adjust the task allocation of the first core cluster according to the load detection result of the first core cluster;

The many-core system includes a plurality of cores, at least one of the cores constitutes the core cluster, the many-core system includes at least one of the core clusters, and the first core cluster is one of the at least one core cluster. One.

In the embodiment of the present disclosure, any one core cluster (referred to as the first core cluster) in the many-core system performs load detection by executing steps S210 to S220 through its control core (referred to as the first control core). When the core cluster needs to adjust the task allocation, the task allocation of the first core cluster is adjusted. In this way, the load detection and task adjustment of each core cluster in the many-core system can be realized.

In the embodiment of the present disclosure, the load detection refers to determining whether the tasks processed by the core cluster match the computing resources of the core cluster. The result of the load detection may be that the target core cluster is overloaded, and adjusting the task allocation in step S220 may be to increase the number of cores in the target core cluster to increase the computing resources of the target core cluster; the result of the load detection may also be that the target core cluster computing resources Excessive, in step S220, adjusting the task allocation can be to reduce the number of cores in the target core cluster to save the computing resources of the many-core system and improve the utilization rate of the computing resources of the many-core system; the load detection result can also be the core cluster processing task and The computing resources of the core cluster are matched. This embodiment of the present disclosure does not limit this.

In the task scheduling method provided by the embodiment of the present disclosure, the control of the core cluster in the many-core system checks the load of the core cluster, and adjusts the task allocation of the core cluster when the core cluster needs to adjust the task allocation, so that when the core cluster is overloaded Dynamically increase computing resources, and can release computing resources in time when the core cluster computing resources are excessive, thereby improving the flexibility of many-core system task processing and reducing the overall processing efficiency of the many-core system due to the reduction of the processing efficiency of some core clusters. , while improving the utilization efficiency of computing resources in many-core systems.

As an optional implementation manner, if the first control core finds that the first control core is overloaded or has excess computing resources through load detection, it is determined that the first core cluster is the target core cluster, and task allocation needs to be adjusted.

Correspondingly, in some embodiments, step S210 includes:

Determine whether the first core cluster is overloaded or has excess computing resources;

When the first core cluster is overloaded or has excess computing resources, it is determined that the first core cluster is the target core cluster.

For example, if the time required for each core in the first core cluster to process a phase of computing tasks significantly exceeds the average value of the cores in each core cluster, the first core cluster may be considered overloaded; otherwise, the first core cluster is overloaded. If the time required by each core in the core cluster to process a computing task of a phase is significantly lower than the average value of the cores in each core cluster, it can be considered that the first core cluster has excess computing resources.

In some embodiments, if the first core cluster is overloaded or has excess computing resources, it may be determined that the first core cluster is the target core cluster, and task allocation needs to be adjusted. It should be understood that those skilled in the art can set specific judgment conditions for overload or excess computing resources according to actual situations, which are not limited in the present disclosure.

In this way, the determination of the target core cluster can be realized, and the efficiency of the determination can be improved.

In the embodiment of the present disclosure, when each core cluster processes tasks in a pipeline manner, multiple core clusters have a uniform synchronization period. Typically, the synchronization period is determined by the runtime of the core cluster with the largest runtime. The longer the runtime, the higher the load on the core cluster. As an optional implementation manner, it may be determined whether the target core cluster is overloaded according to the relationship between the running duration of the target core cluster and the running durations of other core clusters within a certain period of time.

Accordingly, in some embodiments, the step of determining whether the first core cluster is overloaded includes:

determining the running duration of multiple synchronization cycles of the first core cluster within a predetermined time period;

According to the running durations of multiple synchronization cycles in the predetermined time period, determine the number of synchronization cycles with the longest running duration of the first core cluster in at least one of the core clusters;

When the number of synchronization cycles with the longest running duration of the first core cluster exceeds a predetermined value, it is determined that the first core cluster is overloaded.

For example, the predetermined time period may include multiple synchronization cycles (the number is greater than or equal to N), and the running durations of the multiple synchronization cycles of the first core cluster in the predetermined time period may be determined first; The running duration is sorted with the running duration of the synchronization cycle of each core cluster, and the synchronization cycle with the longest running duration is determined. If there is a synchronization period with the longest running time in the first core cluster, and the number N exceeds a predetermined value, it can be determined that the first core cluster is overloaded.

It should be understood that those skilled in the art can set the duration of the predetermined time period and the predetermined number of synchronization cycles with the longest running duration according to actual conditions, which are not limited in the present disclosure.

In this way, the accuracy of judging an overloaded core cluster can be improved.

As another optional implementation manner, a predetermined synchronization period may be preset, and when the running duration of the target core cluster exceeds the predetermined synchronization period, it means that the target core cluster is overloaded.

judging whether the running duration of the first core cluster exceeds a predetermined synchronization period;

In the case that the running time of the first core cluster exceeds the predetermined synchronization period, it is determined that the first core cluster is overloaded.

For example, the predetermined synchronization period can be set as: the time required for each core cluster to process a computing task in one phase during normal computing, and when the predetermined synchronization period is reached, each core in the core cluster can be phased Switch to the next phase. For the first core cluster, if the running duration of a phase of all or part of the cores in the first core cluster exceeds a predetermined synchronization period, the first core cluster may be considered to be overloaded. The present disclosure does not limit the specific value of the predetermined synchronization period.

In the embodiment of the present disclosure, when adjusting the task allocation of the target core cluster in step S220, a solution for adjusting the task allocation of the target core cluster without stopping is provided. For example, when it is necessary to adjust the task allocation of the target core cluster, you can dynamically apply for idle cores, re-establish a new core cluster, and replace the target core cluster with the new core cluster; among them, when forming a new core cluster, there is no need to pause. The operation of the target core cluster. After the new core cluster replaces the target core cluster, the target core cluster may be disbanded, and the cores in the target core cluster become idle cores, thereby releasing computing resources.

As an optional implementation manner, the control core of the target core cluster determines the control core of the new core cluster, and the control core of the new core cluster interacts with the control core of the target core cluster to complete the formation of the new core cluster .

FIG. 4 is a flowchart of a task scheduling method in an embodiment of the present disclosure.

In some embodiments, referring to FIG. 4, in step S220, the step of adjusting the task allocation of the first core cluster according to the load detection result of the first core cluster includes:

In step S221, apply for an idle core as the second control core of the second core cluster that replaces the first core cluster;

In step S222, in response to the request signaling of the second control core to acquire the task information of the task processed by the first core cluster, transmit the task information of the task processed by the first core cluster to the second control core control nucleus;

In step S223, in response to the request signaling of the second control core to obtain the original data, obtain the original data;

In step S224, transmitting the original data to the second control core;

In step S225, the received input and output information of the second core cluster is added to the pipeline composed of multiple core clusters;

In step S226, start signaling is sent to the second control core.

For example, when the task allocation of the target core cluster needs to be adjusted, the first control core can dynamically apply for an idle core in step S221 as the second control core of the second core cluster that replaces the first core cluster.

In some embodiments, the second control core may send a request signaling to the first control core, which is used to request task information of the task processed by the first core cluster; when the first control core receives the request signaling, it may In S22, in response to the request signaling, the task information of the task processed by the first core cluster is transmitted to the second control core.

In the embodiment of the present disclosure, the task information of the task processed by the target core cluster is not limited. For example, the task information may include configuration information of each core in the second core cluster, and may also include information representing the storage content of each core in the second core cluster.

In some embodiments, after determining the task information of the task to be processed, the second control core may form a second core cluster to replace the first core cluster. For example, apply to the many-core system for multiple idle cores according to task information; configure tasks for multiple idle cores.

In some embodiments, after completing the task configuration, the second control core may send request signaling to the first control core for requesting raw data of the task processed by the first core cluster. When the first control core receives the request signaling, in step S223, in response to the request signaling, obtain the original data, and in step S224, transmit the original data to the second control core.

This embodiment of the present disclosure does not limit how to perform step S223 to obtain the original data. As an optional implementation manner, the step of acquiring the original data by the first control core includes: searching for data from each core in the first core cluster according to the original compilation information when the tasks of the first core cluster are configured, and reorganizing them into the raw data.

In some embodiments, after receiving the raw data, the second control core allocates the raw data to each core in the second core cluster, so that the second core cluster can process the tasks processed by the first core cluster. Further, the second control core may determine the input and output routes of the second core cluster, and send the input and output information of the second core cluster to the first control core.

In some embodiments, the first control core may add the received input and output information of the second core cluster to a pipeline composed of multiple core clusters in step S225.

In the embodiment of the present disclosure, the first control core adds the second core cluster to the pipeline by replacing the input and output information of the second core cluster into the pipeline composed of multiple core clusters, so that the second core cluster enters the pipeline. Normal computing task flow.

In some embodiments, the first control core may send a start signaling to the second control core in step S226, so that the second core cluster starts to replace the first core cluster to process computing tasks.

In some embodiments, step S220 further includes:

After receiving the message that the second core cluster has been started sent by the second control core, the first core cluster is disbanded.

In the embodiment of the present disclosure, after the second core cluster starts the task of executing the processing of the first core cluster, the first core cluster can be disbanded, so that the cores in the first core cluster become idle cores, thereby releasing computing resources.

In this way, the second core cluster can replace the first core cluster to process computing tasks, so as to realize the process of adjusting the task allocation of the first core cluster, thereby improving the flexibility of task processing in the many-core system. Improve the utilization efficiency of computing resources in many-core systems.

In the embodiment of the present disclosure, when adjusting the task allocation of the first core cluster in step S220, a solution is provided for adjusting the task allocation of the first core cluster by shutting down. For example, when it is necessary to adjust the task allocation of the first core cluster, dynamically apply for an idle core as the core of the first core cluster, and perform task configuration.

Correspondingly, in some embodiments, step S220 includes: applying for an idle core as a core to which the first core cluster belongs; and re-assigning tasks to the cores of the first core cluster.

That is, the first control core can send a request for adding cores to the high-level control core of the many-core system, so that the high-level control core allocates a new idle core to the first core cluster as the core of the first core cluster. The first control core may set the number of cores to be added according to actual conditions, which is not limited in the present disclosure.

In some embodiments, after adding cores, the first control core may reconfigure tasks for each core of the first core cluster, and after the task configuration is completed, control each core of the first core cluster to process computing tasks, so as to achieve The process of adjusting the task assignment of the first core cluster. The present disclosure does not limit the specific manner of task configuration.

It should be noted that, in the embodiment of the present disclosure, a global notification mechanism is included, for example, all cores and personal computer (PC, Personal Computer) terminals in the many-core system are broadcasted to notify invalid state information and valid state information. In this way, it is convenient for other core clusters, cores and PCs in the many-core system to perform operations such as task switching, task suspension or task restart according to the state of the first core cluster.

FIG. 5 is a flowchart of a task scheduling method in an embodiment of the present disclosure.

Referring to FIG. 5 , an embodiment of the present disclosure provides a task scheduling method, which is applied to the core of a many-core system, and the method includes:

In step S310, a request signaling for obtaining task information of the task processed by the target core cluster is sent to obtain the task information;

In step S320, according to the acquired task information, a second core cluster for replacing the target core cluster is formed, and the core is the second control core of the second core cluster;

In step S330, the task processed by the target core cluster is executed on the second core cluster;

The many-core system includes a plurality of cores, at least one of the cores constitutes the core cluster, the many-core system includes at least one of the core clusters, and the target core cluster is the core cluster that needs to adjust task allocation .

In an embodiment of the present disclosure, a solution for adjusting task allocation of a target core cluster without stopping the system is provided. For example, when it is necessary to adjust the task allocation of the target core cluster, dynamically apply for idle cores, re-establish a second core cluster, and replace the target core cluster with the second core cluster; among which, when forming the second core cluster, the target does not need to be suspended. The operation of the core cluster. After the second core cluster replaces the target core cluster, the target core cluster may be disbanded, and the cores in the target core cluster become idle cores, thereby releasing computing resources.

In the embodiment of the present disclosure, the second control core of the second core cluster may be determined by the control core of the target core cluster, or the second control core of the second core cluster may be determined by the high-level control core. This embodiment of the present disclosure does not limit this. The second control core interacts with the control core or high-level control core of the target core cluster through steps S310 to S330 to complete the formation of the second core cluster.

In the embodiment of the present disclosure, in step S310, the second control core may send a request signaling for acquiring task information of the task processed by the target core cluster to the high-level control core, so as to obtain the task information; The control core sends a request signaling for acquiring task information of the task processed by the target core cluster to acquire the task information. This embodiment of the present disclosure does not limit this.

In the embodiment of the present disclosure, in step S320, the second control core may form a second core cluster for replacing the target core cluster according to the acquired task information; and then in step S330, run the target on the second core cluster The tasks handled by the core cluster.

In the task scheduling method provided by the embodiment of the present disclosure, the second control core is determined by the high-level control core or the control core of the core cluster whose task allocation needs to be adjusted, and then the second control core forms the second core cluster, so that the second control core can be In the case of shutdown, the computing resources are dynamically increased when the core cluster is overloaded, and the computing resources are released in time when the core cluster computing resources are excessive, which can improve the flexibility of task processing in the many-core system and avoid the reduction of the processing efficiency of some core clusters. The overall processing efficiency of the many-core system is reduced, while the utilization efficiency of the computing resources of the many-core system is improved.

FIG. 6 is a flowchart of a task scheduling method in an embodiment of the present disclosure. 6, in some embodiments, step S320 includes:

In step S321, apply for multiple idle cores according to the task information;

In step S322, task configuration is performed on a plurality of the idle cores;

In step S323, sending a request signaling for obtaining raw data to obtain the raw data;

In step S324, the obtained raw data is allocated to each of the idle cores according to the task information;

In step S325, determine the input and output information of the second core cluster;

In step S326, the input and output information of the second core cluster is sent to the control core of the target core cluster;

Wherein, step S330 includes:

In step S331, in response to the start signaling sent by the control core of the target core cluster, the task processed by the target core cluster is executed on the second core cluster.

For example, after acquiring the task information, the second control core can apply for multiple idle cores according to the task information in step S321. For example, the number of idle cores required to execute the task is determined according to parameters such as the calculation amount and completion time requirements of the task processed by the target core cluster in the task information, which may be greater than the current number of cores in the target core cluster. The present disclosure does not limit the specific number of idle cores to be applied for.

In some embodiments, according to the number of idle cores to be applied for, the second control core may send an allocation request for idle cores to the high-level control core, so that the high-level control core allocates a corresponding number of idle cores to the second core cluster.

In some embodiments, after applying for multiple idle cores, the second control core may perform task configuration on the multiple idle cores in step S322 according to the task information, for example, split tasks and assign them to each idle core. The present disclosure does not limit the specific manner of task configuration.

In this embodiment of the present disclosure, performing task configuration on idle cores may include determining configuration information corresponding to each core, and may also include specifying information that each core should store. This embodiment of the present disclosure does not limit this.

In some embodiments, after completing the task configuration, the second control core may, in step S323, send a request signaling for obtaining the original data to the control core of the target core cluster, so as to obtain the original data.

In some embodiments, when the second control core receives the original data sent by the control core of the target core cluster, in step S324, according to the task information and the task configuration, the acquired original data may be allocated to each idle core , so that the second core cluster can process the tasks processed by the target core cluster.

In some embodiments, the second control core may determine the input and output routes of the second core cluster in step S325, obtain input and output information, and send the input and output information to the control core of the target core cluster in step S326.

In some embodiments, after receiving the input and output information, the control core of the target core cluster replaces the input and output information of the second core cluster into a pipeline composed of a plurality of core clusters, and adds the second core cluster to the pipeline to Make the second core cluster enter the normal computing task flow. After the control core of the target core cluster adds the input and output information of the second core cluster to the pipeline, it sends a start signaling to the second control core of the second core cluster.

In some embodiments, in step S331, the second control core controls the second core cluster to execute the task processed by the target core cluster in response to the activation signaling, thereby realizing the formation of a second core cluster for replacing the target core cluster. the whole process.

In this embodiment of the present disclosure, after step S331, the task scheduling method further includes: sending a message that the second core cluster has been started to the control core of the target core cluster, so as to dissolve the target core cluster.

That is, after the second core cluster starts the task of executing the processing of the target core cluster, the control core of the target core cluster may send a message that the second core cluster has been started. After receiving the message, the control core of the target core cluster can dissolve the target core cluster, so that the cores in the target core cluster become idle cores, thereby releasing computing resources.

According to an embodiment of the present disclosure, a control device for task scheduling is also provided, which is applied to the core of a many-core system, and the control method includes:

The first detection module is used to perform load detection on at least one core cluster of the many-core system, and determine whether there is a target core cluster in the detected core cluster, and the target core cluster is at least one of the core clusters that needs to be adjusted a core cluster for task allocation; a first adjustment module, configured to control the target core cluster to adjust task allocation according to the load detection result in the presence of the target core cluster; wherein the many-core system includes a plurality of cores, at least One of the cores constitutes the core cluster, and the many-core system includes at least one of the core clusters.

In some embodiments, the first detection module is configured to: determine whether there is an overloaded core cluster or a core cluster with excess computing resources in the detected core cluster; In the case of a core cluster with excess computing resources, it is determined that the core cluster with excessive load or excess computing resources is the target core cluster.

In some embodiments, the first detection module is configured to: determine whether there is a candidate target core cluster within a predetermined period of time, where the candidate target core cluster is the core cluster with the longest running time in N synchronization cycles , where N is a natural number greater than or equal to a predetermined number; when the candidate target core cluster exists, it is determined that there is an overloaded core cluster; wherein, the candidate target core cluster is an overloaded core cluster.

In some embodiments, the first detection module is configured to: determine whether there is a core cluster whose running duration exceeds a predetermined synchronization period; in the case of a core cluster whose running duration exceeds the predetermined synchronization period, determine that there is overload A core cluster, wherein a core cluster whose running time exceeds the predetermined synchronization period is an overloaded core cluster.

In some embodiments, the first adjustment module is configured to: apply for an idle core as a second control core of a second core cluster that replaces the target core cluster; acquire the target core in response to the second control core The request signaling of the task information of the task processed by the cluster transmits the task information of the task processed by the target core cluster to the second control core.

According to an embodiment of the present disclosure, there is also provided a task scheduling apparatus, which is applied to a first control core of a first core cluster of a many-core system, and the task scheduling apparatus includes:

The second detection module is configured to perform load detection on the first core cluster, and determine whether the first core cluster is a target core cluster, and the target core cluster is a core cluster that needs to adjust task allocation;

A second adjustment module, configured to adjust the task allocation of the first core cluster according to the load detection result of the first core cluster when the first core cluster is the target core cluster; The many-core system includes a plurality of cores, at least one of the cores forms the core cluster, the many-core system includes at least one of the core clusters, and the first core cluster is one of the at least one core cluster .

In some embodiments, the second detection module is configured to: determine whether the first core cluster is overloaded or have excess computing resources; in the case of overloading or excess computing resources of the first core cluster, determine whether the first core cluster is overloaded or has excess computing resources. The first core cluster is the target core cluster.

In some embodiments, the second detection module is configured to: determine the running duration of multiple synchronization cycles of the first core cluster within a predetermined time period; The running duration is to determine the number of synchronization cycles with the longest running duration of the first core cluster in at least one of the core clusters; when the number of synchronization cycles with the longest running duration of the first core cluster exceeds a predetermined value, determine The first core cluster is overloaded.

In some embodiments, the second detection module is configured to: determine whether the running duration of the first core cluster exceeds a predetermined synchronization period; when the running duration of the first core cluster exceeds the predetermined synchronization period Next, it is determined that the core cluster is overloaded.

In some embodiments, the second adjustment module is configured to: apply for an idle core as a second control core of a second core cluster that replaces the first core cluster; acquire the first core cluster in response to the second control core A request signaling of task information of a task processed by a core cluster transmits the task information of a task processed by the first core cluster to the second control core; in response to a request signal of the second control core to acquire original data order, obtain the original data; transmit the original data to the second control core; add the received input and output information of the second core cluster to the pipeline composed of multiple core clusters; The second control core sends the start signaling.

In some embodiments, the second adjustment module is further configured to: receive a message sent by the second control core that the second core cluster has been started, and dissolve the first core cluster.

In some embodiments, the second adjustment module is further configured to: apply for an idle core as a core to which the first core cluster belongs; and re-assign tasks to the cores of the first core cluster.

According to an embodiment of the present disclosure, a task scheduling apparatus is also provided, which is applied to the core of a many-core system, and the task scheduling method includes:

A request signaling sending module, configured to send a request signaling for acquiring task information of a task processed by the target core cluster, to acquire the task information;

a core cluster forming module, configured to form a second core cluster for replacing the target core cluster according to the acquired task information, and the core is the second control core of the second core cluster;

A task running module for running the task processed by the target core cluster on the second core cluster; wherein, the many-core system includes multiple cores, at least one of the cores forms the core cluster, and the many-core system includes multiple cores. The system includes at least one of the core clusters, and the target core cluster is the core cluster for which task allocation needs to be adjusted.

In some embodiments, the core cluster building module is configured to: apply for multiple idle cores according to the task information; perform task configuration on the multiple idle cores; send request signaling for obtaining raw data to obtain all the idle cores. Allocate the acquired raw data to each of the idle cores according to the task information; determine the input and output information of the second core cluster; send the input and output information of the second core cluster to all the idle cores The control core of the target core cluster; wherein, the task operation module is configured to: in response to a start signaling sent by the control core of the target core cluster, run the target core cluster processing on the second core cluster task.

In some embodiments, after the task running module, the apparatus further includes: a startup message sending module, configured to send a message that the second core cluster has been started to the control core of the target core cluster, so as to make The target core cluster is dissolved.

FIG. 7 is a block diagram of the composition of a core in an embodiment of the present disclosure. 7 , according to an embodiment of the present disclosure, a core is further provided, which is applied to a many-core system. The core includes: one or more processing units 101 ; and a storage unit 102 on which one or more programs are stored, When one or more programs are executed by one or more processing units, the one or more processing units implement at least one of the following methods: the control method for task scheduling described in the first aspect of the embodiment of the present disclosure; the implementation of the present disclosure Examples include the task scheduling method described in the second aspect; the task scheduling method described in the third aspect of the embodiments of the present disclosure. Wherein, the processing unit 101 is a device with data processing capability, including but not limited to an arithmetic unit, etc.; the storage unit 102 is a device with data storage capability, including but not limited to random access memory (RAM), read-only memory ( ROM), Power Erasable Programmable Read-Only Memory (EEPROM), Flash Memory (FLASH).

According to an embodiment of the present disclosure, there is also provided a computer-readable medium on which a computer program is stored, and when the program is executed by a processor, at least one of the following methods is implemented: the method described in the first aspect of the embodiment of the present disclosure. A control method for task scheduling; the task scheduling method according to the second aspect of the embodiment of the present disclosure; and the task scheduling method according to the third aspect of the embodiment of the present disclosure.

FIG. 8 is a block diagram of an electronic device in an embodiment of the present disclosure. Referring to FIG. 8 , according to an embodiment of the present disclosure, an electronic device is further provided, including: a plurality of cores 201 ; and an on-chip network 202 configured to interact Data and external data between the multiple cores 201; one or more of the cores 201 store one or more instructions, and the one or more of the instructions are executed by the one or more of the cores 201 to make One or more of the cores 201 can execute at least one of the following methods: the control method for task scheduling described in the first aspect of the embodiment of the present disclosure; the task scheduling method described in the second aspect of the embodiment of the present disclosure; the present disclosure The task scheduling method described in the third aspect of the embodiment.

According to an embodiment of the present disclosure, a computer program product is also provided. When the computer program product runs on a computer, the computer program product causes the computer to execute at least one of the following methods: A control method; the task scheduling method described in the second aspect of the embodiment of the present disclosure; and the task scheduling method described in the third aspect of the embodiment of the present disclosure.

In order to enable those skilled in the art to more clearly understand the technical solutions provided by the embodiments of the present disclosure, the technical solutions provided by the embodiments of the present disclosure will be described in detail below through specific embodiments:

In some embodiments, when the task allocation of the target core cluster needs to be adjusted, a second core cluster is formed, and the second core cluster processes the tasks processed by the target core cluster without suspending the operation of the target core cluster.

In an example, the process of adjusting the task allocation of the target core cluster includes: determining, by the control core or high-level control core of the target core cluster, a second control core in a second core cluster of the target core cluster; obtaining the target control core by the second control core The task information of the task processed by the core; in the example, the control core of the target core cluster holds the task information of the task processed by the target core cluster, the high-level control core also holds the task information of the task processed by the target core cluster, the second control core The core can apply for task information to the control core of the target core cluster, and can also apply to the high-level control core for task information;

According to the task information, the second control core applies for a target number of idle cores and performs task configuration; the second control core sends request signaling to the control cores of the target core cluster to obtain original data;

After receiving the request signaling, the control core of the target core cluster searches for data from each core in the target core cluster according to the original compilation information, reorganizes the original data, and returns the original data to the second control core; The received raw data is allocated to each core in the second core cluster according to the task configuration information.

The second control core determines the input and output routes, and sends the input and output information of the second core cluster to the control core of the target core cluster; the control core of the target core cluster replaces the input and output information of the second core cluster into the pipeline, and sends the information to the first core cluster. The second core cluster sends start signaling; the second core cluster starts according to the start signaling, and notifies the target core cluster. The target core cluster is disbanded.

In an example, when the second control check performs task configuration on the idle cores in the second core cluster, the following configurations may be performed: allocate memory, determine the information and parameters that should be stored by each idle core; and so on to reduce the computational complexity; configure routing; configure calculation control information such as operation sequence and operation time; configure synchronization management information.

In the example, the task configuration includes the following flow:

Decompose the computing task into several computing steps, and determine the dependencies between the computing steps;

Each computational step is decomposed into several subtasks that can be parallelized according to the required memory and amount of computation. Among them, each subtask can be mapped to an idle core; the memory required by each subtask shall not exceed the memory capacity limit of a single idle core, and the routing transmission volume (input and output) of the subtask shall not exceed the routing transmission bandwidth limit of a single idle core; Preferably, the subtask computation amount of each core is evenly distributed;

Determine the number of idle cores and bandwidth required by the computing task according to the current decomposition of the computing task, and determine whether the current computing resources meet the requirements of the computing task; if not, re-decompose the computing task; More idle cores; if the computing task is decomposed after applying for more idle cores and still does not meet the requirements of the computing task, an error will be reported to the upper-level core; if the current computing resources meet the requirements of the computing task, the decomposition of the computing task will be determined success;

When the decomposition of the computing task is successful, according to the physical location of each idle core, find the optimal layout scheme, determine the subtasks that each idle core needs to execute, and the optimal layout scheme is the layout scheme that minimizes the routing transmission bandwidth;

Calculate routing configuration information according to the optimized layout scheme, and generate configuration information such as operation instructions, address generation logic, control logic and synchronization logic of each idle core;

The configuration information is sent to each idle core through the on-chip network, wherein each idle core will return a configuration completion signaling after receiving all the configuration information.

In some embodiments, when the task allocation of the target core cluster needs to be adjusted, the operation of the target core cluster is suspended, and the cores in the target core cluster are increased or decreased according to the result of the load detection, so that the computing resources of the target core cluster and the computing tasks match. The process of adjusting the task allocation of the target core cluster includes: applying for a new core, and performing task configuration. Among them, when remapping, including a global notification mechanism, for example, valid (invalid) and invalid (valid) to broadcast notification to all cores and PC side.

In the example, when applying for a new core and configuring tasks, the following configurations can be performed: allocate memory to determine the information and parameters that each core should store; configure arithmetic operators to reduce computational complexity by dismantling operations; configure routing; Configure calculation control information such as operation sequence and operation time; configure synchronization management information, etc.

In the example, the task configuration includes the following flow:

Each computational step is decomposed into several subtasks that can be parallelized according to the required memory and amount of computation. Among them, each subtask can be mapped to one core; the memory required by each subtask shall not exceed the memory capacity limit of a single core, and the routing transmission volume (input and output) of the subtask shall not exceed the routing transmission bandwidth limit of a single core; preferably, The calculation amount of subtasks of each core is evenly distributed;

Determine the number of cores and bandwidth required by the computing task according to the current decomposition of the computing task, and determine whether the current computing resources meet the requirements of the computing task; if not, re-decompose the computing task; There are many cores; if the computing tasks are decomposed after applying for more cores and the requirements of the computing tasks are still not satisfied, an error will be reported to the upper-level core; if the current computing resources meet the requirements of the computing tasks, it is determined that the decomposition of the computing tasks is successful;

When the decomposition of the computing task is successful, according to the physical location of each core, find the optimal layout scheme, determine the subtasks that each core needs to perform, and the optimal layout scheme is the layout scheme that minimizes the routing transmission bandwidth;

Calculate routing configuration information according to the optimized layout scheme, and generate configuration information such as operation instructions, address generation logic, control logic and synchronization logic of each core;

The configuration information is sent to each core through the on-chip network, and each core will return a configuration completion signaling after receiving all the configuration information.

Those of ordinary skill in the art can understand that all or some of the steps in the methods disclosed above, functional modules/units in the systems, and devices can be implemented as software, firmware, hardware, and appropriate combinations thereof. In a hardware implementation, the division between functional modules/units mentioned in the above description does not necessarily correspond to the division of physical components; for example, one physical component may have multiple functions, or one function or step may be composed of several physical components Components execute cooperatively. Some or all physical components may be implemented as software executed by a processor, such as a central processing unit, digital signal processor or microprocessor, or as hardware, or as an integrated circuit, such as an application specific integrated circuit . Such software may be distributed on computer-readable media, which may include computer storage media (or non-transitory media) and communication media (or transitory media). As known to those of ordinary skill in the art, the term computer storage media includes both volatile and nonvolatile implemented in any method or technology for storage of information, such as computer readable instructions, data structures, program modules or other data flexible, removable and non-removable media. Computer storage media include, but are not limited to, RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital versatile disk (DVD) or other optical disk storage, magnetic cartridges, magnetic tape, magnetic disk storage or other magnetic storage devices, or may Any other medium used to store desired information and which can be accessed by a computer. In addition, communication media typically embodies computer readable instructions, data structures, program modules, or other data in a modulated data signal such as a carrier wave or other transport mechanism, and can include any information delivery media, as is well known to those of ordinary skill in the art .

Example embodiments have been disclosed herein, and although specific terms are employed, they are used and should only be construed in a general descriptive sense and not for purposes of limitation. In some instances, it will be apparent to those skilled in the art that features, characteristics and/or elements described in connection with a particular embodiment may be used alone or in combination with other embodiments, unless expressly stated otherwise. Features and/or elements are used in combination. Accordingly, it will be understood by those of ordinary skill in the art that various changes in form and details may be made therein without departing from the scope of the present disclosure as set forth in the appended claims.

Claims

A control method for task scheduling, applied to the core of a many-core system, the control method comprising:

Perform load detection on at least one core cluster of the many-core system, and determine whether there is a target core cluster in the detected core cluster, and the target core cluster is a core cluster that needs to adjust task allocation in at least one of the core clusters;

In the presence of the target core cluster, controlling the target core cluster to adjust task allocation according to the load detection result;

The many-core system includes a plurality of cores, at least one of the cores forms the core cluster, and the many-core system includes at least one of the core clusters.
The control method according to claim 1, wherein the step of performing load detection on at least one core cluster of the many-core system and judging whether there is a target core cluster comprises:

Judging whether there is an overloaded core cluster or a core cluster with excess computing resources in the detected core clusters;

If there is a core cluster with overload or excess computing resources in the detected core clusters, it is determined that the core cluster with overload or excess computing resources is the target core cluster.
The control method according to claim 2, wherein the step of judging whether there is an overloaded core cluster in at least one of the core clusters of the many-core system comprises:

Judging whether there is a candidate target core cluster within a predetermined time period, the candidate target core cluster is the core cluster with the longest running time in N synchronization cycles, where N is a natural number greater than or equal to a predetermined number;

In the case that the candidate target core cluster exists, it is determined that there is an overloaded core cluster; wherein, the candidate target core cluster is an overloaded core cluster.
The control method according to claim 2, wherein the step of judging whether there is an overloaded core cluster in at least one of the core clusters of the many-core system comprises:

Determine whether there is a core cluster whose running time exceeds the predetermined synchronization period;

In the case that there is a core cluster whose running duration exceeds the predetermined synchronization period, it is determined that there is an overloaded core cluster, wherein the core cluster whose running duration exceeds the predetermined synchronization period is an overloaded core cluster.
The control method according to any one of claims 1 to 4, wherein the step of controlling the target core cluster to adjust task allocation according to the load detection result comprises:

apply for an idle core as the second control core of the second core cluster that replaces the target core cluster;

In response to the request signaling for the second control core to acquire task information of the task processed by the target core cluster, the task information of the task processed by the target core cluster is transmitted to the second control core.
A task scheduling method, applied to the first control core of the first core cluster of a many-core system, the task scheduling method comprising:

Perform load detection on the first core cluster to determine whether the first core cluster is a target core cluster, and the target core cluster is a core cluster that needs to adjust task allocation;

In the case that the first core cluster is the target core cluster, adjusting the task allocation of the first core cluster according to the load detection result of the first core cluster;

The many-core system includes a plurality of cores, at least one of the cores forms the core cluster, the many-core system includes at least one of the core clusters, and the first core cluster is at least one of the core clusters one of the.
The task scheduling method according to claim 6, wherein the step of judging whether the first core cluster is a target core cluster comprises:

Determine whether the first core cluster is overloaded or has excess computing resources;

When the first core cluster is overloaded or has excess computing resources, it is determined that the first core cluster is the target core cluster.
The task scheduling method according to claim 7, wherein the step of judging whether the first core cluster is overloaded comprises:

determining the running duration of multiple synchronization cycles of the first core cluster within a predetermined time period;

According to the running durations of multiple synchronization cycles in the predetermined time period, determine the number of synchronization cycles with the longest running duration of the first core cluster in at least one of the core clusters;

When the number of synchronization cycles with the longest running duration of the first core cluster exceeds a predetermined value, it is determined that the first core cluster is overloaded.
The task scheduling method according to claim 7, wherein the step of judging whether the first core cluster is overloaded comprises:

judging whether the running duration of the first core cluster exceeds a predetermined synchronization period;

When the running time of the first core cluster exceeds the predetermined synchronization period, it is determined that the core cluster is overloaded.
The task scheduling method according to any one of claims 6 to 9, wherein, according to the load detection result of the first core cluster, the step of adjusting the task allocation of the first core cluster comprises:

apply for an idle core as the second control core of the second core cluster that replaces the first core cluster;

In response to the request signaling that the second control core obtains the task information of the task processed by the first core cluster, the task information of the task processed by the first core cluster is transmitted to the second control core;

Acquiring the raw data in response to the request signaling of the second control core to obtain the raw data;

transmitting the raw data to the second control core;

adding the received input and output information of the second core cluster to a pipeline composed of multiple core clusters;

Send start signaling to the second control core.
The task scheduling method according to claim 10, wherein, according to the load detection result of the first core cluster, the step of adjusting the task allocation of the first core cluster further comprises:

After receiving the message that the second core cluster has been started sent by the second control core, the first core cluster is disbanded.
The task scheduling method according to any one of claims 6 to 9, wherein, according to the load detection result of the first core cluster, the step of adjusting the task allocation of the first core cluster comprises:

apply for an idle core as the core to which the first core cluster belongs;

Tasks are reassigned to the cores of the first core cluster.
A task scheduling method is applied to the core of a many-core system, and the task scheduling method includes:

Sending request signaling for obtaining task information of the task processed by the target core cluster to obtain the task information;

According to the acquired task information, a second core cluster for replacing the target core cluster is formed, and the core is the second control core of the second core cluster;

running the task processed by the target core cluster on the second core cluster;

The many-core system includes multiple cores, at least one of the cores constitutes the core cluster, the many-core system includes at least one of the core clusters, and the target core cluster is the core cluster that needs to adjust task allocation .
The task scheduling method according to claim 13, wherein, according to the acquired task information, the step of forming a second core cluster for replacing the target core cluster comprises:

apply for multiple idle cores according to the task information;

performing task configuration on a plurality of the idle cores;

Sending request signaling for obtaining raw data to obtain the raw data;

Allocate the acquired raw data to each of the idle cores according to the task information;

determining the input and output information of the second core cluster;

sending the input and output information of the second core cluster to the control core of the target core cluster;

Wherein, the step of running the task processed by the target core cluster on the second core cluster includes:

In response to the start signaling sent by the control core of the target core cluster, the task processed by the target core cluster is executed on the second core cluster.
The task scheduling method according to claim 13 or 14, wherein after the step of running the task processed by the target core cluster on the second core cluster, the task scheduling method further comprises:

Sending a message that the second core cluster has been started to the control core of the target core cluster to dissolve the target core cluster.
A computer-readable medium having stored thereon a computer program that, when executed by a processor, implements at least one of the following methods:

The control method for task scheduling according to any one of claims 1 to 5;

The task scheduling method according to any one of claims 6 to 12;

The task scheduling method according to any one of claims 13 to 15.
An electronic device comprising:

multiple cores; and

a network-on-chip configured to exchange data among the plurality of cores and external data;

One or more of the cores have one or more instructions stored therein, and the one or more of the instructions are executed by the one or more of the cores to enable the one or more of the cores to perform at least one of the following methods: By:

The control method for task scheduling according to any one of claims 1 to 5;

The task scheduling method according to any one of claims 6 to 12;

The task scheduling method according to any one of claims 13 to 15.
A computer program product that, when run on a computer, causes the computer to perform at least one of the following methods:

The control method for task scheduling according to any one of claims 1 to 5;

The task scheduling method according to any one of claims 6 to 12;

The task scheduling method according to any one of claims 13 to 15.