CN101799838B

CN101799838B - Method for seeking mapping scheme between tasks and nodes of network on chip

Info

Publication number: CN101799838B
Application number: CN2010100281577A
Authority: CN
Inventors: 陈亦欧; 胡剑浩; 凌翔; 符初生
Original assignee: University of Electronic Science and Technology of China
Current assignee: University of Electronic Science and Technology of China
Priority date: 2010-01-22
Filing date: 2010-01-22
Publication date: 2011-12-28
Anticipated expiration: 2030-01-22
Also published as: CN101799838A

Abstract

The present invention proposes a method for finding a mapping scheme between tasks and nodes of an on-chip network. The process of the method is as follows: initialize a scheme group J _t including K allocation schemes, and then obtain _each On-chip network energy consumption, response time, and fitness value under the scheme, and then perform priority sorting, grouping, switching and other processes according to the size of the appropriate value to finally obtain the optimal scheme. The present invention separates energy consumption and response time as the priority of the assessment scheme In the process of finding the optimal solution, energy consumption and response time are considered at the same time, so that the solution found can minimize energy consumption and system delay at the same time, so that the optimal task and node-to-node mapping scheme can be designed The performance of the network reaches the best, and compared with other methods for finding the priority scheme of mapping between tasks and nodes under multiple objective functions, it has the characteristics of complexity and simple implementation, and the method convergence speed of the present invention has also been confirmed by simulation quick.

Description

Method for finding task-to-node mapping schemes for a network-on-chip

所属领域Field

本专利属于集成电路设计的技术领域，特别针对片上网络的多任务与多节点的映射方案的搜索过程。This patent belongs to the technical field of integrated circuit design, especially for the search process of the multi-task and multi-node mapping scheme of the network on chip.

技术背景 technical background

随着未来通信终端和设备的计算复杂度的增大，实时复杂系统芯片的集成规模需求将迅速增长，一块芯片上可能会集成数十个或成百上千的处理单元，基于多核的信号处理平台成为软件无线电的发展趋势。在这样的综合化系统中，设计可靠、高速、低功耗的高性能芯片内通信体系成为片上系统(System on Chip，SoC)发展的挑战与机遇。传统多核SoC的总线式通信结构无法满足大规模实时复杂数字信号处理过程中数据交换的需求，因此提出了新一代的片上互联与通信机制——片上网络。With the increasing computational complexity of communication terminals and equipment in the future, the demand for integration scale of real-time complex system chips will increase rapidly. Dozens or hundreds of processing units may be integrated on one chip, and signal processing based on multi-core Platform becomes the development trend of software defined radio. In such an integrated system, designing a reliable, high-speed, low-power high-performance on-chip communication system has become a challenge and an opportunity for the development of a System on Chip (SoC). The bus communication structure of the traditional multi-core SoC cannot meet the data exchange requirements in the process of large-scale real-time complex digital signal processing, so a new generation of on-chip interconnection and communication mechanism - on-chip network is proposed.

片上网络(上每个节点也就是处理器)实现的系统主要是对实时性要求较高的复杂通信系统或数字信号处理系统。通信与信号处理具有实时性和处理时延要求，因此，NoC的映射方案务必要使得系统的延时尽量小。另外，片上网络是在集成电路的芯片上实现的，为了保证芯片能够稳定的工作，芯片的温度不能高。芯片的温度是由芯片的能耗决定的，能耗越大，芯片的温度越高。因此，在选择映射方案时，必须选择使得能耗尽量低和时延尽量少的方案。The system implemented by the network on chip (each node on it is also a processor) is mainly a complex communication system or a digital signal processing system that requires high real-time performance. Communication and signal processing have real-time and processing delay requirements. Therefore, the NoC mapping scheme must make the system delay as small as possible. In addition, the network on chip is realized on the chip of the integrated circuit. In order to ensure the stable operation of the chip, the temperature of the chip should not be high. The temperature of the chip is determined by the energy consumption of the chip, the greater the energy consumption, the higher the temperature of the chip. Therefore, when choosing a mapping scheme, it is necessary to choose a scheme that minimizes energy consumption and delay.

通过查新和广泛收集文献资料，我们发现已经公开的用于寻找片上网络上任务与节点间映射方案的方法有如下几类：Through novelty searches and extensive literature collection, we found that the published methods for finding mapping schemes between tasks and nodes on a network-on-chip are as follows:

文献“周干民，尹勇生，胡永华等.基于蚁群优化算法的NoC映射.计算机工程与应用.2005，41(18)：7-10.”采用一种有效地蚁群优化算法实现了片上网络映射，在将任务映射到片上网络节点的同时，尽可能的减少了系统的通讯功耗。该方法的目标函数为功耗，该方法大致如下：Document "Zhou Ganmin, Yin Yongsheng, Hu Yonghua, etc. NoC Mapping Based on Ant Colony Optimization Algorithm. Computer Engineering and Applications. 2005, 41(18): 7-10." Using an effective ant colony optimization algorithm to realize on-chip network mapping , while the tasks are mapped to the network nodes on a chip, the communication power consumption of the system is reduced as much as possible. The objective function of this method is power consumption, and the method is roughly as follows:

(1)根据蚂蚁的个数循环地为每个未分配的节点按照概率分配任务(此概率是通过一种方式计算的将一个节点分配到某个任务的概率)，每循环一次都要使所有节点都分配到相应的任务，然后采用两交换法来进行局部搜索，找出功耗最小的最优解，并用最优解的功耗值信息素衰减度和当前信息素的值来更新最优解对应的蚂蚁的信息素，通过对上述过程多次迭代后，最后选出使功耗最小的分配方案。(1) According to the number of ants, assign tasks to each unassigned node according to the probability (this probability is the probability of assigning a node to a task calculated in a way), and each cycle must make all The nodes are assigned to the corresponding tasks, and then use the two-exchange method to perform local search to find the optimal solution with the minimum power consumption, and update the optimal solution with the power consumption value of the optimal solution pheromone attenuation degree and the current pheromone value. Solve the pheromones of the corresponding ants, and finally select the allocation scheme that minimizes power consumption after iterating the above process for many times.

这种方法获得的映射方案具有较小的通讯功耗，但没有考虑系统的延时性能。The mapping scheme obtained by this method has a small communication power consumption, but does not consider the delay performance of the system.

文献“杨盛光，李丽，高明伦等.面向能耗和延时的NoC映射方法.电子学报.2008，36(5)：937-942”基于二维网格结构NoC平台，建立了基于蚁群算法的旨在优化系统通信能耗和链路负载方差的统一目标函数。该方法的目标函数cost的计算式为：The document "Yang Shengguang, Li Li, Gao Minglun, etc. NoC mapping method for energy consumption and time delay. Acta Electronica Sinica. 2008, 36(5): 937-942" is based on a two-dimensional grid structure NoC platform, and establishes an ant-based A unified objective function of swarm algorithms aimed at optimizing system communication energy consumption and link load variance. The calculation formula of the objective function cost of this method is:

cost＝λ×E+(1-λ)×VARcost=λ×E+(1-λ)×VAR

其中，λ为比例系数，E为能耗，VAR为链路负载方差。该方法大致如下：Among them, λ is the proportional coefficient, E is the energy consumption, and VAR is the link load variance. The method is roughly as follows:

(1)根据蚂蚁的个数循环地为每个未分配的节点按照概率分配任务(概率是通过一种方式计算的将一个节点分配到某个任务的概率)，每循环一次都要使所有节点都分配到相应的任务，然后采用两交换法来进行局部搜索，找出功耗最小解，并用最优解的功耗值信息素衰减度和当前信息素的值来更新最优解对应的蚂蚁的信息素，通过对上述过程多次迭代后，最后选出使目标函数cost最小的分配方案。(1) According to the number of ants, assign tasks to each unassigned node according to the probability (probability is the probability of assigning a node to a task calculated in a way) according to the number of ants, and make all nodes They are all assigned to the corresponding tasks, and then use the two-exchange method to perform local search to find the solution with the minimum power consumption, and update the ant corresponding to the optimal solution with the power consumption value pheromone attenuation degree of the optimal solution and the current pheromone value After multiple iterations of the above process, the allocation scheme that minimizes the cost of the objective function is finally selected.

这种方法没有考虑通信能耗和执行时间是完全不同的物理量，获得的映射方案不一定具有优异的性能。This method does not consider that communication energy consumption and execution time are completely different physical quantities, and the obtained mapping scheme does not necessarily have excellent performance.

文献Tang lei，Shashi Kumar.A two-step genetic algorithm for mapping task graphs to a network onchip architecture.Proceedings of the Euromicro Symposium on Digital System Design 2003.Antalya，Turkey：IEEE，2003：180-187.提出一种采用两步遗传算法实现任务映射问题的方法。该方法大致如下：Literature Tang lei, Shashi Kumar. A two-step genetic algorithm for mapping task graphs to a network onchip architecture. Proceedings of the Euromicro Symposium on Digital System Design 2003. Antalya, Turkey: IEEE, 2003: 180-187. A two-step genetic algorithm approach to task mapping problems. The method is roughly as follows:

(1)首先假设任务流图中所有边的延时相等，为一常数，即平均数；为每个任务分配功能单元(如处理器、存储器等)，使得系统的总延时最小(1) First assume that the delays of all edges in the task flow graph are equal, which is a constant, that is, the average number; assign functional units (such as processors, memories, etc.) to each task to minimize the total delay of the system

(2)使用真实的数据流图的边延时，进一步优化，将功能单元分配到NoC的节点上。(2) Use the edge delay of the real data flow graph to further optimize and assign the functional units to the nodes of the NoC.

该方法将整个优化阶段分成了两个步骤，从而降低了计算的复杂度，缩短了计算时间，但获得的映射方案的能耗和延时不会同时最小。This method divides the entire optimization stage into two steps, thereby reducing the computational complexity and shortening the computational time, but the energy consumption and delay of the obtained mapping scheme will not be minimized at the same time.

文献Wenbiao Zhou，Yan Zhang，Zhigang Mao.Pareto based Multi-objective Mapping IP Coresonto NoC Architectures.IEEE Asia Pacific Conference on Circuits and Systems.2006：331-334.和Giuseppe Ascia，Vincenzo Catania，Maurizio Palesi.Multi-objective Mapping for Mesh-based NoCArchitectures.International Conference on Hardware/Software Dodesign and System Synthesis，2004：182-187.提出采用多目标进化算法将片上网络上任务映射到基于Mesh拓扑的片上网络结构的节点上。该方法的目标函数有两个，一个目标函数是平均数据包跳数(hop)，计算式如下：Literature Wenbiao Zhou, Yan Zhang, Zhigang Mao. Pareto based Multi-objective Mapping IP Coreson to NoC Architectures. IEEE Asia Pacific Conference on Circuits and Systems. 2006: 331-334. and Giuseppe Ascia, Vincenzo Catania, Mapsiobping. Miurizio Paleto for Mesh-based NoCA Architectures. International Conference on Hardware/Software Dodesign and System Synthesis, 2004: 182-187. It is proposed to use a multi-objective evolutionary algorithm to map the tasks on the chip network to the nodes of the network structure on the chip based on the Mesh topology. There are two objective functions of this method, one objective function is the average data packet hop count (hop), and the calculation formula is as follows:

$hop hop = = \frac{11}{\underset{&ForAll; &ForAll; s the s,, d d}{Σ Σ} {v v}_{s the s,, d d}} \underset{&ForAll; &ForAll; s the s,, d d}{Σ Σ} {v v}_{s the s,, d d} \underset{&ForAll; &ForAll; i i,, j j}{Σ Σ} {h h}_{i i,, j j} {p p}_{sidj sidj}$

另一个目标函数是热点温度(T)，计算式为：T＝max(R×P)，其中，R为阻抗转移矩阵，利用K.Skadron，M.R.Stan和W.Huang等人发表在ISCA’03会议论文集中的文章“Temperature-AwareMicro-architecture”中提出的热量模型工具HotSpot得到。该方法大致如下：Another objective function is the hot spot temperature (T), the calculation formula is: T=max(R×P), where R is the impedance transfer matrix, which was published in ISCA'03 by K.Skadron, M.R.Stan and W.Huang et al. The thermal model tool HotSpot proposed in the article "Temperature-AwareMicro-architecture" in the conference proceedings was obtained. The method is roughly as follows:

(1)首先随机选择一些方案构成初始化方案组，根据N.Srinivas and K.Deb发表在EvolutionaryComputation1995年第二卷的“Multi-objective optimization function optimization using non-dominatedsorting genetic algorithms”提出的方法进行非支配的遗传算法的复制、交叉和变异操作生成新的方案组，然后重复上述复制、交叉与变异操作数次，选出使得目标函数最小的方案。(1) First randomly select some schemes to form an initialization scheme group, and perform non-dominated according to the method proposed by N.Srinivas and K.Deb in "Multi-objective optimization function optimization using non-dominated sorting genetic algorithms" published in the second volume of Evolutionary Computation in 1995 The replication, crossover and mutation operations of the genetic algorithm generate a new program group, and then repeat the above replication, crossover and mutation operations several times to select the program that minimizes the objective function.

该方法中采用的非支配排序运算复杂度较高、解的性能不是特别优秀。The non-dominated sorting operation used in this method has high complexity, and the performance of the solution is not particularly good.

发明内容Contents of the invention

本专利为多任务多节点的片上网络提出一种寻找使工作能耗与响应时间同时最小时的任务与节点间映射方案的方法，本方法的特点是：得到的映射方案性能高，收敛速度快，复杂度低。This patent proposes a method for finding a mapping scheme between tasks and nodes that minimizes energy consumption and response time for multi-task and multi-node on-chip networks. The characteristics of this method are: the obtained mapping scheme has high performance and fast convergence speed , with low complexity.

为了便于描述，我们用向量X＝(x₁，x₂，...x_n...，x_N)表示片上网络的N个任务向M个节点映射的一种方案，向量X的第n个位置上的x_n(0≤x_n≤M-1)代表了第n个任务映射到某个节点的节点编号，即x_n表示了将编号为n的任务分配给了第x_n的节点。For the convenience of description, we use vector X=(x ₁ , x ₂ , ... x _n ..., x _N ) to represent a scheme for mapping N tasks to M nodes of the network on chip, and the nth of the vector X The x _n (0≤x _n ≤M-1) in the first position represents the node number of the nth task mapped to a certain node, that is, x _n indicates that the task numbered n is assigned to the _xnth node .

寻找能耗与响应时间同时最小的方案的步骤如下：The steps to find the solution that minimizes energy consumption and response time at the same time are as follows:

1、随机生成一个包含K个分配方案的方案组J_t，并为每个分配方案X＝(x₁，x₂，...，x_N)设定一个交叉的概率p_i和变异的概率q_i，以及总的迭代次数T，并令记录迭代的次数的变量t＝0；1. Randomly generate a scheme group J _t containing K allocation schemes, and set a crossover probability p _i and a mutation probability for each allocation scheme X=(x ₁ , x ₂ ,..., x _N ) q _i , and the total number of iterations T, and make the variable t=0 that records the number of iterations;

2、求出方案组J_t中每种方案下片上网络能耗与响应时间；2. Calculate the energy consumption and response time of the on-chip network under each scheme in the scheme group J _t ;

3、采用Deb的非支配排序法、排除法、庄家法、擂台赛法、递归法、快速排序法或改进的快速排序法等方法之一(2007年科学出版社出版，作者为郑金华的《多目标进化算法及其应用》中介绍)计算出方案组J_t中各方案的适应度值，该适应度值大小反映方案的优先级，值越小优先级越高，所谓优先级别是按能耗与响应时间小为优。3. Adopt one of Deb's non-dominated sorting method, elimination method, banker method, ring competition method, recursive method, quick sorting method or improved quick sorting method (published by Science Press in 2007, the author is Zheng Jinhua's " Introduced in "Multi-objective Evolutionary Algorithm and Its Application") to calculate the fitness value of each scheme in the scheme group J _t , the fitness value reflects the priority of the scheme, the smaller the value, the higher the priority, the so-called priority is based on energy Small power consumption and response time are preferred.

4、将方案组J_t中的K个方案随机分成个方案小组，其中L是每方案小组中的方案个数，按照第3步中确定的优先关系选出每方案小组中最优先的方案组成一个优先方案组Q_t；4. Randomly divide the K schemes in the scheme group J _t into program groups, wherein L is the number of programs in each program group, select the most preferred program in each program group to form a priority program group Q _t according to the priority relationship determined in the 3rd step;

5、将优先方案组Q_t中的方案按随机配对，所述的配对是指按照两个方案一组进行分组，并以步骤1中设定的交叉的概率p_i将每对方案中的两个方案上任意一个相同位置上的节点的编号进行互换，然后再将所有被互换后的每对方案合并起来得到方案组Q_t′；5. The schemes in the priority scheme group Q _t are randomly paired, and the pairing refers to grouping according to two schemes, and with the crossover probability p _i set in step 1, the two schemes in each pair of schemes are divided into groups. Exchange the number of any node at the same position on each scheme, and then combine all the exchanged schemes to obtain the scheme group Q _t ′;

6、以步骤1中设定的变异概率q_i改变方案组Q_t′中各个方案中在任意一个位置上的节点的编号，得到方案组G_t 6. Use the mutation probability q _i set in step 1 to change the number of nodes at any position in each scheme in the scheme group Q _t ′ to obtain the scheme group G _t

7、按第3步计算出来的适应度值从小到大的顺序，从方案组J_t中选择

个方案，与方案组G_t的方案合并在一起，构成包含K个方案的新的方案组G_t′7. According to the order of the fitness value calculated in step 3 from small to large, select from the program group J _t

schemes, combined with the schemes of scheme group G _t to form a new scheme group G _t ′ containing K schemes

8、将方案组J_t和G_t合并成方案组R_t 8. Merge scheme groups J _t and G _t into scheme group R _t

9、计算方案组R_t中每种方案下片上网络能耗与响应时间；9. Calculate the energy consumption and response time of the on-chip network under each scheme in the scheme group R _t ;

10、采用Deb在2002年发表在杂志IEEE Transactions on Evolutionary Computation第六卷上的文章A Fast Elitist Non-dominated Sorting Genetic Algorithm for Multi-objective Optimization：NSGA-II中介绍的边界集产生法将方案组R_t中的2K个方案分成多个小组，每个小组代表一个边界集，对小组编号，编号较小的小组里的方案比编号较大的小组的方案的优先级高；所谓优先级别是按能耗与响应时间小为优。10. Using Deb's article A Fast Elitist Non-dominated Sorting Genetic Algorithm for Multi-objective Optimization published in the sixth volume of the magazine IEEE Transactions on Evolutionary Computation in 2002: The boundary set generation method introduced in NSGA-II will scheme group R The 2K schemes in _t are divided into multiple groups, each group represents a boundary set, and the groups are numbered, and the schemes in the smaller numbered group have higher priority than the schemes in the larger numbered group; the so-called priority level is based on energy Small power consumption and response time are preferred.

11、初始化一个新的没有方案的方案组F_t，然后从第10步骤分好的小组中，按小组的编号从大到小的顺序，依次将小组内的方案加入到F_t中，直到方案组F_t里的方案个数超过K个，然后将最后加入的小组的方案全部取出，并将被最后加入的小组记为V；11. Initialize a new program group F _t without a program, and then add the programs in the group to F _t in sequence from the groups divided in step 10 in descending order until the program The number of plans in the group F _t exceeds K, and then all the plans of the last joined group are taken out, and the last joined group is recorded as V;

12、确定第10步中每个小组中各方案的优先次序，具体方法如下：12. Prioritize the programs in each group in step 10 by:

12-1令小组中每个方案的距离值为0，所述的距离值表示该方案与其它方案的联系是否紧密。12-1 Let the distance value of each scheme in the group be 0, and the distance value indicates whether the scheme is closely related to other schemes.

12-2根据第9步计算出的各个方案的能耗的大小对方案进行倒序排列，能耗较小的方案排在能耗较大的方案的前面，12-2 Arrange the schemes in reverse order according to the energy consumption of each scheme calculated in step 9, and the schemes with less energy consumption are ranked in front of the schemes with larger energy consumption.

12-3计算各方案的能耗距离，每个方案的能耗距离为排在它后面的那个方案的能耗减去排在它前面的那个方案的能耗所得到的值，12-3 Calculate the energy consumption distance of each scheme. The energy consumption distance of each scheme is the value obtained by subtracting the energy consumption of the scheme in front of it from the energy consumption of the scheme behind it.

12-4根据第9步计算出的各个方案的响应时间大小对方案进行倒序排列，响应时间较小的方案排在响应时间较大的方案的前面，12-4 Arrange the schemes in reverse order according to the response time of each scheme calculated in step 9, and the scheme with a smaller response time is ranked in front of the scheme with a larger response time.

12-5计算各方案的响应时间距离，每个方案的响应时间距离为排在它后面的那个方案的响应时间减去排在它前面的那个方案的响应时间所得到的值，12-5 Calculate the response time distance of each scheme, the response time distance of each scheme is the value obtained by subtracting the response time of the scheme ahead of it from the response time of the scheme behind it,

12-6将各个方案的响应时间距离和能耗距离相加得到方案的距离值，12-6 Add the response time distance and energy consumption distance of each scheme to get the distance value of the scheme,

12-7把能耗最小的方案和响应时间最小的方案排在最前面，然后根据距离大小对剩余的方案进行排序，并排在能耗最小的方案和响应时间最小的方案的后面，距离值较大的方案排在距离值较小的方案的前面，这样排在前面的方案的优先级比排在后面的方案的优先级高，12-7 Put the scheme with the least energy consumption and the scheme with the shortest response time at the top, and then sort the remaining schemes according to the distance, and arrange them behind the scheme with the least energy consumption and the scheme with the shortest response time. The larger scheme is ranked in front of the scheme with a smaller distance value, so that the priority of the scheme in the front is higher than the priority of the scheme in the back,

13、从经过了第12步排序后的小组V里按优先级从高到低的顺序选择方案加入到方案组F_t，直到方案组F_t里的方案个数为K个。13. From the group V sorted in the 12th step, select the schemes in order of priority from high to low and add them to the scheme group F _t until the number of schemes in the scheme group F _t is K.

14、将方案组F_t中的K个方案随机分成

个方案小组，其中L是每方案小组中的方案个数，按照第10步和第12步联合确定的优先关系选出每方案小组中最优先的方案组成一个优先方案组F_t′；14. Randomly divide the K schemes in the scheme group F _t into

program groups, where L is the number of programs in each program group, select the most preferred program in each program group to form a priority program group F _t ' according to the priority relationship jointly determined in the 10th step and the 12th step;

15、将优先方案组F_t′中的方案再随机配对，以步骤1中设定的交叉的概率p_i将每对方案中的两个方案上任意一个相同位置上的节点的编号进行互换，然后再将所有被互换后的每对方案合并起来得到方案组H_t 15. Randomly pair the schemes in the priority scheme group F _t ', and exchange the number of any node at the same position on the two schemes in each pair of schemes with the crossover probability p _i set in step 1 , and then combine all the swapped schemes to get the scheme group H _t

16、以步骤1中设定的变异的概率q_i改变方案组H_t中各个方案中在任意一个位置上的节点的编号，得到方案组H_t′16. Change the number of nodes at any position in each scheme in the scheme group H _t with the variation probability q _i set in step 1, and obtain the scheme group H _t ′

17、根据第10步和第12步联合确定的方案组F_t中方案的优先次序，按照优先级从高到低的顺序，从方案组F_t中选择

个方案，与方案组H_t′的方案合并在一起，构成包含K个方案的新的方案组J_t+1 17. According to the priority order of the programs in the program group F _t jointly determined in the 10th and 12th steps, select from the program group F _t in the order of priority from high to low

schemes, combined with the schemes of the scheme group H _t ′ to form a new scheme group J _t+1 containing K schemes

18、如果t＜T，则t＝t+1，然后返回到步骤2，否则进入步骤1918. If t<T, then t=t+1, then return to step 2, otherwise go to step 19

19、按照步骤3所述的方法计算出方案组J_T中各方案的适应度值，将适应度值最小的方案，即能耗和响应时间同时最小的方案作为片上网络的任务到节点间的映射方案。19. Calculate the fitness value of each scheme in the scheme group J _T according to the method described in step 3, and use the scheme with the smallest fitness value, that is, the scheme with the smallest energy consumption and response time at the same time, as the task of the network on chip to communicate between nodes. Mapping scheme.

本发明所述的能耗的可以为：The energy consumption of the present invention can be:

E＝E_PE+E_Net E＝E _PE +E _Net

其中，E为能耗，E_PE为片上网络上所有处理节点的能耗之和，E_Net为片上网络上连接各个处理节点的通信网络的能耗。通信网络的能耗E_Net可用Terry Tao Ye，Luca Benini，Giovanni DeMicheli发表在DAC’02.年的会议论文集里的”Analysis of Power Consumption on Switch Fabrics inNetwork Routers”一文中提出的能耗计算方法或J.Hu和R.Marculescu发表在IEEE Transaction onComputer-Aided Design of Integrated Circuits and Systems的2005年第24卷的文章“Energy-andperformance-aware mapping for regular NoC architectures”中提出的方法计算。Among them, E is energy consumption, E _PE is the sum of energy consumption of all processing nodes on the network-on-chip, and E _Net is the energy consumption of the communication network connecting each processing node on the network-on-chip. The energy consumption E _Net of the communication network can use the energy consumption calculation method proposed in the paper "Analysis of Power Consumption on Switch Fabrics in Network Routers" published by Terry Tao Ye, Luca Benini, and Giovanni DeMicheli in the conference proceedings of DAC'02. J. Hu and R. Marculescu published in the article "Energy-and performance-aware mapping for regular NoC architectures" in IEEE Transaction on Computer-Aided Design of Integrated Circuits and Systems, Vol. 24, 2005.

本发明所述的响应时间(数据从输入到输出经历的最长的延时)的计算式可以为：The calculation formula of the response time of the present invention (the longest delay experienced by data from input to output) can be:

D＝P+TR+TWD=P+TR+TW

其中，D为响应时间，P为输入到输出路径上所有节点的处理时间之和，TR为输入到输出路径上所有链路的延时之和，TW为输入到输出路径上数据包的排队与等待时间之和。Among them, D is the response time, P is the sum of the processing time of all nodes on the input-to-output path, TR is the sum of the delays of all links on the input-to-output path, and TW is the queuing and sum of data packets on the input-to-output path sum of waiting times.

发明的实质与效果The Essence and Effect of the Invention

本发明通过将能耗和响应时间分开作为考核方案的优先指标，但在寻找最优方案过程中又同时考虑能耗和响应时间，做到了寻找出来的方案使能耗和系统延时同时最小，从而使采用此优选出来的方案设计出来的网络的性能达到最好，并且与其它对多个目标函数下寻找任务与节点间映射的优先方案的方法比较，具有复杂度，实现简单的特点，通过仿真还可以得出本发明的方法收敛速度快。The present invention separates energy consumption and response time as the priority index of the assessment scheme, but considers energy consumption and response time at the same time in the process of finding the optimal scheme, so that the found scheme minimizes energy consumption and system delay at the same time, Therefore, the performance of the network designed by using this optimized scheme is the best, and compared with other methods for finding the priority scheme for mapping between tasks and nodes under multiple objective functions, it has the characteristics of complexity and simplicity, through It can also be concluded from the simulation that the method of the present invention has a fast convergence speed.

附图说明 Description of drawings

图1是有6个任务组成的任务图。Figure 1 is a task diagram composed of six tasks.

图2是由16个节点组成的片上网络。Figure 2 is an on-chip network composed of 16 nodes.

1是任务，2是任务之间的业务量，3是片上网络的节点，4是片上网络的路由器。1 is a task, 2 is the traffic between tasks, 3 is a node of the network on chip, and 4 is a router of the network on chip.

具体实施方式 Detailed ways

图1示出的任务图中有6个任务，图2示出的片上网络有16个节点。本实施例就是为了寻找一种将图1所示的6个任务分配到图2所示的片上网络的16个节点上、使得映射之后片上网络的能耗和响应时间同时达到最小值的映射方案。为了便于描述，我们用向量X＝(x₁，x₂，x₃，x₄，x₅，x₆)表示片上网络的6个任务向16个节点映射的一种方案，向量X的第n个位置上的x_n(0≤x_n≤15)代表了第n个任务映射到某个节点的节点编号，即x_n表示了将编号为n的任务分配给了第x_n的节点。There are 6 tasks in the task graph shown in FIG. 1 , and there are 16 nodes in the network on chip shown in FIG. 2 . This embodiment is to find a mapping scheme that distributes the 6 tasks shown in Figure 1 to the 16 nodes of the on-chip network shown in Figure 2, so that the energy consumption and response time of the on-chip network after mapping can reach the minimum value . For the convenience of description, we use vector X=(x ₁ , x ₂ , x ₃ , x ₄ , x ₅ , x ₆ ) to represent a scheme for mapping 6 tasks to 16 nodes in the network on chip, and the nth of the vector X The x _n (0≤x _n ≤15) in the first position represents the node number that the nth task is mapped to a certain node, that is, x _n indicates that the task numbered _n is assigned to the xnth node.

1、随机生成一个包含20个分配方案的方案组J_t，方案组中的各个分配方案用向量表示为：(11，1，15，9，0，15)，(4，1，0，8，1，6)，(7，5，15，8，7，8)，(10，11，5，0，6，8)，(0，6，7，6，15，9)，(14，11，11，0，8，5)，(15，5，14，0，15，1)，(3，0，8，7，1，9)，(2，2，1，8，2，8)，(3，13，8，15，5，8)，(8，8，9，12，10，2)，(14，0，3，2，15，5)，(5，2，4，9，0，8)，(7，2，15，11，1，11)，(7，1，15，2，1，11)，(9，12，3，9，10，11)，(11，3，12，14，14，6)，(11，12，4，7，1，9)，(4，8，12，12，9，4)，(1，15，6，9，3，9)。设定各方案的交叉的概率为0.4，变异的概率为0.01，总的迭代次数为100，并且记录迭代的次数的变量t＝0；1. Randomly generate a scheme group J _t containing 20 allocation schemes. Each allocation scheme in the scheme group is represented by a vector: (11, 1, 15, 9, 0, 15), (4, 1, 0, 8 , 1, 6), (7, 5, 15, 8, 7, 8), (10, 11, 5, 0, 6, 8), (0, 6, 7, 6, 15, 9), (14 , 11, 11, 0, 8, 5), (15, 5, 14, 0, 15, 1), (3, 0, 8, 7, 1, 9), (2, 2, 1, 8, 2 , 8), (3, 13, 8, 15, 5, 8), (8, 8, 9, 12, 10, 2), (14, 0, 3, 2, 15, 5), (5, 2 , 4, 9, 0, 8), (7, 2, 15, 11, 1, 11), (7, 1, 15, 2, 1, 11), (9, 12, 3, 9, 10, 11 ), (11, 3, 12, 14, 14, 6), (11, 12, 4, 7, 1, 9), (4, 8, 12, 12, 9, 4), (1, 15, 6 , 9, 3, 9). The probability of crossover of each scheme is set as 0.4, the probability of variation is 0.01, the total number of iterations is 100, and the variable t=0 for recording the number of iterations;

2、求出方案组J_t中每种方案下片上网络的能耗和响应时间：2. Calculate the energy consumption and response time of the on-chip network under each scheme in the scheme group J _t :

2-1、能耗(E)的计算公式采用：E＝E_PE+E_Net，E_PE为片上网络上所有处理节点的能耗之和，E_Net为片上网络上连接各个处理节点的通信网络的能耗。网络上所有处理节点的能耗等于任务图中所有任务的处理时间，各个任务的处理时间可以通过对各个任务的运算量估计得到，即确定任务图的同时，确定任务的能耗。本实例中，对于各种不同的方案，E_PE的值恒为1.18。采用Terry TaoYe，Luca Benini，Giovanni De Micheli发表在DAC′02.年的会议论文集里的″Analysis of PowerConsumption on Switch Fabrics in Network Routers″一文中提出的能耗计算方法计算出各种方案下通信网络的能耗E_Net，分别为：54.69，38.09，61.69，45.84，43.84，46.39，75.19，41.84，46.99，48.59，33.09，55.24，38.14，43.89，50.09，40.94，44.64，43.89，29.44，53.74。2-1. The calculation formula of energy consumption (E) is: E=E _PE +E _Net , E _PE is the sum of energy consumption of all processing nodes on the network on chip, and E _Net is the communication network connecting each processing node on the network on chip energy consumption. The energy consumption of all processing nodes on the network is equal to the processing time of all tasks in the task graph, and the processing time of each task can be obtained by estimating the calculation amount of each task, that is, while determining the task graph, determine the energy consumption of the task. In this example, the value of E _PE is constant at 1.18 for various schemes. Using the energy consumption calculation method proposed in the article "Analysis of Power Consumption on Switch Fabrics in Network Routers" published by Terry TaoYe, Luca Benini, and Giovanni De Micheli in the conference proceedings of DAC'02. to calculate the communication network under various schemes The energy consumption E _Net is: 54.69, 38.09, 61.69, 45.84, 43.84, 46.39, 75.19, 41.84, 46.99, 48.59, 33.09, 55.24, 38.14, 43.89, 50.09, 40.94, 44.64, 43.89, 29.44, 5

2-2、响应时间(D)的计算式采用：D＝P+TR+TW，P为输入到输出路径上所有节点的处理时间之和，TR为输入到输出路径上所有链路的延时之和，TW为输入到输出路径上数据包的排队与等待时间之和。本实例中，各方案的响应时间分别为：0.16，0.1601，0.1601，0.1601，0.1601，0.1601，0.1602，0.1601，0.1601，0.1601，0.1601，0.1601，0.16，0.16，0.1601，0.1601，0.1601，0.1601，0.16，0.1601。2-2. The calculation formula of response time (D) adopts: D=P+TR+TW, P is the sum of the processing time of all nodes on the input to output path, and TR is the delay of all links on the input to output path The sum of TW is the sum of the queuing and waiting time of data packets on the input to output path. In this example, the response times of each scheme are: 0.16, 0.1601, 0.1601, 0.1601, 0.1601, 0.1601, 0.1602, 0.1601, 0.1601, 0.1601, 0.1601, 0.1601, 0.16, 0.16, 0.1601, 0.1601, 0.1601, 0 0.1601.

3、采用Deb的非支配排序法(2007年科学出版社出版，作者为郑金华的《多目标进化算法及其应用》中介绍)计算出方案组J_t中各方案的适应度值，为：0.7895，1.6316，0.1579，1，1.1579，0.1579，0，1.3684，0.4737，0.4737，1.6316，0.4737，1.8947，1.6316，0.4737，1.6316，1，1.2632，2，0.7895，该值大小反映方案的优先级，值越小优先级越高。3. Using Deb's non-dominated sorting method (published by Science Press in 2007, authored by Zheng Jinhua's "Multi-objective Evolutionary Algorithm and Its Application") to calculate the fitness value of each program in the program group J _t , which is: 0.7895, 1.6316, 0.1579, 1, 1.1579, 0.1579, 0, 1.3684, 0.4737, 0.4737, 1.6316, 0.4737, 1.8947, 1.6316, 0.4737, 1.6316, 1, 1.2632, 2, 0.7895, the value reflects the priority of the scheme The smaller the higher the priority.

4、将方案组J_t中的20个方案随机分成10个方案小组，每方案小组中有2个方案，按照第3步中确定的优先关系选出每方案小组最优先的方案，然后将从所有方案小组选出的最优先的一个方案组成一个新的方案组Q_t，新的方案组Q_t中的方案为：(4，1，0，8，1，6)，(10，11，5，0，6，8)，(0，6，7，6，15，9)，(3，0，8，7，1，9)，(2，2，1，8，2，8)，(8，8，9，12，10，2)，(5，2，4，9，0，8)，(9，12，3，9，10，11)，(11，12，4，7，1，9)，(4，8，12，12，9，3)4. Divide the 20 schemes in the scheme group J _t into 10 scheme groups at random, each scheme group has 2 schemes, select the scheme with the highest priority in each scheme group according to the priority relationship determined in step 3, and then select from The most preferred scheme selected by all scheme groups forms a new scheme group Q _t , and the schemes in the new scheme group Q _t are: (4, 1, 0, 8, 1, 6), (10, 11, 5, 0, 6, 8), (0, 6, 7, 6, 15, 9), (3, 0, 8, 7, 1, 9), (2, 2, 1, 8, 2, 8) , (8, 8, 9, 12, 10, 2), (5, 2, 4, 9, 0, 8), (9, 12, 3, 9, 10, 11), (11, 12, 4, 7, 1, 9), (4, 8, 12, 12, 9, 3)

5、将方案组Q_t中的方案按随机配对，所述的配对是指按照两个方案一组进行分组，并以步骤1中设定的交叉的概率0.4对每对方案间在随机位置上的节点的编号进行互换，然后再将所有被交换后的每对方案合并起来得到方案组Q_t′，方案组Q_t′中的方案为：(10，1，0，8，1，6)，(4，11，5，0，6，8)，(0，6，7，6，15，9)，(3，0，8，7，1，9)，(2，2，1，8，2，8)，(8，8，9，12，10，2)，(5，2，4，9，0，8)，(9，12，3，9，10，11)，(11，12，4，7，1，9)，(4，8，12，12，9，3)5. The schemes in the scheme group Q _t are randomly paired, and the pairing refers to grouping according to two schemes, and with the crossover probability 0.4 set in step 1, pair each pair of schemes at random positions exchange the numbers of the nodes, and then combine all the exchanged schemes to obtain the scheme group Q _t ′. The schemes in the scheme group Q _t ′ are: (10, 1, 0, 8, 1, 6 ), (4, 11, 5, 0, 6, 8), (0, 6, 7, 6, 15, 9), (3, 0, 8, 7, 1, 9), (2, 2, 1 , 8, 2, 8), (8, 8, 9, 12, 10, 2), (5, 2, 4, 9, 0, 8), (9, 12, 3, 9, 10, 11), (11, 12, 4, 7, 1, 9), (4, 8, 12, 12, 9, 3)

6、以步骤1中设定的变异概率0.01改变方案组Q_t′中各个方案中在随机位置上的节点的编号，得到方案组G_t，方案组G_t中的方案为：(10，1，0，8，1，6)，(4，11，5，0，6，8)，(0，6，7，6，15，9)，(3，0，8，7，1，9)，(2，2，1，8，2，8)，(8，8，9，12，10，2)，(5，2，4，9，0，8)，(9，12，3，9，10，11)，(11，12，4，7，1，9)，(4，8，12，12，9，3)6. With the variation probability 0.01 set in step 1, change the numbering of nodes at random positions in each scheme in the scheme group Q _t ′ to obtain the scheme group G _t , and the schemes in the scheme group G _t are: (10, 1 , 0, 8, 1, 6), (4, 11, 5, 0, 6, 8), (0, 6, 7, 6, 15, 9), (3, 0, 8, 7, 1, 9 ), (2, 2, 1, 8, 2, 8), (8, 8, 9, 12, 10, 2), (5, 2, 4, 9, 0, 8), (9, 12, 3 , 9, 10, 11), (11, 12, 4, 7, 1, 9), (4, 8, 12, 12, 9, 3)

7、按适应度从小到大的顺序，从方案组J_t中选择10个方案，与方案组G_t的方案合并在一起，构成包含20个方案的新的方案组G_t′7. According to the order of fitness from small to large, select 10 schemes from the scheme group J _t and merge them with the schemes of scheme group G _t to form a new scheme group G _t ′ containing 20 schemes

8、将方案组J_t和G_t′合并成包含40个方案的方案组R_t，即：(11，1，15，9，0，15)，(4，1，0，8，1，6)，(7，5，15，8，7，8)，(10，11，5，0，6，8)，(0，6，7，6，15，9)，(14，11，11，0，8，5)，(15，5，14，0，15，1)，(3，0，8，7，1，9)，(2，2，1，8，2，8)，(3，1，3，8，15，5，8)，(8，8，9，12，10，2)，(14，0，3，2，15，5)，(5，2，4，9，0，8)，(7，2，15，11，1，11)，(7，1，15，2，1，11)，(9，12，3，9，10，11)，(11，3，12，14，14，6)，(11，12，4，7，1，9)，(4，8，12，12，9，4)，(1，15，6，9，3，9)，(4，1，0，8，1，6)，(10，11，5，0，6，8)，(0，6，7，6，15，9)，(3，0，8，7，1，9)，(2，2，1，8，2，8)，(8，8，9，12，10，2)，(5，2，4，9，0，8)，(9，12，3，9，10，11)，(11，12，4，7，1，9)，(4，8，12，12，9，3)，(11，3，12，14，14，6)，(0，6，7，6，15，9)，(11，12，4，7，1，9)，(3，0，8，7，1，9)，(4，1，0，8，1，6)，(8，8，9，12，10，2)，(7，2，15，11，1，11)，(9，12，3，9，10，11)，(5，2，4，9，0，8)，(4，8，12，12，9，4)8. Merge scheme groups J _t and G _t ' into scheme group R _t containing 40 schemes, namely: (11, 1, 15, 9, 0, 15), (4, 1, 0, 8, 1, 6), (7, 5, 15, 8, 7, 8), (10, 11, 5, 0, 6, 8), (0, 6, 7, 6, 15, 9), (14, 11, 11, 0, 8, 5), (15, 5, 14, 0, 15, 1), (3, 0, 8, 7, 1, 9), (2, 2, 1, 8, 2, 8) , (3, 1, 3, 8, 15, 5, 8), (8, 8, 9, 12, 10, 2), (14, 0, 3, 2, 15, 5), (5, 2, 4, 9, 0, 8), (7, 2, 15, 11, 1, 11), (7, 1, 15, 2, 1, 11), (9, 12, 3, 9, 10, 11) , (11, 3, 12, 14, 14, 6), (11, 12, 4, 7, 1, 9), (4, 8, 12, 12, 9, 4), (1, 15, 6, 9, 3, 9), (4, 1, 0, 8, 1, 6), (10, 11, 5, 0, 6, 8), (0, 6, 7, 6, 15, 9), ( 3, 0, 8, 7, 1, 9), (2, 2, 1, 8, 2, 8), (8, 8, 9, 12, 10, 2), (5, 2, 4, 9, 0, 8), (9, 12, 3, 9, 10, 11), (11, 12, 4, 7, 1, 9), (4, 8, 12, 12, 9, 3), (11, 3, 12, 14, 14, 6), (0, 6, 7, 6, 15, 9), (11, 12, 4, 7, 1, 9), (3, 0, 8, 7, 1, 9), (4, 1, 0, 8, 1, 6), (8, 8, 9, 12, 10, 2), (7, 2, 15, 11, 1, 11), (9, 12, 3, 9, 10, 11), (5, 2, 4, 9, 0, 8), (4, 8, 12, 12, 9, 4)

9、计算方案组R_t中每种方案下片上网络能耗与响应时间，各方案的能耗分别为：54.69，38.09，61.69，45.84，43.84，46.39，75.19，41.84，46.99，48.59，33.09，55.24，38.14，43.89，50.09，40.94，44.64，43.89，29.44，53.74，38.09，45.84，43.84，41.84，46.99，33.09，38.14，40.94，43.89，40.84，44.64，43.84，43.89，41.84，38.09，33.09，43.89，40.94，38.14，29.44；各方案的响应时间分别为：9. Calculate the energy consumption and response time of the on-chip network under each scheme in the scheme group R _t . The energy consumption of each scheme is: 54.69, 38.09, 61.69, 45.84, 43.84, 46.39, 75.19, 41.84, 46.99, 48.59, 33.09, 55.24，38.14，43.89，50.09，40.94，44.64，43.89，29.44，53.74，38.09，45.84，43.84，41.84，46.99，33.09，38.14，40.94，43.89，40.84，44.64，43.84，43.89，41.84，38.09，33.09， 43.89, 40.94, 38.14, 29.44; the response time of each scheme is:

10、0.16，0.1601，0.1601，0.1601，0.1601，0.1601，0.1602，0.1601，0.1601，0.1601，0.1601，0.1601，0.16，0.16，0.1601，0.1601，0.1601，0.1601，0.16，0.1601，0.1601，0.1601，0.1601，0.1601，0.1601，0.1601，0.16，0.1601，0.1601，0.1601，0.1601，0.1601，0.1601，0.1601，0.1601，0.1601，0.16，0.1601，0.16，0.16。10、0.16，0.1601，0.1601，0.1601，0.1601，0.1601，0.1602，0.1601，0.1601，0.1601，0.1601，0.1601，0.16，0.16，0.1601，0.1601，0.1601，0.1601，0.16，0.1601，0.1601，0.1601，0.1601，0.1601， 0.1601, 0.1601, 0.16, 0.1601, 0.1601, 0.1601, 0.1601, 0.1601, 0.1601, 0.1601, 0.1601, 0.1601, 0.16, 0.1601, 0.16, 0.16.

11、采用Deb在2002年发表在杂志IEEE Transactions on Evolutionary Computation第六卷上的文章A Fast Elitist Non-dominated Sorting Genetic Algorithm for Multi-objectiveOptimization：NSGA-II中介绍的边界集产生法将方案组R_t中的40个方案分成13个小组，每个小组代表一个边界集，对小组编号，编号较小的小组里的方案比编号较大的小组的方案的优先级高；所谓优先级别是按能耗与响应时间小为优。按上述方法得到的小组1里的方案有方案19，方案40，小组2里面的方案有方案13，方案27，方案39，小组3里的方案有方案2，方案11，方案21，方案26，方案35，方案36，小组4里面的方案有方案16，方案28，方案38，小组5里的方案有方案8，方案24，方案34，小组6里的方案有方案14，方案37，小组7里面的方案有方案18，方案29，方案33，小组8里的方案有方案5，方案23，方案32，小组9里面的方案有方案4，方案17，方案22，方案30，方案31，小组10里的方案有方案1，方案20，小组11里面的方案有方案9，方案10，方案12，方案15，方案25，小组12里的方案有方案3，方案6，小组13里的方案有方案7。11. Using Deb's article A Fast Elitist Non-dominated Sorting Genetic Algorithm for Multi-objective Optimization published in Volume 6 of the journal IEEE Transactions on Evolutionary Computation in 2002: The boundary set generation method introduced in NSGA-II to divide the scheme group R _t The 40 schemes in the group are divided into 13 groups, each group represents a boundary set, and the groups are numbered, and the schemes in the smaller numbered group have higher priority than the schemes in the larger numbered group; the so-called priority level is based on energy consumption It is better to have a small response time. The schemes in Group 1 obtained by the above method include Scheme 19, Scheme 40, the schemes in Group 2 include Scheme 13, Scheme 27, and Scheme 39, and the schemes in Group 3 include Scheme 2, Scheme 11, Scheme 21, and Scheme 26. Scheme 35, Scheme 36, the schemes in Group 4 include Scheme 16, Scheme 28, and Scheme 38, the schemes in Group 5 include Scheme 8, Scheme 24, and Scheme 34, and the schemes in Group 6 include Scheme 14, Scheme 37, and Group 7 The plans in it include plan 18, plan 29, and plan 33. The plans in group 8 include plan 5, plan 23, and plan 32. The plans in group 9 include plan 4, plan 17, plan 22, plan 30, plan 31, and the group The plans in group 10 include plan 1 and plan 20, the plans in group 11 include plan 9, plan 10, plan 12, plan 15, and plan 25, the plans in group 12 include plan 3, plan 6, and the plans in group 13 include Option 7.

12、初始化一个新的没有方案的方案组F_t，然后从第10步骤分好的小组中，按小组的编号从小到大的顺序，从小组1到小组13，依次将小组内的方案加入到F_t中，直到方案组F_t里的方案个数超过20个，此时将最后加入的小组(即小组7)的方案全部取出。12. Initialize a new program group F _t without a program, and then from the groups divided in step 10, according to the order of group numbers from small to large, from group 1 to group 13, add the plans in the group to In F _t , until the number of schemes in the scheme group F _t exceeds 20, all the schemes of the last joined group (namely group 7) are taken out at this time.

13、确定第10步中得到的每个小组中各方案的优先次序，具体方法如下：13. Determine the priority of each program in each group obtained in step 10. The specific method is as follows:

12-4根据第9步计算出的各个方案的响应时间大小对方案进行倒序排列，响应时间较小的方案排在响应时间较大的方案的前面12-4 Arrange the schemes in reverse order according to the response time of each scheme calculated in step 9, and the scheme with a smaller response time is ranked in front of the scheme with a larger response time

12-7把能耗最小的方案和响应时间最小的方案排在最前面，然后根据距离大小对其他方案进行排序，并排在能耗最小的方案和响应时间最小的方案的后面，距离值较大的方案排在距离值较小的方案的前面，这样排在前面的方案的优先级比排在后面的方案的优先级高12-7 Put the scheme with the least energy consumption and the scheme with the shortest response time at the top, and then sort the other schemes according to the distance, and rank them behind the scheme with the least energy consumption and the scheme with the shortest response time, and the distance value is larger The scheme of the distance value is ranked in front of the scheme with a smaller distance value, so that the priority of the scheme in the front is higher than the priority of the scheme in the back

14、从第12步骤排序后的小组里按优先级高低选择方案加入方案组F_t，直到方案组F_t里的方案个数为20个，完成后，方案组F_t中的方案为：(4，8，12，12，9，4)，(4，8，12，12，9，4)，(5，2，4，9，0，8)，(5，2，4，9，0，8)，(5，2，4，9，0，8)，(4，1，0，8，1，6)，(8，8，9，12，10，2)，(4，1，0，8，1，6)，(8，8，9，12，10，2)，(4，1，0，8，1，6)，(8，8，9，12，10，2)，(9，12，3，9，10，11)，(9，12，3，9，10，11)，(9，12，3，9，10，11)，(3，0，8，7，1，9)，(3，0，8，7，1，9)，(3，0，8，7，1，9)，(7，2，15，11，1，11)，(7，2，15，11，1，11)，(11，12，4，7，1，9)14. From the group sorted in the 12th step, select the scheme according to the priority level and join the scheme group F _t until the number of schemes in the scheme group F _t is 20. After completion, the schemes in the scheme group F _t are: ( 4, 8, 12, 12, 9, 4), (4, 8, 12, 12, 9, 4), (5, 2, 4, 9, 0, 8), (5, 2, 4, 9, 0, 8), (5, 2, 4, 9, 0, 8), (4, 1, 0, 8, 1, 6), (8, 8, 9, 12, 10, 2), (4, 1, 0, 8, 1, 6), (8, 8, 9, 12, 10, 2), (4, 1, 0, 8, 1, 6), (8, 8, 9, 12, 10, 2), (9, 12, 3, 9, 10, 11), (9, 12, 3, 9, 10, 11), (9, 12, 3, 9, 10, 11), (3, 0, 8, 7, 1, 9), (3, 0, 8, 7, 1, 9), (3, 0, 8, 7, 1, 9), (7, 2, 15, 11, 1, 11) , (7, 2, 15, 11, 1, 11), (11, 12, 4, 7, 1, 9)

15、将方案组F_t中的20个方案随机分成10个方案小组，每方案小组中有2个方案，按照第10和12步联合确定的优先关系选出每方案小组最优先的方案，然后将从所有方案小组选出的最优先的一个方案组成一个新的方案组F_t′，方案组F_t′中的方案包括(4，8，12，12，9，4)，(5，2，4，9，0，8)，(5，2，4，9，0，8)，(8，8，9，12，10，2)，(8，8，9，12，10，2)，(8，8，9，12，10，2)，(9，12，3，9，10，11)，(3，0，8，7，1，9)，(3，0，8，7，1，9)，(7，2，15，11，1，11)15. The 20 programs in the program group F _t are randomly divided into 10 program groups, each program group has 2 programs, and the most preferred program of each program group is selected according to the priority relationship jointly determined in steps 10 and 12, and then A new scheme group F _t ′ will be formed from the most preferred scheme selected from all scheme groups. The schemes in the scheme group F _t ′ include (4, 8, 12, 12, 9, 4), (5, 2 , 4, 9, 0, 8), (5, 2, 4, 9, 0, 8), (8, 8, 9, 12, 10, 2), (8, 8, 9, 12, 10, 2 ), (8, 8, 9, 12, 10, 2), (9, 12, 3, 9, 10, 11), (3, 0, 8, 7, 1, 9), (3, 0, 8 , 7, 1, 9), (7, 2, 15, 11, 1, 11)

16、将方案组F_t′中的方案再随机配对，以步骤1中设定的交叉的概率0.4对每对方案间在随机位置上的节点的编号进行互换，然后再将所有对合并起来得到方案组H_t，即：(4，8，12，12，9，4)，(5，2，4，9，0，8)，(8，2，4，9，0，8)，(5，8，9，12，10，2)，(8，8，9，12，10，2)，(8，8，9，12，10，2)，(9，12，3，9，10，11)，(3，0，8，7，1，9)，(3，0，8，7，1，9)，(7，2，15，11，1，11)16. Randomly pair the schemes in the scheme group F _t ′, exchange the numbers of nodes at random positions between each pair of schemes with the crossover probability 0.4 set in step 1, and then merge all the pairs The scheme group H _t is obtained, namely: (4, 8, 12, 12, 9, 4), (5, 2, 4, 9, 0, 8), (8, 2, 4, 9, 0, 8), (5, 8, 9, 12, 10, 2), (8, 8, 9, 12, 10, 2), (8, 8, 9, 12, 10, 2), (9, 12, 3, 9 , 10, 11), (3, 0, 8, 7, 1, 9), (3, 0, 8, 7, 1, 9), (7, 2, 15, 11, 1, 11)

17、以步骤1中设定的变异的概率0.01改变方案组H_t中各个方案中在随机位置上的节点的编号，得到方案组H_t′，即：(4，8，12，12，9，4)，(5，2，4，9，0，8)，(8，2，4，9，0，8)，(5，8，9，12，10，2)，(8，8，9，12，10，2)，(8，8，9，12，10，2)，(9，12，3，9，10，11)，(3，0，8，7，1，9)，(3，0，8，7，1，9)，(7，2，15，11，1，11)17. Change the numbering of nodes at random positions in each scheme in the scheme group H _t with the probability 0.01 of the variation set in step 1 to obtain the scheme group H _t ′, namely: (4, 8, 12, 12, 9 , 4), (5, 2, 4, 9, 0, 8), (8, 2, 4, 9, 0, 8), (5, 8, 9, 12, 10, 2), (8, 8 , 9, 12, 10, 2), (8, 8, 9, 12, 10, 2), (9, 12, 3, 9, 10, 11), (3, 0, 8, 7, 1, 9 ), (3, 0, 8, 7, 1, 9), (7, 2, 15, 11, 1, 11)

18、按适应度从小到大的顺序，从方案组F_t中选择10个方案，与方案组H_t′的方案合并在一起，构成包含20个方案的新的方案组J_t+1 18. Select 10 schemes from the scheme group F _t according to the order of fitness from small to large, and merge them with the schemes of scheme group H _t ′ to form a new scheme group J _t+1 containing 20 schemes

19、如果迭代次数t小于100，则t＝t+1，然后返回到步骤2，进行新一轮迭代运算，否则进入步骤19；19. If the number of iterations t is less than 100, then t=t+1, and then return to step 2 to perform a new round of iterative operation, otherwise enter step 19;

20、按照步骤3所述的方法计算出方案组J₁₀₀中各方案的适应度值，将适应度值最小的方案，即方案(5，2，9，9，5，4)作为片上网络的任务到节点间的映射方案。20. Calculate the fitness value of each scheme in the scheme group J ₁₀₀ according to the method described in step 3, and use the scheme with the smallest fitness value, that is, scheme (5, 2, 9, 9, 5, 4) as the network on chip The mapping scheme between tasks and nodes.

Claims

1. be used to seek the method for mapping scheme between the task of network-on-chip and node, it is characterized in that: seek energy consumption and the response time task and the node mapping scheme of minimum simultaneously of sening as an envoy to, the searching process is as follows:

One, generates a scheme group J who comprises K allocative decision at random _t, and be each allocative decision X=(x ₁, x ₂..., x _N) set the Probability p of one intersection _iProbability q with variation _i, and total iterations T, and the variable t=0 of order record number of iterations;

Two, obtain scheme group J _tIn network-on-chip energy consumption and response time under every kind of scheme;

Three, calculate scheme group J _tIn the fitness value of each scheme, the priority of this fitness value size reflection scheme, it is high more be worth more little priority, so-called priority level be little by energy consumption and response time be excellent;

Four, with scheme group J _tIn K scheme be divided at random

Individual scheme group, wherein L is the scheme number in the every scheme group, selects in the every scheme group prepreerence scheme according to the precedence relationship of determining in the 3rd step and forms a priority scheme group Q _t

Five, with priority scheme group Q _tIn scheme press random pair, described pairing is meant according to two schemes divides into groups for one group, and with the Probability p of the intersection set in the step 1 _iThe numbering of the node on any one same position on two schemes in every pair of scheme is exchanged, and then all are merged by every pair of scheme after exchanging obtain scheme group Q ' _t

Six, the variation probability q to set in the first step _iChange scheme group Q ' _tIn in each scheme in the numbering of any one locational node, obtain scheme group G _t

Seven, the fitness value order of calculating by the 3rd step from small to large is from scheme group J _tThe middle selection

Individual scheme is with scheme group G _tScheme combine, constitute the new scheme group G ' comprise K scheme _t

Eight, with scheme group J _tAnd G ' _tBe merged into scheme group R _t

Nine, numerical procedure group R _tIn network-on-chip energy consumption and response time under every kind of scheme;

Ten, with scheme group R _tIn 2K scheme be divided into a plurality of groups, each group represents a boundary set, to little group #, the scheme in the less group of numbering is than the priority height of the scheme of the bigger group of numbering; So-called priority level is little of excellent by energy consumption and response time;

11, new scheme group F who does not have scheme of initialization _t, from the good group of the tenth step branch, the numbering order from big to small by group joins F with the scheme in the group successively then _tIn, up to scheme group F _tIn the scheme number surpass K, the scheme of the group that will add is at last all taken out then, and the group that will be added at last is designated as V;

12, determine in the tenth step the priority ranking of each scheme in each group, concrete grammar is as follows:

12-1 makes that the distance value of each scheme in the group is 0, and described distance value represents whether getting in touch of this scheme and other scheme be tight;

12-2 carries out inverted order according to the 9th size that goes on foot the energy consumption of each scheme that calculates to scheme and arranges, and the less scheme of energy consumption comes the front of the bigger scheme of energy consumption,

12-3 calculates the energy consumption distance of each scheme, and the energy consumption of each scheme distance deducts the resulting value of energy consumption of that scheme that comes its front for the energy consumption of that scheme of coming its back,

12-4 carries out the inverted order arrangement according to the response time size of each scheme that the 9th step calculated to scheme, and the response time, less scheme came the front of bigger scheme of response time,

12-5 calculates the response time distance of each scheme, and response time of each scheme distance deducts the resulting value of response time of that scheme that comes its front for response time of that scheme of coming its back,

12-6 obtains the distance value of scheme with the response time distance of each scheme and energy consumption apart from addition,

12-7 comes the foremost to the scheme of the scheme of energy consumption minimum and response time minimum, according to the distance size remaining scheme is sorted then, and come the back of the scheme of the scheme of energy consumption minimum and response time minimum, the scheme that distance value is bigger comes the front of the less scheme of distance value, and the priority ratio that comes the scheme of front like this comes the priority height of the scheme of back;

13, join scheme group F from the select progressively scheme of having passed through in the V of group after the ordering of the 12 step according to priority from high to low _t, up to scheme group F _tIn the scheme number be K;

14, with scheme group F _tIn K scheme be divided at random

Individual scheme group, wherein L is the scheme number in the every scheme group, selects priority scheme group F ' of prepreerence scheme composition in the every scheme group according to the precedence relationship of determining in the tenth step and the 12 step _t

15, with priority scheme group F ' _tIn scheme random pair again, with the Probability p of the intersection set in the first step _iThe numbering of the node on any one same position on two schemes in every pair of scheme is exchanged, and then all are merged by every pair of scheme after exchanging obtain scheme group H _t

16, with the probability q of the variation set in the first step _iChange scheme group H _tIn in each scheme in the numbering of any one locational node, obtain scheme group H ' _t

17, unite definite scheme group F according to the tenth step and the 12 step _tThe priority ranking of middle scheme is according to priority order from high to low, from scheme group F _tThe middle selection

Individual scheme is with scheme group H _tScheme combine, constitute the new scheme group J comprise K scheme _T+1

If 18 t＜T, then t=t+1 turned back to for second step then, otherwise entered for the 19 step;

19, calculate scheme group J _tIn the fitness value of each scheme, with the scheme of fitness value minimum, promptly minimum simultaneously scheme of energy consumption and response time as the task of network-on-chip to internodal mapping scheme;

2. according to the method that is used to seek mapping scheme between the task of network-on-chip and node of claim 1, it is characterized in that: described energy consumption is the energy consumption that the energy consumption sum of all processing nodes on the network-on-chip is added the communication network that connects each processing node.

3. according to the method that is used to seek mapping scheme between the task of network-on-chip and node of claim 1 or claim 2, it is characterized in that: the described response time is that the processing time sum that is input to all nodes on the outgoing route adds that the time-delay sum that is input to all links on the outgoing route adds queuing and the stand-by period sum that is input to packet on the outgoing route.

4. according to the method that is used to seek mapping scheme between the task of network-on-chip and node of claim 1 or claim 2, it is characterized in that: the fitness value that calculates each scheme adopts one of exclusive method, banker method, challenge tournament method, recurrence method, quick sort or improved quick sort.

5. according to the method that is used to seek mapping scheme between the task of network-on-chip and node of claim 3, it is characterized in that: the fitness value that calculates each scheme adopts one of exclusive method, banker method, challenge tournament method, recurrence method, quick sort or improved quick sort.