CN103970939A - Layering and reconfigurable on-chip network modeling and simulation system - Google Patents
Layering and reconfigurable on-chip network modeling and simulation system Download PDFInfo
- Publication number
- CN103970939A CN103970939A CN201410164158.2A CN201410164158A CN103970939A CN 103970939 A CN103970939 A CN 103970939A CN 201410164158 A CN201410164158 A CN 201410164158A CN 103970939 A CN103970939 A CN 103970939A
- Authority
- CN
- China
- Prior art keywords
- data
- network
- module
- chip
- routing
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Landscapes
- Data Exchanges In Wide-Area Networks (AREA)
Abstract
本发明公开了一种层次化可重构的片上网络建模与仿真系统,该系统包含测试层、主体层与操作层三个层次。测试层的核心是可重构路由单元与网络接口单元,可动态改变网络结构参数;主体层以资源节点为主要构成单元,提供通用的OCP接口经由网络接口单元与路由单元相连,并集成流量产生机制、接收机制和性能分析逻辑单元;操作层以软件配置模块提供良好的人际交互界面,用于片上网络结构特征参数和通信模式的灵活配置与结构生成。本发明能对采用不同拓扑结构、网络规模、路由算法、缓冲区深度等片上网络体系结构进行建模,并进行不同通信模式和通信负载下的网络性能仿真,为不同应用需求下的片上网络架构设计提供依据。
The invention discloses a layered and reconfigurable on-chip network modeling and simulation system. The system includes three layers: a test layer, a main body layer and an operation layer. The core of the test layer is the reconfigurable routing unit and the network interface unit, which can dynamically change the network structure parameters; the main layer uses the resource node as the main component unit, provides a common OCP interface to connect with the routing unit through the network interface unit, and integrates traffic generation Mechanism, receiving mechanism and performance analysis logic unit; the operation layer provides a good human interaction interface with a software configuration module, which is used for flexible configuration and structure generation of on-chip network structure characteristic parameters and communication modes. The present invention can model on-chip network architectures using different topological structures, network scales, routing algorithms, buffer depths, etc., and perform network performance simulations under different communication modes and communication loads, providing an on-chip network architecture under different application requirements design basis.
Description
技术领域 technical field
本发明属于片上网络系统技术领域,具体涉及一种层次化可重构的片上网络建模与仿真系统。 The invention belongs to the technical field of network-on-chip systems, and in particular relates to a hierarchical and reconfigurable network-on-chip modeling and simulation system.
背景技术 Background technique
随着集成电路以及半导体工艺技术的快速发展,片上系统的规模越来越大。片上网络(Network-on-Chip,NoC)作为解决大规模片上系统中全局通信问题的新方法,显著改善了系统的性能,被认为是未来片上系统多核技术发展的必然方向。NoC设计包括拓扑结构选择、路由算法确定、以及基本组件设计等环节,各环节不同的设计方案在性能上存在巨大的差异,因而构建一个通用的可重构片上网络系统建模与仿真系统显得尤为重要。 With the rapid development of integrated circuits and semiconductor process technologies, the scale of SoCs is getting larger and larger. Network-on-Chip (NoC), as a new method to solve the global communication problem in large-scale system-on-chip, can significantly improve the performance of the system, and is considered to be an inevitable direction for the development of multi-core technology in system-on-chip in the future. NoC design includes topology selection, routing algorithm determination, and basic component design. Different design schemes in each link have huge differences in performance. Therefore, it is particularly important to build a general reconfigurable network-on-chip system modeling and simulation system. important.
已有的NoC仿真平台可以分为两大类,一类采用高级语言设计,能够对NoC快速地建模与性能分析,但由于抽象层次较高,忽视了物理层对性能的影响,会造成评估结果与实际情况有较大偏差;另一类是直接采用硬件描述语言对NoC系统进行模拟,但这种方法灵活性差、仿真速度慢,无法准确详细地对片上网络设计细节(如拓扑结构、流量模式等)进行研究。而且上述片上网络性能评价模型或模拟器尽管各具特点,但均缺少了对测试环境进行动态配置的能力。 Existing NoC simulation platforms can be divided into two categories. One is designed in a high-level language, which can quickly model and analyze NoC performance. However, due to the high level of abstraction, the impact of the physical layer on performance is ignored, which will cause evaluation problems. The result deviates greatly from the actual situation; the other is to directly use the hardware description language to simulate the NoC system, but this method has poor flexibility and slow simulation speed, and cannot accurately and detailedly design the details of the network on chip (such as topology, flow rate, etc. model, etc.) for research. Moreover, although the above-mentioned network-on-chip performance evaluation models or simulators have their own characteristics, they all lack the ability to dynamically configure the test environment.
发明内容 Contents of the invention
本发明的目的是针对目前片上网络仿真系统的不足,提供一种层次化可重构的片上网络建模与仿真系统。该系统支持不同拓扑结构、网络规模、路由算法、缓冲区深度等片上网络体系结构参数,以及多种通信流量模式的灵活动态配置,以实现多种片上网络架构在不同通信模式与通信负载下的快速建模与性能仿真。 The purpose of the present invention is to provide a layered and reconfigurable on-chip network modeling and simulation system aiming at the deficiencies of the current on-chip network simulation system. The system supports on-chip network architecture parameters such as different topologies, network scales, routing algorithms, and buffer depths, as well as flexible and dynamic configuration of various communication traffic modes, so as to realize various on-chip network architectures under different communication modes and communication loads. Rapid modeling and performance simulation.
实现本发明的技术方案如下: Realize the technical scheme of the present invention as follows:
一种层次化可重构的片上网络建模与仿真系统,其特征在于:该系统包含如下层次: A hierarchical reconfigurable network-on-chip modeling and simulation system is characterized in that the system includes the following layers:
a)测试层,由可重构的路由单元与网络接口单元组成,可动态改变片上网络结构特征参数,以支持多种片上网络结构; a) The test layer is composed of a reconfigurable routing unit and a network interface unit, which can dynamically change the characteristic parameters of the on-chip network structure to support various on-chip network structures;
b)主体层,由具有OCP接口的资源节点组成,通过OCP接口与路由单元相连,资源节点内部集成了流量生成、接收和性能分析模块,以支持多种通信模式和通信负载,以及通信数据统计分析; b) The main layer is composed of resource nodes with OCP interfaces, which are connected to routing units through OCP interfaces. The resource nodes integrate traffic generation, reception and performance analysis modules to support multiple communication modes and communication loads, as well as communication data statistics analyze;
c)操作层,由软件配置模块组成,提供良好的人机交互界面,用于片上网络结构特征参数和通信模式的灵活配置与片上网络结构生成。 c) The operation layer, which is composed of software configuration modules, provides a good human-computer interaction interface for flexible configuration of on-chip network structure characteristic parameters and communication modes and on-chip network structure generation.
本发明与现有技术相比,具有以下技术效果: Compared with the prior art, the present invention has the following technical effects:
本发明针对采用高级语言和硬件描述语言分别设计仿真平台的优劣,构建层次化的NoC仿真平台,包括以高级语言设计的软件配置模块组成的操作层,以及以硬件描述语言设计的片上网络通信节点和资源节点组成的测试层和主体层。其中,片上网络通信节点和资源节点具有可重构性,能对仿真的NoC系统结构及通信模式等进行动态配置。本发明还进一步将开放式核协议OCP运用到NoC仿真系统中,设计具有OCP接口的NoC资源节点,通过OCP协议与NoC路由单元进行通信,以提高模型的通用性,并将流量产生机制、接收机制和性能分析逻辑单元集成在资源节点内部,使得评价模型层次简单、结构清晰,可灵活配置。本发明能够实现不同架构片上网络在不同通信模式与通信负载下的快速建模与性能仿真,对基于各种不同参数设计的NoC体系结构进行准确全面地性能评价,为面向具体应用的NoC设计选择最佳互连体系结构提供依据。 The present invention aims at the advantages and disadvantages of separately designing the simulation platform using high-level language and hardware description language, and constructs a hierarchical NoC simulation platform, including an operation layer composed of software configuration modules designed in high-level language, and an on-chip network communication designed in hardware description language The test layer and the main layer composed of nodes and resource nodes. Among them, the network-on-chip communication nodes and resource nodes are reconfigurable, and can dynamically configure the simulated NoC system structure and communication modes. The present invention further applies the open core protocol OCP to the NoC simulation system, designs a NoC resource node with an OCP interface, and communicates with the NoC routing unit through the OCP protocol to improve the versatility of the model, and integrates the flow generation mechanism, receiving The mechanism and performance analysis logic unit are integrated in the resource node, which makes the evaluation model simple in hierarchy, clear in structure, and flexible in configuration. The present invention can realize rapid modeling and performance simulation of different architecture on-chip networks under different communication modes and communication loads, perform accurate and comprehensive performance evaluation on NoC architectures designed based on various parameters, and provide specific application-oriented NoC design options Optimal interconnection architecture provides the basis.
附图说明 Description of drawings
图1是层次化可重构的片上网络仿真系统结构图; Figure 1 is a structural diagram of a hierarchical reconfigurable network-on-chip simulation system;
图2是可重构路由单元结构图; Fig. 2 is a structural diagram of a reconfigurable routing unit;
图3是路由单元输入模块结构图; Fig. 3 is a structural diagram of the routing unit input module;
图4 是循环控制方式生成可配置输入端口代码结构图; Figure 4 is a code structure diagram of the configurable input port generated by the loop control method;
图5 是路由单元路由计算模块结构图; Figure 5 is a structural diagram of the routing calculation module of the routing unit;
图6 是路由单元仲裁模块结构图; Figure 6 is a structural diagram of the routing unit arbitration module;
图7是路由单元中数据交换开关模块可重构结构图; Fig. 7 is a reconfigurable structural diagram of the data exchange switch module in the routing unit;
图8是路由单元输出模块结构图; Fig. 8 is a structural diagram of a routing unit output module;
图9是网络接口结构图; Fig. 9 is a network interface structural diagram;
图10是OCP协议固定数据包格式; Fig. 10 is the OCP protocol fixed packet format;
图11是数据微片格式; Figure 11 is the data microchip format;
图12是资源节点结构图; Figure 12 is a resource node structure diagram;
图13是资源节点中流量生成模块结构图; Fig. 13 is a structural diagram of a traffic generation module in a resource node;
图14是资源节点中流量接收与性能分析模块结构图; Fig. 14 is a structural diagram of the traffic reception and performance analysis module in the resource node;
图15是基于OCP接口的片上网络通信结构图; Fig. 15 is a network communication structure diagram based on the OCP interface;
图16是基于操作层软件构建NoC模型流程图。 Figure 16 is a flow chart of building a NoC model based on the operating layer software.
具体实施方式 Detailed ways
以下结合附图,通过具体实施方式对本发明进行详细描述。 Hereinafter, the present invention will be described in detail through specific embodiments in conjunction with the accompanying drawings.
本发明层次化可重构的片上网络仿真系统包括操作层、主体层和测试层三个层次,用于对片上网络关键特性进行建模和性能指标统计分析,其结构如图1所示。每一层的功能组成及关系说明如下。 The hierarchical and reconfigurable network-on-chip simulation system of the present invention includes three layers: an operation layer, a main body layer and a test layer, and is used for modeling key characteristics of the network-on-chip and statistically analyzing performance indicators. Its structure is shown in FIG. 1 . The functional composition and relationship of each layer are described as follows.
(1)操作层,该层直接面向仿真平台使用者,用户通过软件配置模块与仿真模型进行交互,根据应用需求设置网络拓扑结构、网络规模、路由算法等定制片上网络结构,并选择测试流量的分布模型、数据包包长、网络注入率、微片大小等,便于测试片上网络在不同通信模式和通信负载下的各项性能指标,测试的性能结果亦通过软件配置模块显示。 (1) The operation layer, which is directly oriented to the users of the simulation platform. Users interact with the simulation model through the software configuration module, set the network topology, network scale, routing algorithm, etc. The distribution model, data packet length, network injection rate, microchip size, etc. are convenient for testing various performance indicators of the on-chip network under different communication modes and communication loads. The performance results of the test are also displayed through the software configuration module.
(2)主体层,该层由具有OCP接口的资源节点S构成,用于模拟片上网络系统中各种功能IP核(如,专用硬件资源、嵌入式微处理器、自定义的可重构硬件或者是上述各种资源的组合)的通信任务,包括根据所配置的空间和时间分布产生相应的通信流量注入网络,并根据网络的通信状况进行性能分析,计算各种性能指标。此外,用户可以根据网络的规模配置资源节点S的数目,如图1所示是对一个4×4大小的2D Mesh结构的NoC进行性能评价,分配了16个资源节点与路由单元相对应。当只需要测试几对节点在固定路径上的通信情况时,可以仅配置需要发送和接收数据的资源节点,这样有效的控制了主体层的规模。 (2) The main layer, which is composed of resource nodes S with OCP interface, which is used to simulate various functional IP cores in the network-on-chip system (such as dedicated hardware resources, embedded microprocessors, custom reconfigurable hardware or It is a combination of the above-mentioned various resources) communication tasks, including generating corresponding communication traffic into the network according to the configured space and time distribution, and performing performance analysis and calculating various performance indicators according to the communication status of the network. In addition, users can configure the number of resource nodes S according to the scale of the network. As shown in Figure 1, the performance evaluation of a 4×4 2D Mesh structure NoC is performed, and 16 resource nodes are allocated corresponding to the routing unit. When it is only necessary to test the communication of several pairs of nodes on a fixed path, only the resource nodes that need to send and receive data can be configured, which effectively controls the scale of the main layer.
(3)测试层,放置用于测试的片上网络实例,其核心组件是可重构的路由单元R与网络接口单元NI。根据用户在操作层的配置信息重构路由单元,在测试层形成指定结构特征参数(包括拓扑结构、交换策略、路由策略、缓冲区大小、端口数目等)的NoC实例。如图1中使用的NoC是一个4×4的2D Mesh网络,每个路由单元R与相邻的路由单元进行通信,并通过网络接口单元与资源节点进行通信。 (3) The test layer, where the on-chip network instance for testing is placed, and its core components are the reconfigurable routing unit R and the network interface unit NI. Reconstruct the routing unit according to the user's configuration information at the operation layer, and form a NoC instance with specified structural characteristic parameters (including topology, switching strategy, routing strategy, buffer size, port number, etc.) at the test layer. The NoC used in Figure 1 is a 4×4 2D Mesh network, each routing unit R communicates with adjacent routing units, and communicates with resource nodes through network interface units.
如上文所述,上述仿真系统的三个层次分别由软件配置模块、具有OCP接口的资源节点、可重构路由单元与网络接口单元这些核心部分组成。因而,本发明涉及的关键问题是这些核心组件的设计,下面详细说明核心组件的具体结构与设计方法。 As mentioned above, the three levels of the above-mentioned simulation system are composed of core parts such as software configuration module, resource node with OCP interface, reconfigurable routing unit and network interface unit. Therefore, the key issue involved in the present invention is the design of these core components, and the specific structure and design method of the core components will be described in detail below.
路由单元是片上网络通信的核心,主要负责数据的接收、存储,并根据特定的路由算法协议转发给下一个路由单元或资源节点。其传输信息的最小单元是微片,微片中包含了控制信息和负载数据。控制信息主要用来识别微片类型,标示微片的状态;负载数据即微片传输的数据内容。为了提高传输的效率,本发明设计了一种双通道的可重构路由单元,将控制信号与负载数据的传输分离开来,其结构框图如图2所示,主要由输入模块、仲裁模块、交换开关模块和输出模块组成。各部分实现功能与结构分别如下: The routing unit is the core of on-chip network communication. It is mainly responsible for receiving and storing data, and forwarding it to the next routing unit or resource node according to a specific routing algorithm protocol. The smallest unit for transmitting information is a microchip, which contains control information and load data. The control information is mainly used to identify the microchip type and indicate the status of the microchip; the payload data is the data content transmitted by the microchip. In order to improve the transmission efficiency, the present invention designs a dual-channel reconfigurable routing unit, which separates the transmission of the control signal from the load data. Its structural block diagram is shown in Figure 2. Composed of exchange switch module and output module. The function and structure of each part are as follows:
(1)输入模块 (1) Input module
该模块对接收到的微片数据进行解析,分离控制信号与负载数据,并根据规定的路由算法协议,将其分别存入相应的数据缓冲区,并向仲裁模块传输发送请求信号。其结构如图3所示,包含输入端口模块、路由计算模块和数据缓存模块。 The module analyzes the received microchip data, separates the control signal and load data, and stores them in the corresponding data buffer according to the specified routing algorithm protocol, and transmits the send request signal to the arbitration module. Its structure is shown in Figure 3, including an input port module, a routing calculation module and a data cache module.
输入模块接收上一级路由单元发送过来的微片,经输入端口的解析,将控制信号ctr_in与负载数据Data_in分离开来,分别送至相应的数据缓存模块。通过获得ctrl_in中微片类型控制位,判断微片类型,若为头微片,则要将负载数据送至路由计算模块进行路由,路由结果送给缓存控制器。数据缓存模块在接收到微片数据时,会触发缓存控制器向仲裁模块发起请求命令,在接收到仲裁模块的发送允许信号后,缓存控制器控制数据FIFO和控制信号,数据缓存模块发送数据包到相应的交换开关模块。 The input module receives the chip sent by the upper-level routing unit, and through the analysis of the input port, separates the control signal ctr_in from the load data Data_in, and sends them to the corresponding data buffer module respectively. By obtaining the microchip type control bit in ctrl_in, the microchip type is judged. If it is the header microchip, the load data is sent to the routing calculation module for routing, and the routing result is sent to the cache controller. When the data cache module receives microchip data, it will trigger the cache controller to initiate a request command to the arbitration module. After receiving the sending permission signal from the arbitration module, the cache controller controls the data FIFO and the control signal, and the data cache module sends the data packet to the corresponding switch module.
对于不同的NoC拓扑结构,路由单元的输入输出端口的数目以及与所连接的资源节点的数目不尽相同。因而,本发明采用循环控制的方式生成可配置端口数目的接口电路。以生成8端口为例,其代码结构如图4所示,代码中定义了参数num_ports为8,采用for循环语句,使输入模块input_module实例化8次。 For different NoC topologies, the number of input and output ports of the routing unit and the number of resource nodes connected to it are not the same. Therefore, the present invention generates an interface circuit with a configurable number of ports in a cyclic control manner. Taking the generation of 8 ports as an example, the code structure is shown in Figure 4. The parameter num_ports is defined as 8 in the code, and the for loop statement is used to instantiate the input module input_module 8 times.
此外,在输入模块中,路由计算模块接收头微片中路由信息,根据某种路由机制对路由信息进行路由,获得数据包的转发方向。为体现可重构性,路由机制可根据网络拓扑结构选择配置。路由计算模块结构如图5所示,包括了多种路由算法子模块以及可自定义的路由算法子模块。 In addition, in the input module, the routing calculation module receives the routing information in the header chip, routes the routing information according to a certain routing mechanism, and obtains the forwarding direction of the data packet. In order to reflect reconfigurability, the routing mechanism can be configured according to the network topology. The routing calculation module structure is shown in Figure 5, including a variety of routing algorithm sub-modules and customizable routing algorithm sub-modules.
在路由计算接口模块中配置各项参数,如路由算法类型控制参数router_type、路由信息位宽参数router_info_width、输出结果位宽参数router_out_width等,选择指定路由算法的子模块,实现该算法的路由方式。若为实现XY路由算法,只需将类型参数配置为ROUTING_TYPE_XY,路由信息位宽配置为8bit,输出结果位宽配置为5bit,并选择XY路由算法子模块即可。若所仿真的网络拓扑结构采用的路由算法在已有路由算法子模块库中没有实现,则配置类型参数为ROUTING_TYPE_DEFAULT以及相应的其他参数值,通过可扩展的自定义路由算法接口,添加自定义路由算法子模块至路由计算模块代码工程中,实现路由算法的扩展。 Configure various parameters in the routing calculation interface module, such as the routing algorithm type control parameter router_type, the routing information bit width parameter router_info_width, the output result bit width parameter router_out_width, etc., select the sub-module of the specified routing algorithm, and realize the routing method of the algorithm. To implement the XY routing algorithm, you only need to configure the type parameter as ROUTING_TYPE_XY, configure the routing information bit width as 8bit, configure the output result bit width as 5bit, and select the XY routing algorithm submodule. If the routing algorithm used by the simulated network topology is not implemented in the existing routing algorithm sub-module library, the configuration type parameter is ROUTING_TYPE_DEFAULT and other corresponding parameter values, and a custom route can be added through the scalable custom routing algorithm interface The algorithm sub-module is transferred to the code project of the routing calculation module to realize the expansion of the routing algorithm.
路由算法的输出结果为端口编号,对于不同的网络拓扑,路由单元的端口编号也不一样。如网格网络的路由单元端口编号0~3分别表示东西南北四个方向的端口,而第4个端口表示本地连接端口,树型网络的路由单元端口0~3端口分布在路由单元下边,负责连接子节点,4~7端口分布在路由单元上边,负责连接父节点。不同拓扑结构系统在连接路由单元时,需按照端口编号进行连接。 The output result of the routing algorithm is the port number, and for different network topologies, the port numbers of the routing unit are also different. For example, the port numbers 0~3 of the routing unit in the grid network indicate the ports in the four directions of east, west, north, south, respectively, and the fourth port indicates the local connection port. The routing unit ports 0~3 of the tree network are distributed under the routing unit, responsible for To connect child nodes, ports 4~7 are distributed on the routing unit and are responsible for connecting to the parent node. When systems with different topologies are connected to routing units, they need to be connected according to the port numbers.
(2)仲裁模块 (2) Arbitration module
该模块在接收到各输入模块发送的请求信号后,根据仲裁协议作出相应的响应,并控制交换开关模块进行数据交换。其可重构性主要体现在仲裁算法的可选择性及发送请求数目可变性。为提高传输效率,本发明在路由单元中采用了虚通道技术,因此,对于每个端口发送的转发请求还需具有分虚通道仲裁功能,仲裁模块一共分两级进行数据仲裁分配,分为输出端口申请和输出虚通道申请。该模块电路结构如图6所示。 After receiving the request signal sent by each input module, the module makes a corresponding response according to the arbitration agreement, and controls the exchange switch module to exchange data. Its reconfigurability is mainly reflected in the selectivity of the arbitration algorithm and the variability of the number of sending requests. In order to improve the transmission efficiency, the present invention adopts the virtual channel technology in the routing unit. Therefore, the forwarding request sent by each port also needs to have a virtual channel arbitration function. The arbitration module is divided into two stages for data arbitration and distribution, divided into output Port application and output virtual channel application. The circuit structure of the module is shown in Figure 6.
图中p0为路由单元端口数目,v为每个端口中虚通道数目。第一级向输出端口申请转发数据的虚通道,需要p0个v:1的仲裁器。第二级是每个输出端口的pi:1的仲裁器,响应端口输出请求。仲裁模块的输入信号位宽参数arbiter_width随仲裁器实例位置的改变而改变,在第一级仲裁时arbiter_width位宽为num_vcs,即虚通道数目,而在第二级仲裁时为num_ports,即端口数目。与路由算法模块类似,仲裁模块的重构主要也是实现不同仲裁算法的选择。仲裁算法的选择则通过case语句选择实例化不同的仲裁算法。在采用自定义仲裁算法时,对定义了输入与输出信号的自定义仲裁算法模块进行二次开发即可。 In the figure, p0 is the number of routing unit ports, and v is the number of virtual channels in each port. The first stage applies to the output port for a virtual channel for forwarding data, requiring p0 arbitrators of v:1. The second stage is an arbiter of pi:1 for each output port, responding to port output requests. The input signal bit width parameter arbiter_width of the arbitration module changes with the position of the arbiter instance. In the first level of arbitration, the arbiter_width bit width is num_vcs, that is, the number of virtual channels, and in the second level of arbitration, it is num_ports, that is, the number of ports. Similar to the routing algorithm module, the reconstruction of the arbitration module is mainly to realize the selection of different arbitration algorithms. The selection of the arbitration algorithm is to select and instantiate different arbitration algorithms through the case statement. When using a custom arbitration algorithm, the secondary development of the custom arbitration algorithm module that defines the input and output signals is sufficient.
(3)交换开关模块 (3) Exchange switch module
该模块分为负载数据交换模块和控制信号交换模块,由仲裁模块控制实现数据由输入模块到输出模块的物理传输。以数据交换开关模块为例,其电路结构图如图7所示。 The module is divided into a load data exchange module and a control signal exchange module, which is controlled by the arbitration module to realize the physical transmission of data from the input module to the output module. Taking the data exchange switch module as an example, its circuit structure diagram is shown in Figure 7.
模块输入信号由数据信号线data_in_ip和交叉端口控制信号ctrl_in_op_ip构成,数据信号线的位宽为num_ports*data_width,总位宽与路由单元端口数目成比例,实现重构配置。控制信号线的为num_ports*num_ports,分布控制输入端口与输出端口的连接状态。switch_type为交换阵列连接类型控制参数,分为CROSSBAR_TYPE_TRISTATE 、CROSSBAR_TYPE_MUX、 CROSSBAR_TYPE_DIST_MUX三种,分别实现三态矩阵开关、全连接矩阵以及部分连接矩阵的交换连接方式。 The input signal of the module is composed of the data signal line data_in_ip and the cross port control signal ctrl_in_op_ip. The bit width of the data signal line is num_ports*data_width, and the total bit width is proportional to the number of routing unit ports to realize reconfiguration configuration. The control signal line is num_ports*num_ports, and the connection status of the distribution control input port and output port. switch_type is the control parameter of the switching array connection type, which is divided into three types: CROSSBAR_TYPE_TRISTATATE, CROSSBAR_TYPE_MUX, and CROSSBAR_TYPE_DIST_MUX, which respectively realize the switching connection modes of three-state matrix switch, full connection matrix and partial connection matrix.
(4)输出模块 (4) Output module
该模块接收负载数据和控制信号,根据仲裁结果控制多路选择器,选择相应端口数据进行输出,输出给下一路由单元或目的资源节点。该模块电路结构图如图8所示,主要由多路选择器和输出控制器构成,其端口数目的可重构方式与输入模块一致。 The module receives load data and control signals, controls the multiplexer according to the arbitration result, selects the corresponding port data for output, and outputs to the next routing unit or destination resource node. The circuit structure diagram of this module is shown in Figure 8. It is mainly composed of a multiplexer and an output controller, and the number of ports can be reconfigured in the same way as the input module.
测试层的另一核心组件是可重构网络接口单元NI,其负责实现资源节点与路由单元间的数据连接。资源节点中的数据以固定数据包格式将数据传送给网络接口单元NI,而网络接口单元NI需根据不同的网络拓扑对数据包重新组包,以适应不同的拓扑结构。可重构网络接口单元结构图如图9所示,包含与资源节点接口、数据组包与解包、输入控制与输出控制、数据缓存这四个模块。各部分实现功能与结构分别如下: Another core component of the test layer is the reconfigurable network interface unit NI, which is responsible for realizing the data connection between resource nodes and routing units. The data in the resource nodes is transmitted to the network interface unit NI in a fixed data packet format, and the network interface unit NI needs to repackage the data packets according to different network topologies to adapt to different topological structures. The structure diagram of the reconfigurable network interface unit is shown in Figure 9, which includes four modules: interface with resource nodes, data packetization and unpacking, input control and output control, and data cache. The function and structure of each part are as follows:
(1)与资源节点接口模块:采用OCP协议与NoC系统中的资源节点通信,以提高通用性; (1) Interface module with resource nodes: use OCP protocol to communicate with resource nodes in the NoC system to improve versatility;
(2)数据组包与解包模块:数据组包模块将固定格式数据包转换为适应当前片上网络结构传输的数据微片,是网络接口单元中实现重构的关键模块,数据微片的组成、位宽等都随重构参数的变化而改变;解包是组包的逆向过程,数据解包模块将从片上网络传送来的数据微片格式转换成符合OCP协议通信的数据包格式。 (2) Data grouping and unpacking module: The data grouping module converts fixed-format data packets into data flakes suitable for transmission on the current on-chip network structure. It is a key module for reconfiguration in the network interface unit. , bit width, etc. all change with the change of the reconstruction parameters; unpacking is the reverse process of packeting, and the data unpacking module converts the data flake format transmitted from the network on chip into a data packet format that conforms to the OCP protocol communication.
(3)输入控制与输出控制模块:实现数据微片传输的调度,调配数据在资源节点与路由单元间的传输; (3) Input control and output control module: realize the scheduling of data microchip transmission, and allocate data transmission between resource nodes and routing units;
(4)数据缓存模块:按微片形式存储数据包,其深度可配置。 (4) Data cache module: store data packets in the form of microchips, and its depth is configurable.
其中,定义的OCP协议固定数据包格式如图10所示。Route_trans_cnt用于统计数据包在网络传输中的路由跳数,数据包每经过一个路由单元,该标志位计数加1,用于链路跳数统计分析;Packet_length为数据包包长,用于计算将数据包划分为微片的个数;Source_ID为数据包发送的源地址,是资源节点编码地址;Destination_ID为数据包发送的目的地址,同样是资源节点编码地址;Reserved为保留字;Time_cnt为记录数据包从数据发送到数据接收的时间延迟,用于统计分析网络拓扑传输的延时性能。 Wherein, the defined OCP protocol fixed data packet format is shown in FIG. 10 . Route_trans_cnt is used to count the number of routing hops of data packets in network transmission. Every time a data packet passes through a routing unit, the flag bit count is increased by 1, which is used for statistical analysis of link hops; Packet_length is the length of the data packet, used to calculate the The data packet is divided into the number of microchips; Source_ID is the source address of the data packet, which is the coded address of the resource node; Destination_ID is the destination address of the data packet, which is also the coded address of the resource node; Reserved is a reserved word; Time_cnt is the recorded data The time delay from data sending to data receiving of packets is used to statistically analyze the delay performance of network topology transmission.
本发明提供的可重构网络接口单元,具有将固定格式的数据包转换为重构特性下的数据微片,数据微片的组成、位宽等都随重构参数的变化而改变。根据对数据包格式的分析,可以将数据微片定义为如图11所示的格式形式。图11中,Ctrl部分为微片控制信号,包含了微片的有效信号标志V,微片所在虚通道编号VC,微片类型标志H(头尾片,Header)及T(尾微片,Tail),而Data为数据微片部分,在头尾片时,Data中包含了路由信息Router_info,微片长度L,保留字R。在不同的网络拓扑中,通过配置这些信号的位宽参数,就可以实现不同位宽的数据微片。如,定义4×4的2D Mesh结构的片上网络,虚通道数目为4,数据包包长最大为127,则数据微片的控制信号位宽为5,而数据信号位宽至少为11位(路由信息二维地址宽度为4,数据包微片长度位宽为7)。 The reconfigurable network interface unit provided by the present invention has the function of converting fixed-format data packets into data flakes under reconstruction characteristics, and the composition and bit width of the data flakes are all changed with the change of reconfiguration parameters. According to the analysis of the data packet format, the data flake can be defined as the format shown in FIG. 11 . In Figure 11, the Ctrl part is the microchip control signal, which includes the effective signal flag V of the microchip, the virtual channel number VC where the microchip is located, the microchip type flag H (header, Header) and T (tail microchip, Tail ), and Data is the part of the data flake. In the head and tail flakes, Data contains the routing information Router_info, the flake length L, and the reserved word R. In different network topologies, data flakes with different bit widths can be realized by configuring the bit width parameters of these signals. For example, if a network on chip with a 4×4 2D Mesh structure is defined, the number of virtual channels is 4, and the maximum data packet length is 127, then the bit width of the control signal of the data chip is 5, and the bit width of the data signal is at least 11 bits ( The two-dimensional address width of the routing information is 4, and the data packet flake length bit width is 7).
主体层的核心组件是具有OCP接口的资源节点。资源节点内部采用模块化的设计方法,封装了片上网络流量生成、接收模块、性能分析模块和接口模块,内部结构如图12所示。各模块在设计时可分别进行单独的修改,如需要添加某个流量分布模型时只需要对流量生成模块进行相应的修改,而不对其他模块产生影响,便于仿真平台的升级。用资源节点来模拟片上网络的IP核发生通信,使得平台配置更加灵活,其灵活性主要表现在以下两个方面:一方面可以根据需要通信的节点数量来配置资源节点,对于不需要进行数据发送和接收的节点则不配置资源节点,这样可以减小平台主体层的规模;另一方面,对于某些只发送数据或者只接收数据的通信节点,还可以根据需要仅配置资源节点中的流量生成模块或者流量接收模块,进一步节省片上资源。下面详细介绍资源节点各模块实现功能与结构。 The core component of the main layer is the resource node with OCP interface. The resource node adopts a modular design method to encapsulate the on-chip network traffic generation, receiving module, performance analysis module and interface module. The internal structure is shown in Figure 12. Each module can be individually modified during design. If a traffic distribution model needs to be added, only the traffic generation module needs to be modified accordingly, without affecting other modules, which facilitates the upgrade of the simulation platform. Using resource nodes to simulate the communication of the IP core of the network on chip makes the platform configuration more flexible. The flexibility is mainly reflected in the following two aspects: on the one hand, resource nodes can be configured according to the number of nodes that need to communicate, and for those that do not need to send data The nodes that receive and receive data do not configure resource nodes, which can reduce the scale of the main layer of the platform; on the other hand, for some communication nodes that only send data or only receive data, it is also possible to configure only traffic generation in resource nodes as needed module or flow receiving module, further saving on-chip resources. The function and structure of each module of the resource node are introduced in detail below.
(1)流量生成模块 (1) Traffic generation module
流量生成是整个性能评价工作的源头,本发明中的流量生成模块结构如图13所示,包含random模块、Time interval模块、Addr_gen模块以及Inject模块。 Traffic generation is the source of the entire performance evaluation work. The structure of the traffic generation module in the present invention is shown in Figure 13, including a random module, a Time interval module, an Addr_gen module and an Inject module.
random模块用来产生满足0到1之间均匀分布的随机数供时间分布控制模块和空间分布控制模块使用。 The random module is used to generate random numbers satisfying the uniform distribution between 0 and 1 for use by the time distribution control module and the space distribution control module.
时间分布控制模块Time interval用于控制网络流量在时间维度上的发生间隔,其内部集成了多种流量分布模型的运算逻辑,供测试时选择使用。平台中支持的各种时间分布(如,泊松分布、自相似分布、固定时间间隔)的时间间隔使用的随机数由random模块提供,其他参数均可以配置。当时钟周期计数器当到达计算出的时间间隔时,Time interval模块将输出信号WR_en置为高电平,通知Inject模块向网络中注入数据,在下一个时钟周期WR_en自动置为低电平。对于需要持续发送数据的ON-OFF模型,在突发周期内WR_en始终保持高电平,在空闲周期内WR_en始终保持低电平。 The time distribution control module Time interval is used to control the occurrence interval of network traffic in the time dimension. It integrates the operation logic of various traffic distribution models for selection during testing. The random numbers used in the time intervals of various time distributions supported in the platform (such as Poisson distribution, self-similar distribution, fixed time interval) are provided by the random module, and other parameters can be configured. When the clock cycle counter reaches the calculated time interval, the Time interval module sets the output signal WR_en to high level, notifies the Inject module to inject data into the network, and WR_en is automatically set to low level in the next clock cycle. For the ON-OFF model that needs to continuously send data, WR_en is always kept high during the burst period, and WR_en is always kept low during the idle period.
空间分布控制模块Addr_gen负责确定网络流量在空间维度上的通信关系,根据用户的配置,Addr_gen模块内部集成的多种空间分布模型运算逻辑,计算出满足指定分布结果作为当前节点通信的目标地址,平台中支持的空间分布包括均匀分布、降概率分布、固定传输路径分布。在每次发生流量注入时Inject模块根据Addr_gen模块产生的地址编号作为本次通信的目标地址,完成通信后,Addr_gen模块就立刻通过计算更新地址编号,为下一次通信提供满足指定分布的目标地址。 The spatial distribution control module Addr_gen is responsible for determining the communication relationship of network traffic in the spatial dimension. According to the user's configuration, the operational logic of various spatial distribution models integrated in the Addr_gen module calculates the target address that meets the specified distribution results as the current node communication, and the platform The spatial distributions supported in include uniform distribution, descending probability distribution, and fixed transmission path distribution. When traffic injection occurs each time, the Inject module uses the address number generated by the Addr_gen module as the target address of this communication. After the communication is completed, the Addr_gen module immediately updates the address number through calculation to provide a target address that satisfies the specified distribution for the next communication.
Inject模块读取从操作层传来的配置文件,根据配置信息选择Time interval模块和Addr_gen模块应生成的流量分布模型并设定指定分布的各项参数。Inject模块按照用户指定的微片长度产生待发送的数据包,以Addr_gen模块产生的服从特定分布的地址为通信目标,在信号WR_en为高电平时将数据和地址同时传送给网络接口单元模块。 The Inject module reads the configuration file transmitted from the operation layer, selects the traffic distribution model that should be generated by the Time interval module and the Addr_gen module according to the configuration information, and sets various parameters of the specified distribution. The Inject module generates the data packet to be sent according to the microchip length specified by the user, takes the address subject to the specific distribution generated by the Addr_gen module as the communication target, and transmits the data and address to the network interface unit module at the same time when the signal WR_en is at a high level.
(2)流量接收与性能分析模块 (2) Traffic reception and performance analysis module
资源节点中的流量接收与性能分析模块包含Receiver模块和Access模块,其结构如图14所示。 The traffic reception and performance analysis module in the resource node includes a Receiver module and an Access module, and its structure is shown in Figure 14.
Receiver模块为测试数据接收模块,用于存储片上网络传来的数据。每收到一个数据包,就记录数据包到达的时间,并对数据包头微片中数据包的发送时间、数据包长度和链路跳数等参数信息进行统计,然后将这些参数传给性能分析模块Access进行分析处理。 The Receiver module is a test data receiving module for storing data from the on-chip network. Every time a data packet is received, the arrival time of the data packet is recorded, and the parameter information such as the sending time of the data packet, the length of the data packet and the number of link hops in the data packet header chip are counted, and then these parameters are passed to the performance analysis The module Access performs analysis and processing.
根据网络性能指标(包括平均延时、吞吐量、链路利用率等)计算时需要统计的信息,结合数据传输时的必要信息,可以确定测试流量的数据包包头格式,平台中的数据微片长度为32位,头微片格式如下所示。 According to the information that needs to be counted during the calculation of network performance indicators (including average delay, throughput, link utilization, etc.), combined with the necessary information during data transmission, the format of the data packet header of the test traffic can be determined, and the data microchips in the platform The length is 32 bits, and the header chip format is as follows.
头微片内的数据均由平台的流量生成模块在数据产生时写入头微片中,其中发送时间用于记录数据包传入片上网络的时间,与Receiver模块记录的数据包接收时间一起用于网络平均延时指标的计算;源地址和目标地址信息为路由单元计算传输路径提供依据;数据包长度信息可以为网络吞吐量和链路利用率的计算提供重要参数;路由跳数记录了数据包传输过程中经历的链路数,用于对链路利用率和网络功耗的计算,其初始值为0,每经过一次路由转发就进行加一操作,在头微片的数据信息中只有路由跳数的值是可以修改的。 The data in the header microchip is written into the header microchip by the traffic generation module of the platform when the data is generated. The sending time is used to record the time when the data packet is transmitted to the on-chip network, and is used together with the data packet receiving time recorded by the Receiver module. It is used to calculate the average delay index of the network; the source address and destination address information provide the basis for the routing unit to calculate the transmission path; the data packet length information can provide important parameters for the calculation of network throughput and link utilization; the routing hop count records the data The number of links experienced during packet transmission is used to calculate link utilization and network power consumption. Its initial value is 0, and it will be incremented every time it is routed and forwarded. In the data information of the header chip, there is only The value of routing hops can be modified.
Access模块为性能指标计算模块,其内部集成了各项性能指标公式的算法逻辑。除此之外,Access模块中还包含了一个全局时钟计数器,用于计算平台运行时间,结合Receiver模块统计的各项信息以及操作层传来的网络功耗参数,为算法模块提供性能指标计算所需要的各种参数,最终计算结果通过主体层汇总并等待操纵层的读取。 The Access module is a performance index calculation module, which integrates the algorithmic logic of various performance index formulas. In addition, the Access module also includes a global clock counter, which is used to calculate the running time of the platform. Combined with the statistics of the Receiver module and the network power consumption parameters from the operation layer, it provides the algorithm module with the necessary information for the calculation of performance indicators. Various parameters are required, and the final calculation results are summarized through the main layer and wait for the reading of the manipulation layer.
(3)OCP接口模块 (3) OCP interface module
该模块负责实现资源节点与片上通信网络间的通信,包含主机接口与从机接口两部分。主机接口与流量生成模块连接,将命令和数据发送给与之相邻的片上通信网络中的路由单元,并接收路由单元的反馈信号;从机接口与流量接收性能分析模块相连,按照OCP时序接收路由单元发来的数据,并将数据传送给性能分析模块保存,同时发送反馈信号给数据发送方。基于OCP接口的片上网络通信结构如图15所示。 This module is responsible for realizing the communication between the resource node and the on-chip communication network, including two parts: the host interface and the slave interface. The host interface is connected to the traffic generation module, and sends commands and data to the routing unit in the adjacent on-chip communication network, and receives the feedback signal from the routing unit; the slave interface is connected to the traffic receiving performance analysis module, and receives according to the OCP sequence The data sent by the routing unit is transmitted to the performance analysis module for storage, and a feedback signal is sent to the data sender at the same time. The communication structure of the on-chip network based on the OCP interface is shown in Figure 15.
操作层的组成单元是软件配置模块。该模块提供良好的人际交互界面,将所提取的网络特征参数显示在软件界面上,以供用户选择配置生成用于仿真评估的各种NoC模型;根据所配置的通信流量模式参数,调用主体层的基本组件(资源节点模块),根据所配置的拓扑结构、路由算法、虚通道数量、缓冲区深度等参数,调用测试层的基本组件(可重构路由单元和网络接口单元),以及可扩展的自定义模块(自定义的路由算法、仲裁算法等),以此构建完成NoC系统结构;将生成的NoC工程文件(包括系统结构代码以及仿真测试数据TestBench)输入至EDA仿真工具(如,Modelsim),统计NoC网络节点收发数据、延时等性能数据,并将数据保存于一个文本文件中;最后读取EDA工具反馈的统计数据,并对该数据进行分析,产生网络性能评估报告。根据上述功能描述,本发明中的软件配置模块进一步划分为网络拓扑结构配置、路由单元结构配置、网络接口单元配置、流量参数配置以及EDA仿真数据分析这五个模块,各部分实现功能分别如下,基于此构建NoC仿真模型的流程如图16所示。 The constituent unit of the operation layer is the software configuration module. This module provides a good human interaction interface, and displays the extracted network characteristic parameters on the software interface for users to select and configure to generate various NoC models for simulation evaluation; according to the configured communication traffic pattern parameters, call the main layer The basic components of the test layer (resource node module), according to the configured topology, routing algorithm, number of virtual channels, buffer depth and other parameters, call the basic components of the test layer (reconfigurable routing unit and network interface unit), and expandable Custom modules (custom routing algorithm, arbitration algorithm, etc.) to complete the NoC system structure; input the generated NoC project files (including system structure code and simulation test data TestBench) to EDA simulation tools (such as Modelsim ), statistics of NoC network node sending and receiving data, delay and other performance data, and save the data in a text file; finally read the statistical data fed back by the EDA tool, analyze the data, and generate a network performance evaluation report. According to the above functional description, the software configuration module in the present invention is further divided into five modules of network topology configuration, routing unit configuration, network interface unit configuration, traffic parameter configuration and EDA simulation data analysis, and the functions of each part are as follows respectively, The process of building NoC simulation model based on this is shown in Figure 16.
(1)网络拓扑结构配置模块:主要实现拓扑结构参数(拓扑结构、网络规模、资源节点等)的配置,并根据配置参数建立网络节点连接,形成NoC网络对象,对该对象进行网络直径、平均距离、网络节点路由最短路径等性能参数计算; (1) Network topology configuration module: mainly implements the configuration of topology parameters (topology structure, network scale, resource nodes, etc.), and establishes network node connections according to the configuration parameters to form a NoC network object, and performs network diameter, average Calculation of performance parameters such as distance and shortest path of network node routing;
(2)路由单元结构配置模块:结合已配置好的拓扑结构参数,进一步对路由单元结构参数进行配置。可配置的参数有路由算法类型参数、虚通道数目、缓冲区深度、微片位宽、交换方式类型参数、仲裁类型参数、拥塞控制类型参数等。 (2) Routing unit structure configuration module: Combined with the configured topology parameters, further configure the routing unit structure parameters. Configurable parameters include routing algorithm type parameters, number of virtual channels, buffer depth, microchip bit width, switching mode type parameters, arbitration type parameters, congestion control type parameters, etc.
(3)网络接口配置模块:配置实现网络节点与资源节点的连接关系;根据网络拓扑结构参数及路由单元结构参数,配置网络接口单元相关参数,以便实现将固定格式数据包转换为适应当前拓扑结构网络系统的数据微片格式。 (3) Network interface configuration module: configure and realize the connection relationship between network nodes and resource nodes; configure network interface unit related parameters according to network topology parameters and routing unit structure parameters, so as to convert fixed-format data packets to adapt to the current topology structure A data chip format for network systems.
(4)流量参数配置模块:配置流量产生的空间分布与时间分布的参数、数据包长度、资源节点数目等,生成EDA仿真测试平台数据及TestBench等。 (4) Traffic parameter configuration module: configure the parameters of spatial distribution and time distribution generated by traffic, data packet length, number of resource nodes, etc., and generate EDA simulation test platform data and TestBench, etc.
(5)EDA仿真数据分析:对EDA仿真工具统计的NoC性能参数数据进行分析,产生NoC性能评估报告等。 (5) EDA simulation data analysis: analyze NoC performance parameter data collected by EDA simulation tools, and generate NoC performance evaluation reports, etc.
应用上述方法可以构成层次化可重构的片上网络建模与仿真系统,实现对具有不同拓扑结构、网络规模、路由算法、缓冲区深度等特征的片上网络体系结构进行正确快速建模,并在不同通信模式和通信负载下进行网络性能仿真,评估各种结构特征参数对于网络性能的影响,为不同应用需求下的片上网络架构设计提供重要依据。 By applying the above method, a hierarchical reconfigurable network-on-chip modeling and simulation system can be constructed to realize correct and fast modeling of network-on-chip architectures with different topological structures, network scales, routing algorithms, buffer depths, etc. Perform network performance simulation under different communication modes and communication loads, evaluate the impact of various structural characteristic parameters on network performance, and provide an important basis for on-chip network architecture design under different application requirements.
Claims (5)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410164158.2A CN103970939A (en) | 2014-04-22 | 2014-04-22 | Layering and reconfigurable on-chip network modeling and simulation system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410164158.2A CN103970939A (en) | 2014-04-22 | 2014-04-22 | Layering and reconfigurable on-chip network modeling and simulation system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN103970939A true CN103970939A (en) | 2014-08-06 |
Family
ID=51240429
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410164158.2A Pending CN103970939A (en) | 2014-04-22 | 2014-04-22 | Layering and reconfigurable on-chip network modeling and simulation system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103970939A (en) |
Cited By (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105391576A (en) * | 2015-11-02 | 2016-03-09 | 电子科技大学 | Network on chip operation process reproduction method and system |
CN105703948A (en) * | 2016-01-19 | 2016-06-22 | 河海大学常州校区 | On-chip network communication structure simulating and evaluating platform based on FPGA |
CN106302209A (en) * | 2015-06-12 | 2017-01-04 | 华为技术有限公司 | A kind of network-on-chip NoC and the method for data transmission |
CN106980921A (en) * | 2017-03-02 | 2017-07-25 | 上海歌略软件科技有限公司 | A kind of self-defined risk analysis method |
CN107003691A (en) * | 2014-12-27 | 2017-08-01 | 英特尔公司 | Technology for synchronizing sampling to counter based on global clock |
CN107196792A (en) * | 2017-05-17 | 2017-09-22 | 南京大学 | The Reconfigurable Computation Configuration network system of expansible support partially dynamical reconfiguration |
CN107450899A (en) * | 2016-06-01 | 2017-12-08 | 深圳市信锐网科技术有限公司 | The generation method and device of terminal control script |
CN108564170A (en) * | 2018-04-26 | 2018-09-21 | 福州瑞芯微电子股份有限公司 | A kind of restructural neural network computing method and circuit based on NOC |
CN109670268A (en) * | 2018-12-29 | 2019-04-23 | 京微齐力(北京)科技有限公司 | A kind of multiple IP and the connection method of the port EFPGA |
CN110958177A (en) * | 2019-11-07 | 2020-04-03 | 浪潮电子信息产业股份有限公司 | Network-on-chip route optimization method, device, equipment and readable storage medium |
CN111104775A (en) * | 2019-11-22 | 2020-05-05 | 核芯互联科技(青岛)有限公司 | Network-on-chip topological structure and implementation method thereof |
CN111193603A (en) * | 2019-08-06 | 2020-05-22 | 腾讯科技(深圳)有限公司 | Network architecture generation method and device, readable storage medium and computer equipment |
CN112073249A (en) * | 2020-09-17 | 2020-12-11 | 深圳市信锐网科技术有限公司 | Data transmission method, cluster switch system and related equipment |
CN112311701A (en) * | 2019-07-29 | 2021-02-02 | 奥塔索克技术有限公司 | Analog broadcasting in network on chip |
CN113055219A (en) * | 2019-12-27 | 2021-06-29 | 阿特里斯公司 | Physically aware topology synthesis of networks |
CN114691599A (en) * | 2020-12-30 | 2022-07-01 | 阿特里斯公司 | Synthesis of network on chip (NoC) using performance constraints and targets |
CN115348181A (en) * | 2022-10-18 | 2022-11-15 | 苏州浪潮智能科技有限公司 | A data transmission modeling method, system, device and storage medium |
CN115643167A (en) * | 2022-12-14 | 2023-01-24 | 摩尔线程智能科技(北京)有限责任公司 | Network-on-chip configuration method and device, and storage medium |
CN115665041A (en) * | 2022-11-18 | 2023-01-31 | 北京红山微电子技术有限公司 | Network-on-chip structure, data transmission method, electronic device, and storage medium |
CN117061427A (en) * | 2023-08-08 | 2023-11-14 | 苏州国科测试科技有限公司 | TTE network system under test equipment on-chip network architecture and design method thereof |
-
2014
- 2014-04-22 CN CN201410164158.2A patent/CN103970939A/en active Pending
Non-Patent Citations (4)
Title |
---|
周阳: "面向多种拓扑结构的可重构片上网络建模与仿真", 《万方数据学位论文库》 * |
罗丹 等: "基于OCP接口的片上网络性能评估平台", 《万方数据库》 * |
葛芬 等: "基于网络监控器的专用片上网络动态容错路由", 《电子学报》 * |
陈延仓 等: "面向性能优化的可重构片上网络技术研究", 《微电子学》 * |
Cited By (36)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107003691A (en) * | 2014-12-27 | 2017-08-01 | 英特尔公司 | Technology for synchronizing sampling to counter based on global clock |
CN107003691B (en) * | 2014-12-27 | 2021-02-02 | 英特尔公司 | Techniques for synchronously sampling counters based on global clock |
CN106302209B (en) * | 2015-06-12 | 2019-11-29 | 华为技术有限公司 | A kind of network-on-chip NoC and the method for data transmission |
CN106302209A (en) * | 2015-06-12 | 2017-01-04 | 华为技术有限公司 | A kind of network-on-chip NoC and the method for data transmission |
CN105391576B (en) * | 2015-11-02 | 2019-04-02 | 电子科技大学 | A kind of network-on-chip operational process replay method and system |
CN105391576A (en) * | 2015-11-02 | 2016-03-09 | 电子科技大学 | Network on chip operation process reproduction method and system |
CN105703948A (en) * | 2016-01-19 | 2016-06-22 | 河海大学常州校区 | On-chip network communication structure simulating and evaluating platform based on FPGA |
CN107450899A (en) * | 2016-06-01 | 2017-12-08 | 深圳市信锐网科技术有限公司 | The generation method and device of terminal control script |
CN107450899B (en) * | 2016-06-01 | 2022-04-26 | 深圳市信锐网科技术有限公司 | Method and device for generating terminal control script |
CN106980921A (en) * | 2017-03-02 | 2017-07-25 | 上海歌略软件科技有限公司 | A kind of self-defined risk analysis method |
CN107196792A (en) * | 2017-05-17 | 2017-09-22 | 南京大学 | The Reconfigurable Computation Configuration network system of expansible support partially dynamical reconfiguration |
CN107196792B (en) * | 2017-05-17 | 2020-08-04 | 南京大学 | Expandable reconfigurable computing configuration network system supporting dynamic partial reconfiguration |
CN108564170A (en) * | 2018-04-26 | 2018-09-21 | 福州瑞芯微电子股份有限公司 | A kind of restructural neural network computing method and circuit based on NOC |
CN108564170B (en) * | 2018-04-26 | 2020-06-19 | 福州瑞芯微电子股份有限公司 | Reconfigurable neural network operation method and circuit based on NOC |
CN109670268A (en) * | 2018-12-29 | 2019-04-23 | 京微齐力(北京)科技有限公司 | A kind of multiple IP and the connection method of the port EFPGA |
CN109670268B (en) * | 2018-12-29 | 2022-11-25 | 京微齐力(北京)科技有限公司 | Method for connecting multiple IP and EFPGA ports |
CN112311701A (en) * | 2019-07-29 | 2021-02-02 | 奥塔索克技术有限公司 | Analog broadcasting in network on chip |
CN112311701B (en) * | 2019-07-29 | 2024-03-19 | 西门子工业软件有限公司 | Analog broadcasting in a network on chip |
CN111193603B (en) * | 2019-08-06 | 2021-10-22 | 腾讯科技(深圳)有限公司 | Network architecture generation method and device, readable storage medium and computer equipment |
CN111193603A (en) * | 2019-08-06 | 2020-05-22 | 腾讯科技(深圳)有限公司 | Network architecture generation method and device, readable storage medium and computer equipment |
CN110958177A (en) * | 2019-11-07 | 2020-04-03 | 浪潮电子信息产业股份有限公司 | Network-on-chip route optimization method, device, equipment and readable storage medium |
CN110958177B (en) * | 2019-11-07 | 2022-02-18 | 浪潮电子信息产业股份有限公司 | Network-on-chip route optimization method, device, equipment and readable storage medium |
CN111104775B (en) * | 2019-11-22 | 2023-09-15 | 核芯互联科技(青岛)有限公司 | Network-on-chip topological structure and implementation method thereof |
CN111104775A (en) * | 2019-11-22 | 2020-05-05 | 核芯互联科技(青岛)有限公司 | Network-on-chip topological structure and implementation method thereof |
CN113055219B (en) * | 2019-12-27 | 2023-03-03 | 阿特里斯公司 | Physically aware topology synthesis of networks |
CN113055219A (en) * | 2019-12-27 | 2021-06-29 | 阿特里斯公司 | Physically aware topology synthesis of networks |
CN112073249A (en) * | 2020-09-17 | 2020-12-11 | 深圳市信锐网科技术有限公司 | Data transmission method, cluster switch system and related equipment |
CN114691599A (en) * | 2020-12-30 | 2022-07-01 | 阿特里斯公司 | Synthesis of network on chip (NoC) using performance constraints and targets |
CN114691599B (en) * | 2020-12-30 | 2024-10-22 | 阿特里斯公司 | Synthesis of Network-on-Chip (NoC) using performance constraints and targets |
CN115348181A (en) * | 2022-10-18 | 2022-11-15 | 苏州浪潮智能科技有限公司 | A data transmission modeling method, system, device and storage medium |
CN115665041A (en) * | 2022-11-18 | 2023-01-31 | 北京红山微电子技术有限公司 | Network-on-chip structure, data transmission method, electronic device, and storage medium |
CN115665041B (en) * | 2022-11-18 | 2023-03-28 | 北京红山微电子技术有限公司 | Network-on-chip structure, data transmission method, electronic device, and storage medium |
CN115643167B (en) * | 2022-12-14 | 2023-03-10 | 摩尔线程智能科技(北京)有限责任公司 | On-chip network configuration method and device, and storage medium |
CN115643167A (en) * | 2022-12-14 | 2023-01-24 | 摩尔线程智能科技(北京)有限责任公司 | Network-on-chip configuration method and device, and storage medium |
CN117061427A (en) * | 2023-08-08 | 2023-11-14 | 苏州国科测试科技有限公司 | TTE network system under test equipment on-chip network architecture and design method thereof |
CN117061427B (en) * | 2023-08-08 | 2024-06-28 | 苏州国科测试科技有限公司 | Time-triggered Ethernet network system under test equipment on-chip network architecture and implementation method thereof |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103970939A (en) | Layering and reconfigurable on-chip network modeling and simulation system | |
Pande et al. | Performance evaluation and design trade-offs for network-on-chip interconnect architectures | |
Ben-Itzhak et al. | HNOCS: modular open-source simulator for heterogeneous NoCs | |
US9515961B2 (en) | Credit flow control scheme in a router with flexible link widths utilizing minimal storage | |
US10817627B1 (en) | Network on-chip topology generation | |
Abdallah et al. | Basic network-on-chip interconnection for future gigascale MCSoCs applications: Communication and computation orthogonalization | |
Kamali et al. | AdapNoC: A fast and flexible FPGA-based NoC simulator | |
US11194950B2 (en) | Network-on-chip topology generation | |
US11310169B2 (en) | Network-on-chip topology generation | |
CN105681123A (en) | Spacewire network delay testing and optimizing system | |
CN109450705A (en) | A kind of network-on-chip verification method and system towards mapping based on FPGA | |
US12250145B2 (en) | Network-on-chip topology generation | |
Chang et al. | Low-power algorithm for automatic topology generation for application-specific networks on chips | |
Wang et al. | Flexible and efficient QoS provisioning in AXI4-based network-on-chip architecture | |
CN108365996A (en) | A kind of network-on-chip emulation platform based on FPGA+ARM frameworks | |
Prasad et al. | YaNoC: Yet another network-on-chip simulation acceleration engine using FPGAs | |
Ji et al. | A high-performance fully adaptive routing based on software defined network-on-chip | |
Parane et al. | FPGA based NoC simulation acceleration framework supporting adaptive routing | |
Tan et al. | Generation of emulation platforms for NoC exploration on FPGA | |
Joseph et al. | A cycle-accurate network-on-chip simulator with support for abstract task graph modeling | |
Reshadi et al. | Elixir: a new bandwidth-constrained mapping for networks-on-chip | |
Aslam et al. | An efficient router architecture and its FPGA prototyping to support junction based routing in NoC platforms | |
Fresse et al. | Case study: deployment of the 2D NoC on 3D for the generation of large emulation platforms | |
Parane et al. | YaNoC: Yet another network-on-chip simulation acceleration engine supporting congestion-aware adaptive routing using FPGAs | |
Senouci et al. | Large scale on-chip networks: an accurate multi-FPGA emulation platform |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20140806 |