CN115668225A - 神经网络调度方法及装置 - Google Patents
神经网络调度方法及装置 Download PDFInfo
- Publication number
- CN115668225A CN115668225A CN202080101523.9A CN202080101523A CN115668225A CN 115668225 A CN115668225 A CN 115668225A CN 202080101523 A CN202080101523 A CN 202080101523A CN 115668225 A CN115668225 A CN 115668225A
- Authority
- CN
- China
- Prior art keywords
- layer
- neural network
- group
- layer group
- layers
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/06—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
- G06N3/063—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using electronic means
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/48—Program initiating; Program switching, e.g. by interrupt
- G06F9/4806—Task transfer initiation or dispatching
- G06F9/4843—Task transfer initiation or dispatching by program, e.g. task dispatcher, supervisor, operating system
- G06F9/4881—Scheduling strategies for dispatcher, e.g. round robin, multi-level priority queues
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5005—Allocation of resources, e.g. of the central processing unit [CPU] to service a request
- G06F9/5011—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resources being hardware resources other than CPUs, Servers and Terminals
- G06F9/5016—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resources being hardware resources other than CPUs, Servers and Terminals the resource being the memory
Abstract
一种神经网络调度方法及装置,涉及神经网络领域,能够提升片上存储容量的利用率,提升硬件的运行性能。该方法包括:确定神经网络中每一层分别对应的第一批尺寸(S701);基于第一批尺寸,将神经网络切分为包含至少一个第一层组的神经网络(S702);基于第一层组的切分结果,将神经网络切分为包含至少一个第二层组的神经网络(S703);基于第二层组的切分结果,调度神经网络(S704)。
Description
PCT国内申请,说明书已公开。
Claims (25)
- PCT国内申请,权利要求书已公开。
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2020/093544 WO2021237755A1 (zh) | 2020-05-29 | 2020-05-29 | 神经网络调度方法及装置 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN115668225A true CN115668225A (zh) | 2023-01-31 |
Family
ID=78745499
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202080101523.9A Pending CN115668225A (zh) | 2020-05-29 | 2020-05-29 | 神经网络调度方法及装置 |
Country Status (4)
Country | Link |
---|---|
US (1) | US20230085718A1 (zh) |
EP (1) | EP4148627A4 (zh) |
CN (1) | CN115668225A (zh) |
WO (1) | WO2021237755A1 (zh) |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10083395B2 (en) * | 2015-05-21 | 2018-09-25 | Google Llc | Batch processing in a neural network processor |
US10019668B1 (en) * | 2017-05-19 | 2018-07-10 | Google Llc | Scheduling neural network processing |
CN110321999B (zh) * | 2018-03-30 | 2021-10-01 | 赛灵思电子科技(北京)有限公司 | 神经网络计算图优化方法 |
CN110058943B (zh) * | 2019-04-12 | 2021-09-21 | 三星(中国)半导体有限公司 | 用于电子设备的内存优化方法和设备 |
-
2020
- 2020-05-29 EP EP20938103.7A patent/EP4148627A4/en active Pending
- 2020-05-29 WO PCT/CN2020/093544 patent/WO2021237755A1/zh unknown
- 2020-05-29 CN CN202080101523.9A patent/CN115668225A/zh active Pending
-
2022
- 2022-11-28 US US18/070,054 patent/US20230085718A1/en active Pending
Also Published As
Publication number | Publication date |
---|---|
WO2021237755A1 (zh) | 2021-12-02 |
EP4148627A1 (en) | 2023-03-15 |
US20230085718A1 (en) | 2023-03-23 |
EP4148627A4 (en) | 2023-06-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11449576B2 (en) | Convolution operation processing method and related product | |
CN210006057U (zh) | 用于深度学习引擎的设备和系统 | |
CN108765247B (zh) | 图像处理方法、装置、存储介质及设备 | |
CN112840356B (zh) | 运算加速器、处理方法及相关设备 | |
CN108108809B (zh) | 一种针对卷积神经元网络进行推理加速的硬件架构及其工作方法 | |
US11775430B1 (en) | Memory access for multiple circuit components | |
CN106228238B (zh) | 现场可编程门阵列平台上加速深度学习算法的方法和系统 | |
Pestana et al. | A full featured configurable accelerator for object detection with YOLO | |
CN111414994B (zh) | 一种基于FPGA的Yolov3网络计算加速系统及其加速方法 | |
CN110321997B (zh) | 高并行度计算平台、系统及计算实现方法 | |
US20230026006A1 (en) | Convolution computation engine, artificial intelligence chip, and data processing method | |
CN112668708A (zh) | 一种提高数据利用率的卷积运算装置 | |
Shahshahani et al. | Memory optimization techniques for fpga based cnn implementations | |
CN109993293B (zh) | 一种适用于堆叠式沙漏网络的深度学习加速器 | |
CN109472734B (zh) | 一种基于fpga的目标检测网络及其实现方法 | |
CN114399035A (zh) | 搬运数据的方法、直接存储器访问装置以及计算机系统 | |
CN114003201A (zh) | 矩阵变换方法、装置及卷积神经网络加速器 | |
JP7108702B2 (ja) | 複数の入力データセットのための処理 | |
EP3268859A1 (en) | Scheduling heterogenous processors | |
CN112200310A (zh) | 智能处理器、数据处理方法及存储介质 | |
CN115668225A (zh) | 神经网络调度方法及装置 | |
CN116227599A (zh) | 一种推理模型的优化方法、装置、电子设备及存储介质 | |
EP4071619A1 (en) | Address generation method, related device and storage medium | |
CN115668222A (zh) | 一种神经网络的数据处理方法及装置 | |
CN115346099A (zh) | 基于加速器芯片的图像卷积方法、芯片、设备及介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |