CN115668225A - 神经网络调度方法及装置 - Google Patents

神经网络调度方法及装置 Download PDF

Info

Publication number
CN115668225A
CN115668225A CN202080101523.9A CN202080101523A CN115668225A CN 115668225 A CN115668225 A CN 115668225A CN 202080101523 A CN202080101523 A CN 202080101523A CN 115668225 A CN115668225 A CN 115668225A
Authority
CN
China
Prior art keywords
layer
neural network
group
layer group
layers
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202080101523.9A
Other languages
English (en)
Inventor
袁宏辉
李述成
熊乐进
切尔涅加·尼基塔
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Publication of CN115668225A publication Critical patent/CN115668225A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/06Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
    • G06N3/063Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using electronic means
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/48Program initiating; Program switching, e.g. by interrupt
    • G06F9/4806Task transfer initiation or dispatching
    • G06F9/4843Task transfer initiation or dispatching by program, e.g. task dispatcher, supervisor, operating system
    • G06F9/4881Scheduling strategies for dispatcher, e.g. round robin, multi-level priority queues
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5011Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resources being hardware resources other than CPUs, Servers and Terminals
    • G06F9/5016Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resources being hardware resources other than CPUs, Servers and Terminals the resource being the memory

Abstract

一种神经网络调度方法及装置,涉及神经网络领域,能够提升片上存储容量的利用率,提升硬件的运行性能。该方法包括:确定神经网络中每一层分别对应的第一批尺寸(S701);基于第一批尺寸,将神经网络切分为包含至少一个第一层组的神经网络(S702);基于第一层组的切分结果,将神经网络切分为包含至少一个第二层组的神经网络(S703);基于第二层组的切分结果,调度神经网络(S704)。

Description

PCT国内申请,说明书已公开。

Claims (25)

  1. PCT国内申请,权利要求书已公开。
CN202080101523.9A 2020-05-29 2020-05-29 神经网络调度方法及装置 Pending CN115668225A (zh)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2020/093544 WO2021237755A1 (zh) 2020-05-29 2020-05-29 神经网络调度方法及装置

Publications (1)

Publication Number Publication Date
CN115668225A true CN115668225A (zh) 2023-01-31

Family

ID=78745499

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202080101523.9A Pending CN115668225A (zh) 2020-05-29 2020-05-29 神经网络调度方法及装置

Country Status (4)

Country Link
US (1) US20230085718A1 (zh)
EP (1) EP4148627A4 (zh)
CN (1) CN115668225A (zh)
WO (1) WO2021237755A1 (zh)

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10083395B2 (en) * 2015-05-21 2018-09-25 Google Llc Batch processing in a neural network processor
US10019668B1 (en) * 2017-05-19 2018-07-10 Google Llc Scheduling neural network processing
CN110321999B (zh) * 2018-03-30 2021-10-01 赛灵思电子科技(北京)有限公司 神经网络计算图优化方法
CN110058943B (zh) * 2019-04-12 2021-09-21 三星(中国)半导体有限公司 用于电子设备的内存优化方法和设备

Also Published As

Publication number Publication date
WO2021237755A1 (zh) 2021-12-02
EP4148627A1 (en) 2023-03-15
US20230085718A1 (en) 2023-03-23
EP4148627A4 (en) 2023-06-28

Similar Documents

Publication Publication Date Title
US11449576B2 (en) Convolution operation processing method and related product
CN210006057U (zh) 用于深度学习引擎的设备和系统
CN108765247B (zh) 图像处理方法、装置、存储介质及设备
CN112840356B (zh) 运算加速器、处理方法及相关设备
CN108108809B (zh) 一种针对卷积神经元网络进行推理加速的硬件架构及其工作方法
US11775430B1 (en) Memory access for multiple circuit components
CN106228238B (zh) 现场可编程门阵列平台上加速深度学习算法的方法和系统
Pestana et al. A full featured configurable accelerator for object detection with YOLO
CN111414994B (zh) 一种基于FPGA的Yolov3网络计算加速系统及其加速方法
CN110321997B (zh) 高并行度计算平台、系统及计算实现方法
US20230026006A1 (en) Convolution computation engine, artificial intelligence chip, and data processing method
CN112668708A (zh) 一种提高数据利用率的卷积运算装置
Shahshahani et al. Memory optimization techniques for fpga based cnn implementations
CN109993293B (zh) 一种适用于堆叠式沙漏网络的深度学习加速器
CN109472734B (zh) 一种基于fpga的目标检测网络及其实现方法
CN114399035A (zh) 搬运数据的方法、直接存储器访问装置以及计算机系统
CN114003201A (zh) 矩阵变换方法、装置及卷积神经网络加速器
JP7108702B2 (ja) 複数の入力データセットのための処理
EP3268859A1 (en) Scheduling heterogenous processors
CN112200310A (zh) 智能处理器、数据处理方法及存储介质
CN115668225A (zh) 神经网络调度方法及装置
CN116227599A (zh) 一种推理模型的优化方法、装置、电子设备及存储介质
EP4071619A1 (en) Address generation method, related device and storage medium
CN115668222A (zh) 一种神经网络的数据处理方法及装置
CN115346099A (zh) 基于加速器芯片的图像卷积方法、芯片、设备及介质

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination