CN111542838A - 一种卷积神经网络的量化方法、装置及电子设备 - Google Patents

一种卷积神经网络的量化方法、装置及电子设备 Download PDF

Info

Publication number
CN111542838A
CN111542838A CN201880083718.8A CN201880083718A CN111542838A CN 111542838 A CN111542838 A CN 111542838A CN 201880083718 A CN201880083718 A CN 201880083718A CN 111542838 A CN111542838 A CN 111542838A
Authority
CN
China
Prior art keywords
quantized
neural network
layer
convolutional neural
ssdlite
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201880083718.8A
Other languages
English (en)
Other versions
CN111542838B (zh
Inventor
范鸿翔
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Corerain Technologies Co Ltd
Original Assignee
Shenzhen Corerain Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Corerain Technologies Co Ltd filed Critical Shenzhen Corerain Technologies Co Ltd
Publication of CN111542838A publication Critical patent/CN111542838A/zh
Application granted granted Critical
Publication of CN111542838B publication Critical patent/CN111542838B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Image Analysis (AREA)

Abstract

一种卷积神经网络的量化方法、装置及电子设备,所述方法包括:获取初始SSDlite卷积神经网络,所述初始SSDlite卷积神经网络包括用于特征提取的特征处理器与用于预测特征位置的位置预测器(101);将所述特征处理器中的网络层参数进行量化,得到量化特征处理器(102);保持所述位置预测器的网络层参数状态,基于所述量化特征处理器与所述位置预测器,得到量化SSDlite卷积神经网络(103);基于所述量化SSDlite卷积神经网络,输出得到目标SSDlite卷积神经网络(104)。通过对SSDlite卷积神经网络中的特征处理器进行量化,使得特征处理器中的特征处理算法复杂度下降,从而降低了SSDlite卷积神经网络的复杂度,解决了SSDLite卷积神经网络应用于小型嵌入式系统时存在算法复杂度高的问题。

Description

PCT国内申请,说明书已公开。

Claims (10)

  1. PCT国内申请,权利要求书已公开。
CN201880083718.8A 2018-12-12 2018-12-12 一种卷积神经网络的量化方法、装置及电子设备 Active CN111542838B (zh)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2018/120560 WO2020118553A1 (zh) 2018-12-12 2018-12-12 一种卷积神经网络的量化方法、装置及电子设备

Publications (2)

Publication Number Publication Date
CN111542838A true CN111542838A (zh) 2020-08-14
CN111542838B CN111542838B (zh) 2024-02-20

Family

ID=71075935

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201880083718.8A Active CN111542838B (zh) 2018-12-12 2018-12-12 一种卷积神经网络的量化方法、装置及电子设备

Country Status (2)

Country Link
CN (1) CN111542838B (zh)
WO (1) WO2020118553A1 (zh)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112101284A (zh) * 2020-09-25 2020-12-18 北京百度网讯科技有限公司 图像识别方法、图像识别模型的训练方法、装置及系统
CN112232491A (zh) * 2020-10-29 2021-01-15 深兰人工智能(深圳)有限公司 基于卷积神经网络模型的特征提取方法和装置

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115165363A (zh) * 2022-06-27 2022-10-11 西南交通大学 一种基于cnn的轻型轴承故障诊断方法及系统

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108304919A (zh) * 2018-01-29 2018-07-20 百度在线网络技术(北京)有限公司 用于生成卷积神经网络的方法和装置
US20180247180A1 (en) * 2015-08-21 2018-08-30 Institute Of Automation, Chinese Academy Of Sciences Deep convolutional neural network acceleration and compression method based on parameter quantification
CN108596143A (zh) * 2018-05-03 2018-09-28 复旦大学 基于残差量化卷积神经网络的人脸识别方法及装置
US20180349758A1 (en) * 2017-06-06 2018-12-06 Via Alliance Semiconductor Co., Ltd. Computation method and device used in a convolutional neural network

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10074041B2 (en) * 2015-04-17 2018-09-11 Nec Corporation Fine-grained image classification by exploring bipartite-graph labels
CN107480770B (zh) * 2017-07-27 2020-07-28 中国科学院自动化研究所 可调节量化位宽的神经网络量化与压缩的方法及装置
CN108510067B (zh) * 2018-04-11 2021-11-09 西安电子科技大学 基于工程化实现的卷积神经网络量化方法

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180247180A1 (en) * 2015-08-21 2018-08-30 Institute Of Automation, Chinese Academy Of Sciences Deep convolutional neural network acceleration and compression method based on parameter quantification
US20180349758A1 (en) * 2017-06-06 2018-12-06 Via Alliance Semiconductor Co., Ltd. Computation method and device used in a convolutional neural network
CN108304919A (zh) * 2018-01-29 2018-07-20 百度在线网络技术(北京)有限公司 用于生成卷积神经网络的方法和装置
CN108596143A (zh) * 2018-05-03 2018-09-28 复旦大学 基于残差量化卷积神经网络的人脸识别方法及装置

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112101284A (zh) * 2020-09-25 2020-12-18 北京百度网讯科技有限公司 图像识别方法、图像识别模型的训练方法、装置及系统
CN112232491A (zh) * 2020-10-29 2021-01-15 深兰人工智能(深圳)有限公司 基于卷积神经网络模型的特征提取方法和装置

Also Published As

Publication number Publication date
CN111542838B (zh) 2024-02-20
WO2020118553A1 (zh) 2020-06-18

Similar Documents

Publication Publication Date Title
US20190171927A1 (en) Layer-level quantization in neural networks
US10552737B2 (en) Artificial neural network class-based pruning
US11488002B2 (en) Binary neural network accelerator engine methods and systems
US10643124B2 (en) Method and device for quantizing complex artificial neural network
US10474430B2 (en) Mixed-precision processing elements, systems, and methods for computational models
EP3460792A1 (en) Optimization method and apparatus suitable for model of pattern recognition, and terminal device
CN110119745B (zh) 深度学习模型的压缩方法、装置、计算机设备及存储介质
KR20190034985A (ko) 인공 신경망의 양자화 방법 및 장치
CN111542838A (zh) 一种卷积神经网络的量化方法、装置及电子设备
CN110929865A (zh) 网络量化方法、业务处理方法及相关产品
KR20190054454A (ko) 인공 신경망의 양자화 방법 및 장치
EP3620982B1 (en) Sample processing method and device
CN112926570B (zh) 一种自适应比特网络量化方法、系统及图像处理方法
CN112287986A (zh) 一种图像处理方法、装置、设备及可读存储介质
CN112990438B (zh) 基于移位量化操作的全定点卷积计算方法、系统及设备
CN112200296B (zh) 网络模型量化方法、装置、存储介质及电子设备
CN111027684A (zh) 一种深度学习模型量化方法、装置、电子设备及存储介质
CN114627146A (zh) 图像处理方法、装置、电子设备及存储介质
CN114332500A (zh) 图像处理模型训练方法、装置、计算机设备和存储介质
CN111582229A (zh) 一种网络自适应半精度量化的图像处理方法和系统
US20200242467A1 (en) Calculation method and calculation device for sparse neural network, electronic device, computer readable storage medium, and computer program product
CN111126501B (zh) 一种图像识别方法、终端设备及存储介质
US11068784B2 (en) Generic quantization of artificial neural networks
WO2023185209A1 (zh) 模型剪枝
KR20220168527A (ko) 프라이버시 보존 엣지-서버 시너지 컴퓨팅 장치 및 방법

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant