CN111542838A - 一种卷积神经网络的量化方法、装置及电子设备 - Google Patents
一种卷积神经网络的量化方法、装置及电子设备 Download PDFInfo
- Publication number
- CN111542838A CN111542838A CN201880083718.8A CN201880083718A CN111542838A CN 111542838 A CN111542838 A CN 111542838A CN 201880083718 A CN201880083718 A CN 201880083718A CN 111542838 A CN111542838 A CN 111542838A
- Authority
- CN
- China
- Prior art keywords
- quantized
- neural network
- layer
- convolutional neural
- ssdlite
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000013527 convolutional neural network Methods 0.000 title claims abstract description 129
- 238000000034 method Methods 0.000 title claims abstract description 38
- 238000011002 quantification Methods 0.000 title claims description 7
- 238000013139 quantization Methods 0.000 claims abstract description 80
- 238000000605 extraction Methods 0.000 claims abstract description 11
- 230000015654 memory Effects 0.000 claims description 17
- 238000004590 computer program Methods 0.000 claims description 12
- 238000012549 training Methods 0.000 claims description 9
- 238000004422 calculation algorithm Methods 0.000 abstract description 14
- 238000012545 processing Methods 0.000 abstract description 5
- 238000004364 calculation method Methods 0.000 description 10
- 238000001514 detection method Methods 0.000 description 8
- 238000010586 diagram Methods 0.000 description 7
- 238000013473 artificial intelligence Methods 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Health & Medical Sciences (AREA)
- Image Analysis (AREA)
Abstract
一种卷积神经网络的量化方法、装置及电子设备,所述方法包括:获取初始SSDlite卷积神经网络,所述初始SSDlite卷积神经网络包括用于特征提取的特征处理器与用于预测特征位置的位置预测器(101);将所述特征处理器中的网络层参数进行量化,得到量化特征处理器(102);保持所述位置预测器的网络层参数状态,基于所述量化特征处理器与所述位置预测器,得到量化SSDlite卷积神经网络(103);基于所述量化SSDlite卷积神经网络,输出得到目标SSDlite卷积神经网络(104)。通过对SSDlite卷积神经网络中的特征处理器进行量化,使得特征处理器中的特征处理算法复杂度下降,从而降低了SSDlite卷积神经网络的复杂度,解决了SSDLite卷积神经网络应用于小型嵌入式系统时存在算法复杂度高的问题。
Description
PCT国内申请,说明书已公开。
Claims (10)
- PCT国内申请,权利要求书已公开。
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2018/120560 WO2020118553A1 (zh) | 2018-12-12 | 2018-12-12 | 一种卷积神经网络的量化方法、装置及电子设备 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111542838A true CN111542838A (zh) | 2020-08-14 |
CN111542838B CN111542838B (zh) | 2024-02-20 |
Family
ID=71075935
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201880083718.8A Active CN111542838B (zh) | 2018-12-12 | 2018-12-12 | 一种卷积神经网络的量化方法、装置及电子设备 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN111542838B (zh) |
WO (1) | WO2020118553A1 (zh) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112101284A (zh) * | 2020-09-25 | 2020-12-18 | 北京百度网讯科技有限公司 | 图像识别方法、图像识别模型的训练方法、装置及系统 |
CN112232491A (zh) * | 2020-10-29 | 2021-01-15 | 深兰人工智能(深圳)有限公司 | 基于卷积神经网络模型的特征提取方法和装置 |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115165363A (zh) * | 2022-06-27 | 2022-10-11 | 西南交通大学 | 一种基于cnn的轻型轴承故障诊断方法及系统 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108304919A (zh) * | 2018-01-29 | 2018-07-20 | 百度在线网络技术(北京)有限公司 | 用于生成卷积神经网络的方法和装置 |
US20180247180A1 (en) * | 2015-08-21 | 2018-08-30 | Institute Of Automation, Chinese Academy Of Sciences | Deep convolutional neural network acceleration and compression method based on parameter quantification |
CN108596143A (zh) * | 2018-05-03 | 2018-09-28 | 复旦大学 | 基于残差量化卷积神经网络的人脸识别方法及装置 |
US20180349758A1 (en) * | 2017-06-06 | 2018-12-06 | Via Alliance Semiconductor Co., Ltd. | Computation method and device used in a convolutional neural network |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10074041B2 (en) * | 2015-04-17 | 2018-09-11 | Nec Corporation | Fine-grained image classification by exploring bipartite-graph labels |
CN107480770B (zh) * | 2017-07-27 | 2020-07-28 | 中国科学院自动化研究所 | 可调节量化位宽的神经网络量化与压缩的方法及装置 |
CN108510067B (zh) * | 2018-04-11 | 2021-11-09 | 西安电子科技大学 | 基于工程化实现的卷积神经网络量化方法 |
-
2018
- 2018-12-12 CN CN201880083718.8A patent/CN111542838B/zh active Active
- 2018-12-12 WO PCT/CN2018/120560 patent/WO2020118553A1/zh active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20180247180A1 (en) * | 2015-08-21 | 2018-08-30 | Institute Of Automation, Chinese Academy Of Sciences | Deep convolutional neural network acceleration and compression method based on parameter quantification |
US20180349758A1 (en) * | 2017-06-06 | 2018-12-06 | Via Alliance Semiconductor Co., Ltd. | Computation method and device used in a convolutional neural network |
CN108304919A (zh) * | 2018-01-29 | 2018-07-20 | 百度在线网络技术(北京)有限公司 | 用于生成卷积神经网络的方法和装置 |
CN108596143A (zh) * | 2018-05-03 | 2018-09-28 | 复旦大学 | 基于残差量化卷积神经网络的人脸识别方法及装置 |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112101284A (zh) * | 2020-09-25 | 2020-12-18 | 北京百度网讯科技有限公司 | 图像识别方法、图像识别模型的训练方法、装置及系统 |
CN112232491A (zh) * | 2020-10-29 | 2021-01-15 | 深兰人工智能(深圳)有限公司 | 基于卷积神经网络模型的特征提取方法和装置 |
Also Published As
Publication number | Publication date |
---|---|
CN111542838B (zh) | 2024-02-20 |
WO2020118553A1 (zh) | 2020-06-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20190171927A1 (en) | Layer-level quantization in neural networks | |
US10552737B2 (en) | Artificial neural network class-based pruning | |
US11488002B2 (en) | Binary neural network accelerator engine methods and systems | |
US10643124B2 (en) | Method and device for quantizing complex artificial neural network | |
US10474430B2 (en) | Mixed-precision processing elements, systems, and methods for computational models | |
EP3460792A1 (en) | Optimization method and apparatus suitable for model of pattern recognition, and terminal device | |
CN110119745B (zh) | 深度学习模型的压缩方法、装置、计算机设备及存储介质 | |
KR20190034985A (ko) | 인공 신경망의 양자화 방법 및 장치 | |
CN111542838A (zh) | 一种卷积神经网络的量化方法、装置及电子设备 | |
CN110929865A (zh) | 网络量化方法、业务处理方法及相关产品 | |
KR20190054454A (ko) | 인공 신경망의 양자화 방법 및 장치 | |
EP3620982B1 (en) | Sample processing method and device | |
CN112926570B (zh) | 一种自适应比特网络量化方法、系统及图像处理方法 | |
CN112287986A (zh) | 一种图像处理方法、装置、设备及可读存储介质 | |
CN112990438B (zh) | 基于移位量化操作的全定点卷积计算方法、系统及设备 | |
CN112200296B (zh) | 网络模型量化方法、装置、存储介质及电子设备 | |
CN111027684A (zh) | 一种深度学习模型量化方法、装置、电子设备及存储介质 | |
CN114627146A (zh) | 图像处理方法、装置、电子设备及存储介质 | |
CN114332500A (zh) | 图像处理模型训练方法、装置、计算机设备和存储介质 | |
CN111582229A (zh) | 一种网络自适应半精度量化的图像处理方法和系统 | |
US20200242467A1 (en) | Calculation method and calculation device for sparse neural network, electronic device, computer readable storage medium, and computer program product | |
CN111126501B (zh) | 一种图像识别方法、终端设备及存储介质 | |
US11068784B2 (en) | Generic quantization of artificial neural networks | |
WO2023185209A1 (zh) | 模型剪枝 | |
KR20220168527A (ko) | 프라이버시 보존 엣지-서버 시너지 컴퓨팅 장치 및 방법 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |