CN115618943A - 一种模型部署方法、装置、系统及电子设备 - Google Patents
一种模型部署方法、装置、系统及电子设备 Download PDFInfo
- Publication number
- CN115618943A CN115618943A CN202211389776.8A CN202211389776A CN115618943A CN 115618943 A CN115618943 A CN 115618943A CN 202211389776 A CN202211389776 A CN 202211389776A CN 115618943 A CN115618943 A CN 115618943A
- Authority
- CN
- China
- Prior art keywords
- model
- original
- target
- deployment
- pruning
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 35
- 238000013138 pruning Methods 0.000 claims abstract description 28
- 102100030148 Integrator complex subunit 8 Human genes 0.000 claims abstract description 11
- 101710092891 Integrator complex subunit 8 Proteins 0.000 claims abstract description 11
- 238000012545 processing Methods 0.000 claims description 17
- 230000006835 compression Effects 0.000 claims description 15
- 238000007906 compression Methods 0.000 claims description 15
- 238000011156 evaluation Methods 0.000 claims description 11
- 238000004590 computer program Methods 0.000 claims description 4
- 238000012795 verification Methods 0.000 claims description 4
- 238000004891 communication Methods 0.000 claims description 2
- 230000000694 effects Effects 0.000 abstract description 6
- 238000010586 diagram Methods 0.000 description 6
- 238000005457 optimization Methods 0.000 description 6
- 238000013135 deep learning Methods 0.000 description 5
- 238000013139 quantization Methods 0.000 description 5
- 238000012549 training Methods 0.000 description 5
- 230000008569 process Effects 0.000 description 4
- 238000013528 artificial neural network Methods 0.000 description 2
- 230000007547 defect Effects 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 238000003491 array Methods 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 238000013136 deep learning model Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000001537 neural effect Effects 0.000 description 1
- 238000012805 post-processing Methods 0.000 description 1
- 238000002203 pretreatment Methods 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/082—Learning methods modifying the architecture, e.g. adding, deleting or silencing nodes or connections
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/10—File systems; File servers
- G06F16/17—Details of further file system functions
- G06F16/174—Redundancy elimination performed by the file system
- G06F16/1744—Redundancy elimination performed by the file system using compression, e.g. sparse files
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Biophysics (AREA)
- Evolutionary Computation (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Computational Linguistics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Health & Medical Sciences (AREA)
- Databases & Information Systems (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202211389776.8A CN115618943A (zh) | 2022-11-08 | 2022-11-08 | 一种模型部署方法、装置、系统及电子设备 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202211389776.8A CN115618943A (zh) | 2022-11-08 | 2022-11-08 | 一种模型部署方法、装置、系统及电子设备 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN115618943A true CN115618943A (zh) | 2023-01-17 |
Family
ID=84879460
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202211389776.8A Pending CN115618943A (zh) | 2022-11-08 | 2022-11-08 | 一种模型部署方法、装置、系统及电子设备 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115618943A (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116048542A (zh) * | 2023-02-11 | 2023-05-02 | 之江实验室 | 一种计算机视觉深度学习模型的优化部署方法与装置 |
-
2022
- 2022-11-08 CN CN202211389776.8A patent/CN115618943A/zh active Pending
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116048542A (zh) * | 2023-02-11 | 2023-05-02 | 之江实验室 | 一种计算机视觉深度学习模型的优化部署方法与装置 |
CN116048542B (zh) * | 2023-02-11 | 2023-10-31 | 之江实验室 | 一种计算机视觉深度学习模型的优化部署方法与装置 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Li et al. | Auto-tuning neural network quantization framework for collaborative inference between the cloud and edge | |
US11055063B2 (en) | Systems and methods for deep learning processor | |
US20200364552A1 (en) | Quantization method of improving the model inference accuracy | |
WO2021233069A1 (zh) | 量化训练、图像处理方法及装置、存储介质 | |
JP2022513404A (ja) | トレーニング済み長短期記憶ニューラルネットワークの量子化 | |
US20140094938A1 (en) | Method and system for updating tuning parameters of a controller | |
CN113449859A (zh) | 一种数据处理方法及其装置 | |
CN115794913B (zh) | 一种人工智能系统中数据处理方法及装置 | |
US20200226458A1 (en) | Optimizing artificial neural network computations based on automatic determination of a batch size | |
CN111738488A (zh) | 一种任务调度方法及其装置 | |
CN115618943A (zh) | 一种模型部署方法、装置、系统及电子设备 | |
Yao et al. | Faster yolo-lite: Faster object detection on robot and edge devices | |
US20200293865A1 (en) | Using identity layer in a cellular neural network architecture | |
US10990525B2 (en) | Caching data in artificial neural network computations | |
CN113902112A (zh) | 硬件计算模拟方法、系统及计算机可读存储介质 | |
CN115238883A (zh) | 神经网络模型的训练方法、装置、设备及存储介质 | |
US20210374561A1 (en) | Green artificial intelligence implementation | |
Bolme et al. | Face recognition oak ridge (faro): A framework for distributed and scalable biometrics applications | |
Song et al. | Research on the acceleration effect of tensorrt in deep learning | |
Hong et al. | Image Classification on Resource-Constrained Microcontrollers | |
CN112991382A (zh) | 一种基于pynq框架的异构视觉目标跟踪系统及方法 | |
US20230051344A1 (en) | Optimization of memory use for efficient neural network execution | |
CN117076335B (zh) | 一种模型测试方法、系统、介质及电子设备 | |
US20230043584A1 (en) | Optimization of memory use for efficient neural network execution | |
Tian et al. | An Inference Performance Evaluation of TensorFlow and PyTorch on GPU Platform Using Image Super-Resolution Workloads |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20240121 Address after: Room 202B East 965, 2nd Floor, Building 1, No.1 Courtyard, Lize Middle Road, Chaoyang District, Beijing, 100102 Applicant after: Beijing Yixin Yiyu Microelectronics Technology Co.,Ltd. Country or region after: China Address before: 519, Floor 5, No. 68, North Fourth Ring West Road, Haidian District, Beijing, 100089 Applicant before: Beijing Shihai Xintu Microelectronics Co.,Ltd. Country or region before: China |
|
TA01 | Transfer of patent application right | ||
CB02 | Change of applicant information |
Country or region after: China Address after: Room 965 East, Building 1, 2nd Floor, 202B, No.1 Courtyard, Lize Middle Road, Chaoyang District, Beijing, 100020 Applicant after: Beijing Qianhe Yibang Cloud Information Technology Co.,Ltd. Address before: Room 202B East 965, 2nd Floor, Building 1, No.1 Courtyard, Lize Middle Road, Chaoyang District, Beijing, 100102 Applicant before: Beijing Yixin Yiyu Microelectronics Technology Co.,Ltd. Country or region before: China |
|
CB02 | Change of applicant information |