CN112753016A - 神经网络中数据预处理阶段的计算资源的管理方法和装置 - Google Patents
神经网络中数据预处理阶段的计算资源的管理方法和装置 Download PDFInfo
- Publication number
- CN112753016A CN112753016A CN201880098036.4A CN201880098036A CN112753016A CN 112753016 A CN112753016 A CN 112753016A CN 201880098036 A CN201880098036 A CN 201880098036A CN 112753016 A CN112753016 A CN 112753016A
- Authority
- CN
- China
- Prior art keywords
- computing
- resource
- information
- node
- nodes
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000007781 pre-processing Methods 0.000 title claims abstract description 70
- 238000013528 artificial neural network Methods 0.000 title claims abstract description 48
- 238000007726 management method Methods 0.000 title description 61
- 238000000034 method Methods 0.000 claims abstract description 104
- 238000012549 training Methods 0.000 claims abstract description 39
- 238000012544 monitoring process Methods 0.000 claims abstract description 25
- 238000012545 processing Methods 0.000 claims description 64
- 230000008569 process Effects 0.000 claims description 47
- 238000004590 computer program Methods 0.000 claims description 4
- 230000003993 interaction Effects 0.000 claims description 2
- 238000003062 neural network model Methods 0.000 abstract description 6
- 238000004364 calculation method Methods 0.000 description 16
- 238000010586 diagram Methods 0.000 description 15
- 230000006870 function Effects 0.000 description 8
- 238000013473 artificial intelligence Methods 0.000 description 6
- 238000011160 research Methods 0.000 description 4
- 238000004891 communication Methods 0.000 description 3
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 230000001133 acceleration Effects 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 230000003416 augmentation Effects 0.000 description 2
- 238000012790 confirmation Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 230000001360 synchronised effect Effects 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 238000013508 migration Methods 0.000 description 1
- 230000005012 migration Effects 0.000 description 1
- 238000003058 natural language processing Methods 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 238000013138 pruning Methods 0.000 description 1
- 238000010008 shearing Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 238000012706 support-vector machine Methods 0.000 description 1
- 230000026676 system process Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
Landscapes
- Engineering & Computer Science (AREA)
- Software Systems (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Data Exchanges In Wide-Area Networks (AREA)
- Debugging And Monitoring (AREA)
Abstract
一种应用于神经网络的数据预处理阶段的计算资源的管理方法和装置,该计算资源包括多个异构的计算节点,该方法包括:分别监测所述多个计算节点的资源使用信息(S310);根据所述资源使用信息,并基于预设的资源调度策略,生成与所述多个计算节点中的待调整节点对应的资源调整信息(S320);根据所述资源调整信息,对所述待调整节点的计算资源进行动态调整(S330),能够提高计算资源的利用率,有助于减少神经网络模型的训练时间。
Description
PCT国内申请,说明书已公开。
Claims (18)
- PCT国内申请,权利要求书已公开。
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2018/109181 WO2020062277A1 (zh) | 2018-09-30 | 2018-09-30 | 神经网络中数据预处理阶段的计算资源的管理方法和装置 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112753016A true CN112753016A (zh) | 2021-05-04 |
CN112753016B CN112753016B (zh) | 2024-09-24 |
Family
ID=69950217
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201880098036.4A Active CN112753016B (zh) | 2018-09-30 | 2018-09-30 | 神经网络中数据预处理阶段的计算资源的管理方法和装置 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN112753016B (zh) |
WO (1) | WO2020062277A1 (zh) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113391907A (zh) * | 2021-06-25 | 2021-09-14 | 中债金科信息技术有限公司 | 一种任务的放置方法、装置、设备和介质 |
CN114756372A (zh) * | 2022-04-28 | 2022-07-15 | 北京百度网讯科技有限公司 | 用于负载均衡的方法、装置、设备和介质 |
WO2023093375A1 (zh) * | 2021-11-25 | 2023-06-01 | 北京九章云极科技有限公司 | 一种计算资源获取方法、装置、电子设备和存储介质 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040216114A1 (en) * | 2003-04-22 | 2004-10-28 | Lin Sheng Ling | Balancing loads among computing nodes |
CN103617086A (zh) * | 2013-11-20 | 2014-03-05 | 东软集团股份有限公司 | 一种并行计算方法及系统 |
CN103812895A (zh) * | 2012-11-12 | 2014-05-21 | 华为技术有限公司 | 调度方法、管理节点以及云计算集群 |
CN104168332A (zh) * | 2014-09-01 | 2014-11-26 | 广东电网公司信息中心 | 高性能计算中负载均衡与节点状态监控方法 |
CN108200156A (zh) * | 2017-12-29 | 2018-06-22 | 南京邮电大学 | 一种云环境下分布式文件系统的动态负载均衡方法 |
-
2018
- 2018-09-30 CN CN201880098036.4A patent/CN112753016B/zh active Active
- 2018-09-30 WO PCT/CN2018/109181 patent/WO2020062277A1/zh active Application Filing
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040216114A1 (en) * | 2003-04-22 | 2004-10-28 | Lin Sheng Ling | Balancing loads among computing nodes |
CN103812895A (zh) * | 2012-11-12 | 2014-05-21 | 华为技术有限公司 | 调度方法、管理节点以及云计算集群 |
CN103617086A (zh) * | 2013-11-20 | 2014-03-05 | 东软集团股份有限公司 | 一种并行计算方法及系统 |
CN104168332A (zh) * | 2014-09-01 | 2014-11-26 | 广东电网公司信息中心 | 高性能计算中负载均衡与节点状态监控方法 |
CN108200156A (zh) * | 2017-12-29 | 2018-06-22 | 南京邮电大学 | 一种云环境下分布式文件系统的动态负载均衡方法 |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113391907A (zh) * | 2021-06-25 | 2021-09-14 | 中债金科信息技术有限公司 | 一种任务的放置方法、装置、设备和介质 |
WO2023093375A1 (zh) * | 2021-11-25 | 2023-06-01 | 北京九章云极科技有限公司 | 一种计算资源获取方法、装置、电子设备和存储介质 |
CN114756372A (zh) * | 2022-04-28 | 2022-07-15 | 北京百度网讯科技有限公司 | 用于负载均衡的方法、装置、设备和介质 |
Also Published As
Publication number | Publication date |
---|---|
CN112753016B (zh) | 2024-09-24 |
WO2020062277A1 (zh) | 2020-04-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111626430B (zh) | 一种数据处理方法及相关产品 | |
US9424079B2 (en) | Iteration support in a heterogeneous dataflow engine | |
EP3502975A1 (en) | Methods and apparatus for model parallelism in artificial neural networks | |
CN110674936A (zh) | 一种神经网络处理方法、装置、计算机设备及存储介质 | |
US20210295168A1 (en) | Gradient compression for distributed training | |
CN107679625B (zh) | 针对数据记录执行机器学习的分布式系统及其方法 | |
CN112753016A (zh) | 神经网络中数据预处理阶段的计算资源的管理方法和装置 | |
WO2023246801A1 (zh) | 算法流水线编排方法、装置、电子设备和存储介质 | |
US20210158131A1 (en) | Hierarchical partitioning of operators | |
CN111191789B (zh) | 模型优化部署系统、芯片、电子设备及介质 | |
CN114118433A (zh) | 一种设备的配置参数的推荐方法及装置 | |
JP2021507345A (ja) | 畳み込みニューラル・ネットワークの完全なカーネルを近似するためのスパース・カーネルの融合 | |
CN105700956A (zh) | 用于处理分布式作业的方法和系统 | |
WO2018175164A1 (en) | Resource-efficient machine learning | |
CN112099882B (zh) | 一种业务处理方法、装置及设备 | |
WO2020164644A2 (zh) | 神经网络模型拆分方法、装置、计算机设备和存储介质 | |
Li et al. | An intelligent collaborative inference approach of service partitioning and task offloading for deep learning based service in mobile edge computing networks | |
CN112099848A (zh) | 一种业务处理方法、装置及设备 | |
US11308396B2 (en) | Neural network layer-by-layer debugging | |
US20220067495A1 (en) | Intelligent processor, data processing method and storage medium | |
CN118313458A (zh) | 数据处理方法、数据处理器、电子设备、存储介质 | |
US12014202B2 (en) | Method and apparatus with accelerator | |
US11551095B2 (en) | Sharing preprocessing, computations, and hardware resources between multiple neural networks | |
US20210319307A1 (en) | Heterogeneous computing on a system-on-chip, including machine learning inference | |
CN116402091A (zh) | 面向人工智能芯片的混合引擎智能计算方法和装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |