CN111886593A - 数据处理系统和数据处理方法 - Google Patents
数据处理系统和数据处理方法 Download PDFInfo
- Publication number
- CN111886593A CN111886593A CN201880091518.7A CN201880091518A CN111886593A CN 111886593 A CN111886593 A CN 111886593A CN 201880091518 A CN201880091518 A CN 201880091518A CN 111886593 A CN111886593 A CN 111886593A
- Authority
- CN
- China
- Prior art keywords
- data
- aggregation
- processing system
- data processing
- aggregation operation
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000012545 processing Methods 0.000 title claims abstract description 70
- 238000003672 processing method Methods 0.000 title abstract description 8
- 230000002776 aggregation Effects 0.000 claims abstract description 212
- 238000004220 aggregation Methods 0.000 claims abstract description 212
- 230000015654 memory Effects 0.000 claims abstract description 135
- 238000013473 artificial intelligence Methods 0.000 claims description 129
- 238000006243 chemical reaction Methods 0.000 claims description 13
- 238000012549 training Methods 0.000 abstract description 34
- 238000013528 artificial neural network Methods 0.000 abstract description 33
- 238000000034 method Methods 0.000 description 34
- 238000004364 calculation method Methods 0.000 description 23
- 230000008569 process Effects 0.000 description 16
- 238000004422 calculation algorithm Methods 0.000 description 13
- 238000010586 diagram Methods 0.000 description 12
- 230000009286 beneficial effect Effects 0.000 description 5
- 238000003860 storage Methods 0.000 description 5
- 238000004590 computer program Methods 0.000 description 4
- 239000012141 concentrate Substances 0.000 description 4
- 230000006870 function Effects 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 238000012546 transfer Methods 0.000 description 3
- 238000013527 convolutional neural network Methods 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000001537 neural effect Effects 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 230000000306 recurrent effect Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/27—Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F13/00—Interconnection of, or transfer of information or other signals between, memories, input/output devices or central processing units
- G06F13/14—Handling requests for interconnection or transfer
- G06F13/20—Handling requests for interconnection or transfer for access to input/output bus
- G06F13/28—Handling requests for interconnection or transfer for access to input/output bus using burst mode transfer, e.g. direct memory access DMA, cycle steal
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/16—Combinations of two or more digital computers each having at least an arithmetic unit, a program unit and a register, e.g. for a simultaneous processing of several programs
- G06F15/163—Interprocessor communication
- G06F15/173—Interprocessor communication using an interconnection network, e.g. matrix, shuffle, pyramid, star, snowflake
- G06F15/17306—Intercommunication techniques
- G06F15/17331—Distributed shared memory [DSM], e.g. remote direct memory access [RDMA]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/06—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
- G06N3/063—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using electronic means
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Software Systems (AREA)
- Mathematical Physics (AREA)
- Data Mining & Analysis (AREA)
- Computing Systems (AREA)
- Computer Hardware Design (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- Databases & Information Systems (AREA)
- Medical Informatics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Computational Linguistics (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Neurology (AREA)
- Advance Control (AREA)
Abstract
一种数据处理系统(600)和一种数据处理方法。该数据处理系统(600)包括第一计算节点,第一计算节点包括AI处理器(610)和聚合运算器(620),AI处理器(610)用于:执行AI运算生成第一计算节点的第一数据;聚合运算器(620)用于:对来自第二计算节点的第二数据和第一数据执行聚合运算生成聚合运算结果。由于上述AI处理器(610)和聚合运算器(620)能够并行运行,能够减少聚合运算中对第一计算节点的内存模块的读写次数,减少调度次数,避免聚合运算对AI处理器(610)的缓存的影响,使得聚合运算和AI运算能够并行进行,从而提高了深度神经网络的训练效率。
Description
PCT国内申请,说明书已公开。
Claims (12)
- PCT国内申请,权利要求书已公开。
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2018/103669 WO2020042182A1 (zh) | 2018-08-31 | 2018-08-31 | 数据处理系统和数据处理方法 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN111886593A true CN111886593A (zh) | 2020-11-03 |
Family
ID=69643150
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201880091518.7A Pending CN111886593A (zh) | 2018-08-31 | 2018-08-31 | 数据处理系统和数据处理方法 |
Country Status (4)
Country | Link |
---|---|
US (1) | US20210166156A1 (zh) |
EP (1) | EP3819788A4 (zh) |
CN (1) | CN111886593A (zh) |
WO (1) | WO2020042182A1 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116257494A (zh) * | 2021-04-21 | 2023-06-13 | 华为技术有限公司 | 一种聚合通信的方法、系统和计算机设备 |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11842260B2 (en) * | 2020-09-25 | 2023-12-12 | International Business Machines Corporation | Incremental and decentralized model pruning in federated machine learning |
CN113297111B (zh) * | 2021-06-11 | 2023-06-23 | 上海壁仞智能科技有限公司 | 人工智能芯片及其操作方法 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100161846A1 (en) * | 2008-12-23 | 2010-06-24 | International Business Machines Corporation | Multithreaded Programmable Direct Memory Access Engine |
CN107545005A (zh) * | 2016-06-28 | 2018-01-05 | 华为软件技术有限公司 | 一种数据处理方法及装置 |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103092886B (zh) * | 2011-11-07 | 2016-03-02 | 中国移动通信集团公司 | 一种数据查询操作的实现方法、装置及系统 |
CN103559247B (zh) * | 2013-10-29 | 2018-06-05 | 北京华胜天成科技股份有限公司 | 一种数据业务处理方法及装置 |
CN105760395A (zh) * | 2014-12-18 | 2016-07-13 | 华为技术有限公司 | 一种数据处理的方法、装置及系统 |
US20180046903A1 (en) * | 2016-08-12 | 2018-02-15 | DeePhi Technology Co., Ltd. | Deep processing unit (dpu) for implementing an artificial neural network (ann) |
-
2018
- 2018-08-31 CN CN201880091518.7A patent/CN111886593A/zh active Pending
- 2018-08-31 WO PCT/CN2018/103669 patent/WO2020042182A1/zh unknown
- 2018-08-31 EP EP18931858.7A patent/EP3819788A4/en active Pending
-
2021
- 2021-02-11 US US17/173,691 patent/US20210166156A1/en active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100161846A1 (en) * | 2008-12-23 | 2010-06-24 | International Business Machines Corporation | Multithreaded Programmable Direct Memory Access Engine |
CN107545005A (zh) * | 2016-06-28 | 2018-01-05 | 华为软件技术有限公司 | 一种数据处理方法及装置 |
Non-Patent Citations (1)
Title |
---|
RAGHID MORCEL ET AL.: "FPGA-based Accelerator for Deep Convolutional Neural Networks for the SPARK Environment", 2016 IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD, 18 November 2016 (2016-11-18), pages 126, XP033029318, DOI: 10.1109/SmartCloud.2016.31 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116257494A (zh) * | 2021-04-21 | 2023-06-13 | 华为技术有限公司 | 一种聚合通信的方法、系统和计算机设备 |
CN116257494B (zh) * | 2021-04-21 | 2023-12-08 | 华为技术有限公司 | 一种聚合通信的方法、系统和计算机设备 |
Also Published As
Publication number | Publication date |
---|---|
WO2020042182A1 (zh) | 2020-03-05 |
US20210166156A1 (en) | 2021-06-03 |
EP3819788A4 (en) | 2021-07-14 |
EP3819788A1 (en) | 2021-05-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110689138B (zh) | 运算方法、装置及相关产品 | |
US10872290B2 (en) | Neural network processor with direct memory access and hardware acceleration circuits | |
WO2017124644A1 (zh) | 一种人工神经网络压缩编码装置和方法 | |
US20210166156A1 (en) | Data processing system and data processing method | |
CN117724763A (zh) | 用于矩阵操作加速器的指令的装置、方法和系统 | |
CN111142938B (zh) | 一种异构芯片的任务处理方法、任务处理装置及电子设备 | |
JP7256811B2 (ja) | アドバンストインタコネクト技術を利用してaiトレーニングを加速するための方法及びシステム | |
US20140143524A1 (en) | Information processing apparatus, information processing apparatus control method, and a computer-readable storage medium storing a control program for controlling an information processing apparatus | |
CN113435682A (zh) | 分布式训练的梯度压缩 | |
KR20220145848A (ko) | 집적 회로 아키텍처 내에서 최적화된 데이터흐름을 위한 지능형 버퍼 추적 시스템 및 방법 | |
TW202127326A (zh) | 用於加速神經網路計算的硬體電路 | |
WO2016024508A1 (ja) | マルチプロセッサ装置 | |
US10476492B2 (en) | Structures and operations of integrated circuits having network of configurable switches | |
US11816061B2 (en) | Dynamic allocation of arithmetic logic units for vectorized operations | |
CN116468078A (zh) | 面向人工智能芯片的智能引擎处理方法和装置 | |
CN111078286A (zh) | 数据通信方法、计算系统和存储介质 | |
Siládi et al. | Adapted parallel Quine-McCluskey algorithm using GPGPU | |
US10769527B2 (en) | Accelerating artificial neural network computations by skipping input values | |
US20210150311A1 (en) | Data layout conscious processing in memory architecture for executing neural network model | |
US20230289065A1 (en) | Data flow control device in streaming architecture chip | |
US20230051344A1 (en) | Optimization of memory use for efficient neural network execution | |
US20230043584A1 (en) | Optimization of memory use for efficient neural network execution | |
Zmejev et al. | Hash Unit as One of Computations Control Elements in Parallel Dataflow Computing System | |
WO2020073874A1 (zh) | 机器学习运算的分配系统及方法 | |
US20210117800A1 (en) | Multiple locally stored artificial neural network computations |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |