CN112231630B - 基于fpga并行加速的稀疏矩阵求解方法 - Google Patents
基于fpga并行加速的稀疏矩阵求解方法 Download PDFInfo
- Publication number
- CN112231630B CN112231630B CN202011156271.8A CN202011156271A CN112231630B CN 112231630 B CN112231630 B CN 112231630B CN 202011156271 A CN202011156271 A CN 202011156271A CN 112231630 B CN112231630 B CN 112231630B
- Authority
- CN
- China
- Prior art keywords
- calculation
- processing unit
- memory
- unit
- multiplication
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 239000011159 matrix material Substances 0.000 title claims abstract description 75
- 238000000034 method Methods 0.000 title claims abstract description 35
- 230000001133 acceleration Effects 0.000 title claims abstract description 24
- 238000012545 processing Methods 0.000 claims abstract description 111
- 238000004364 calculation method Methods 0.000 claims abstract description 79
- 238000004891 communication Methods 0.000 claims description 18
- 239000013598 vector Substances 0.000 claims description 18
- 238000000354 decomposition reaction Methods 0.000 claims description 5
- 238000004422 calculation algorithm Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 238000004088 simulation Methods 0.000 description 3
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/16—Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/11—Complex mathematical operations for solving equations, e.g. nonlinear equations, general mathematical optimization problems
- G06F17/12—Simultaneous equations, e.g. systems of linear equations
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5005—Allocation of resources, e.g. of the central processing unit [CPU] to service a request
- G06F9/5027—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Theoretical Computer Science (AREA)
- Mathematical Optimization (AREA)
- Computational Mathematics (AREA)
- Mathematical Analysis (AREA)
- Pure & Applied Mathematics (AREA)
- Software Systems (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- Algebra (AREA)
- Databases & Information Systems (AREA)
- Operations Research (AREA)
- Computing Systems (AREA)
- Complex Calculations (AREA)
Abstract
Description
Claims (4)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011156271.8A CN112231630B (zh) | 2020-10-26 | 2020-10-26 | 基于fpga并行加速的稀疏矩阵求解方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011156271.8A CN112231630B (zh) | 2020-10-26 | 2020-10-26 | 基于fpga并行加速的稀疏矩阵求解方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112231630A CN112231630A (zh) | 2021-01-15 |
CN112231630B true CN112231630B (zh) | 2024-02-02 |
Family
ID=74110860
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011156271.8A Active CN112231630B (zh) | 2020-10-26 | 2020-10-26 | 基于fpga并行加速的稀疏矩阵求解方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112231630B (zh) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113554574A (zh) * | 2021-09-23 | 2021-10-26 | 苏州浪潮智能科技有限公司 | 一种压缩感知图像恢复方法、装置、设备及介质 |
CN115658323A (zh) * | 2022-11-15 | 2023-01-31 | 国网上海能源互联网研究院有限公司 | 基于软硬件协同的fpga潮流计算加速架构和方法 |
CN116436012B (zh) * | 2023-06-07 | 2023-08-15 | 国网上海能源互联网研究院有限公司 | 一种基于fpga的电力潮流计算系统和方法 |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101533387A (zh) * | 2009-04-24 | 2009-09-16 | 西安电子科技大学 | 基于fpga的边角块稀疏矩阵并行lu分解器 |
CN107341133A (zh) * | 2017-06-24 | 2017-11-10 | 中国人民解放军信息工程大学 | 基于任意维数矩阵lu分解的可重构计算结构的调度方法 |
CN108716916A (zh) * | 2018-05-31 | 2018-10-30 | 北京航空航天大学 | 一种基于超级块的分布式并行星点质心提取方法及fpga实现装置 |
CN108718091A (zh) * | 2018-07-09 | 2018-10-30 | 国网福建省电力有限公司 | 一种应用于主动配电网的三相极坐标系线性潮流计算方法 |
CN109101464A (zh) * | 2018-07-13 | 2018-12-28 | 清华大学 | 基于矩阵修正的电力系统稀疏矩阵并行求解方法及系统 |
CN109144702A (zh) * | 2018-09-06 | 2019-01-04 | 陈彦楠 | 一种用于行列并行粗粒度可重构阵列多目标优化自动映射调度方法 |
CN110535687A (zh) * | 2019-07-30 | 2019-12-03 | 大连理工大学 | 一种基于车联网环境下轻量级区块链的协同缓存方法 |
CN111796796A (zh) * | 2020-06-12 | 2020-10-20 | 杭州云象网络技术有限公司 | 基于稀疏矩阵乘法的fpga存储方法、计算方法、模块和fpga板 |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2018206078A (ja) * | 2017-06-05 | 2018-12-27 | 富士通株式会社 | 並列処理装置、並列演算方法、及び並列演算プログラム |
-
2020
- 2020-10-26 CN CN202011156271.8A patent/CN112231630B/zh active Active
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101533387A (zh) * | 2009-04-24 | 2009-09-16 | 西安电子科技大学 | 基于fpga的边角块稀疏矩阵并行lu分解器 |
CN107341133A (zh) * | 2017-06-24 | 2017-11-10 | 中国人民解放军信息工程大学 | 基于任意维数矩阵lu分解的可重构计算结构的调度方法 |
CN108716916A (zh) * | 2018-05-31 | 2018-10-30 | 北京航空航天大学 | 一种基于超级块的分布式并行星点质心提取方法及fpga实现装置 |
CN108718091A (zh) * | 2018-07-09 | 2018-10-30 | 国网福建省电力有限公司 | 一种应用于主动配电网的三相极坐标系线性潮流计算方法 |
CN109101464A (zh) * | 2018-07-13 | 2018-12-28 | 清华大学 | 基于矩阵修正的电力系统稀疏矩阵并行求解方法及系统 |
CN109144702A (zh) * | 2018-09-06 | 2019-01-04 | 陈彦楠 | 一种用于行列并行粗粒度可重构阵列多目标优化自动映射调度方法 |
CN110535687A (zh) * | 2019-07-30 | 2019-12-03 | 大连理工大学 | 一种基于车联网环境下轻量级区块链的协同缓存方法 |
CN111796796A (zh) * | 2020-06-12 | 2020-10-20 | 杭州云象网络技术有限公司 | 基于稀疏矩阵乘法的fpga存储方法、计算方法、模块和fpga板 |
Non-Patent Citations (6)
Title |
---|
An LU decomposition based direct integral equation solver of linear complexity and higher-order accuracy for large-scale interconnect extraction;Chai Wenwen 等;《IEEE Transactions on Advanced Packaging》;第33卷(第4期);794-803 * |
GPU加速不完全Cholesky分解预条件共轭梯度法;陈尧 等;《计算机研究与发展》(第04期);843-850 * |
Parallel direct solver for solving systems of linear equations resulting from finite element method on multi-core desktops and workstations;Fialko Sergiy;《Computers & Mathematics with Applications》;第70卷(第12期);2968-2987 * |
一种基于FPGA并行加速的稀疏矩阵求解方法;吴志勇 等;《电力系统保护与控制》;第49卷(第11期);155-162 * |
基于GPU平台的KLU并行算法的研究:对角线块的LU分解;游聪伟;《中国优秀硕士学位论文全文数据库信息科技辑》(第11期);I138-322 * |
基于异构系统的多对角矩阵并行计算研究;焦江磊;《中国优秀硕士学位论文全文数据库信息科技辑》(第04期);I137-246 * |
Also Published As
Publication number | Publication date |
---|---|
CN112231630A (zh) | 2021-01-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112231630B (zh) | 基于fpga并行加速的稀疏矩阵求解方法 | |
US20210182721A1 (en) | Method and apparatus for constructing quantum machine learning framework, quantum computer and computer storage medium | |
Schinianakis et al. | An RNS implementation of an $ F_ {p} $ elliptic curve point multiplier | |
Chen et al. | NICSLU: An adaptive sparse matrix solver for parallel circuit simulation | |
Brandfass et al. | Rank reordering for MPI communication optimization | |
Lin et al. | A fast parallel algorithm for selected inversion of structured sparse matrices with application to 2D electronic structure calculations | |
Polizzi et al. | SPIKE: A parallel environment for solving banded linear systems | |
Chen et al. | An escheduler-based data dependence analysis and task scheduling for parallel circuit simulation | |
Yamazaki et al. | On techniques to improve robustness and scalability of a parallel hybrid linear solver | |
Wang et al. | WinoNN: Optimizing FPGA-based convolutional neural network accelerators using sparse Winograd algorithm | |
Hifi et al. | Reduction strategies and exact algorithms for the disjunctively constrained knapsack problem | |
Chen et al. | An adaptive LU factorization algorithm for parallel circuit simulation | |
Akbudak et al. | Simultaneous input and output matrix partitioning for outer-product--parallel sparse matrix-matrix multiplication | |
CN102156777B (zh) | 电路仿真时电路稀疏矩阵的基于消去图的并行分解方法 | |
CN110460443A (zh) | 椭圆曲线密码的高速点加运算方法和装置 | |
Toma et al. | Decomposition and parallelization of strongly coupled fluid–structure interaction linear subsystems based on the Q1/P0 discretization | |
Auckenthaler et al. | Developing algorithms and software for the parallel solution of the symmetric eigenvalue problem | |
CN103853835A (zh) | 基于gpu加速的网络社区检测方法 | |
Chow et al. | An efficient sparse conjugate gradient solver using a Beneš permutation network | |
Jamal et al. | A hybrid CPU/GPU approach for the parallel algebraic recursive multilevel solver pARMS | |
Yu et al. | Travelling wave solutions in nonlocal reaction–diffusion systems with delays and applications | |
Sommer et al. | Reduce–factor–solve for fast Thevenin impedance computation and network reduction | |
CN109101708B (zh) | 基于二级区域分解的隐式有限元并行方法 | |
Yamazaki et al. | On techniques to improve robustness and scalability of the Schur complement method | |
Udupa et al. | IKW: Inter-kernel weights for power efficient edge computing |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20210120 Address after: 214000 No.1 YinBai Road, Binhu District, Wuxi City, Jiangsu Province Applicant after: NATIONAL SUPERCOMPUTING CENTER IN WUXI Applicant after: Taichu (Wuxi) Electronic Technology Co.,Ltd. Applicant after: STATE GRID HUBEI ELECTRIC POWER Co.,Ltd. Address before: 214000 No.1 YinBai Road, Binhu District, Wuxi City, Jiangsu Province Applicant before: NATIONAL SUPERCOMPUTING CENTER IN WUXI Applicant before: Taichu (Wuxi) Electronic Technology Co.,Ltd. |
|
GR01 | Patent grant | ||
GR01 | Patent grant |