CN112231630A - 基于fpga并行加速的稀疏矩阵求解方法 - Google Patents
基于fpga并行加速的稀疏矩阵求解方法 Download PDFInfo
- Publication number
- CN112231630A CN112231630A CN202011156271.8A CN202011156271A CN112231630A CN 112231630 A CN112231630 A CN 112231630A CN 202011156271 A CN202011156271 A CN 202011156271A CN 112231630 A CN112231630 A CN 112231630A
- Authority
- CN
- China
- Prior art keywords
- processing unit
- calculation
- memory
- unit
- multiplication
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 239000011159 matrix material Substances 0.000 title claims abstract description 77
- 238000000034 method Methods 0.000 title claims abstract description 36
- 230000001133 acceleration Effects 0.000 title claims abstract description 26
- 238000004364 calculation method Methods 0.000 claims abstract description 74
- 238000000354 decomposition reaction Methods 0.000 claims abstract description 7
- 238000004891 communication Methods 0.000 claims description 18
- 239000013598 vector Substances 0.000 claims description 18
- 239000002699 waste material Substances 0.000 abstract description 3
- 238000004422 calculation algorithm Methods 0.000 description 3
- 238000004088 simulation Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 2
- 238000000638 solvent extraction Methods 0.000 description 2
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 238000004904 shortening Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/16—Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/11—Complex mathematical operations for solving equations, e.g. nonlinear equations, general mathematical optimization problems
- G06F17/12—Simultaneous equations, e.g. systems of linear equations
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5005—Allocation of resources, e.g. of the central processing unit [CPU] to service a request
- G06F9/5027—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Theoretical Computer Science (AREA)
- Pure & Applied Mathematics (AREA)
- Mathematical Optimization (AREA)
- Mathematical Analysis (AREA)
- Computational Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Software Systems (AREA)
- General Engineering & Computer Science (AREA)
- Algebra (AREA)
- Databases & Information Systems (AREA)
- Computing Systems (AREA)
- Operations Research (AREA)
- Complex Calculations (AREA)
Abstract
Description
Claims (6)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011156271.8A CN112231630B (zh) | 2020-10-26 | 2020-10-26 | 基于fpga并行加速的稀疏矩阵求解方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011156271.8A CN112231630B (zh) | 2020-10-26 | 2020-10-26 | 基于fpga并行加速的稀疏矩阵求解方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112231630A true CN112231630A (zh) | 2021-01-15 |
CN112231630B CN112231630B (zh) | 2024-02-02 |
Family
ID=74110860
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011156271.8A Active CN112231630B (zh) | 2020-10-26 | 2020-10-26 | 基于fpga并行加速的稀疏矩阵求解方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112231630B (zh) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113554574A (zh) * | 2021-09-23 | 2021-10-26 | 苏州浪潮智能科技有限公司 | 一种压缩感知图像恢复方法、装置、设备及介质 |
CN115658323A (zh) * | 2022-11-15 | 2023-01-31 | 国网上海能源互联网研究院有限公司 | 基于软硬件协同的fpga潮流计算加速架构和方法 |
CN116436012A (zh) * | 2023-06-07 | 2023-07-14 | 国网上海能源互联网研究院有限公司 | 一种基于fpga的电力潮流计算系统和方法 |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101533387A (zh) * | 2009-04-24 | 2009-09-16 | 西安电子科技大学 | 基于fpga的边角块稀疏矩阵并行lu分解器 |
CN107341133A (zh) * | 2017-06-24 | 2017-11-10 | 中国人民解放军信息工程大学 | 基于任意维数矩阵lu分解的可重构计算结构的调度方法 |
CN108716916A (zh) * | 2018-05-31 | 2018-10-30 | 北京航空航天大学 | 一种基于超级块的分布式并行星点质心提取方法及fpga实现装置 |
CN108718091A (zh) * | 2018-07-09 | 2018-10-30 | 国网福建省电力有限公司 | 一种应用于主动配电网的三相极坐标系线性潮流计算方法 |
US20180349321A1 (en) * | 2017-06-05 | 2018-12-06 | Fujitsu Limited | Parallel processing apparatus, parallel operation method, and parallel operation program |
CN109101464A (zh) * | 2018-07-13 | 2018-12-28 | 清华大学 | 基于矩阵修正的电力系统稀疏矩阵并行求解方法及系统 |
CN109144702A (zh) * | 2018-09-06 | 2019-01-04 | 陈彦楠 | 一种用于行列并行粗粒度可重构阵列多目标优化自动映射调度方法 |
CN110535687A (zh) * | 2019-07-30 | 2019-12-03 | 大连理工大学 | 一种基于车联网环境下轻量级区块链的协同缓存方法 |
CN111796796A (zh) * | 2020-06-12 | 2020-10-20 | 杭州云象网络技术有限公司 | 基于稀疏矩阵乘法的fpga存储方法、计算方法、模块和fpga板 |
-
2020
- 2020-10-26 CN CN202011156271.8A patent/CN112231630B/zh active Active
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101533387A (zh) * | 2009-04-24 | 2009-09-16 | 西安电子科技大学 | 基于fpga的边角块稀疏矩阵并行lu分解器 |
US20180349321A1 (en) * | 2017-06-05 | 2018-12-06 | Fujitsu Limited | Parallel processing apparatus, parallel operation method, and parallel operation program |
CN107341133A (zh) * | 2017-06-24 | 2017-11-10 | 中国人民解放军信息工程大学 | 基于任意维数矩阵lu分解的可重构计算结构的调度方法 |
CN108716916A (zh) * | 2018-05-31 | 2018-10-30 | 北京航空航天大学 | 一种基于超级块的分布式并行星点质心提取方法及fpga实现装置 |
CN108718091A (zh) * | 2018-07-09 | 2018-10-30 | 国网福建省电力有限公司 | 一种应用于主动配电网的三相极坐标系线性潮流计算方法 |
CN109101464A (zh) * | 2018-07-13 | 2018-12-28 | 清华大学 | 基于矩阵修正的电力系统稀疏矩阵并行求解方法及系统 |
CN109144702A (zh) * | 2018-09-06 | 2019-01-04 | 陈彦楠 | 一种用于行列并行粗粒度可重构阵列多目标优化自动映射调度方法 |
CN110535687A (zh) * | 2019-07-30 | 2019-12-03 | 大连理工大学 | 一种基于车联网环境下轻量级区块链的协同缓存方法 |
CN111796796A (zh) * | 2020-06-12 | 2020-10-20 | 杭州云象网络技术有限公司 | 基于稀疏矩阵乘法的fpga存储方法、计算方法、模块和fpga板 |
Non-Patent Citations (6)
Title |
---|
CHAI WENWEN 等: "An LU decomposition based direct integral equation solver of linear complexity and higher-order accuracy for large-scale interconnect extraction", 《IEEE TRANSACTIONS ON ADVANCED PACKAGING》, vol. 33, no. 4, pages 794 - 803, XP011354002, DOI: 10.1109/TADVP.2010.2053537 * |
FIALKO SERGIY: "Parallel direct solver for solving systems of linear equations resulting from finite element method on multi-core desktops and workstations", 《COMPUTERS & MATHEMATICS WITH APPLICATIONS》, vol. 70, no. 12, pages 2968 - 2987, XP029310874, DOI: 10.1016/j.camwa.2015.10.009 * |
吴志勇 等: "一种基于FPGA并行加速的稀疏矩阵求解方法", 《电力系统保护与控制》, vol. 49, no. 11, pages 155 - 162 * |
游聪伟: "基于GPU平台的KLU并行算法的研究:对角线块的LU分解", 《中国优秀硕士学位论文全文数据库信息科技辑》, no. 11, pages 138 - 322 * |
焦江磊: "基于异构系统的多对角矩阵并行计算研究", 《中国优秀硕士学位论文全文数据库信息科技辑》, no. 04, pages 137 - 246 * |
陈尧 等: "GPU加速不完全Cholesky分解预条件共轭梯度法", 《计算机研究与发展》, no. 04, pages 843 - 850 * |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113554574A (zh) * | 2021-09-23 | 2021-10-26 | 苏州浪潮智能科技有限公司 | 一种压缩感知图像恢复方法、装置、设备及介质 |
CN115658323A (zh) * | 2022-11-15 | 2023-01-31 | 国网上海能源互联网研究院有限公司 | 基于软硬件协同的fpga潮流计算加速架构和方法 |
CN116436012A (zh) * | 2023-06-07 | 2023-07-14 | 国网上海能源互联网研究院有限公司 | 一种基于fpga的电力潮流计算系统和方法 |
CN116436012B (zh) * | 2023-06-07 | 2023-08-15 | 国网上海能源互联网研究院有限公司 | 一种基于fpga的电力潮流计算系统和方法 |
Also Published As
Publication number | Publication date |
---|---|
CN112231630B (zh) | 2024-02-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112231630A (zh) | 基于fpga并行加速的稀疏矩阵求解方法 | |
Clark et al. | Solving Lattice QCD systems of equations using mixed precision solvers on GPUs | |
Hine et al. | Linear-scaling density-functional theory with tens of thousands of atoms: Expanding the scope and scale of calculations with ONETEP | |
Dziekonski et al. | Generation of large finite‐element matrices on multiple graphics processors | |
Yamazaki et al. | On techniques to improve robustness and scalability of a parallel hybrid linear solver | |
Brandfass et al. | Rank reordering for MPI communication optimization | |
CN108170639B (zh) | 基于分布式环境的张量cp分解实现方法 | |
Chenhan et al. | A CPU–GPU hybrid approach for the unsymmetric multifrontal method | |
Koric et al. | Sparse matrix factorization in the implicit finite element method on petascale architecture | |
Chen et al. | An adaptive LU factorization algorithm for parallel circuit simulation | |
Sáez et al. | Graphical reduction of reaction networks by linear elimination of species | |
Toma et al. | Decomposition and parallelization of strongly coupled fluid–structure interaction linear subsystems based on the Q1/P0 discretization | |
CN103853835A (zh) | 基于gpu加速的网络社区检测方法 | |
Bernaschi et al. | A factored sparse approximate inverse preconditioned conjugate gradient solver on graphics processing units | |
US9727529B2 (en) | Calculation device and calculation method for deriving solutions of system of linear equations and program that is applied to the same | |
Chow et al. | An efficient sparse conjugate gradient solver using a Beneš permutation network | |
Sommer et al. | Reduce–factor–solve for fast Thevenin impedance computation and network reduction | |
Gulati et al. | FPGA-based hardware acceleration for Boolean satisfiability | |
Nguyen et al. | A region-oriented hardware implementation for membrane computing applications | |
Yamazaki et al. | On techniques to improve robustness and scalability of the Schur complement method | |
Zhang et al. | Mixed-precision block incomplete sparse approximate preconditioner on Tensor core | |
CN109947861A (zh) | 用于数据仓库生成目标表的方法、装置和计算机可读介质 | |
Birke et al. | Block-relaxation methods for 3D constant-coefficient stencils on GPUs and multicore CPUs | |
Kim et al. | Hybrid Parallelism of Multifrontal Linear Solution Algorithm with Out Of Core Capability for Finite Element Analysis | |
Wang et al. | An efficient architecture for floating-point eigenvalue decomposition |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20210120 Address after: 214000 No.1 YinBai Road, Binhu District, Wuxi City, Jiangsu Province Applicant after: NATIONAL SUPERCOMPUTING CENTER IN WUXI Applicant after: Taichu (Wuxi) Electronic Technology Co.,Ltd. Applicant after: STATE GRID HUBEI ELECTRIC POWER Co.,Ltd. Address before: 214000 No.1 YinBai Road, Binhu District, Wuxi City, Jiangsu Province Applicant before: NATIONAL SUPERCOMPUTING CENTER IN WUXI Applicant before: Taichu (Wuxi) Electronic Technology Co.,Ltd. |
|
GR01 | Patent grant | ||
GR01 | Patent grant |