JPWO2020091848A5 - - Google Patents

Download PDF

Info

Publication number
JPWO2020091848A5
JPWO2020091848A5 JP2021523783A JP2021523783A JPWO2020091848A5 JP WO2020091848 A5 JPWO2020091848 A5 JP WO2020091848A5 JP 2021523783 A JP2021523783 A JP 2021523783A JP 2021523783 A JP2021523783 A JP 2021523783A JP WO2020091848 A5 JPWO2020091848 A5 JP WO2020091848A5
Authority
JP
Japan
Prior art keywords
matrix
submatrix
input register
multiplication
multiplication cycle
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2021523783A
Other languages
English (en)
Japanese (ja)
Other versions
JP2022506418A5 (https=
JP7461945B2 (ja
JP2022506418A (ja
Publication date
Priority claimed from US16/176,449 external-priority patent/US11093580B2/en
Application filed filed Critical
Publication of JP2022506418A publication Critical patent/JP2022506418A/ja
Publication of JP2022506418A5 publication Critical patent/JP2022506418A5/ja
Publication of JPWO2020091848A5 publication Critical patent/JPWO2020091848A5/ja
Priority to JP2023065959A priority Critical patent/JP2023089161A/ja
Application granted granted Critical
Publication of JP7461945B2 publication Critical patent/JP7461945B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

JP2021523783A 2018-10-31 2019-06-18 部分行列の順序付けを伴う行列乗算器 Active JP7461945B2 (ja)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2023065959A JP2023089161A (ja) 2018-10-31 2023-04-13 部分行列の順序付けを伴う行列乗算器

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US16/176,449 2018-10-31
US16/176,449 US11093580B2 (en) 2018-10-31 2018-10-31 Matrix multiplier with submatrix sequencing
PCT/US2019/037656 WO2020091848A1 (en) 2018-10-31 2019-06-18 Matrix multiplier with submatrix sequencing

Related Child Applications (1)

Application Number Title Priority Date Filing Date
JP2023065959A Division JP2023089161A (ja) 2018-10-31 2023-04-13 部分行列の順序付けを伴う行列乗算器

Publications (4)

Publication Number Publication Date
JP2022506418A JP2022506418A (ja) 2022-01-17
JP2022506418A5 JP2022506418A5 (https=) 2022-06-22
JPWO2020091848A5 true JPWO2020091848A5 (https=) 2022-06-22
JP7461945B2 JP7461945B2 (ja) 2024-04-04

Family

ID=70327188

Family Applications (2)

Application Number Title Priority Date Filing Date
JP2021523783A Active JP7461945B2 (ja) 2018-10-31 2019-06-18 部分行列の順序付けを伴う行列乗算器
JP2023065959A Withdrawn JP2023089161A (ja) 2018-10-31 2023-04-13 部分行列の順序付けを伴う行列乗算器

Family Applications After (1)

Application Number Title Priority Date Filing Date
JP2023065959A Withdrawn JP2023089161A (ja) 2018-10-31 2023-04-13 部分行列の順序付けを伴う行列乗算器

Country Status (6)

Country Link
US (1) US11093580B2 (https=)
EP (1) EP3891626A4 (https=)
JP (2) JP7461945B2 (https=)
KR (1) KR102586989B1 (https=)
CN (1) CN113168430A (https=)
WO (1) WO2020091848A1 (https=)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109871236B (zh) * 2017-12-01 2025-05-06 超威半导体公司 具有低功率并行矩阵乘法流水线的流处理器
US20210303987A1 (en) * 2020-03-26 2021-09-30 Advanced Micro Devices, Inc. Power reduction for machine learning accelerator background
US11720328B2 (en) 2020-06-26 2023-08-08 Advanced Micro Devices, Inc. Processing unit with small footprint arithmetic logic unit
CN112429475B (zh) * 2020-09-29 2023-06-30 贵州大学 一种胶囊排序送料装置
CN112433760B (zh) * 2020-11-27 2022-09-23 海光信息技术股份有限公司 数据排序方法和数据排序电路
CN112632464B (zh) * 2020-12-28 2022-11-29 上海壁仞智能科技有限公司 用于处理数据的处理装置
US11556337B2 (en) 2021-04-12 2023-01-17 Analog Devices International Unlimited Company Parallel matrix multiplication technique optimized for memory fetches
CN117407640A (zh) * 2022-07-15 2024-01-16 华为技术有限公司 一种矩阵计算方法及装置
KR102640249B1 (ko) * 2023-06-12 2024-02-27 주식회사 하이퍼엑셀 대규모 언어 모델을 위해 멀티-디바이스에 기반한 추론을 수행하는 방법 및 시스템
CN119883379B (zh) * 2024-12-24 2025-11-18 深圳市鸿合创新信息技术有限责任公司 数据排序方法、装置、电子设备和存储介质

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CH594477A5 (https=) * 1976-08-20 1978-01-13 Agie Ag Ind Elektronik
JPH05324700A (ja) * 1992-05-19 1993-12-07 N T T Data Tsushin Kk 行列乗算装置
JP3935678B2 (ja) * 2001-01-31 2007-06-27 富士通株式会社 Simd積和演算方法、積和演算回路、および、半導体集積回路装置
US6901422B1 (en) * 2001-03-21 2005-05-31 Apple Computer, Inc. Matrix multiplication in a vector processing system
US20040122887A1 (en) * 2002-12-20 2004-06-24 Macy William W. Efficient multiplication of small matrices using SIMD registers
US20050240646A1 (en) * 2004-04-23 2005-10-27 The Research Foundation Of State University Of New York Reconfigurable matrix multiplier architecture and extended borrow parallel counter and small-multiplier circuits
US8051124B2 (en) * 2007-07-19 2011-11-01 Itt Manufacturing Enterprises, Inc. High speed and efficient matrix multiplication hardware module
US9354944B2 (en) * 2009-07-27 2016-05-31 Advanced Micro Devices, Inc. Mapping processing logic having data-parallel threads across processors
US8577951B1 (en) * 2010-08-19 2013-11-05 Altera Corporation Matrix operations in an integrated circuit device
US8862653B2 (en) * 2011-04-26 2014-10-14 University Of South Carolina System and method for sparse matrix vector multiplication processing
US9886418B2 (en) * 2015-04-28 2018-02-06 Intel Corporation Matrix operands for linear algebra operations
CN108491359B (zh) 2016-04-22 2019-12-24 北京中科寒武纪科技有限公司 子矩阵运算装置及方法
US10032247B2 (en) 2016-06-22 2018-07-24 Palo Alto Research Center Incorporated System and method for speeding up general matrix-vector multiplication on GPU
US10067910B2 (en) 2016-07-01 2018-09-04 Palo Alto Research Center Incorporated System and method for GPU maximum register count optimization applied to general matrix-matrix multiplication
US10929944B2 (en) * 2016-11-23 2021-02-23 Advanced Micro Devices, Inc. Low power and low latency GPU coprocessor for persistent computing
US10817587B2 (en) 2017-02-28 2020-10-27 Texas Instruments Incorporated Reconfigurable matrix multiplier system and method
JP6912703B2 (ja) * 2017-02-24 2021-08-04 富士通株式会社 演算方法、演算装置、演算プログラム及び演算システム
US10521225B2 (en) * 2017-06-29 2019-12-31 Oracle International Corporation Matrix multiplication at memory bandwidth
CN107622037A (zh) 2017-09-27 2018-01-23 郑州云海信息技术有限公司 一种提高图形处理单元的矩阵乘计算性能的方法和装置

Similar Documents

Publication Publication Date Title
JP2022506418A5 (https=)
WO2004061705A3 (en) Efficient multiplication of small matrices using simd registers
JP5408913B2 (ja) 高速かつ効率的な行列乗算ハードウェアモジュール
JPWO2020091848A5 (https=)
CN109144469B (zh) 流水线结构神经网络矩阵运算架构及方法
JP7793585B2 (ja) ハードウェアにおけるスパース行列乗算
EP4290371A3 (en) Systems and methods for performing instructions to transform matrices into row-interleaved format
WO2018134740A3 (en) Sparse matrix multiplication in associative memory device
JP7461945B2 (ja) 部分行列の順序付けを伴う行列乗算器
CN102053948A (zh) 在单指令多数据多核处理器架构上转置矩阵的方法和系统
GB2582094A (en) Matrix computation engine
JP2024028901A5 (https=)
GB2601701A (en) Performing dot product operations using a memristive crossbar array
JP7700142B2 (ja) 機械学習アクセラレータの電力削減
CN110673824B (zh) 矩阵向量乘电路以及循环神经网络硬件加速器
JP2005122141A5 (https=)
JP7506086B2 (ja) データ処理
WO2008037975A3 (en) Matrix multiplication
JP2000148730A5 (https=)
CN111723906A (zh) 一种循环神经网络的加速计算方法、系统及相关装置
CN1198206C (zh) 时分型矩阵计算器
JP2008506191A5 (https=)
WO2018228703A1 (en) Multiply accumulator array and processor device
Faugàre et al. Fast change of ordering with exponent ω
US20070260660A1 (en) Efficient mapping of FFT to a reconfigurable parallel and pipeline data flow machine