JP2022506418A5 - - Google Patents

Info

Publication number
JP2022506418A5
JP2022506418A5 JP2021523783A JP2021523783A JP2022506418A5 JP 2022506418 A5 JP2022506418 A5 JP 2022506418A5 JP 2021523783 A JP2021523783 A JP 2021523783A JP 2021523783 A JP2021523783 A JP 2021523783A JP 2022506418 A5 JP2022506418 A5 JP 2022506418A5
Authority
JP
Japan
Prior art keywords
matrix
submatrix
input register
multiplication
stored
Prior art date
Application number
JP2021523783A
Other languages
English (en)
Japanese (ja)
Other versions
JPWO2020091848A5 (https=
JP7461945B2 (ja
JP2022506418A (ja
Filing date
Publication date
Priority claimed from US16/176,449 external-priority patent/US11093580B2/en
Application filed filed Critical
Publication of JP2022506418A publication Critical patent/JP2022506418A/ja
Publication of JP2022506418A5 publication Critical patent/JP2022506418A5/ja
Publication of JPWO2020091848A5 publication Critical patent/JPWO2020091848A5/ja
Priority to JP2023065959A priority Critical patent/JP2023089161A/ja
Application granted granted Critical
Publication of JP7461945B2 publication Critical patent/JP7461945B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

JP2021523783A 2018-10-31 2019-06-18 部分行列の順序付けを伴う行列乗算器 Active JP7461945B2 (ja)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2023065959A JP2023089161A (ja) 2018-10-31 2023-04-13 部分行列の順序付けを伴う行列乗算器

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US16/176,449 2018-10-31
US16/176,449 US11093580B2 (en) 2018-10-31 2018-10-31 Matrix multiplier with submatrix sequencing
PCT/US2019/037656 WO2020091848A1 (en) 2018-10-31 2019-06-18 Matrix multiplier with submatrix sequencing

Related Child Applications (1)

Application Number Title Priority Date Filing Date
JP2023065959A Division JP2023089161A (ja) 2018-10-31 2023-04-13 部分行列の順序付けを伴う行列乗算器

Publications (4)

Publication Number Publication Date
JP2022506418A JP2022506418A (ja) 2022-01-17
JP2022506418A5 true JP2022506418A5 (https=) 2022-06-22
JPWO2020091848A5 JPWO2020091848A5 (https=) 2022-06-22
JP7461945B2 JP7461945B2 (ja) 2024-04-04

Family

ID=70327188

Family Applications (2)

Application Number Title Priority Date Filing Date
JP2021523783A Active JP7461945B2 (ja) 2018-10-31 2019-06-18 部分行列の順序付けを伴う行列乗算器
JP2023065959A Withdrawn JP2023089161A (ja) 2018-10-31 2023-04-13 部分行列の順序付けを伴う行列乗算器

Family Applications After (1)

Application Number Title Priority Date Filing Date
JP2023065959A Withdrawn JP2023089161A (ja) 2018-10-31 2023-04-13 部分行列の順序付けを伴う行列乗算器

Country Status (6)

Country Link
US (1) US11093580B2 (https=)
EP (1) EP3891626A4 (https=)
JP (2) JP7461945B2 (https=)
KR (1) KR102586989B1 (https=)
CN (1) CN113168430A (https=)
WO (1) WO2020091848A1 (https=)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109871236B (zh) * 2017-12-01 2025-05-06 超威半导体公司 具有低功率并行矩阵乘法流水线的流处理器
US20210303987A1 (en) * 2020-03-26 2021-09-30 Advanced Micro Devices, Inc. Power reduction for machine learning accelerator background
US11720328B2 (en) 2020-06-26 2023-08-08 Advanced Micro Devices, Inc. Processing unit with small footprint arithmetic logic unit
CN112429475B (zh) * 2020-09-29 2023-06-30 贵州大学 一种胶囊排序送料装置
CN112433760B (zh) * 2020-11-27 2022-09-23 海光信息技术股份有限公司 数据排序方法和数据排序电路
CN112632464B (zh) * 2020-12-28 2022-11-29 上海壁仞智能科技有限公司 用于处理数据的处理装置
US11556337B2 (en) 2021-04-12 2023-01-17 Analog Devices International Unlimited Company Parallel matrix multiplication technique optimized for memory fetches
CN117407640A (zh) * 2022-07-15 2024-01-16 华为技术有限公司 一种矩阵计算方法及装置
KR102640249B1 (ko) * 2023-06-12 2024-02-27 주식회사 하이퍼엑셀 대규모 언어 모델을 위해 멀티-디바이스에 기반한 추론을 수행하는 방법 및 시스템
CN119883379B (zh) * 2024-12-24 2025-11-18 深圳市鸿合创新信息技术有限责任公司 数据排序方法、装置、电子设备和存储介质

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CH594477A5 (https=) * 1976-08-20 1978-01-13 Agie Ag Ind Elektronik
JPH05324700A (ja) * 1992-05-19 1993-12-07 N T T Data Tsushin Kk 行列乗算装置
JP3935678B2 (ja) * 2001-01-31 2007-06-27 富士通株式会社 Simd積和演算方法、積和演算回路、および、半導体集積回路装置
US6901422B1 (en) * 2001-03-21 2005-05-31 Apple Computer, Inc. Matrix multiplication in a vector processing system
US20040122887A1 (en) * 2002-12-20 2004-06-24 Macy William W. Efficient multiplication of small matrices using SIMD registers
US20050240646A1 (en) * 2004-04-23 2005-10-27 The Research Foundation Of State University Of New York Reconfigurable matrix multiplier architecture and extended borrow parallel counter and small-multiplier circuits
US8051124B2 (en) * 2007-07-19 2011-11-01 Itt Manufacturing Enterprises, Inc. High speed and efficient matrix multiplication hardware module
US9354944B2 (en) * 2009-07-27 2016-05-31 Advanced Micro Devices, Inc. Mapping processing logic having data-parallel threads across processors
US8577951B1 (en) * 2010-08-19 2013-11-05 Altera Corporation Matrix operations in an integrated circuit device
US8862653B2 (en) * 2011-04-26 2014-10-14 University Of South Carolina System and method for sparse matrix vector multiplication processing
US9886418B2 (en) * 2015-04-28 2018-02-06 Intel Corporation Matrix operands for linear algebra operations
CN108491359B (zh) 2016-04-22 2019-12-24 北京中科寒武纪科技有限公司 子矩阵运算装置及方法
US10032247B2 (en) 2016-06-22 2018-07-24 Palo Alto Research Center Incorporated System and method for speeding up general matrix-vector multiplication on GPU
US10067910B2 (en) 2016-07-01 2018-09-04 Palo Alto Research Center Incorporated System and method for GPU maximum register count optimization applied to general matrix-matrix multiplication
US10929944B2 (en) * 2016-11-23 2021-02-23 Advanced Micro Devices, Inc. Low power and low latency GPU coprocessor for persistent computing
US10817587B2 (en) 2017-02-28 2020-10-27 Texas Instruments Incorporated Reconfigurable matrix multiplier system and method
JP6912703B2 (ja) * 2017-02-24 2021-08-04 富士通株式会社 演算方法、演算装置、演算プログラム及び演算システム
US10521225B2 (en) * 2017-06-29 2019-12-31 Oracle International Corporation Matrix multiplication at memory bandwidth
CN107622037A (zh) 2017-09-27 2018-01-23 郑州云海信息技术有限公司 一种提高图形处理单元的矩阵乘计算性能的方法和装置

Similar Documents

Publication Publication Date Title
JP2022506418A5 (https=)
JP5408913B2 (ja) 高速かつ効率的な行列乗算ハードウェアモジュール
JP6900487B2 (ja) ハードウェアにおける平均プーリングの実行
JP6987860B2 (ja) ハードウェアにおけるカーネルストライドの実行
WO2004061705A3 (en) Efficient multiplication of small matrices using simd registers
JP2019537139A5 (https=)
JP7461945B2 (ja) 部分行列の順序付けを伴う行列乗算器
WO2018134740A3 (en) Sparse matrix multiplication in associative memory device
FI3779681T3 (fi) Kiihdytin harvan-tiheän matriisin kertomista varten
CN109144469B (zh) 流水线结构神经网络矩阵运算架构及方法
JPWO2021113044A5 (https=)
JP2011222024A5 (https=)
US11573765B2 (en) Fused convolution and batch normalization for neural networks
JP7377869B2 (ja) グラフィックスプロセッシングユニットでのパイプライン化された行列乗算
JP2010539593A5 (https=)
JP2022181161A (ja) ハードウェアにおけるスパース行列乗算
JP2012528391A5 (https=)
EP3655851B1 (en) Register-based complex number processing
WO1999038088A1 (en) Method and apparatus for arithmetic operation
JP2005122141A5 (https=)
JPWO2020091848A5 (https=)
WO2008037975A3 (en) Matrix multiplication
JP2001067206A (ja) モジュラー乗算を実行するためのシステム並びに方法
CN113032009A (zh) 使用处理器指令的硬件加速矩阵操纵运算
JP2008506191A5 (https=)