KR20250002449A - 처리 요소들의 어레이를 포함하는 컨볼루션 엔진을 이용하여 수행되는 매트릭스 곱셈 - Google Patents
처리 요소들의 어레이를 포함하는 컨볼루션 엔진을 이용하여 수행되는 매트릭스 곱셈 Download PDFInfo
- Publication number
- KR20250002449A KR20250002449A KR1020247037544A KR20247037544A KR20250002449A KR 20250002449 A KR20250002449 A KR 20250002449A KR 1020247037544 A KR1020247037544 A KR 1020247037544A KR 20247037544 A KR20247037544 A KR 20247037544A KR 20250002449 A KR20250002449 A KR 20250002449A
- Authority
- KR
- South Korea
- Prior art keywords
- matrix
- processor
- processing elements
- processing
- paragraph
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/16—Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/15—Correlation function computation including computation of convolution operations
- G06F17/153—Multidimensional correlation or convolution
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
- G06F7/38—Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation
- G06F7/48—Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation using non-contact-making devices, e.g. tube, solid state device; using unspecified devices
- G06F7/544—Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation using non-contact-making devices, e.g. tube, solid state device; using unspecified devices for evaluating functions by calculation
- G06F7/5443—Sum of products
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Theoretical Computer Science (AREA)
- Computational Mathematics (AREA)
- Mathematical Analysis (AREA)
- Mathematical Optimization (AREA)
- Pure & Applied Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- Computing Systems (AREA)
- Algebra (AREA)
- Software Systems (AREA)
- Databases & Information Systems (AREA)
- Complex Calculations (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US202263336586P | 2022-04-29 | 2022-04-29 | |
| US63/336,586 | 2022-04-29 | ||
| PCT/US2023/020213 WO2023212203A1 (en) | 2022-04-29 | 2023-04-27 | Matrix multiplication performed using convolution engine which includes array of processing elements |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| KR20250002449A true KR20250002449A (ko) | 2025-01-07 |
Family
ID=86469086
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| KR1020247037544A Pending KR20250002449A (ko) | 2022-04-29 | 2023-04-27 | 처리 요소들의 어레이를 포함하는 컨볼루션 엔진을 이용하여 수행되는 매트릭스 곱셈 |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US20250284767A1 (https=) |
| EP (1) | EP4515426A1 (https=) |
| JP (1) | JP2025514088A (https=) |
| KR (1) | KR20250002449A (https=) |
| CN (1) | CN119278445A (https=) |
| WO (1) | WO2023212203A1 (https=) |
Family Cites Families (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US11157287B2 (en) | 2017-07-24 | 2021-10-26 | Tesla, Inc. | Computational array microprocessor system with variable latency memory access |
| US11157441B2 (en) | 2017-07-24 | 2021-10-26 | Tesla, Inc. | Computational array microprocessor system using non-consecutive data formatting |
| US11409692B2 (en) | 2017-07-24 | 2022-08-09 | Tesla, Inc. | Vector computational unit |
| US11256977B2 (en) * | 2017-12-29 | 2022-02-22 | Facebook, Inc. | Lowering hardware for neural networks |
| EP3674982A1 (en) * | 2018-12-27 | 2020-07-01 | IMEC vzw | Hardware accelerator architecture for convolutional neural network |
-
2023
- 2023-04-27 EP EP23725527.8A patent/EP4515426A1/en active Pending
- 2023-04-27 CN CN202380043098.6A patent/CN119278445A/zh active Pending
- 2023-04-27 JP JP2024562065A patent/JP2025514088A/ja active Pending
- 2023-04-27 US US18/859,039 patent/US20250284767A1/en active Pending
- 2023-04-27 KR KR1020247037544A patent/KR20250002449A/ko active Pending
- 2023-04-27 WO PCT/US2023/020213 patent/WO2023212203A1/en not_active Ceased
Also Published As
| Publication number | Publication date |
|---|---|
| US20250284767A1 (en) | 2025-09-11 |
| WO2023212203A1 (en) | 2023-11-02 |
| CN119278445A (zh) | 2025-01-07 |
| JP2025514088A (ja) | 2025-05-02 |
| EP4515426A1 (en) | 2025-03-05 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11698773B2 (en) | Accelerated mathematical engine | |
| US12174910B2 (en) | Methods and systems for implementing a convolution transpose layer of a neural network | |
| TWI832006B (zh) | 實行卷積運算的系統及方法 | |
| JP7271820B2 (ja) | 行列乗算アクセラレータ(mma)を用いる基本計算原始関数の実装 | |
| KR102861760B1 (ko) | 뉴럴 네트워크의 컨볼루션 연산을 처리하는 방법 및 장치 | |
| EP3093757B1 (en) | Multi-dimensional sliding window operation for a vector processor | |
| US11899741B2 (en) | Memory device and method | |
| KR20200095300A (ko) | 뉴럴 네트워크의 컨볼루션 연산을 처리하는 방법 및 장치 | |
| EP3690757B1 (en) | Method and apparatus with convolution neural network processing | |
| KR20240068634A (ko) | 깊이별 콘볼루션들을 위한 메모리 병목 현상들의 제거 | |
| KR20250002449A (ko) | 처리 요소들의 어레이를 포함하는 컨볼루션 엔진을 이용하여 수행되는 매트릭스 곱셈 | |
| US12579413B2 (en) | Method and apparatus for performing convolution neural network operations | |
| US20250209132A1 (en) | Efficient multiply-accumulate units for convolutional neural network processing including max pooling | |
| GB2598918A (en) | Downscaler and method of downscaling | |
| US20240160692A1 (en) | Implementing a scatter function on a neural network accelerator | |
| US20250231742A1 (en) | Transposing information using shadow latches and active latches for efficient die area in processing system | |
| KR20250008751A (ko) | 뉴럴 프로세싱 유니트들을 위한 단일 명령어 복수 데이터 연산들의 효율적인 선택 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PA0105 | International application |
St.27 status event code: A-0-1-A10-A15-nap-PA0105 |
|
| PG1501 | Laying open of application |
St.27 status event code: A-1-1-Q10-Q12-nap-PG1501 |
|
| E13 | Pre-grant limitation requested |
Free format text: ST27 STATUS EVENT CODE: A-2-3-E10-E13-LIM-X000 (AS PROVIDED BY THE NATIONAL OFFICE) |
|
| E13-X000 | Pre-grant limitation requested |
St.27 status event code: A-2-3-E10-E13-lim-X000 |
|
| P11 | Amendment of application requested |
Free format text: ST27 STATUS EVENT CODE: A-2-2-P10-P11-NAP-X000 (AS PROVIDED BY THE NATIONAL OFFICE) |
|
| P11-X000 | Amendment of application requested |
St.27 status event code: A-2-2-P10-P11-nap-X000 |