DE112019005586T5 - Skalarproduktrechner und verfahren zu deren betrieb - Google Patents
Skalarproduktrechner und verfahren zu deren betrieb Download PDFInfo
- Publication number
- DE112019005586T5 DE112019005586T5 DE112019005586.0T DE112019005586T DE112019005586T5 DE 112019005586 T5 DE112019005586 T5 DE 112019005586T5 DE 112019005586 T DE112019005586 T DE 112019005586T DE 112019005586 T5 DE112019005586 T5 DE 112019005586T5
- Authority
- DE
- Germany
- Prior art keywords
- vector
- value
- product
- mask
- binary
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
- G06F7/38—Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation
- G06F7/48—Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation using non-contact-making devices, e.g. tube, solid state device; using unspecified devices
- G06F7/544—Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation using non-contact-making devices, e.g. tube, solid state device; using unspecified devices for evaluating functions by calculation
- G06F7/5443—Sum of products
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/16—Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
- G06F7/38—Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation
- G06F7/48—Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation using non-contact-making devices, e.g. tube, solid state device; using unspecified devices
- G06F7/52—Multiplying; Dividing
- G06F7/523—Multiplying only
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/30007—Arrangements for executing specific machine instructions to perform operations on data operands
- G06F9/3001—Arithmetic instructions
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/30007—Arrangements for executing specific machine instructions to perform operations on data operands
- G06F9/30018—Bit or string instructions
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30003—Arrangements for executing specific machine instructions
- G06F9/30007—Arrangements for executing specific machine instructions to perform operations on data operands
- G06F9/30036—Instructions to perform operations on packed data, e.g. vector, tile or matrix operations
- G06F9/30038—Instructions to perform operations on packed data, e.g. vector, tile or matrix operations using a mask
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0464—Convolutional networks [CNN, ConvNet]
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0495—Quantised networks; Sparse networks; Compressed networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/06—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
- G06N3/063—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using electronic means
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2207/00—Indexing scheme relating to methods or arrangements for processing data by operating upon the order or content of the data handled
- G06F2207/38—Indexing scheme relating to groups G06F7/38 - G06F7/575
- G06F2207/48—Indexing scheme relating to groups G06F7/48 - G06F7/575
- G06F2207/4802—Special implementations
- G06F2207/4818—Threshold devices
- G06F2207/4824—Neural networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- General Engineering & Computer Science (AREA)
- Mathematical Analysis (AREA)
- Mathematical Optimization (AREA)
- Pure & Applied Mathematics (AREA)
- Computational Mathematics (AREA)
- Mathematical Physics (AREA)
- Computing Systems (AREA)
- Data Mining & Analysis (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Evolutionary Computation (AREA)
- Computational Linguistics (AREA)
- Artificial Intelligence (AREA)
- Algebra (AREA)
- Databases & Information Systems (AREA)
- Neurology (AREA)
- Complex Calculations (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US16/184,985 | 2018-11-08 | ||
| US16/184,985 US10768895B2 (en) | 2018-11-08 | 2018-11-08 | Dot product calculators and methods of operating the same |
| PCT/EP2019/080136 WO2020094586A1 (en) | 2018-11-08 | 2019-11-04 | Dot product calculators and methods of operating the same |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| DE112019005586T5 true DE112019005586T5 (de) | 2021-12-16 |
Family
ID=68461801
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| DE112019005586.0T Withdrawn DE112019005586T5 (de) | 2018-11-08 | 2019-11-04 | Skalarproduktrechner und verfahren zu deren betrieb |
Country Status (7)
| Country | Link |
|---|---|
| US (3) | US10768895B2 (https=) |
| EP (1) | EP3877839A1 (https=) |
| JP (1) | JP2022506345A (https=) |
| KR (1) | KR20210092751A (https=) |
| CN (1) | CN113330421B (https=) |
| DE (1) | DE112019005586T5 (https=) |
| WO (1) | WO2020094586A1 (https=) |
Families Citing this family (21)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US11169809B2 (en) * | 2017-03-31 | 2021-11-09 | Intel Corporation | Method and apparatus for converting scatter control elements to gather control elements used to sort vector data elements |
| US10409614B2 (en) | 2017-04-24 | 2019-09-10 | Intel Corporation | Instructions having support for floating point and integer data types in the same register |
| US10474458B2 (en) | 2017-04-28 | 2019-11-12 | Intel Corporation | Instructions and logic to perform floating-point and integer operations for machine learning |
| CN110770722B (zh) * | 2017-06-29 | 2023-08-18 | 北京清影机器视觉技术有限公司 | 二维数据匹配方法、装置和逻辑电路 |
| US10768895B2 (en) | 2018-11-08 | 2020-09-08 | Movidius Limited | Dot product calculators and methods of operating the same |
| US12182035B2 (en) | 2019-03-15 | 2024-12-31 | Intel Corporation | Systems and methods for cache optimization |
| US11934342B2 (en) | 2019-03-15 | 2024-03-19 | Intel Corporation | Assistance for hardware prefetch in cache access |
| WO2020190807A1 (en) * | 2019-03-15 | 2020-09-24 | Intel Corporation | Systolic disaggregation within a matrix accelerator architecture |
| PL3938914T3 (pl) | 2019-03-15 | 2025-03-31 | Intel Corporation | Dynamiczna rekonfiguracja pamięci |
| US12131251B2 (en) * | 2019-03-19 | 2024-10-29 | Cirrus Logic Inc. | Neurons for artificial neural networks |
| WO2020218157A1 (ja) * | 2019-04-25 | 2020-10-29 | 国立大学法人静岡大学 | 予測システム、予測方法、および予測プログラム |
| US11741349B2 (en) * | 2019-10-31 | 2023-08-29 | Arm Limited | Performing matrix-vector multiply operations for neural networks on electronic devices |
| US11861761B2 (en) | 2019-11-15 | 2024-01-02 | Intel Corporation | Graphics processing unit processing and caching improvements |
| US11663746B2 (en) | 2019-11-15 | 2023-05-30 | Intel Corporation | Systolic arithmetic on sparse data |
| US11500680B2 (en) * | 2020-04-24 | 2022-11-15 | Alibaba Group Holding Limited | Systolic array-friendly data placement and control based on masked write |
| US20220405571A1 (en) * | 2021-06-16 | 2022-12-22 | Microsoft Technology Licensing, Llc | Sparsifying narrow data formats for neural networks |
| US12321857B2 (en) | 2021-06-24 | 2025-06-03 | Intel Corporation | Methods and apparatus to perform machine-learning model operations on sparse accelerators |
| US20230035474A1 (en) * | 2021-07-20 | 2023-02-02 | Gsi Technology Inc. | Compiler for a parallel processor |
| US20230083270A1 (en) * | 2021-09-14 | 2023-03-16 | International Business Machines Corporation | Mixed signal circuitry for bitwise multiplication with different accuracies |
| US11789646B2 (en) | 2021-09-24 | 2023-10-17 | Intel Corporation | Methods, apparatus, and articles of manufacture to increase data reuse for multiply and accumulate (MAC) operations |
| US11669489B2 (en) * | 2021-09-30 | 2023-06-06 | International Business Machines Corporation | Sparse systolic array design |
Family Cites Families (14)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH0731669B2 (ja) * | 1986-04-04 | 1995-04-10 | 株式会社日立製作所 | ベクトル・プロセツサ |
| US20080071851A1 (en) * | 2006-09-20 | 2008-03-20 | Ronen Zohar | Instruction and logic for performing a dot-product operation |
| GB2457303A (en) * | 2008-02-11 | 2009-08-12 | Linear Algebra Technologies | Randomly accessing elements of compressed matrix data by calculating offsets from non-zero values of a bitmap |
| US20160358069A1 (en) * | 2015-06-03 | 2016-12-08 | Samsung Electronics Co., Ltd. | Neural network suppression |
| US10223359B2 (en) * | 2016-10-10 | 2019-03-05 | The Directv Group, Inc. | Determining recommended media programming from sparse consumption data |
| US10360163B2 (en) * | 2016-10-27 | 2019-07-23 | Google Llc | Exploiting input data sparsity in neural network compute units |
| US10146738B2 (en) * | 2016-12-31 | 2018-12-04 | Intel Corporation | Hardware accelerator architecture for processing very-sparse and hyper-sparse matrix data |
| US20180189675A1 (en) * | 2016-12-31 | 2018-07-05 | Intel Corporation | Hardware accelerator architecture and template for web-scale k-means clustering |
| US10096134B2 (en) * | 2017-02-01 | 2018-10-09 | Nvidia Corporation | Data compaction and memory bandwidth reduction for sparse neural networks |
| US10171084B2 (en) * | 2017-04-24 | 2019-01-01 | The Regents Of The University Of Michigan | Sparse coding with Memristor networks |
| US20180330235A1 (en) * | 2017-05-15 | 2018-11-15 | National Taiwan University | Apparatus and Method of Using Dual Indexing in Input Neurons and Corresponding Weights of Sparse Neural Network |
| TWI684141B (zh) | 2017-10-12 | 2020-02-01 | 英屬開曼群島商意騰科技股份有限公司 | 人工神經元中以非零封包加速乘法運算的裝置及方法 |
| KR102697300B1 (ko) * | 2018-03-07 | 2024-08-23 | 삼성전자주식회사 | 전자 장치 및 머신 러닝 수행 방법 |
| US10768895B2 (en) | 2018-11-08 | 2020-09-08 | Movidius Limited | Dot product calculators and methods of operating the same |
-
2018
- 2018-11-08 US US16/184,985 patent/US10768895B2/en not_active Expired - Fee Related
-
2019
- 2019-11-04 KR KR1020217015961A patent/KR20210092751A/ko not_active Withdrawn
- 2019-11-04 JP JP2021523664A patent/JP2022506345A/ja not_active Ceased
- 2019-11-04 CN CN201980088183.8A patent/CN113330421B/zh active Active
- 2019-11-04 DE DE112019005586.0T patent/DE112019005586T5/de not_active Withdrawn
- 2019-11-04 WO PCT/EP2019/080136 patent/WO2020094586A1/en not_active Ceased
- 2019-11-04 EP EP19798070.9A patent/EP3877839A1/en not_active Withdrawn
-
2020
- 2020-08-24 US US17/001,455 patent/US11023206B2/en not_active Expired - Fee Related
-
2021
- 2021-04-28 US US17/243,282 patent/US11656845B2/en active Active
Also Published As
| Publication number | Publication date |
|---|---|
| JP2022506345A (ja) | 2022-01-17 |
| US20200387350A1 (en) | 2020-12-10 |
| US11023206B2 (en) | 2021-06-01 |
| US20210247961A1 (en) | 2021-08-12 |
| US11656845B2 (en) | 2023-05-23 |
| CN113330421A (zh) | 2021-08-31 |
| EP3877839A1 (en) | 2021-09-15 |
| US10768895B2 (en) | 2020-09-08 |
| CN113330421B (zh) | 2024-01-16 |
| WO2020094586A1 (en) | 2020-05-14 |
| US20200150926A1 (en) | 2020-05-14 |
| KR20210092751A (ko) | 2021-07-26 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| DE112019005586T5 (de) | Skalarproduktrechner und verfahren zu deren betrieb | |
| DE112020004625T5 (de) | Transponierte faltung mit systolischem array | |
| DE112005001277B4 (de) | Verfahren und Vorrichtung zum Vektorisieren mehrerer Eingabebefehle | |
| DE112020003128T5 (de) | Dilatierte faltung mit systolischem array | |
| DE102025103251A1 (de) | Dreidimensionale gaussian-splatting-initialisierung basierend auf trainierten neuronalen strahlungsfeld-repräsentationen | |
| DE60008397T2 (de) | Benutzer emulation für datenaustausch beim rechnergestützten entwurf | |
| DE112020003066T5 (de) | Transponierungsoperationen mit verarbeitungselementarray | |
| DE102020201154A1 (de) | Verfahren und vorrichtung zum speichern von und zugreifen auf mehrdimensionale daten | |
| DE102019133028A1 (de) | Für neuronale netzwerke geeignetes effizientes matrixformat | |
| DE112020005789T5 (de) | Hierarchische partitionierung von operatoren | |
| DE102020118004A1 (de) | Verfahren und vorrichtungen zum kachelförmigen durchlaufen eines tensors für faltungsoperationen | |
| DE102023105572A1 (de) | Effiziente Matrixmultiplikation und -addition mit einer Gruppe von Warps | |
| DE102018132781A1 (de) | Heterogenes Rechensystem, welches konfiguriert ist, um eine Cachekohärenz adaptiv zu steuern | |
| DE102018126670A1 (de) | Fortschreitende Modifizierung von generativen adversativen neuronalen Netzen | |
| DE102019102009A1 (de) | Reduzierung des rauschens während des renderings durch parallele path-space-filterung unter verwendung von hashing | |
| EP2765528B1 (de) | Wahlfreier Zugriff auf Signalwerte eines FPGA zur Laufzeit | |
| DE102017121257A1 (de) | Ausführen von Mittelwert-Poolbildung in Hardware | |
| DE202016107443U1 (de) | Berechnen von Faltungen mithilfe eines neuronalen Netzwerkprozessors | |
| DE102021107510A1 (de) | Training eines neuronalen netzwerks unter speicherbeschränkung | |
| DE102020112826A1 (de) | Verfahren zur effizienten durchführung von datenreduktionen in parallelverarbeitungseinheiten | |
| DE112020007010T5 (de) | Inferenzmaschinen-schaltungsarchitektur | |
| DE112020003105T5 (de) | Schicht für schicht-debugging von neuronalen netzwerken | |
| DE112019002622T5 (de) | Verfahren, systeme, artikel zur herstellung und vorrichtungen zurrekonstruktion von szenen unter verwendung von convolutional neural networks | |
| DE102013218594A1 (de) | System, Verfahren und Computerprogrammprodukt zur parallelen Rekonstruktion eines gesampelten Suffixarrays | |
| DE102020119519A1 (de) | Verfahren und einrichtungen zum ermöglichen einer "out-of-order"-pipeline-ausführung der statischen abbildung einer arbeitslast |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| R119 | Application deemed withdrawn, or ip right lapsed, due to non-payment of renewal fee |