JP2024508596A - 階層的な共有指数浮動小数点データタイプ - Google Patents
階層的な共有指数浮動小数点データタイプ Download PDFInfo
- Publication number
- JP2024508596A JP2024508596A JP2023541370A JP2023541370A JP2024508596A JP 2024508596 A JP2024508596 A JP 2024508596A JP 2023541370 A JP2023541370 A JP 2023541370A JP 2023541370 A JP2023541370 A JP 2023541370A JP 2024508596 A JP2024508596 A JP 2024508596A
- Authority
- JP
- Japan
- Prior art keywords
- value
- shared
- floating point
- values
- exponent
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
- G06F7/38—Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation
- G06F7/48—Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation using non-contact-making devices, e.g. tube, solid state device; using unspecified devices
- G06F7/483—Computations with numbers represented by a non-linear combination of denominational numbers, e.g. rational numbers, logarithmic number system or floating-point numbers
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/06—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
- G06N3/063—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using electronic means
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/14—Conversion to or from non-weighted codes
- H03M7/24—Conversion to or from floating-point codes
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computing Systems (AREA)
- Computational Mathematics (AREA)
- Mathematical Analysis (AREA)
- Mathematical Optimization (AREA)
- Pure & Applied Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Nonlinear Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Data Mining & Analysis (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Neurology (AREA)
- Evolutionary Computation (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Complex Calculations (AREA)
- Electromagnetism (AREA)
Applications Claiming Priority (5)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US202163148086P | 2021-02-10 | 2021-02-10 | |
| US63/148,086 | 2021-02-10 | ||
| US17/361,263 US11886833B2 (en) | 2021-02-10 | 2021-06-28 | Hierarchical and shared exponent floating point data types |
| US17/361,263 | 2021-06-28 | ||
| PCT/US2022/013086 WO2022173572A1 (en) | 2021-02-10 | 2022-01-20 | Hierarchical and shared exponent floating point data types |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| JP2024508596A true JP2024508596A (ja) | 2024-02-28 |
| JP2024508596A5 JP2024508596A5 (https=) | 2024-12-27 |
Family
ID=82704967
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2023541370A Pending JP2024508596A (ja) | 2021-02-10 | 2022-01-20 | 階層的な共有指数浮動小数点データタイプ |
Country Status (5)
| Country | Link |
|---|---|
| US (1) | US11886833B2 (https=) |
| EP (1) | EP4291979A1 (https=) |
| JP (1) | JP2024508596A (https=) |
| KR (1) | KR20230137356A (https=) |
| CN (1) | CN116830077A (https=) |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20240402993A1 (en) * | 2023-05-30 | 2024-12-05 | Microsoft Technology Licensing, Llc | Determining shared exponent values for shared exponent floating point data types |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20190347072A1 (en) * | 2018-05-08 | 2019-11-14 | Microsoft Technology Licensing, Llc | Block floating point computations using shared exponents |
| JP2019212295A (ja) * | 2018-06-08 | 2019-12-12 | インテル・コーポレーション | フレキシブルな浮動小数点テンソルを用いた人工ニューラルネットワーク訓練 |
| JP2021536076A (ja) * | 2018-09-19 | 2021-12-23 | ザイリンクス インコーポレイテッドXilinx Incorporated | 乗算累積回路 |
Family Cites Families (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US8301803B2 (en) | 2009-10-23 | 2012-10-30 | Samplify Systems, Inc. | Block floating point compression of signal data |
| WO2013003479A2 (en) | 2011-06-30 | 2013-01-03 | Samplify Systems, Inc. | Compression of floating-point data |
| US12141689B2 (en) | 2019-03-18 | 2024-11-12 | Nvidia Corporation | Data compression for a neural network |
-
2021
- 2021-06-28 US US17/361,263 patent/US11886833B2/en active Active
-
2022
- 2022-01-20 JP JP2023541370A patent/JP2024508596A/ja active Pending
- 2022-01-20 CN CN202280014048.0A patent/CN116830077A/zh active Pending
- 2022-01-20 KR KR1020237027167A patent/KR20230137356A/ko active Pending
- 2022-01-20 EP EP22704074.8A patent/EP4291979A1/en active Pending
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20190347072A1 (en) * | 2018-05-08 | 2019-11-14 | Microsoft Technology Licensing, Llc | Block floating point computations using shared exponents |
| JP2019212295A (ja) * | 2018-06-08 | 2019-12-12 | インテル・コーポレーション | フレキシブルな浮動小数点テンソルを用いた人工ニューラルネットワーク訓練 |
| JP2021536076A (ja) * | 2018-09-19 | 2021-12-23 | ザイリンクス インコーポレイテッドXilinx Incorporated | 乗算累積回路 |
Also Published As
| Publication number | Publication date |
|---|---|
| US20220253281A1 (en) | 2022-08-11 |
| TW202234229A (zh) | 2022-09-01 |
| EP4291979A1 (en) | 2023-12-20 |
| KR20230137356A (ko) | 2023-10-04 |
| US11886833B2 (en) | 2024-01-30 |
| CN116830077A (zh) | 2023-09-29 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN110880038B (zh) | 基于fpga的加速卷积计算的系统、卷积神经网络 | |
| US12182687B2 (en) | Data representation for dynamic precision in neural network cores | |
| JP6977864B2 (ja) | 推論装置、畳み込み演算実行方法及びプログラム | |
| WO2019211226A1 (en) | Neural hardware accelerator for parallel and distributed tensor computations | |
| KR20200004700A (ko) | 뉴럴 네트워크에서 파라미터를 처리하는 방법 및 장치 | |
| EP3788559A1 (en) | Quantization for dnn accelerators | |
| KR102655950B1 (ko) | 뉴럴 네트워크의 고속 처리 방법 및 그 방법을 이용한 장치 | |
| JP2018010618A (ja) | 畳み込みニューラルネットワークハードウエア構成 | |
| US20200257986A1 (en) | Artificial neural network implementation in field-programmable gate arrays | |
| CN114418057A (zh) | 卷积神经网络的运算方法及相关设备 | |
| KR20240021853A (ko) | 신경망에 대한 좁은 데이터 형식의 희소화 | |
| TW202234232A (zh) | 用於標準化功能的數位電路系統 | |
| WO2022247368A1 (en) | Methods, systems, and mediafor low-bit neural networks using bit shift operations | |
| CN119005265A (zh) | 面向高性能数据并行dnn训练的稀疏化压缩方法及装置 | |
| JP2024508596A (ja) | 階層的な共有指数浮動小数点データタイプ | |
| CN115965048A (zh) | 数据处理装置、数据处理方法和电子设备 | |
| CN116187413A (zh) | 基于动态精度量化的神经网络模型训练加速方法及系统 | |
| TWI913392B (zh) | 階層和共享指數浮點數資料類型 | |
| US20210216867A1 (en) | Information processing apparatus, neural network computation program, and neural network computation method | |
| WO2021036412A1 (zh) | 数据处理方法、装置、计算机设备和存储介质 | |
| WO2022173572A1 (en) | Hierarchical and shared exponent floating point data types | |
| WO2020177863A1 (en) | Training of algorithms | |
| JP7632286B2 (ja) | 情報処理装置、情報処理システム及び情報処理方法 | |
| JP2023031296A (ja) | 演算方法及び装置 | |
| US20240402993A1 (en) | Determining shared exponent values for shared exponent floating point data types |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20241218 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20241218 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20260206 |