JP2024508596A - 階層的な共有指数浮動小数点データタイプ - Google Patents

階層的な共有指数浮動小数点データタイプ Download PDF

Info

Publication number
JP2024508596A
JP2024508596A JP2023541370A JP2023541370A JP2024508596A JP 2024508596 A JP2024508596 A JP 2024508596A JP 2023541370 A JP2023541370 A JP 2023541370A JP 2023541370 A JP2023541370 A JP 2023541370A JP 2024508596 A JP2024508596 A JP 2024508596A
Authority
JP
Japan
Prior art keywords
value
shared
floating point
values
exponent
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2023541370A
Other languages
English (en)
Japanese (ja)
Other versions
JP2024508596A5 (https=
Inventor
ロウハーニー,ビタ ダルビッシュ
エランゴ,ヴェンムギル
シャフィプール,ラスール
フォワーズ,ジェレミー
ガン リウ,ミン
シー,ジンウェン
シー. バーガー,ダグラス
エス. チュン,エリック
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Technology Licensing LLC
Original Assignee
Microsoft Technology Licensing LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Technology Licensing LLC filed Critical Microsoft Technology Licensing LLC
Priority claimed from PCT/US2022/013086 external-priority patent/WO2022173572A1/en
Publication of JP2024508596A publication Critical patent/JP2024508596A/ja
Publication of JP2024508596A5 publication Critical patent/JP2024508596A5/ja
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • G06F7/38Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation
    • G06F7/48Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation using non-contact-making devices, e.g. tube, solid state device; using unspecified devices
    • G06F7/483Computations with numbers represented by a non-linear combination of denominational numbers, e.g. rational numbers, logarithmic number system or floating-point numbers
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/06Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
    • G06N3/063Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using electronic means
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/14Conversion to or from non-weighted codes
    • H03M7/24Conversion to or from floating-point codes
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Computational Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Pure & Applied Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Nonlinear Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Data Mining & Analysis (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Neurology (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Complex Calculations (AREA)
  • Electromagnetism (AREA)
JP2023541370A 2021-02-10 2022-01-20 階層的な共有指数浮動小数点データタイプ Pending JP2024508596A (ja)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US202163148086P 2021-02-10 2021-02-10
US63/148,086 2021-02-10
US17/361,263 US11886833B2 (en) 2021-02-10 2021-06-28 Hierarchical and shared exponent floating point data types
US17/361,263 2021-06-28
PCT/US2022/013086 WO2022173572A1 (en) 2021-02-10 2022-01-20 Hierarchical and shared exponent floating point data types

Publications (2)

Publication Number Publication Date
JP2024508596A true JP2024508596A (ja) 2024-02-28
JP2024508596A5 JP2024508596A5 (https=) 2024-12-27

Family

ID=82704967

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2023541370A Pending JP2024508596A (ja) 2021-02-10 2022-01-20 階層的な共有指数浮動小数点データタイプ

Country Status (5)

Country Link
US (1) US11886833B2 (https=)
EP (1) EP4291979A1 (https=)
JP (1) JP2024508596A (https=)
KR (1) KR20230137356A (https=)
CN (1) CN116830077A (https=)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20240402993A1 (en) * 2023-05-30 2024-12-05 Microsoft Technology Licensing, Llc Determining shared exponent values for shared exponent floating point data types

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190347072A1 (en) * 2018-05-08 2019-11-14 Microsoft Technology Licensing, Llc Block floating point computations using shared exponents
JP2019212295A (ja) * 2018-06-08 2019-12-12 インテル・コーポレーション フレキシブルな浮動小数点テンソルを用いた人工ニューラルネットワーク訓練
JP2021536076A (ja) * 2018-09-19 2021-12-23 ザイリンクス インコーポレイテッドXilinx Incorporated 乗算累積回路

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8301803B2 (en) 2009-10-23 2012-10-30 Samplify Systems, Inc. Block floating point compression of signal data
WO2013003479A2 (en) 2011-06-30 2013-01-03 Samplify Systems, Inc. Compression of floating-point data
US12141689B2 (en) 2019-03-18 2024-11-12 Nvidia Corporation Data compression for a neural network

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190347072A1 (en) * 2018-05-08 2019-11-14 Microsoft Technology Licensing, Llc Block floating point computations using shared exponents
JP2019212295A (ja) * 2018-06-08 2019-12-12 インテル・コーポレーション フレキシブルな浮動小数点テンソルを用いた人工ニューラルネットワーク訓練
JP2021536076A (ja) * 2018-09-19 2021-12-23 ザイリンクス インコーポレイテッドXilinx Incorporated 乗算累積回路

Also Published As

Publication number Publication date
US20220253281A1 (en) 2022-08-11
TW202234229A (zh) 2022-09-01
EP4291979A1 (en) 2023-12-20
KR20230137356A (ko) 2023-10-04
US11886833B2 (en) 2024-01-30
CN116830077A (zh) 2023-09-29

Similar Documents

Publication Publication Date Title
CN110880038B (zh) 基于fpga的加速卷积计算的系统、卷积神经网络
US12182687B2 (en) Data representation for dynamic precision in neural network cores
JP6977864B2 (ja) 推論装置、畳み込み演算実行方法及びプログラム
WO2019211226A1 (en) Neural hardware accelerator for parallel and distributed tensor computations
KR20200004700A (ko) 뉴럴 네트워크에서 파라미터를 처리하는 방법 및 장치
EP3788559A1 (en) Quantization for dnn accelerators
KR102655950B1 (ko) 뉴럴 네트워크의 고속 처리 방법 및 그 방법을 이용한 장치
JP2018010618A (ja) 畳み込みニューラルネットワークハードウエア構成
US20200257986A1 (en) Artificial neural network implementation in field-programmable gate arrays
CN114418057A (zh) 卷积神经网络的运算方法及相关设备
KR20240021853A (ko) 신경망에 대한 좁은 데이터 형식의 희소화
TW202234232A (zh) 用於標準化功能的數位電路系統
WO2022247368A1 (en) Methods, systems, and mediafor low-bit neural networks using bit shift operations
CN119005265A (zh) 面向高性能数据并行dnn训练的稀疏化压缩方法及装置
JP2024508596A (ja) 階層的な共有指数浮動小数点データタイプ
CN115965048A (zh) 数据处理装置、数据处理方法和电子设备
CN116187413A (zh) 基于动态精度量化的神经网络模型训练加速方法及系统
TWI913392B (zh) 階層和共享指數浮點數資料類型
US20210216867A1 (en) Information processing apparatus, neural network computation program, and neural network computation method
WO2021036412A1 (zh) 数据处理方法、装置、计算机设备和存储介质
WO2022173572A1 (en) Hierarchical and shared exponent floating point data types
WO2020177863A1 (en) Training of algorithms
JP7632286B2 (ja) 情報処理装置、情報処理システム及び情報処理方法
JP2023031296A (ja) 演算方法及び装置
US20240402993A1 (en) Determining shared exponent values for shared exponent floating point data types

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20241218

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20241218

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20260206