CN116830077A - 分层和共享指数浮点数据类型 - Google Patents

分层和共享指数浮点数据类型 Download PDF

Info

Publication number
CN116830077A
CN116830077A CN202280014048.0A CN202280014048A CN116830077A CN 116830077 A CN116830077 A CN 116830077A CN 202280014048 A CN202280014048 A CN 202280014048A CN 116830077 A CN116830077 A CN 116830077A
Authority
CN
China
Prior art keywords
value
exponent
shared
floating point
values
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202280014048.0A
Other languages
English (en)
Chinese (zh)
Inventor
B·达尔维什·鲁哈尼
V·埃兰戈
R·沙菲普尔
J·弗沃斯
刘明罡
奚锦文
D·C·伯格
E·S·钟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Technology Licensing LLC
Original Assignee
Microsoft Technology Licensing LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Technology Licensing LLC filed Critical Microsoft Technology Licensing LLC
Priority claimed from PCT/US2022/013086 external-priority patent/WO2022173572A1/en
Publication of CN116830077A publication Critical patent/CN116830077A/zh
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • G06F7/38Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation
    • G06F7/48Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation using non-contact-making devices, e.g. tube, solid state device; using unspecified devices
    • G06F7/483Computations with numbers represented by a non-linear combination of denominational numbers, e.g. rational numbers, logarithmic number system or floating-point numbers
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/06Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
    • G06N3/063Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using electronic means
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/14Conversion to or from non-weighted codes
    • H03M7/24Conversion to or from floating-point codes
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Computational Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Pure & Applied Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Nonlinear Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Data Mining & Analysis (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Neurology (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Complex Calculations (AREA)
  • Electromagnetism (AREA)
CN202280014048.0A 2021-02-10 2022-01-20 分层和共享指数浮点数据类型 Pending CN116830077A (zh)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US202163148086P 2021-02-10 2021-02-10
US63/148,086 2021-02-10
US17/361,263 US11886833B2 (en) 2021-02-10 2021-06-28 Hierarchical and shared exponent floating point data types
US17/361,263 2021-06-28
PCT/US2022/013086 WO2022173572A1 (en) 2021-02-10 2022-01-20 Hierarchical and shared exponent floating point data types

Publications (1)

Publication Number Publication Date
CN116830077A true CN116830077A (zh) 2023-09-29

Family

ID=82704967

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202280014048.0A Pending CN116830077A (zh) 2021-02-10 2022-01-20 分层和共享指数浮点数据类型

Country Status (5)

Country Link
US (1) US11886833B2 (https=)
EP (1) EP4291979A1 (https=)
JP (1) JP2024508596A (https=)
KR (1) KR20230137356A (https=)
CN (1) CN116830077A (https=)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20240402993A1 (en) * 2023-05-30 2024-12-05 Microsoft Technology Licensing, Llc Determining shared exponent values for shared exponent floating point data types

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8301803B2 (en) 2009-10-23 2012-10-30 Samplify Systems, Inc. Block floating point compression of signal data
WO2013003479A2 (en) 2011-06-30 2013-01-03 Samplify Systems, Inc. Compression of floating-point data
US10579334B2 (en) * 2018-05-08 2020-03-03 Microsoft Technology Licensing, Llc Block floating point computations using shared exponents
US12205035B2 (en) * 2018-06-08 2025-01-21 Intel Corporation Artificial neural network training using flexible floating point tensors
US10747502B2 (en) * 2018-09-19 2020-08-18 Xilinx, Inc. Multiply and accumulate circuit
US12141689B2 (en) 2019-03-18 2024-11-12 Nvidia Corporation Data compression for a neural network

Also Published As

Publication number Publication date
US20220253281A1 (en) 2022-08-11
TW202234229A (zh) 2022-09-01
EP4291979A1 (en) 2023-12-20
KR20230137356A (ko) 2023-10-04
US11886833B2 (en) 2024-01-30
JP2024508596A (ja) 2024-02-28

Similar Documents

Publication Publication Date Title
CN110880038B (zh) 基于fpga的加速卷积计算的系统、卷积神经网络
CN110222821B (zh) 基于权重分布的卷积神经网络低位宽量化方法
US10491239B1 (en) Large-scale computations using an adaptive numerical format
KR102381770B1 (ko) 정밀 지수 및 정밀 소프트맥스 계산
WO2020074989A1 (en) Data representation for dynamic precision in neural network cores
CA3044660C (en) Information processing device and information processing method
EP3931758A1 (en) Neural network layer processing with scaled quantization
WO2020176250A1 (en) Neural network layer processing with normalization and transformation of data
EP4022773B1 (en) Compression of data that exhibits mixed compressibility
CN114418057A (zh) 卷积神经网络的运算方法及相关设备
CN115237992B (zh) 数据格式转换的方法、装置及矩阵处理的方法、装置
KR20240021853A (ko) 신경망에 대한 좁은 데이터 형식의 희소화
CN116830077A (zh) 分层和共享指数浮点数据类型
CN117574966A (zh) 模型量化方法、装置、电子设备及存储介质
CN115965048A (zh) 数据处理装置、数据处理方法和电子设备
CN116187413A (zh) 基于动态精度量化的神经网络模型训练加速方法及系统
TWI913392B (zh) 階層和共享指數浮點數資料類型
WO2022173572A1 (en) Hierarchical and shared exponent floating point data types
US20210216867A1 (en) Information processing apparatus, neural network computation program, and neural network computation method
CN119884571A (zh) 矩阵乘法器和包括矩阵乘法器的矩阵乘法器件的操作方法
US20240402993A1 (en) Determining shared exponent values for shared exponent floating point data types
WO2020177863A1 (en) Training of algorithms
CN114207609B (zh) 信息处理装置、信息处理系统和信息处理方法
KR20230076641A (ko) 부동-소수점 연산을 위한 장치 및 방법
TWI846454B (zh) 用於深度學習網路的優化方法及運算系統

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination