WO2022170809A1 - Reconfigurable floating point multiply-accumulate operation unit and method suitable for multi-precision calculation - Google Patents

Reconfigurable floating point multiply-accumulate operation unit and method suitable for multi-precision calculation Download PDF

Info

Publication number
WO2022170809A1
WO2022170809A1 PCT/CN2021/131745 CN2021131745W WO2022170809A1 WO 2022170809 A1 WO2022170809 A1 WO 2022170809A1 CN 2021131745 W CN2021131745 W CN 2021131745W WO 2022170809 A1 WO2022170809 A1 WO 2022170809A1
Authority
WO
WIPO (PCT)
Prior art keywords
floating
point
operated
precision
unit
Prior art date
Application number
PCT/CN2021/131745
Other languages
French (fr)
Chinese (zh)
Inventor
毛伟
余浩
谢歆昂
李凯
李博宇
杜来民
代柳瑶
Original Assignee
南方科技大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 南方科技大学 filed Critical 南方科技大学
Publication of WO2022170809A1 publication Critical patent/WO2022170809A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • G06F7/38Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation
    • G06F7/48Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation using non-contact-making devices, e.g. tube, solid state device; using unspecified devices
    • G06F7/57Arithmetic logic units [ALU], i.e. arrangements or devices for performing two or more of the operations covered by groups G06F7/483 – G06F7/556 or for performing logical operations
    • G06F7/575Basic arithmetic logic units, i.e. devices selectable to perform either addition, subtraction or one of several logical operations, using, at least partially, the same circuitry
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • G06F7/38Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation
    • G06F7/48Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation using non-contact-making devices, e.g. tube, solid state device; using unspecified devices
    • G06F7/52Multiplying; Dividing
    • G06F7/523Multiplying only
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Definitions

  • the invention relates to the field of digital circuits, in particular to a reconfigurable floating-point multiply-add operation unit and method suitable for multi-precision calculation.
  • the technical problem to be solved by the present invention is to provide a reconfigurable floating-point multiply-add operation unit and method suitable for multi-precision calculation, aiming at solving the problem of supporting multi-precision floating point in the prior art
  • the operation method of the multiplication operation will cause problems such as bit redundancy and low hardware utilization.
  • an embodiment of the present invention provides a reconfigurable floating-point multiply-add operation method suitable for multi-precision computing, wherein the method includes:
  • a shift-add operation is performed on the product, and an operation result generated based on the shift-add operation is used as the result of the multiply-accumulate operation of the floating-point number to be operated.
  • the significant number is divided according to the number of bits of the unit multiplier, and after division, several target segments are generated; including one.
  • the number of called unit multipliers is determined according to the precision of the floating-point number to be operated, a target segment is used as an operand of a unit multiplier, and the unit multiplier is obtained based on the Operand-generated products include:
  • a number of row products are generated after the operands are input to the unit multiplier.
  • the determining the number of unit multipliers to be called according to the precision and logarithm of the floating-point number to be operated includes:
  • n calls n unit multipliers for the floating-point number to be operated
  • n When the floating-point number to be operated is a single-precision floating-point number, n calls 4n unit multipliers for the floating-point number to be operated;
  • n 16n unit multipliers for the floating-point number to be operated
  • n is an integer greater than 0.
  • the generating of several row products after the operand is input to the unit multiplier includes:
  • the operand is input into the unit multiplier, and the operand is encoded by the unsigned bit Booth to generate several row products.
  • the floating-point number to be operated is a double-precision floating-point number
  • before the using a target segment as an operand of a unit multiplier further includes:
  • performing a shift-add operation on the product, and using an operation result generated based on the shift-add operation as the result of the multiply-accumulate operation of the floating-point number to be operated includes:
  • the displacement includes at least one of an internal displacement and an external displacement
  • the calculation method of the internal displacement amount is: taking the sum of the high and low bits of the segment numbers divided based on the floating-point number to be operated as the internal shift amount of the product corresponding to the segment number;
  • the calculation method of the external displacement is as follows: adding the exponent parts of the floating-point numbers to be operated to obtain an exponent sum, and taking the maximum value of all the exponent sums obtained as a reference value; The difference obtains the exponent difference, and the exponent difference is used as the external shift amount of the product corresponding to the floating-point number to be operated.
  • an embodiment of the present invention also provides a reconfigurable floating-point multiply-add operation unit suitable for multi-precision computing, characterized in that the operation unit includes:
  • a division module used to obtain the significant digits of the floating-point number to be operated, and generate several target segments based on the significant digits; the several include one;
  • a unit multiplier used for determining the number of unit multipliers to be called according to the precision of the floating-point number to be operated, taking a target segment as an operand of a unit multiplier, and obtaining the unit multiplier generated based on the operand the product of ;
  • An addition tree configured to perform a shift-add operation on the product, and use an operation result generated based on the shift-add operation as a result of the multiply-accumulate operation of the floating-point number to be operated.
  • the operation unit includes 16n unit multipliers, and n is a non-negative number.
  • the embodiment of the present invention avoids the problem of bit redundancy by adopting a unified mantissa division scheme, improves the hardware utilization rate by adopting a unified unit multiplier, and can also realize the multiply-accumulate operation of half-precision floating-point numbers, single The multiply-accumulate operation of precision floating-point numbers and the multiply-accumulate operation of double-precision floating-point numbers. It solves the problems of bit redundancy and low hardware utilization in the operation method supporting multi-precision floating-point multiplication in the prior art.
  • FIG. 1 is a schematic flowchart of a reconfigurable floating-point multiply-add operation method suitable for multi-precision computing according to an embodiment of the present invention.
  • FIG. 2 is a schematic diagram of a division scheme of significant digits of floating-point numbers of different precisions provided by an embodiment of the present invention.
  • FIG. 3 is a schematic diagram of a working principle of a 14-bit basic multiplier provided by an embodiment of the present invention.
  • FIG. 4 is a calculation diagram of 16 groups of products input to an adder tree when a pair of FP64 is calculated according to an embodiment of the present invention.
  • FIG. 5 is an internal basic block diagram of a reconfigurable floating-point multiply-add operation unit suitable for multi-precision calculation provided by an embodiment of the present invention.
  • FIG. 6 is a reference diagram of a minimum operation unit that can implement mantissa multiply-accumulate operations of three types of floating-point numbers of different precisions provided by an embodiment of the present invention.
  • the architecture is based on a 15-bit multiplier, which is optimized to support FP128-precision floating-point multiplication, but uses When performing floating-point multiplication operations of other precisions, a large amount of bit redundancy and waste of hardware resources will be generated.
  • the fixed-point multiplier has a fixed number of input bits, which cannot meet the requirements of multi-precision computing, and cannot maximize the use of hardware resources for application requirements to improve energy efficiency ratio and throughput; while the existing ones support multi-precision floating-point.
  • the multiplication operation method also has problems such as loss of precision, bit redundancy, and low hardware utilization.
  • the present invention provides a reconfigurable floating-point multiply-add operation method suitable for multi-precision calculation.
  • a reconfigurable floating-point multiply-add operation method suitable for multi-precision calculation.
  • the invention adopts a unified mantissa division scheme to avoid the problem of bit redundancy, adopts a unified unit multiplier to improve the hardware utilization rate, and can also realize the multiply-accumulate operation of half-precision floating-point numbers, the multiply-accumulate operation of single-precision floating-point numbers, and the double-accumulation operation of double-precision floating-point numbers. Multiply-accumulate operation of precision floating-point numbers. It solves the problems of bit redundancy and low hardware utilization in the operation method supporting multi-precision floating-point multiplication in the prior art.
  • the method includes the following steps:
  • Step S100 Obtain the significant digits of the floating-point number to be operated, and generate several target segments based on the significant digits; the several target segments include one.
  • the exponent part of the multiplication result is the sum of the exponent parts of the two floating-point numbers to be multiplied
  • the mantissa part of the multiplication result is the product of the mantissas of the two multiplied floating-point numbers.
  • the present embodiment needs to obtain the significant digits of the floating-point number to be operated, and the significant digits refer to the data that needs to participate in the multiplication operation in the mantissa of the floating-point number to be operated. Only the significant digits that need to participate in the multiplication operation are determined first, Subsequent multiplication operations can be performed. After the significant figures are obtained, this embodiment needs to generate one or more target segments based on the significant figures, and then use the target segments as input data of the unit multiplier.
  • the step S100 includes the following steps:
  • Step S110 adding a 1-bit integer to the mantissa part of the floating-point number to be operated;
  • Step S120 taking the number on the significant digits of the floating-point number obtained after adding as the significant number of the floating-point number to be operated;
  • Step S130 when the number of bits of the significant figure is greater than the number of bits of the unit multiplier, divide the significant number according to the number of bits of the unit multiplier, and generate several target segments after division; The several include one.
  • a 1-bit integer needs to be added to the mantissa part of the floating-point number to be calculated, and then the significant digits of the floating-point number obtained after adding are added.
  • the number is used as the significant figure of the floating point number to be operated on.
  • the significant number of digits is 11; for a single-precision floating point number (floating point 32-bit number, FP32)
  • the mantissa After adding a 1-bit integer to the part, the significand is 24 bits; for a double-precision floating point number (floating point 64-bit number, FP64), after adding a 1-bit integer to the mantissa, the significand is 53 bits.
  • the target segment needs to be obtained according to the significant figures, and the target segment is used as the input data of the subsequent unit multiplier.
  • the significant number may be directly input into the unit multiplier as a target segment.
  • the unit multiplier is a 14-bit basic unit multiplier
  • the significant figures of the 16-bit floating-point numbers have only 11 bits, so there is no need to divide the significant figures of the 16-bit floating-point numbers. Use it directly as a target segment.
  • the significant figure of a 32-bit floating point number is 24 bits, so the significant figure needs to be divided to generate two 12-bit target segments.
  • the significant figure of a 64-bit floating point number is 53 bits, so the significant figure also needs to be divided, and then 4 target segments of 14:13:13:13 are generated.
  • the target segment After the target segment is acquired, the target segment needs to be input into the unit multiplier, so as shown in FIG. 1 , the method further includes the following steps:
  • Step S200 Determine the number of called unit multipliers according to the precision of the floating-point number to be operated, take a target segment as an operand of a unit multiplier, and obtain a product generated by the unit multiplier based on the operand.
  • the number of called unit multiplications needs to be determined according to the precision of the floating-point number to be operated. Then use the obtained target segment as an operand of a unit multiplier. It can be understood that a unit multiplier needs two operands to perform multiplication, one operand is used as a multiplier, and the other operand is used as a multiplicand . The product generated by the unit multiplier based on the operands is then obtained. In multiplication, if the multiplier is a number with two or more digits, when multiplying, each digit of the multiplier must be used to multiply the multiplicand, and the product obtained each time is called the product, or called incomplete. product.
  • the step S200 specifically includes the following steps:
  • Step 210 determining the number of called unit multipliers according to the precision of the floating-point number to be operated
  • Step 220 using a target segment as an operand of a unit multiplier
  • Step 230 Input the operand into the unit multiplier to generate several row products.
  • the determining the number of unit multipliers to call according to the precision and logarithm of the floating-point number to be operated includes: when the to-be-operated floating-point number is When the floating-point number is a half-precision floating-point number, n calls n-unit multipliers for the floating-point number to be operated; when the floating-point number to be operated is a single-precision floating-point number, n calls 4n-unit multipliers for the floating-point number to be operated; When the floating-point number to be operated is a double-precision floating-point number, n calls 16n unit multipliers for the floating-point number to be operated; n is an integer greater than 0.
  • the unit multiplier is a 14-bit basic unit multiplier
  • the unit multiplier when you need to calculate the multiplication and accumulation results of 16 pairs of half-precision floating-point numbers at the same time, you need to call 16 14-bit basic unit multipliers for the following reasons.
  • the embodiment is to divide the significant figures based on the number of bits of the unit multiplier, and the significant figures of the half-precision floating-point numbers can be directly used as a target segment.
  • a pair of half-precision floating-point numbers needs to call a 14-bit basic unit multiplier, and a total of 16 14-bit basic unit multipliers need to be called.
  • a target segment since it is possible to generate target segments with unequal number of bits after the significant digits are divided, in an implementation manner, before using a target segment as an operand of a unit multiplier, it further includes: when the to-be-to-be-multiplier is used When the number of bits of the target segment to which the floating-point number should be operated is not equal, a complementing operation is performed on the target segment with the smallest number of bits, and the complementing operation may be implemented in the form of zero-filling.
  • the unit multiplier is a 14-bit basic unit multiplier
  • the significant figure corresponding to the double-precision floating-point number is 53 bits
  • the four target ends of 14:13:13:13 generated after division need to The 13-bit target segment is zero-padded.
  • a target segment is then used as one operand of a unit multiplier, after which several row products generated by the unit multiplier are taken. Specifically, after the operand is input to the unit multiplier, the unit multiplier encodes the operand through an unsigned bit booth and generates several row products (as shown in FIG. 3 ). .
  • the method further includes the following steps:
  • Step S300 Perform a shift-add operation on the product, and use an operation result generated based on the shift-add operation as a result of the multiply-accumulate operation of the floating-point number to be operated.
  • step S300 specifically includes the following steps:
  • Step S310 inputting the product into a preset addition tree
  • Step S320 calculating the displacement of the product, and performing a shift operation on the product according to the displacement through the addition tree;
  • Step S330 performing a summation operation on the data obtained after the shift operation to obtain a result of the multiply-accumulate operation of the floating-point number to be operated.
  • an addition tree is preset for the scheme of generating the target segment and the usage of the unit multiplier, so as to realize the lossless processing of the data. Specifically, after the product is obtained, the product is input into the addition tree, then the displacement of the product is calculated in the addition tree, and then the product is shifted according to the displacement. .
  • the displacement calculated in the addition tree includes at least one of an internal displacement and an external displacement, that is, it may include only the internal displacement, only the external displacement, or both the internal displacement The final value of the displacement amount needs to be determined according to the precision of the floating point number to be operated and the logarithm of the calculation.
  • the calculation method of the internal displacement amount is to use the sum of the high and low bits of the segment numbers divided based on the floating-point number to be operated as the internal shift amount of the product corresponding to the segment numbers.
  • the calculation method of the external displacement is as follows: adding the exponent parts of the floating-point numbers to be operated to obtain an exponent sum, taking the maximum value of all exponent sums obtained as a reference value, and then adding the reference value to the exponent sum. A difference is obtained to obtain an exponent difference, and finally the exponent difference is used as the external shift amount of the product corresponding to the floating-point number to be operated.
  • 16 14-bit basic unit multipliers need to be called. Since the significant digits of half-precision floating-point numbers are not divided, it is necessary to calculate the multiplication of multiple pairs of floating-point numbers at the same time. The result of multiply-accumulate operation, so in this case, the displacement of the product is only the external displacement.
  • the displacement of the product in this case includes both the internal displacement and the external displacement.
  • the displacement of the product only includes the internal displacement.
  • a 1 ⁇ b 0 results in a displacement of 1 ⁇ 14bit
  • the resulting displacement is 2 ⁇ 14bit ( As shown in Figure 4).
  • the present invention also provides a reconfigurable floating-point multiply-add operation unit suitable for multi-precision calculation.
  • the operation unit includes:
  • a division module 01 is used to obtain the significant figures of the floating-point numbers to be operated, and generate several target segments based on the significant figures; the several include one;
  • Unit multiplier 02 for determining the number of unit multipliers to be called according to the precision of the floating-point number to be operated, taking a target segment as an operand of a unit multiplier, and obtaining the unit multiplier based on the operand the resulting product;
  • the addition tree 03 is configured to perform a shift-add operation on the product, and use the operation result generated based on the shift-add operation as the result of the multiply-accumulate operation of the floating-point number to be operated.
  • the operation unit in order to enable the operation unit to realize the multiply-accumulate operation of half-precision floating-point numbers, single-precision floating-point numbers and double-precision floating-point numbers, the operation unit includes 16n unit multipliers, and n is a non-negative number .
  • each unit multiplier can realize 1 set of half-precision floating-point multiplication operations, so 16 pairs of half-precision can be realized at the same time Multiplication and accumulation of floating-point numbers. Every 4 unit multipliers can realize 1 set of single-precision floating-point multiplication operations, so it can also realize the multiply-accumulate operation of 4 pairs of single-precision floating-point numbers at the same time.
  • the 16 unit multipliers can realize a set of double-precision floating-point multiplication operations, so it can also realize a multiply-accumulate operation of a pair of double-precision floating-point numbers.
  • FIG. 6 is a reference diagram of a minimum operation unit provided by the present invention that can realize the multiply-accumulate operation of three floating-point numbers of different precisions. Therefore, the embodiments of the present invention can at least complete the multiply-accumulate operations of multiple pairs of half-precision floating-point numbers, the multiply-accumulate operations of multiple pairs of single-precision floating-point numbers, or the multiply-accumulate operations of one pair of double-precision floating-point numbers within one clock cycle without limiting hardware resources. Multiply and accumulate operations. Compared with the fixed FP32 and FP64 multiply-add units, the arithmetic unit provided by the present invention can increase the maximum throughput rate by 4 times and 16 times respectively.
  • the present invention discloses a reconfigurable floating-point multiply-add operation unit and method suitable for multi-precision calculation.
  • a reconfigurable floating-point multiply-add operation unit By adopting a unified method to divide the mantissas of floating-point numbers of different precisions, a plurality of bit segments are obtained. , and call different numbers of the same type of unit multipliers to complete the multiplication operation of multiple bit segments in one cycle and output the corresponding product, and then perform the shift and addition operation on the product to obtain the multiplication and accumulation of floating-point numbers. Operation result.
  • the invention adopts a unified mantissa division scheme to avoid the problem of bit redundancy, adopts a unified unit multiplier to improve the hardware utilization rate, and can also realize the multiply-accumulate operation of half-precision floating-point numbers, the multiply-accumulate operation of single-precision floating-point numbers, and the double-accumulation operation of double-precision floating-point numbers. Multiply-accumulate operation of precision floating-point numbers. It solves the problems of bit redundancy and low hardware utilization in the operation method supporting multi-precision floating-point multiplication in the prior art.

Landscapes

  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Computational Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Pure & Applied Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Complex Calculations (AREA)

Abstract

Disclosed in the present invention are a reconfigurable floating point multiply-accumulate operation unit and method suitable for multi-precision calculation. A uniform method is used to divide mantissas of floating points of different precision to obtain a plurality of bit segments; different numbers of same-type unit multipliers are called to implement multiplication operations of the plurality of bit segments in one period, and corresponding products are outputted; then, a shift-add operation is performed on the products to obtain a multiply-accumulate operation result of floating-point numbers. In the present invention, the problem of bit redundancy is avoided by employing a uniform mantissa division scheme, the hardware utilization rate is increased by employing a uniform unit multiplier, and the multiply-accumulate operation of half-precision floating-point numbers, the multiply-accumulate operation of single-precision dot product floating-point numbers, and the multiply-accumulate operation of double-precision floating-point numbers can be achieved. The problems in the prior art of bit redundancy, low hardware utilization rate and the like of an operation method supporting a multi-precision floating point multiplication operation are solved.

Description

一种适用于多精度计算的可重构浮点乘加运算单元及方法A reconfigurable floating-point multiply-accumulate unit and method suitable for multi-precision computing 技术领域technical field
本发明涉及数字电路领域,尤其涉及的是一种适用于多精度计算的可重构浮点乘加运算单元及方法。The invention relates to the field of digital circuits, in particular to a reconfigurable floating-point multiply-add operation unit and method suitable for multi-precision calculation.
背景技术Background technique
随着科学计算和机器学习训练等的高速发展与广泛应用,能够支持浮点数据处理的乘法单元应运而生。常规定点乘法器的输入比特数固定,不能满足多精度计算的要求,因此出现了支持多精度浮点乘法运算的方法。然而现有的支持多精度浮点乘法运算的运算方法由于需要多种尾数划分方案以及需要将产生的乘积用补零方法分离为两个并行部分,因此存在精度损失以及比特冗余、硬件利用率低等问题。With the rapid development and wide application of scientific computing and machine learning training, multiplication units that can support floating-point data processing emerge as the times require. The fixed number of input bits of the conventional floating point multiplier cannot meet the requirements of multi-precision calculation, so there is a method to support multi-precision floating-point multiplication. However, the existing operation methods that support multi-precision floating-point multiplication require multiple mantissa division schemes and need to separate the generated product into two parallel parts by zero-filling, so there is a loss of precision, bit redundancy, and hardware utilization. lower issues.
因此,现有技术还有待改进和发展。Therefore, the existing technology still needs to be improved and developed.
发明内容SUMMARY OF THE INVENTION
本发明要解决的技术问题在于,针对现有技术的上述缺陷,提供一种适用于多精度计算的可重构浮点乘加运算单元及方法,旨在解决现有技术中支持多精度浮点乘法运算的运算方法会产生比特冗余、硬件利用率低等情况的问题。The technical problem to be solved by the present invention is to provide a reconfigurable floating-point multiply-add operation unit and method suitable for multi-precision calculation, aiming at solving the problem of supporting multi-precision floating point in the prior art The operation method of the multiplication operation will cause problems such as bit redundancy and low hardware utilization.
本发明解决问题所采用的技术方案如下:The technical scheme adopted by the present invention to solve the problem is as follows:
第一方面,本发明实施例提供一种适用于多精度计算的可重构浮点乘加运算方法,其中,所述方法包括:In a first aspect, an embodiment of the present invention provides a reconfigurable floating-point multiply-add operation method suitable for multi-precision computing, wherein the method includes:
获取待运算浮点数的有效数字,基于所述有效数字生成若干个目标段;所述若干个包括一个;Obtain the significant digits of the floating-point number to be operated, and generate several target segments based on the significant digits; the several include one;
根据所述待运算浮点数的精度确定调用的单元乘法器的个数,将一个目标段作为一个单元乘法器的操作数,获取所述单元乘法器基于所述操作数生成的乘积;Determine the number of called unit multipliers according to the precision of the floating-point number to be operated, take a target segment as an operand of a unit multiplier, and obtain a product generated by the unit multiplier based on the operand;
对所述乘积进行移位相加操作,并将基于所述移位相加操作生成的运算结果作为所述待运算浮点数的乘累加运算的结果。A shift-add operation is performed on the product, and an operation result generated based on the shift-add operation is used as the result of the multiply-accumulate operation of the floating-point number to be operated.
在一种实施方式中,所述获取待运算浮点数的有效数字,基于所述有效数字生成若干个目标段;所述若干个包括一个,包括:In one embodiment, the obtaining the significant digits of the floating-point number to be operated, and generating several target segments based on the significant digits; the several target segments include one, including:
在所述待运算浮点数的尾数部分添加1比特整数;Add a 1-bit integer to the mantissa part of the floating-point number to be operated;
将添加完毕以后得到的浮点数的有效位数上的数字作为所述待运算浮点数的有效数字;Taking the number on the significant digits of the floating-point number obtained after the addition is completed as the significant number of the floating-point number to be operated;
当所述有效数字的比特位数大于所述单元乘法器的比特位数时,根据所述单元乘法器的比特位数对所述有效数字进行划分,划分后生成若干个目标段;所述若干个包括一个。When the number of bits of the significant figure is greater than the number of bits of the unit multiplier, the significant number is divided according to the number of bits of the unit multiplier, and after division, several target segments are generated; including one.
在一种实施方式中,所述根据所述待运算浮点数的精度确定调用的单元乘法器的个数,将一个目标段作为一个单元乘法器的操作数,获取所述单元乘法器基于所述操作数生成的乘积包括:In an embodiment, the number of called unit multipliers is determined according to the precision of the floating-point number to be operated, a target segment is used as an operand of a unit multiplier, and the unit multiplier is obtained based on the Operand-generated products include:
根据所述待运算浮点数的精度确定调用的单元乘法器的个数;Determine the number of called unit multipliers according to the precision of the floating-point number to be operated;
将一个目标段作为一个单元乘法器的一个操作数;take a target segment as an operand of a unit multiplier;
将所述操作数输入所述单元乘法器后生成若干行乘积。A number of row products are generated after the operands are input to the unit multiplier.
在一种实施方式中,当所述单元乘法器为14比特乘法器时,所述根据所述待运算浮点数的精度和对数确定调用的单元乘法器的个数包括:In an implementation manner, when the unit multiplier is a 14-bit multiplier, the determining the number of unit multipliers to be called according to the precision and logarithm of the floating-point number to be operated includes:
当所述待运算浮点数为半精度浮点数时,n对待运算浮点数调用n个单元乘法器;When the floating-point number to be operated is a half-precision floating-point number, n calls n unit multipliers for the floating-point number to be operated;
当所述待运算浮点数为单精度浮点数时,n对待运算浮点数调用4n个单元乘法器;When the floating-point number to be operated is a single-precision floating-point number, n calls 4n unit multipliers for the floating-point number to be operated;
当所述待运算浮点数为双精度浮点数时,n对待运算浮点数调用16n个单元乘法器;When the floating-point number to be operated is a double-precision floating-point number, n calls 16n unit multipliers for the floating-point number to be operated;
n为大于0的整数。n is an integer greater than 0.
在一种实施方式中,所述将所述操作数输入所述单元乘法器后生成若干行乘积包括:In one embodiment, the generating of several row products after the operand is input to the unit multiplier includes:
将所述操作数输入所述单元乘法器中,通过无符号位布斯对所述操作数进行编码后生成若干行乘积。The operand is input into the unit multiplier, and the operand is encoded by the unsigned bit Booth to generate several row products.
在一种实施方式中,当所述待运算浮点数为双精度浮点数时,所述将一个目标段作为一个单元乘法器的一个操作数之前还包括:In one embodiment, when the floating-point number to be operated is a double-precision floating-point number, before the using a target segment as an operand of a unit multiplier further includes:
当所述待运算浮点数应的目标段的比特位数不相等时,对比特位数最小的目标段进行补位操作。When the number of bits of the target segment corresponding to the floating-point number to be operated is not equal, a complement operation is performed on the target segment with the smallest number of bits.
在一种实施方式中,所述对所述乘积进行移位相加操作,并将基于所述移位相加操作生成的运算结果作为所述待运算浮点数的乘累加运算的结果包括:In an implementation manner, performing a shift-add operation on the product, and using an operation result generated based on the shift-add operation as the result of the multiply-accumulate operation of the floating-point number to be operated includes:
将所述乘积输入预设的加法树中;inputting the product into a preset addition tree;
计算所述乘积的位移量,通过所述加法树根据所述位移量对所述乘积进行移位操作;Calculate the displacement amount of the product, and perform a shift operation on the product according to the displacement amount through the addition tree;
对所述移位操作后得到的数据进行求和操作后得到所述待运算浮点数的乘累加运算的结果。After the data obtained after the shift operation is summed, the result of the multiply-accumulate operation of the floating-point number to be operated is obtained.
在一种实施方式中,所述位移量包括内部位移量以及外部位移量中至少一种位移量;In one embodiment, the displacement includes at least one of an internal displacement and an external displacement;
所述内部位移量的计算方式为:将基于所述待运算浮点数划分出的段数的高低位之和作为所述段数对应的乘积的内部移位量;The calculation method of the internal displacement amount is: taking the sum of the high and low bits of the segment numbers divided based on the floating-point number to be operated as the internal shift amount of the product corresponding to the segment number;
所述外部位移量的计算方式为:将所述待运算浮点数的指数部分相加得到指数和,将得到的所有指数和的最大值作为参考值;将所述参考值与所述指数和作差得到指数差,将所述指数差作为所述待运算浮点数对应的乘积的外部移位量。The calculation method of the external displacement is as follows: adding the exponent parts of the floating-point numbers to be operated to obtain an exponent sum, and taking the maximum value of all the exponent sums obtained as a reference value; The difference obtains the exponent difference, and the exponent difference is used as the external shift amount of the product corresponding to the floating-point number to be operated.
第二方面,本发明实施例还提供一种适用于多精度计算的可重构浮点乘加运算单元,其特征在于,所述运算单元包括:In a second aspect, an embodiment of the present invention also provides a reconfigurable floating-point multiply-add operation unit suitable for multi-precision computing, characterized in that the operation unit includes:
划分模块,用于获取待运算浮点数的有效数字,基于所述有效数字生成若干个目标段;所述若干个包括一个;A division module, used to obtain the significant digits of the floating-point number to be operated, and generate several target segments based on the significant digits; the several include one;
单元乘法器,用于根据所述待运算浮点数的精度确定调用的单元乘法器的个数,将一个目标段作为一个单元乘法器的操作数,获取所述单元乘法器基于所述操作数生成的乘积;a unit multiplier, used for determining the number of unit multipliers to be called according to the precision of the floating-point number to be operated, taking a target segment as an operand of a unit multiplier, and obtaining the unit multiplier generated based on the operand the product of ;
加法树,用于对所述乘积进行移位相加操作,并将基于所述移位相加操作生成的运算结果作为所述待运算浮点数的乘累加运算的结果。An addition tree, configured to perform a shift-add operation on the product, and use an operation result generated based on the shift-add operation as a result of the multiply-accumulate operation of the floating-point number to be operated.
在一种实施方式中,所述运算单元中包含16n个单元乘法器,n为非负数。In one embodiment, the operation unit includes 16n unit multipliers, and n is a non-negative number.
本发明的有益效果:本发明实施例通过采用统一的尾数划分方案避免了比特冗余的问题,采用统一的单元乘法器提高了硬件利用率,还可以实现半精度浮点数的乘累加运算、单精度浮点数的乘累加运算和双精度浮点数的乘累加运算。解决了现有技术中支持多精度浮点乘法运算的运算方法会产生比特冗余、硬件利用率低等情况的问题。Beneficial effects of the present invention: The embodiment of the present invention avoids the problem of bit redundancy by adopting a unified mantissa division scheme, improves the hardware utilization rate by adopting a unified unit multiplier, and can also realize the multiply-accumulate operation of half-precision floating-point numbers, single The multiply-accumulate operation of precision floating-point numbers and the multiply-accumulate operation of double-precision floating-point numbers. It solves the problems of bit redundancy and low hardware utilization in the operation method supporting multi-precision floating-point multiplication in the prior art.
附图说明Description of drawings
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明中记载的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to explain the embodiments of the present invention or the technical solutions in the prior art more clearly, the following briefly introduces the accompanying drawings that need to be used in the description of the embodiments or the prior art. Obviously, the accompanying drawings in the following description are only These are some embodiments described in the present invention. For those of ordinary skill in the art, other drawings can also be obtained based on these drawings without any creative effort.
图1是本发明实施例提供的一种适用于多精度计算的可重构浮点乘加运算方法的流程示意图。FIG. 1 is a schematic flowchart of a reconfigurable floating-point multiply-add operation method suitable for multi-precision computing according to an embodiment of the present invention.
图2是本发明实施例提供的不同精度的浮点数的有效数字的划分方案的示意图。FIG. 2 is a schematic diagram of a division scheme of significant digits of floating-point numbers of different precisions provided by an embodiment of the present invention.
图3是本发明实施例提供的14比特基本乘法器的工作原理示意图。FIG. 3 is a schematic diagram of a working principle of a 14-bit basic multiplier provided by an embodiment of the present invention.
图4是本发明实施例提供的计算一对FP64时,16组乘积输入加法器树的计算图。FIG. 4 is a calculation diagram of 16 groups of products input to an adder tree when a pair of FP64 is calculated according to an embodiment of the present invention.
图5是本发明实施例提供的一种适用于多精度计算的可重构浮点乘加运算单元的内部基本模块图。FIG. 5 is an internal basic block diagram of a reconfigurable floating-point multiply-add operation unit suitable for multi-precision calculation provided by an embodiment of the present invention.
图6是本发明实施例提供的可以实现3种不同精度的浮点数的尾数乘累加运算的最小运算单元的参考图。FIG. 6 is a reference diagram of a minimum operation unit that can implement mantissa multiply-accumulate operations of three types of floating-point numbers of different precisions provided by an embodiment of the present invention.
具体实施方式Detailed ways
为使本发明的目的、技术方案及优点更加清楚、明确,以下参照附图并举实施例对本发明进一步详细说明。应当理解,此处所描述的具体实施例仅仅用以解释本发明,并不用于限定本发明。In order to make the objectives, technical solutions and advantages of the present invention clearer and clearer, the present invention will be further described in detail below with reference to the accompanying drawings and examples. It should be understood that the specific embodiments described herein are only used to explain the present invention, but not to limit the present invention.
需要说明,若本发明实施例中有涉及方向性指示(诸如上、下、左、右、前、后……),则该方向性指示仅用于解释在某一特定姿态(如附图所示)下各部件之间的相对位置关系、运动情况等,如果该特定姿态发生改变时,则该方向性指示也相应地随之改变。It should be noted that if there are directional indications (such as up, down, left, right, front, back, etc.) involved in the embodiments of the present invention, the directional indications are only used to explain a certain posture (as shown in the accompanying drawings). If the specific posture changes, the directional indication also changes accordingly.
随着科学计算和机器学习训练等的高速发展与广泛应用,能够支持浮点数据处理的乘法单元应运而生。常规定点乘法器的输入比特数固定,不能满足多精度计算的要求,无法针对应用需求,最大化的利用硬件资源从而提高能效比和吞吐率。因此产生了多精度浮点乘法运算的方法。然而部分现有的支持多精度浮点乘法运算的方法在实现多精度乘法时,需要将产生的乘积用补零方法分离为两个并行部分,进而导致系统模块的利用率降低;还有部分现有的支持多精度浮点乘法运算的方法在实现多精度乘法时,需要采用不同的尾数划分方案,如使架构基于15位乘法器,其优化用以支持FP128精度的浮点乘法运算,但用于其他精度的浮点乘法运算时,将会产生了大量的比特冗余和硬件资源浪费。简言之,常规定点乘法器的输入比特数固定,不能满足多精度计算的要求,无法针对应用需求,最大化的利用硬件资源从而提高能效比和吞吐率;而现有的支持多精度浮点乘法运算的运算方法又存在精度损失以及比特冗余、硬件利用率低等问题。With the rapid development and wide application of scientific computing and machine learning training, multiplication units that can support floating-point data processing emerge as the times require. The number of input bits of the conventional fixed-point multiplier is fixed, which cannot meet the requirements of multi-precision computing, and cannot maximize the use of hardware resources to improve the energy efficiency ratio and throughput rate according to application requirements. Hence the method for multi-precision floating-point multiplication. However, some existing methods that support multi-precision floating-point multiplication operations need to separate the generated product into two parallel parts by zero-filling when implementing multi-precision multiplication, which leads to a reduction in the utilization of system modules; Some methods that support multi-precision floating-point multiplication require different mantissa division schemes when implementing multi-precision multiplication. For example, the architecture is based on a 15-bit multiplier, which is optimized to support FP128-precision floating-point multiplication, but uses When performing floating-point multiplication operations of other precisions, a large amount of bit redundancy and waste of hardware resources will be generated. In short, the fixed-point multiplier has a fixed number of input bits, which cannot meet the requirements of multi-precision computing, and cannot maximize the use of hardware resources for application requirements to improve energy efficiency ratio and throughput; while the existing ones support multi-precision floating-point. The multiplication operation method also has problems such as loss of precision, bit redundancy, and low hardware utilization.
基于现有技术的上述缺陷,本发明提供一种适用于多精度计算的可重构浮点乘加运算方法,通过采用统一的方法对不同精度的浮点的尾数进行划分,得到多个比特段,并调用不同数量的同一类单元乘法器实现多个比特段的乘法运算在一个周期内完成并输出对应的乘积,然后对所述乘积进行移位相加操作后即可得到浮点的尾数相乘的运算结果。本发明采用统一的尾数划分方案避免了比特冗余的问题,采用统一的单元乘法器提高了硬件利用率,还可以实现半精度浮点数的乘累加运算、单精度浮点数的乘累加运算 和双精度浮点数的乘累加运算。解决了现有技术中支持多精度浮点乘法运算的运算方法会产生比特冗余、硬件利用率低等情况的问题。Based on the above-mentioned defects of the prior art, the present invention provides a reconfigurable floating-point multiply-add operation method suitable for multi-precision calculation. By adopting a unified method to divide the mantissas of floating-point numbers of different precisions, multiple bit segments are obtained. , and call different numbers of the same type of unit multipliers to complete the multiplication operation of multiple bit segments in one cycle and output the corresponding product, and then shift and add the product to obtain the mantissa phase of the floating point. The result of the multiplication operation. The invention adopts a unified mantissa division scheme to avoid the problem of bit redundancy, adopts a unified unit multiplier to improve the hardware utilization rate, and can also realize the multiply-accumulate operation of half-precision floating-point numbers, the multiply-accumulate operation of single-precision floating-point numbers, and the double-accumulation operation of double-precision floating-point numbers. Multiply-accumulate operation of precision floating-point numbers. It solves the problems of bit redundancy and low hardware utilization in the operation method supporting multi-precision floating-point multiplication in the prior art.
如图1所示,所述方法包括如下步骤:As shown in Figure 1, the method includes the following steps:
步骤S100、获取待运算浮点数的有效数字,基于所述有效数字生成若干个目标段;所述若干个包括一个。Step S100: Obtain the significant digits of the floating-point number to be operated, and generate several target segments based on the significant digits; the several target segments include one.
在浮点数的乘法运算中,乘法运算结果的指数部分为相乘的两个浮点数的指数部分之和,乘法运算结果的尾数部分为相乘的两个浮点数的尾数之积。本实施例主要是针对浮点数的乘法运算中生成乘法运算结果的尾数部分,即相乘的两个浮点数的尾数之积的方法进行优化。具体地,首先本实施例需要获取待运算浮点数的有效数字,所述有效数字指的是待运算浮点数的尾数中需要参与乘法运算的数据,只有首先确定了需要参与乘法运算的有效数字,才能进行后续的乘法运算。获取到所述有效数字以后,本实施例需要基于所述有效数字生成一个或者多个目标段,然后再将所述目标段作为单元乘法器的输入数据。In the multiplication of floating-point numbers, the exponent part of the multiplication result is the sum of the exponent parts of the two floating-point numbers to be multiplied, and the mantissa part of the multiplication result is the product of the mantissas of the two multiplied floating-point numbers. This embodiment mainly optimizes the method for generating the mantissa part of the multiplication result in the multiplication of floating-point numbers, that is, the product of the mantissas of two multiplied floating-point numbers. Specifically, firstly, the present embodiment needs to obtain the significant digits of the floating-point number to be operated, and the significant digits refer to the data that needs to participate in the multiplication operation in the mantissa of the floating-point number to be operated. Only the significant digits that need to participate in the multiplication operation are determined first, Subsequent multiplication operations can be performed. After the significant figures are obtained, this embodiment needs to generate one or more target segments based on the significant figures, and then use the target segments as input data of the unit multiplier.
所述步骤S100包括如下步骤:The step S100 includes the following steps:
步骤S110、在所述待运算浮点数的尾数部分添加1比特整数;Step S110, adding a 1-bit integer to the mantissa part of the floating-point number to be operated;
步骤S120、将添加完毕以后得到的浮点数的有效位数上的数字作为所述待运算浮点数的有效数字;Step S120, taking the number on the significant digits of the floating-point number obtained after adding as the significant number of the floating-point number to be operated;
步骤S130、当所述有效数字的比特位数大于所述单元乘法器的比特位数时,根据所述单元乘法器的比特位数对所述有效数字进行划分,划分后生成若干个目标段;所述若干个包括一个。Step S130, when the number of bits of the significant figure is greater than the number of bits of the unit multiplier, divide the significant number according to the number of bits of the unit multiplier, and generate several target segments after division; The several include one.
具体地,为了获取到所述待运算浮点数的有效数字,本实施例首先需要在所述待运算浮点数的尾数部分添加1比特整数,然后将添加完毕以后得到的浮点数的有效位上的数字作为所述待运算浮点数的有效数字。举例说明,对于半精度浮点数(浮点16比特,FP16)的尾数部分加上1比特整数后,其有效位数为11位;对于单精度浮点数(浮点32 比特数,FP32)的尾数部分加上1比特整数后,其有效位数为24位;对于双精度浮点数(浮点64比特数,FP64)的尾数部分加上1比特整数后,其有效位数为53位。获取到有效数字以后,本实施例还需要根据所述有效数字得到目标段,并将所述目标段作为后续单元乘法器的输入数据。Specifically, in order to obtain the significant digits of the floating-point number to be calculated, in this embodiment, a 1-bit integer needs to be added to the mantissa part of the floating-point number to be calculated, and then the significant digits of the floating-point number obtained after adding are added. The number is used as the significant figure of the floating point number to be operated on. For example, after adding a 1-bit integer to the mantissa part of a half-precision floating point number (floating point 16-bit, FP16), the significant number of digits is 11; for a single-precision floating point number (floating point 32-bit number, FP32) The mantissa After adding a 1-bit integer to the part, the significand is 24 bits; for a double-precision floating point number (floating point 64-bit number, FP64), after adding a 1-bit integer to the mantissa, the significand is 53 bits. After the significant figures are obtained, in this embodiment, the target segment needs to be obtained according to the significant figures, and the target segment is used as the input data of the subsequent unit multiplier.
具体地,本实施例需要比较所述有效数字的比特位数与所述单元乘法器的比特位数,并最终判断应该对所述有效数字进行何种处理,进而生成目标段。当所述有效数字的比特位数小于或者等于所述单元乘法器的比特位数时,可以直接将所述有效数字作为目标段输入所述单元乘法器中。举例说明,如图2所示,当所述单元乘法器为14位基本单元乘法器时,16比特浮点数的有效数字只有11位,因此不需要对16比特浮点数的有效数字进行划分,可以直接将其作为一个目标段。Specifically, in this embodiment, it is necessary to compare the number of bits of the significant figure and the number of bits of the unit multiplier, and finally determine what processing should be performed on the significant number, and then generate the target segment. When the number of bits of the significant figure is less than or equal to the number of bits of the unit multiplier, the significant number may be directly input into the unit multiplier as a target segment. For example, as shown in Figure 2, when the unit multiplier is a 14-bit basic unit multiplier, the significant figures of the 16-bit floating-point numbers have only 11 bits, so there is no need to divide the significant figures of the 16-bit floating-point numbers. Use it directly as a target segment.
当所述有效数字的比特位数大于所述单元乘法器的比特位数时,很明显无法将所述有效数字直接输入所述单元乘法器中,因此需要对所述有效数字进行划分,再将划分后生成的若干个目标段输入单元乘法器中。举例说明,当所述单元乘法器为14位基本单元乘法器时,32比特浮点数的有效数字为24位,因此需要对该有效数字进行划分,生成2个12比特的目标段。同理可得,64比特浮点数的有效数字为53位,因此也需要对该有效数字进行划分,进而生成14:13:13:13的4个目标段。When the number of bits of the significant figure is greater than the number of bits of the unit multiplier, it is obvious that the significant number cannot be directly input into the unit multiplier, so the significant number needs to be divided, and then Several target segments generated after division are input into the unit multiplier. For example, when the unit multiplier is a 14-bit basic unit multiplier, the significant figure of a 32-bit floating point number is 24 bits, so the significant figure needs to be divided to generate two 12-bit target segments. In the same way, the significant figure of a 64-bit floating point number is 53 bits, so the significant figure also needs to be divided, and then 4 target segments of 14:13:13:13 are generated.
获取到目标段以后,需要将所述目标段输入单元乘法器,因此如图1所示,所述方法还包括如下步骤:After the target segment is acquired, the target segment needs to be input into the unit multiplier, so as shown in FIG. 1 , the method further includes the following steps:
步骤S200、根据所述待运算浮点数的精度确定调用的单元乘法器的个数,将一个目标段作为一个单元乘法器的操作数,获取所述单元乘法器基于所述操作数生成的乘积。Step S200: Determine the number of called unit multipliers according to the precision of the floating-point number to be operated, take a target segment as an operand of a unit multiplier, and obtain a product generated by the unit multiplier based on the operand.
本实施例首先需要根据所述待运算浮点数的精度确定调用的单位乘法的个数。然后将得到的一个目标段作为一个单元乘法器的一个操作数,可以理解的是一个单元乘法器需要两个操作数才能进行乘法运算,一个操作数作为乘数,另一个操作数作为被乘数。然后获取所述单元乘法器基于所述操作数生成的乘积。在乘法中,如果乘数是两位或两 位以上的数,乘的时候,就要用乘数的每一位去乘被乘数,每次乘得的积,叫做乘积,或叫做不完全积。In this embodiment, firstly, the number of called unit multiplications needs to be determined according to the precision of the floating-point number to be operated. Then use the obtained target segment as an operand of a unit multiplier. It can be understood that a unit multiplier needs two operands to perform multiplication, one operand is used as a multiplier, and the other operand is used as a multiplicand . The product generated by the unit multiplier based on the operands is then obtained. In multiplication, if the multiplier is a number with two or more digits, when multiplying, each digit of the multiplier must be used to multiply the multiplicand, and the product obtained each time is called the product, or called incomplete. product.
所述步骤S200具体包括如下步骤:The step S200 specifically includes the following steps:
步骤210、根据所述待运算浮点数的精度确定调用的单元乘法器的个数;Step 210, determining the number of called unit multipliers according to the precision of the floating-point number to be operated;
步骤220、将一个目标段作为一个单元乘法器的一个操作数;Step 220, using a target segment as an operand of a unit multiplier;
步骤230、将所述操作数输入所述单元乘法器后生成若干行乘积。Step 230: Input the operand into the unit multiplier to generate several row products.
本实施例中首先需要确定调用的单元乘法器的个数。在一种实现方式中,当所述单元乘法器为14比特乘法器时,所述根据所述待运算浮点数的精度和对数确定调用的单元乘法器的个数包括:当所述待运算浮点数为半精度浮点数时,n对待运算浮点数调用n个单元乘法器;当所述待运算浮点数为单精度浮点数时,n对待运算浮点数调用4n个单元乘法器;当所述待运算浮点数为双精度浮点数时,n对待运算浮点数调用16n个单元乘法器;n为大于0的整数。In this embodiment, it is first necessary to determine the number of called unit multipliers. In an implementation manner, when the unit multiplier is a 14-bit multiplier, the determining the number of unit multipliers to call according to the precision and logarithm of the floating-point number to be operated includes: when the to-be-operated floating-point number is When the floating-point number is a half-precision floating-point number, n calls n-unit multipliers for the floating-point number to be operated; when the floating-point number to be operated is a single-precision floating-point number, n calls 4n-unit multipliers for the floating-point number to be operated; When the floating-point number to be operated is a double-precision floating-point number, n calls 16n unit multipliers for the floating-point number to be operated; n is an integer greater than 0.
举例说明,假设所述单元乘法器为14比特基本单元乘法器,当需要同时计算16对半精度浮点数的乘累加运算结果时,需要调用16个14比特基本单元乘法器,理由如下,因为本实施例是基于单元乘法器的比特位数对所述有效数字进行划分,半精度浮点数的有效数字可以直接作为一个目标段,因此同时计算16对半精度浮点数的乘累加运算结果时,每一对半精度浮点数需要调用1个14比特基本单元乘法器,一共需要调用16个14比特基本单元乘法器。同理,当需要同时计算4对单精度浮点数的乘累加运算结果时,也需要调用16个14比特基本单元乘法器。因为单精度浮点数的有效数字需要划分后才能输入单元乘法器中,其划分结果是生成2个目标段,则1对单精度浮点数对应的4个目标段之间有2*2=4种乘法组合方式,即需要4个单元乘法器,则4对单精度浮点数的乘累加运算就需要4*4=16个单元乘法器。同理当需要同时计算1对双精度浮点数的乘累加运算结果时,也需要调用16个14比特基本单元乘法器,理由如下,因为双精度浮点数的有效数字划分后生成4个目标段,则1对双精度浮点数对应的8个目标段之间有4*4=16种乘法组 合方式,因此一共需要16个14比特基本单元乘法器。For example, assuming that the unit multiplier is a 14-bit basic unit multiplier, when you need to calculate the multiplication and accumulation results of 16 pairs of half-precision floating-point numbers at the same time, you need to call 16 14-bit basic unit multipliers for the following reasons. The embodiment is to divide the significant figures based on the number of bits of the unit multiplier, and the significant figures of the half-precision floating-point numbers can be directly used as a target segment. A pair of half-precision floating-point numbers needs to call a 14-bit basic unit multiplier, and a total of 16 14-bit basic unit multipliers need to be called. Similarly, when the multiplication and accumulation operation results of 4 pairs of single-precision floating-point numbers need to be calculated at the same time, 16 14-bit basic unit multipliers also need to be called. Because the significant digits of single-precision floating-point numbers need to be divided before they can be input into the unit multiplier, and the result of the division is to generate 2 target segments, then there are 2*2=4 types between the 4 target segments corresponding to 1 pair of single-precision floating-point numbers In the multiplication and combination mode, 4 unit multipliers are required, and 4*4=16 unit multipliers are required for the multiply-accumulate operation of 4 pairs of single-precision floating-point numbers. Similarly, when the multiplication and accumulation results of a pair of double-precision floating-point numbers need to be calculated at the same time, 16 14-bit basic unit multipliers also need to be called for the following reasons, because the significant digits of double-precision floating-point numbers are divided to generate 4 target segments, then There are 4*4=16 multiplication combinations between the 8 target segments corresponding to a pair of double-precision floating-point numbers, so a total of 16 14-bit basic unit multipliers are required.
此外,由于对有效数字进行划分后有可能生成比特位数不相等的目标段,因此在一种实现方式中,将一个目标段作为一个单元乘法器的一个操作数之前还包括:当所述待运算浮点数应的目标段的比特位数不相等时,对比特位数最小的目标段进行补位操作,所述补位操作可以以补零的方式实现。举例说明,当所述单元乘法器为14比特基本单元乘法器时,双精度浮点数对应的有效数字为53位,划分后生成的14:13:13:13的4个目标端,则需要对13比特的目标段进行补零操作。In addition, since it is possible to generate target segments with unequal number of bits after the significant digits are divided, in an implementation manner, before using a target segment as an operand of a unit multiplier, it further includes: when the to-be-to-be-multiplier is used When the number of bits of the target segment to which the floating-point number should be operated is not equal, a complementing operation is performed on the target segment with the smallest number of bits, and the complementing operation may be implemented in the form of zero-filling. For example, when the unit multiplier is a 14-bit basic unit multiplier, the significant figure corresponding to the double-precision floating-point number is 53 bits, and the four target ends of 14:13:13:13 generated after division need to The 13-bit target segment is zero-padded.
然后将一个目标段作为一个单元乘法器的一个操作数,之后获取所述单元乘法器生成的若干行乘积。具体地,所述操作数输入到单元乘法器以后,在所述单元乘法器里会通过无符号位布斯(booth)对所述操作数进行编码后生成若干行乘积(如图3所示)。A target segment is then used as one operand of a unit multiplier, after which several row products generated by the unit multiplier are taken. Specifically, after the operand is input to the unit multiplier, the unit multiplier encodes the operand through an unsigned bit booth and generates several row products (as shown in FIG. 3 ). .
获取到乘积以后,为了获得浮点数的乘累加运算的结果,如图1所示,所述方法还包括如下步骤:After the product is obtained, in order to obtain the result of the multiply-accumulate operation of floating-point numbers, as shown in FIG. 1 , the method further includes the following steps:
步骤S300、对所述乘积进行移位相加操作,并将基于所述移位相加操作生成的运算结果作为所述待运算浮点数的乘累加运算的结果。Step S300: Perform a shift-add operation on the product, and use an operation result generated based on the shift-add operation as a result of the multiply-accumulate operation of the floating-point number to be operated.
本实施例获取到单元乘法器输出的乘积以后,为了获取到准确的乘法运算结果,还需要对得到的乘积进行移位相加操作以后,再将移位相加操作后的运算结果作为所述待运算浮点数的乘累加运算的结果。In this embodiment, after the product output by the unit multiplier is obtained, in order to obtain an accurate multiplication result, it is necessary to perform a shift-add operation on the obtained product, and then use the result of the shift-add operation as the above-mentioned operation result. The result of the multiply-accumulate operation of the floating-point number to be operated on.
在一种实现方式中,所述步骤S300具体包括如下步骤:In an implementation manner, the step S300 specifically includes the following steps:
步骤S310、将所述乘积输入预设的加法树中;Step S310, inputting the product into a preset addition tree;
步骤S320、计算所述乘积的位移量,通过所述加法树根据所述位移量对所述乘积进行移位操作;Step S320, calculating the displacement of the product, and performing a shift operation on the product according to the displacement through the addition tree;
步骤S330、对所述移位操作后得到的数据进行求和操作后得到所述待运算浮点数的乘累加运算的结果。Step S330 , performing a summation operation on the data obtained after the shift operation to obtain a result of the multiply-accumulate operation of the floating-point number to be operated.
本实施例针对生成目标段的方案以及单元乘法器的使用情况,预先设置了一个加法 树,以实现数据的无损处理。具体地,获取到乘积以后,将所述乘积输入到所述加法树中,然后在所述加法树中计算出所述乘积的位移量,再根据所述位移量对所述乘积进行移位操作。具体地,所述加法树中计算出的位移量包括内部位移量以及外部位移量中至少一种位移量,即可以仅包括内部位移量,也可以仅包括外部位移量,还可以同时包括内部位移量和外部位移量,位移量的最终值需要具体根据待运算浮点数的精度以及计算对数确定。其中,所述内部位移量的计算方式为将基于所述待运算浮点数划分出的段数的高低位之和作为所述段数对应的乘积的内部移位量。所述外部位移量的计算方式为:将所述待运算浮点数的指数部分相加得到指数和,将得到的所有指数和的最大值作为参考值,然后将所述参考值与所述指数和作差得到指数差,最后将所述指数差作为所述待运算浮点数对应的乘积的外部移位量。In this embodiment, an addition tree is preset for the scheme of generating the target segment and the usage of the unit multiplier, so as to realize the lossless processing of the data. Specifically, after the product is obtained, the product is input into the addition tree, then the displacement of the product is calculated in the addition tree, and then the product is shifted according to the displacement. . Specifically, the displacement calculated in the addition tree includes at least one of an internal displacement and an external displacement, that is, it may include only the internal displacement, only the external displacement, or both the internal displacement The final value of the displacement amount needs to be determined according to the precision of the floating point number to be operated and the logarithm of the calculation. The calculation method of the internal displacement amount is to use the sum of the high and low bits of the segment numbers divided based on the floating-point number to be operated as the internal shift amount of the product corresponding to the segment numbers. The calculation method of the external displacement is as follows: adding the exponent parts of the floating-point numbers to be operated to obtain an exponent sum, taking the maximum value of all exponent sums obtained as a reference value, and then adding the reference value to the exponent sum. A difference is obtained to obtain an exponent difference, and finally the exponent difference is used as the external shift amount of the product corresponding to the floating-point number to be operated.
简言之,当同时计算多对浮点数的时候,则浮点数本身指数部分的数字需要作为外部位移量。当对所述有效数字进行划分后,由于调用了不同的单元乘法器进行运算,虽然每一个单元乘法器输出乘积的比特位数相同,但是需要将划分出的段数的高低位之和作为内部位移量,并进行相应的高低位移位后再累加才可生成正确的浮点数的乘累加运算结果。In short, when calculating multiple pairs of floating-point numbers at the same time, the number in the exponent part of the floating-point number itself needs to be used as the external displacement. After the significant digits are divided, since different unit multipliers are called for operation, although the number of bits of the output product of each unit multiplier is the same, the sum of the high and low bits of the divided segment numbers needs to be used as the internal displacement The correct multiplication-accumulation result of floating-point numbers can be generated after accumulating the corresponding high and low bits and then accumulating.
举例说明,同时计算16对半精度浮点数的乘累加运算结果,需要调用16个14比特基本单元乘法器,由于半精度浮点数的有效数字并未进行划分,但是需要同时计算多对浮点数的乘累加运算结果,因此在这种情况下乘积的位移量只有外部位移量,首先查询16对半精度浮点数的16个指数和,将16个指数和中的最大的和作为参考值,将所述参考值分别减去16个不同的指数和,以此得到外部位移量。For example, to calculate the multiplication and accumulation results of 16 pairs of half-precision floating-point numbers at the same time, 16 14-bit basic unit multipliers need to be called. Since the significant digits of half-precision floating-point numbers are not divided, it is necessary to calculate the multiplication of multiple pairs of floating-point numbers at the same time. The result of multiply-accumulate operation, so in this case, the displacement of the product is only the external displacement. First, query the sum of 16 exponents of 16 pairs of half-precision floating-point numbers, and use the largest sum of the 16 exponent sums as the reference value. Subtract 16 different exponent sums from the above reference values to obtain the external displacement.
当同时计算4对单精度浮点数的乘累加运算结果时,由于单精度浮点数的有效数字是经过划分以后才输入单元乘法器中的,且需要同时计算多对浮点数的乘累加运算结果,因此这种情况下乘积的位移量既包含内部位移量也包含外部位移量。首先查询4对单精度浮点数的4个指数和,将4个指数和中的最大的和作为参考值,然后将所述参考值 分别减去4个不同的指数和,以此得到外部位移量。此外,还需要将基于所述待运算浮点数划分出的段数的高低位之作为所述段数对应的乘积的内部移位量,例如a 0×b 0的段数和位0+0=0,则内部位移量为0;a 1×b 0的段数和位1+0=1,则内部位移量为向左移位1×14=14位;a 3×b 1的段数和位3+1=4,则内部位移量为向左移位4×14=56位。 When calculating the multiplication-accumulation results of 4 pairs of single-precision floating-point numbers at the same time, since the significant digits of the single-precision floating-point numbers are input into the unit multiplier after division, and the multiplication-accumulation results of multiple pairs of floating-point numbers need to be calculated at the same time, Therefore, the displacement of the product in this case includes both the internal displacement and the external displacement. First, query the sum of 4 exponents of 4 pairs of single-precision floating-point numbers, take the largest sum of the 4 exponent sums as the reference value, and then subtract 4 different sums of exponents from the reference value to obtain the external displacement . In addition, it is also necessary to use the high and low bits of the segment number divided based on the floating-point number to be operated as the internal shift amount of the product corresponding to the segment number. For example, the segment number of a 0 ×b 0 and bit 0+0=0, then The internal displacement is 0; the segment number of a 1 ×b 0 and bit 1+0=1, then the internal displacement is 1×14=14 bits to the left; the segment number of a 3 ×b 1 and bit 3+1= 4, then the internal displacement is 4×14=56 bits to the left.
当计算1对双精度浮点数的乘累加运算结果时,由于双精度浮点数的有效数字是经过划分以后才输入单元乘法器中的,但是只需要计算1对浮点数的乘累加运算结果,因此这种情况下乘积的位移量只包含内部位移量,例如a 1×b 0结果移位量为1×14bit、a 1×b 1及a 2×b 0结果移位量均为2×14bit(如图4所示)。 When calculating the result of multiply-accumulate operation of one pair of double-precision floating-point numbers, since the significant figures of double-precision floating-point numbers are input into the unit multiplier after division, but only need to calculate the result of multiply-accumulate operation of one pair of floating-point numbers, so In this case, the displacement of the product only includes the internal displacement. For example, a 1 ×b 0 results in a displacement of 1 × 14bit, and a 1 ×b 1 and a 2 ×b 0 The resulting displacement is 2 × 14bit ( As shown in Figure 4).
计算出乘积的位移量以后,根据所述位移量对所述乘积进行移位操作,然后对所述移位操作后得到的数据进行求和操作以后,即可得到所述待运算浮点数的乘累加运算的结果。After calculating the displacement of the product, perform a shift operation on the product according to the displacement, and then perform a sum operation on the data obtained after the shift operation, and then the multiplication of the floating-point number to be operated can be obtained. The result of the accumulation operation.
基于上述实施例,本发明还提供一种适用于多精度计算的可重构浮点乘加运算单元,如图5所示,所述运算单元包括:Based on the above embodiment, the present invention also provides a reconfigurable floating-point multiply-add operation unit suitable for multi-precision calculation. As shown in FIG. 5 , the operation unit includes:
划分模块01,用于获取待运算浮点数的有效数字,基于所述有效数字生成若干个目标段;所述若干个包括一个;A division module 01 is used to obtain the significant figures of the floating-point numbers to be operated, and generate several target segments based on the significant figures; the several include one;
单元乘法器02,用于根据所述待运算浮点数的精度确定调用的单元乘法器的个数,将一个目标段作为一个单元乘法器的操作数,获取所述单元乘法器基于所述操作数生成的乘积; Unit multiplier 02, for determining the number of unit multipliers to be called according to the precision of the floating-point number to be operated, taking a target segment as an operand of a unit multiplier, and obtaining the unit multiplier based on the operand the resulting product;
加法树03,用于对所述乘积进行移位相加操作,并将基于所述移位相加操作生成的运算结果作为所述待运算浮点数的乘累加运算的结果。The addition tree 03 is configured to perform a shift-add operation on the product, and use the operation result generated based on the shift-add operation as the result of the multiply-accumulate operation of the floating-point number to be operated.
在一种实现方式中,为了使所述运算单元可以实现半精度浮点数、单精度浮点数以及双精度浮点数的乘累加运算,所述运算单元中包含16n个单元乘法器,n为非负数。举例说明,如图6所示,当所述运算单元中包含16个14比特基本单元乘法器时,每个单元乘法器可以实现1组半精度浮点数乘法运算,因此可以同时实现16对半精度浮点数的乘 累加运算。每4个单元乘法器可以实现1组单精度浮点数乘法运算,因此还可以同时实现4对单精度浮点数的乘累加运算。16个单元乘法器可以实现1组双精度浮点数乘法运算,因此还可以实现1对双精度浮点数的乘累加运算。图6是本发明提供的可以实现3种不同精度的浮点数的乘累加运算的最小运算单元的参考图。因此本发明实施例可在无限制硬件资源的情况下在一个时钟周期内至少完成多对半精度浮点数的乘累加运算、多对单精度浮点数的乘累加运算或者1对双精度浮点数的乘累加运算。与固定的FP32和FP64乘加单元相比,本发明提供的运算单元可分别提高4倍和16倍的最大吞吐率。In an implementation manner, in order to enable the operation unit to realize the multiply-accumulate operation of half-precision floating-point numbers, single-precision floating-point numbers and double-precision floating-point numbers, the operation unit includes 16n unit multipliers, and n is a non-negative number . For example, as shown in FIG. 6 , when the operation unit includes 16 14-bit basic unit multipliers, each unit multiplier can realize 1 set of half-precision floating-point multiplication operations, so 16 pairs of half-precision can be realized at the same time Multiplication and accumulation of floating-point numbers. Every 4 unit multipliers can realize 1 set of single-precision floating-point multiplication operations, so it can also realize the multiply-accumulate operation of 4 pairs of single-precision floating-point numbers at the same time. The 16 unit multipliers can realize a set of double-precision floating-point multiplication operations, so it can also realize a multiply-accumulate operation of a pair of double-precision floating-point numbers. FIG. 6 is a reference diagram of a minimum operation unit provided by the present invention that can realize the multiply-accumulate operation of three floating-point numbers of different precisions. Therefore, the embodiments of the present invention can at least complete the multiply-accumulate operations of multiple pairs of half-precision floating-point numbers, the multiply-accumulate operations of multiple pairs of single-precision floating-point numbers, or the multiply-accumulate operations of one pair of double-precision floating-point numbers within one clock cycle without limiting hardware resources. Multiply and accumulate operations. Compared with the fixed FP32 and FP64 multiply-add units, the arithmetic unit provided by the present invention can increase the maximum throughput rate by 4 times and 16 times respectively.
综上所述,本发明公开了一种适用于多精度计算的可重构浮点乘加运算单元及方法,通过采用统一的方法对不同精度的浮点的尾数进行划分,得到多个比特段,并调用不同数量的同一类单元乘法器实现多个比特段的乘法运算在一个周期内完成并输出对应的乘积,然后对所述乘积进行移位相加操作后即可得到浮点数的乘累加运算结果。本发明采用统一的尾数划分方案避免了比特冗余的问题,采用统一的单元乘法器提高了硬件利用率,还可以实现半精度浮点数的乘累加运算、单精度浮点数的乘累加运算和双精度浮点数的乘累加运算。解决了现有技术中支持多精度浮点乘法运算的运算方法会产生比特冗余、硬件利用率低等情况的问题。To sum up, the present invention discloses a reconfigurable floating-point multiply-add operation unit and method suitable for multi-precision calculation. By adopting a unified method to divide the mantissas of floating-point numbers of different precisions, a plurality of bit segments are obtained. , and call different numbers of the same type of unit multipliers to complete the multiplication operation of multiple bit segments in one cycle and output the corresponding product, and then perform the shift and addition operation on the product to obtain the multiplication and accumulation of floating-point numbers. Operation result. The invention adopts a unified mantissa division scheme to avoid the problem of bit redundancy, adopts a unified unit multiplier to improve the hardware utilization rate, and can also realize the multiply-accumulate operation of half-precision floating-point numbers, the multiply-accumulate operation of single-precision floating-point numbers, and the double-accumulation operation of double-precision floating-point numbers. Multiply-accumulate operation of precision floating-point numbers. It solves the problems of bit redundancy and low hardware utilization in the operation method supporting multi-precision floating-point multiplication in the prior art.
应当理解的是,本发明的应用不限于上述的举例,对本领域普通技术人员来说,可以根据上述说明加以改进或变换,所有这些改进和变换都应属于本发明所附权利要求的保护范围。It should be understood that the application of the present invention is not limited to the above examples. For those of ordinary skill in the art, improvements or transformations can be made according to the above descriptions, and all these improvements and transformations should belong to the protection scope of the appended claims of the present invention.

Claims (10)

  1. 一种适用于多精度计算的可重构浮点乘加运算方法,其特征在于,所述方法包括:A reconfigurable floating-point multiply-add operation method suitable for multi-precision computing, characterized in that the method comprises:
    获取待运算浮点数的有效数字,基于所述有效数字生成若干个目标段;所述若干个包括一个;Obtain the significant digits of the floating-point number to be operated, and generate several target segments based on the significant digits; the several include one;
    根据所述待运算浮点数的精度确定调用的单元乘法器的个数,将一个目标段作为一个单元乘法器的操作数,获取所述单元乘法器基于所述操作数生成的乘积;Determine the number of called unit multipliers according to the precision of the floating-point number to be operated, take a target segment as an operand of a unit multiplier, and obtain a product generated by the unit multiplier based on the operand;
    对所述乘积进行移位相加操作,并将基于所述移位相加操作生成的运算结果作为所述待运算浮点数的乘累加运算的结果。A shift-add operation is performed on the product, and an operation result generated based on the shift-add operation is used as the result of the multiply-accumulate operation of the floating-point number to be operated.
  2. 根据权利要求1所述的一种适用于多精度计算的可重构浮点乘加运算方法,其特征在于,所述获取待运算浮点数的有效数字,基于所述有效数字生成若干个目标段;所述若干个包括一个包括:The reconfigurable floating-point multiply-add operation method suitable for multi-precision computing according to claim 1, wherein the obtaining the significant digits of the floating-point number to be operated, generates several target segments based on the significant digits ; the several include one including:
    在所述待运算浮点数的尾数部分添加1比特整数;Add a 1-bit integer to the mantissa part of the floating-point number to be operated;
    将添加完毕以后得到的浮点数的有效位数上的数字作为所述待运算浮点数的有效数字;Taking the number on the significant digits of the floating-point number obtained after the addition is completed as the significant number of the floating-point number to be operated;
    当所述有效数字的比特位数大于所述单元乘法器的比特位数时,根据所述单元乘法器的比特位数对所述有效数字进行划分,划分后生成若干个目标段;所述若干个包括一个。When the number of bits of the significant figure is greater than the number of bits of the unit multiplier, the significant number is divided according to the number of bits of the unit multiplier, and after division, several target segments are generated; including one.
  3. 根据权利要求1所述的一种适用于多精度计算的可重构浮点乘加运算方法,其特征在于,所述根据所述待运算浮点数的精度确定调用的单元乘法器的个数,将一个目标段作为一个单元乘法器的操作数,获取所述单元乘法器基于所述操作数生成的乘积包括:The reconfigurable floating-point multiply-add operation method suitable for multi-precision computing according to claim 1, wherein the number of called unit multipliers is determined according to the precision of the floating-point number to be operated, Taking a target segment as an operand of a unit multiplier, obtaining the product generated by the unit multiplier based on the operand includes:
    根据所述待运算浮点数的精度确定调用的单元乘法器的个数;Determine the number of called unit multipliers according to the precision of the floating-point number to be operated;
    将一个目标段作为一个单元乘法器的一个操作数;take a target segment as an operand of a unit multiplier;
    将所述操作数输入所述单元乘法器后生成若干行乘积。A number of row products are generated after the operands are input to the unit multiplier.
  4. 根据权利要求3所述的一种适用于多精度计算的可重构浮点乘加运算方法,其特征在于,当所述单元乘法器为14比特乘法器时,所述根据所述待运算浮点数的精度和对数 确定调用的单元乘法器的个数包括:The reconfigurable floating-point multiply-add operation method suitable for multi-precision computing according to claim 3, wherein when the unit multiplier is a 14-bit multiplier, the floating-point multiplier according to the to-be-operated multiplier Point precision and logarithms determine the number of cell multipliers called by:
    当所述待运算浮点数为半精度浮点数时,n对待运算浮点数调用n个单元乘法器;When the floating-point number to be operated is a half-precision floating-point number, n calls n unit multipliers for the floating-point number to be operated;
    当所述待运算浮点数为单精度浮点数时,n对待运算浮点数调用4n个单元乘法器;When the floating-point number to be operated is a single-precision floating-point number, n calls 4n unit multipliers for the floating-point number to be operated;
    当所述待运算浮点数为双精度浮点数时,n对待运算浮点数调用16n个单元乘法器;When the floating-point number to be operated is a double-precision floating-point number, n calls 16n unit multipliers for the floating-point number to be operated;
    n为大于0的整数。n is an integer greater than 0.
  5. 根据权利要求3所述的一种适用于多精度计算的可重构浮点乘加运算方法,其特征在于,所述将所述操作数输入所述单元乘法器后生成若干行乘积包括:The reconfigurable floating-point multiply-add operation method suitable for multi-precision computing according to claim 3, wherein the generating of several row products after the operand is input to the unit multiplier comprises:
    将所述操作数输入所述单元乘法器中,通过无符号位布斯对所述操作数进行编码后生成若干行乘积。The operand is input into the unit multiplier, and the operand is encoded by the unsigned bit Booth to generate several row products.
  6. 根据权利要求3所述的一种适用于多精度计算的可重构浮点乘加运算方法,其特征在于,当所述待运算浮点数为双精度浮点数时,所述将一个目标段作为一个单元乘法器的一个操作数之前还包括:The reconfigurable floating-point multiply-add operation method suitable for multi-precision computing according to claim 3, wherein when the floating-point number to be operated is a double-precision floating-point number, the target segment is used as the An operand of a unit multiplier also includes:
    当所述待运算浮点数应的目标段的比特位数不相等时,对比特位数最小的目标段进行补位操作。When the number of bits of the target segment corresponding to the floating-point number to be operated is not equal, a complement operation is performed on the target segment with the smallest number of bits.
  7. 根据权利要求1所述的一种适用于多精度计算的可重构浮点乘加运算方法,其特征在于,所述对所述乘积进行移位相加操作,并将基于所述移位相加操作生成的运算结果作为所述待运算浮点数的乘累加运算的结果包括:The reconfigurable floating-point multiply-add operation method suitable for multi-precision computation according to claim 1, wherein the shift-add operation is performed on the product, and the shift-add operation is performed based on the shift-add operation. The operation result generated by the addition operation, as the result of the multiply-accumulate operation of the floating-point number to be operated, includes:
    将所述乘积输入预设的加法树中;inputting the product into a preset addition tree;
    计算所述乘积的位移量,通过所述加法树根据所述位移量对所述乘积进行移位操作;Calculate the displacement amount of the product, and perform a shift operation on the product according to the displacement amount through the addition tree;
    对所述移位操作后得到的数据进行求和操作后得到所述待运算浮点数的乘累加运算的结果。After the data obtained after the shift operation is summed, the result of the multiply-accumulate operation of the floating-point number to be operated is obtained.
  8. 根据权利要求7所述的一种适用于多精度计算的可重构浮点乘加运算方法,其特征在于,所述位移量包括内部位移量以及外部位移量中至少一种位移量;The reconfigurable floating-point multiply-add operation method suitable for multi-precision computing according to claim 7, wherein the displacement includes at least one of an internal displacement and an external displacement;
    所述内部位移量的计算方式为:将基于所述待运算浮点数划分出的段数的高低位之和作为所述段数对应的乘积的内部移位量;The calculation method of the internal displacement amount is: taking the sum of the high and low bits of the segment numbers divided based on the floating-point number to be operated as the internal shift amount of the product corresponding to the segment number;
    所述外部位移量的计算方式为:将所述待运算浮点数的指数部分相加得到指数和,将得到的所有指数和的最大值作为参考值;将所述参考值与所述指数和作差得到指数差,将所述指数差作为所述待运算浮点数对应的乘积的外部移位量。The calculation method of the external displacement is as follows: adding the exponent parts of the floating-point numbers to be operated to obtain an exponent sum, and taking the maximum value of all the exponent sums obtained as a reference value; The difference obtains the exponent difference, and the exponent difference is used as the external shift amount of the product corresponding to the floating-point number to be operated.
  9. 一种适用于多精度计算的可重构浮点乘加运算单元,其特征在于,所述运算单元包括:A reconfigurable floating-point multiply-add operation unit suitable for multi-precision computing, characterized in that the operation unit includes:
    划分模块,用于获取待运算浮点数的有效数字,基于所述有效数字生成若干个目标段;所述若干个包括一个;A division module, used to obtain the significant digits of the floating-point number to be operated, and generate several target segments based on the significant digits; the several include one;
    单元乘法器,用于根据所述待运算浮点数的精度确定调用的单元乘法器的个数,将一个目标段作为一个单元乘法器的操作数,获取所述单元乘法器基于所述操作数生成的乘积;a unit multiplier, used for determining the number of unit multipliers to be called according to the precision of the floating-point number to be operated, taking a target segment as an operand of a unit multiplier, and obtaining the unit multiplier generated based on the operand the product of ;
    加法树,用于对所述乘积进行移位相加操作,并将基于所述移位相加操作生成的运算结果作为所述待运算浮点数的乘累加运算的结果。An addition tree, configured to perform a shift-add operation on the product, and use an operation result generated based on the shift-add operation as a result of the multiply-accumulate operation of the floating-point number to be operated.
  10. 根据权利要求9所述的一种适用于多精度计算的可重构浮点乘加运算单元,其特征在于,所述运算单元中包含16n个单元乘法器,n为非负数。A reconfigurable floating-point multiply-add operation unit suitable for multi-precision calculation according to claim 9, characterized in that, the operation unit includes 16n unit multipliers, and n is a non-negative number.
PCT/CN2021/131745 2021-02-09 2021-11-19 Reconfigurable floating point multiply-accumulate operation unit and method suitable for multi-precision calculation WO2022170809A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110178984.2A CN112860220B (en) 2021-02-09 2021-02-09 Reconfigurable floating-point multiply-add operation unit and method suitable for multi-precision calculation
CN202110178984.2 2021-02-09

Publications (1)

Publication Number Publication Date
WO2022170809A1 true WO2022170809A1 (en) 2022-08-18

Family

ID=75989427

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/131745 WO2022170809A1 (en) 2021-02-09 2021-11-19 Reconfigurable floating point multiply-accumulate operation unit and method suitable for multi-precision calculation

Country Status (2)

Country Link
CN (1) CN112860220B (en)
WO (1) WO2022170809A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115827555A (en) * 2022-11-30 2023-03-21 格兰菲智能科技有限公司 Data processing method, computer device, storage medium and multiplier structure
CN117908835A (en) * 2024-03-20 2024-04-19 南京邮电大学 Method for accelerating SM2 cryptographic algorithm based on floating point number computing capability

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112860220B (en) * 2021-02-09 2023-03-24 南方科技大学 Reconfigurable floating-point multiply-add operation unit and method suitable for multi-precision calculation
US20230083270A1 (en) * 2021-09-14 2023-03-16 International Business Machines Corporation Mixed signal circuitry for bitwise multiplication with different accuracies
CN114237551B (en) * 2021-11-26 2022-11-11 南方科技大学 Multi-precision accelerator based on pulse array and data processing method thereof
CN117435164A (en) * 2022-07-15 2024-01-23 格兰菲智能科技有限公司 High-performance multiply-add device, multiply-add method and electronic equipment
CN116301717A (en) * 2022-11-22 2023-06-23 中国科学院自动化研究所 Method and device for determining multiply-add sum, electronic equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107273090A (en) * 2017-05-05 2017-10-20 中国科学院计算技术研究所 Towards the approximate floating-point multiplier and floating number multiplication of neural network processor
CN109739555A (en) * 2019-01-04 2019-05-10 腾讯科技(深圳)有限公司 Chip, terminal and control method including multiplying accumulating module
CN111492343A (en) * 2017-12-21 2020-08-04 高通股份有限公司 System and method for floating-point multiply operation processing
CN112860220A (en) * 2021-02-09 2021-05-28 南方科技大学 Reconfigurable floating-point multiply-add operation unit and method suitable for multi-precision calculation

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8468191B2 (en) * 2009-09-02 2013-06-18 Advanced Micro Devices, Inc. Method and system for multi-precision computation
CN104111816B (en) * 2014-06-25 2017-04-12 中国人民解放军国防科学技术大学 Multifunctional SIMD structure floating point fusion multiplying and adding arithmetic device in GPDSP
CN109062540B (en) * 2018-06-06 2022-11-25 北京理工大学 Reconfigurable floating point operation device based on CORDIC algorithm
WO2021056507A1 (en) * 2019-09-29 2021-04-01 深圳市大疆创新科技有限公司 Method for processing floating point number, device, and mobile platform

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107273090A (en) * 2017-05-05 2017-10-20 中国科学院计算技术研究所 Towards the approximate floating-point multiplier and floating number multiplication of neural network processor
CN111492343A (en) * 2017-12-21 2020-08-04 高通股份有限公司 System and method for floating-point multiply operation processing
CN109739555A (en) * 2019-01-04 2019-05-10 腾讯科技(深圳)有限公司 Chip, terminal and control method including multiplying accumulating module
CN112860220A (en) * 2021-02-09 2021-05-28 南方科技大学 Reconfigurable floating-point multiply-add operation unit and method suitable for multi-precision calculation

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
MAO WEI; LI KAI; XIE XINANG; ZHAO SHIRUI; LI HE; YU HAO: "A Reconfigurable Multiple-Precision Floating-Point Dot Product Unit for High-Performance Computing", 2021 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), EDAA, 1 February 2021 (2021-02-01), pages 1793 - 1798, XP033941161, DOI: 10.23919/DATE51398.2021.9473928 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115827555A (en) * 2022-11-30 2023-03-21 格兰菲智能科技有限公司 Data processing method, computer device, storage medium and multiplier structure
CN115827555B (en) * 2022-11-30 2024-05-28 格兰菲智能科技有限公司 Data processing method, computer device, storage medium, and multiplier structure
CN117908835A (en) * 2024-03-20 2024-04-19 南京邮电大学 Method for accelerating SM2 cryptographic algorithm based on floating point number computing capability
CN117908835B (en) * 2024-03-20 2024-05-17 南京邮电大学 Method for accelerating SM2 cryptographic algorithm based on floating point number computing capability

Also Published As

Publication number Publication date
CN112860220B (en) 2023-03-24
CN112860220A (en) 2021-05-28

Similar Documents

Publication Publication Date Title
WO2022170809A1 (en) Reconfigurable floating point multiply-accumulate operation unit and method suitable for multi-precision calculation
US6847985B1 (en) Floating point divide and square root processor
US4969118A (en) Floating point unit for calculating A=XY+Z having simultaneous multiply and add
CN107168678B (en) Multiply-add computing device and floating-point multiply-add computing method
US8606840B2 (en) Apparatus and method for floating-point fused multiply add
US4949296A (en) Method and apparatus for computing square roots of binary numbers
US10949168B2 (en) Compressing like-magnitude partial products in multiply accumulation
US20140089371A1 (en) Mixed precision fused multiply-add operator
CA2530015C (en) Division and square root arithmetic unit
EP0472139A2 (en) A floating-point processor
US20100125621A1 (en) Arithmetic processing device and methods thereof
US8316071B2 (en) Arithmetic processing unit that performs multiply and multiply-add operations with saturation and method therefor
US8060551B2 (en) Method and apparatus for integer division
WO2020191417A2 (en) Techniques for fast dot-product computation
KR20170138143A (en) Method and apparatus for fused multiply-add
US5260889A (en) Computation of sticky-bit in parallel with partial products in a floating point multiplier unit
US20200133633A1 (en) Arithmetic processing apparatus and controlling method therefor
US5278782A (en) Square root operation device
GB2511314A (en) Fast fused-multiply-add pipeline
US5206826A (en) Floating-point division cell
Gonzalez-Navarro et al. A binary integer decimal-based multiplier for decimal floating-point arithmetic
Lutz et al. Fused FP8 4-Way Dot Product With Scaling and FP32 Accumulation
KR20040033198A (en) Floating point with multiply-add unit
US20230144030A1 (en) Multi-input multi-output adder and operating method thereof
Balasaraswathi et al. IMPLEMENTATION OF FLOATING POINT FFT PROCESSOR WITH SINGLE PRECISION FOR REDUCTION IN POWER

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21925456

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21925456

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 21925456

Country of ref document: EP

Kind code of ref document: A1

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 20.02.2024)