CN107168928A - The eight point Winograd Fourier transformers without rearrangement - Google Patents

The eight point Winograd Fourier transformers without rearrangement Download PDF

Info

Publication number
CN107168928A
CN107168928A CN201710303794.2A CN201710303794A CN107168928A CN 107168928 A CN107168928 A CN 107168928A CN 201710303794 A CN201710303794 A CN 201710303794A CN 107168928 A CN107168928 A CN 107168928A
Authority
CN
China
Prior art keywords
vector
point
matrix
complex multiplier
reordering
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN201710303794.2A
Other languages
Chinese (zh)
Inventor
刘明璐
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
RONGCHENG DINGTONG ELECTRONIC INFORMATION TECHNOLOGY Co Ltd
Original Assignee
RONGCHENG DINGTONG ELECTRONIC INFORMATION TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by RONGCHENG DINGTONG ELECTRONIC INFORMATION TECHNOLOGY Co Ltd filed Critical RONGCHENG DINGTONG ELECTRONIC INFORMATION TECHNOLOGY Co Ltd
Priority to CN201710303794.2A priority Critical patent/CN107168928A/en
Publication of CN107168928A publication Critical patent/CN107168928A/en
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/14Fourier, Walsh or analogous domain transformations, e.g. Laplace, Hilbert, Karhunen-Loeve, transforms
    • G06F17/141Discrete Fourier transforms
    • G06F17/142Fast Fourier transforms, e.g. using a Cooley-Tukey type algorithm

Landscapes

  • Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Pure & Applied Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Discrete Mathematics (AREA)
  • Algebra (AREA)
  • Databases & Information Systems (AREA)
  • Software Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Complex Calculations (AREA)

Abstract

The present invention relates to a kind of eight point Winograd Fourier transformers without rearrangement, it is characterised in that the processor is main by input matrix I, variable diagonal matrix A, output matrix O and complex multiplier M1~M3Four parts are constituted.Input matrix I passes through complex multiplier M1It is multiplied with input vector v and obtains vectorial p, variable diagonal matrix A passes through complex multiplier M2It is multiplied with vectorial p and obtains vectorial q, output matrix O passes through complex multiplier M3It is multiplied with vectorial q and obtains output vector V.Invention removes the index of N points with the rearrangement operation that eight point Winograd Fourier transformations are related in sequence prime factor algorithm, control logic is simplified, arithmetic speed is improved, memory consumption has been saved, hardware cost is reduced.

Description

Eight-point Winograd Fourier transformer without reordering
Technical Field
The invention relates to the field of digital signal processing, in particular to a method for realizing a small-point Winograd fast Fourier Transform Algorithm (WFTA) without reordering.
Background
With the ever-increasing growth of wireless communication services, the available spectrum resources are increasingly strained. In order to improve the spectrum utilization rate and the communication quality, the modern wireless communication system widely adopts an Orthogonal Frequency Division Multiplexing (OFDM) technology with strong immunity to Frequency selective fading. The core of OFDM technology is the FFT. The number of points of the FFT is divided into two categories, power-of-2 and non-power-of-2. The FFT algorithm with the number of points being the power of 2 is relatively mature. In contrast, FFT with a non-power-of-2 point number is more flexible and has recently been applied to DRM, DTMB, and LTE systems. Therefore, the algorithm and implementation of the non-power-of-2 point FFT is worthy of intensive study.
At present, Prime Factor Algorithm (PFA) is the most effective non-power-2 FFT, and a nested multidimensional structure is adopted, so that the calculation complexity can be effectively reduced. For an N-point non-power-of-2 FFT, it is assumed that N can be decomposed into the product of s two-by-two prime factors, i.e., N equals N1N2…Ns. The basic principle of N-point PFA is to map a one-dimensional large-point FFT to an s-dimensional small-point FFT, and perform N/N on the ith (i ═ 1,2, …, s) -dimensional FFTiSub NiPoint-by-point FFT. The small point number FFT may be aided by Cooley-Tukey algorithms, WFTA, and other efficient algorithms.
In some cases, the PFA needs to be reordered. The reordering is divided into pre-scrambling and post-scrambling according to where it is located during the calculation. Irrespective of Ni(i-1, 2, …, s) point FFT, which is co-located if the ith dimension FFT does not need to be reordered; otherwise, it is indexed, and the reordering is at NiWithin the sequence of points, pre-scrambling and post-scrambling being performed at NiThe point FFT is performed before and after. Similarly, regardless of the internal mechanism of each-dimensional FFT, an N-point PFA is in-order if it does not need to be reordered as a whole; otherwise, it is permuted, the reordering is performed within a sequence of N points, and the pre-scrambling and post-scrambling are performed before the first-dimension FFT starts and after the last-dimension FFT ends, respectively. Thus, PFA can be theoretically divided into 4 types: indexing and sequencing, same-address and sequence, and same-address and sequence.
Currently, PFA is either indexed in order or indexed in order, inevitably introducing reordering operations. As is well known, reordering means that a level one buffer must be added, more memory resources are consumed, and hardware cost is increased. In addition, reordering reduces the computation speed and increases the complexity of control. Compared with the same-address and same-sequence PFA, the same-address and same-sequence PFA consumes less memory resources, and the total time delay of reordering is the same, N clock cycles, so the same-address and same-sequence PFA is more preferable.
Disclosure of Invention
In response to the technical disadvantage of the need for reordering in the prior art implementation of PFA, the present invention provides an eight-point Winograd fourier transformer that does not require reordering. When N-point non-2-power FFT is realized by adopting indexing same-order PFA, if a certain mutlipin factor N of Ni8 (i-1, 2, …, s), then the i-th dimension FFT need not be reordered using this patent.
To remove the reordering operation of the ith dimension 8-point FFT, the conventional 8-point WFTA needs to be modified. The diagonal matrix of the conventional 8-point WFTA is fixed and invariable, and the invention expresses the angle parameter theta as 2 pi/8 x by each element on the diagonal of the diagonal matrix<N/8>8The function of (a), wherein,<N/8>8representing a modulo-8 operation on N/8. The diagonal matrices of the modified 8-point WFTA are different for different N.
For the ith-dimension FFT, N/8 times of 8-point WFTA which does not need to be reordered is needed, reordering is not needed, control logic is simplified, N/8 × 8 is saved in total, N clock cycles are saved, operation speed is improved, memory consumption is reduced by half, and hardware cost is reduced.
The advantages and spirit of the present invention can be further understood by the following detailed description and accompanying drawings.
Drawings
FIG. 1 is a functional block diagram of a conventional eight-point Winograd Fourier transformer;
FIG. 2 is a concrete constitution of an input matrix I;
fig. 3 is a specific configuration of the output matrix O;
FIG. 4 is a specific composition on the diagonal of the diagonal matrix D;
FIG. 5 is a schematic diagram of a structure of a pre-scrambled eight-point Winograd Fourier transformer;
FIG. 6 is a schematic structural diagram of an eight-point Winograd Fourier transformer for post-scrambling;
FIG. 7 is a functional block diagram of an eight-point Winograd Fourier transformer without reordering;
fig. 8 is a specific configuration on the diagonal of the variable diagonal matrix a.
Detailed Description
The invention is further described with reference to the following figures and specific examples, which are not intended to be limiting.
FFT of N-point sequence x (N) as
Wherein N, k is 0,1, …, N-1, WN=e-j2π/N. The multiplication and addition operations for directly computing the N-point FFT are both proportional to the square of N. When N is large, the amount of calculation is large.
To reduce computational complexity, an N-point FFT can be implemented using nested multidimensional PFA when N is not a power of 2. Suppose N can be decomposed into s products of two-by-two interdependent factors, i.e., N equals N1N2…Ns. That is, any two factors NiAnd NjThe greatest common divisor of (i, j ≠ j) is 1,2, …, s, and i ≠ j. Note that NiNot necessarily a prime number. The basic principle of the N-point PFA is that one-dimensional large-point FFT is mapped into s-dimensional FFT, and the ith-dimensional FFT is used for N/NiSub NiPoint-by-point FFT. The small point number FFT may be aided by Cooley-Tukey algorithms, WFTA, and other efficient algorithms.
In order to make PFA generally in the same order, when mapping one-dimensional FFT to s-dimensional FFT, according to the chinese remainder theorem, the input index n and the output index k adopt the same mapping manner as follows:
wherein, the symbol<>NRepresenting modulo N operation, Ni,ki=0,1,…,Ni-1. The formula (2) and the formula (3) are substituted into the formula (1) and can be obtained by finishing the following steps:
wherein,
comparing equations (2) and (3) it is easy to find that the mapping manner of indices n and k is essentially identical. Therefore, as long as the index n of each-dimensional FFT in the equation (4)iAnd kiAll are naturally ordered, and N-point PFA is in-order.
In equation (4), the fourier transform factor of the i (i ═ 1,2, …, s) -th dimension FFT can be written as
Or
In the formula,
as is well known, conventional NiThe input and output of the point FFT algorithm are in natural order. If the ith dimension FFT in equation (4) adopts the conventional NiPoint FFT algorithm, then equation (5) is in terms of niNatural order input of, ki' the natural sequence is output, and the formula (6) is according to ni' Natural order input, kiIs output in a natural order. However, as can be seen from equations (2) and (3), the i-th dimension FFT in equation (4) is required for the same order PFA in terms of niNatural order input of, kiIs output in a natural order. It can be seen that if the ith dimension FFT in equation (4) employs the conventional NiPoint FFT algorithm, then reordering is necessary. Specifically, equations (5) and (6) are post-scrambled and pre-scrambled according to the rules in equations (7) and (8), respectively. It can be seen that the i-th dimension FFT in equation (4) is indexed. Indexing is achieved by reordering. To get rid of this extra operation of reordering, we have to modify the regular NiA point FFT algorithm, which takes the reordering operation into account.
Conventional NiPoint WFTA can be represented by a continuous multiplication of a vector and a matrix, i.e.
V=O*D*I*v (9)
Wherein V and V are each independently of the other NiThe vector formed by the input and output sequences of points, I and O are the input and output matrices, respectively, and D is the diagonal matrix. In general, the elements in both matrices I and O are only possible 0, ± 1 and ± j, and multiplication with a vector involves no substantial multiplication. For the diagonal matrix D, the elements at other positions are 0 except for the elements at the diagonal.
When N is presentiWhen the value is 8, the i-th dimension FFT of the N-point indexed homosequential PFA may use 8-point WFTA. Fig. 1 shows a functional block diagram of a conventional eight-point Winograd fourier transformer. The 8-point input sequence forms a vector V, the vector V is multiplied by a matrix I, the vector obtained by operation is multiplied by a diagonal matrix D, the vector obtained by operation is multiplied by a matrix O, and the vector V obtained by operation is the 8-point output sequence. Fig. 2 and 3 show specific configurations of the input matrix I and the output matrix O, respectively. FIG. 4 shows the specific composition of the diagonal matrix D, where the elements from the top left corner to the bottom right corner are D0~d7
Indexing of the ith dimension FFT of the indexed, in-order PFA is accomplished by reordering. When N is presentiFig. 5 and 6 show schematic diagrams of the eight-point Winograd fourier transformer for pre-scrambling and post-scrambling, respectively, at 8. To remove the reordering operation in pre-scrambling or post-scrambling, we have to modify the regular NiThe reordering operation is taken in 8 points WFTA.
Fig. 7 shows a functional block diagram of an eight-point Winograd fourier transformer without reordering, which mainly comprises four functional blocks, i.e., an input matrix I, a variable diagonal matrix a, an output matrix O, and a complex multiplier. The output vector and the input vector satisfy: v ═ O × a ═ I × V. Compared with the conventional 8-point WFTA, the matrixes I and O of the 8-point WFTA without reordering are kept unchanged, the diagonal matrix is not a constant any more, and each element on the diagonal is modified into an angle parameter theta of 2 pi/8<N/8>8The function of (a), wherein,<N/8>8representing a modulo-8 operation on N/8. FIG. 8 shows the specific structure of the diagonal of the variable diagonal matrix A, where the elements from the top left corner to the bottom right corner are a0~a7. The diagonal matrices of 8-point WFTAs that do not need to be reordered are different for different N.
The invention provides a method for removing 8-point WFTA reordering in indexing same-order PFA, when N isiWhen the value is 8, the i-th dimension FFT of the N-point indexed same-order PFA can be implemented by N/8 times of 8-point WFTA without reordering, and the steps are as follows:
(1) determining an angle parameter theta 2 pi/8 x from N<N/8>8Initializing the value of each element on the diagonal of the variable diagonal matrix A on the basis of the specific value of (1), so that A becomes a constant, and initializing a variable l to be 0(0 is less than or equal to l)<N/8);
(2) From the input sequence x n]Of 8 data whose index is n ═ c<N/8*m+8*l>N(0≤m<8) They constitute a vector v;
(3) by means of a complex multiplier M1Multiplying the input matrix I by the vector v to obtain a vector p;
(4) by means of a complex multiplier M2Multiplying the variable diagonal matrix A by the vector p to obtain a vector q;
(5) by means of a complex multiplier M3Multiplying the output matrix O by the vector q to obtain a vector V;
(6) writing 8 data in vector V into output sequence X [ k ] in sequence]In (2), the written index is identical to the read index, and k is still equal to<N/8*m+8*l>N
(7) And (5) gradually changing the value of l by taking 1 as a step length, and repeating the steps (2) to (6) until N/8 times of 8-point WFTA without reordering are completed.
If the i-th dimension FFT of the N-point indexed co-ordered PFA uses a regular 8-point WFTA, then 8 clock cycles are required for each reordering, which means that N/8 × 8 cycles of the regular 8-point WFTA are required for N/8 × 8 reordering. Therefore, if the ith dimension FFT of the N-point index same-sequence PFA adopts the invention, the reordering is not needed, thereby simplifying the control logic, saving N clock cycles, improving the operation speed, reducing the memory requirement by half and reducing the hardware cost.
While the present invention has been described in detail and by way of examples and embodiments, it will be apparent to those skilled in the art that various changes and modifications can be made without departing from the spirit and scope of the invention.

Claims (4)

1. An eight-point Winograd Fourier transformer without reordering is nested in an s-dimensional N-point indexing same-order prime factor algorithm, wherein N is N1N2…NsAny two different factors NiAnd NjReciprocity, i-1, 2, …, s, j-1, 2, …, s, when a certain factor NiWhen the value is 8, the processor may be configured to remove reordering operations of an ith-dimension FFT of an N-point indexed prime factor algorithm, and the processor may include:
complex multiplier M1~M3They perform a momentMultiplication of the array and the vector;
an input matrix I which is passed through a complex multiplier M1Multiplying the input vector v to obtain a vector p;
a variable diagonal matrix A which is passed through a complex multiplier M2Multiplying the vector p to obtain a vector q;
an output matrix O which passes through a complex multiplier M3And multiplying the vector q to obtain an output vector V.
2. The eight-point Winograd fourier transformer of claim 1, wherein the input matrix I and the output matrix O are the same as a conventional eight-point Winograd fourier transformer, and the diagonal matrix is modified from a conventional constant matrix to a variable matrix a.
3. The eight-point Winograd fourier transformer of claim 1, wherein each element on a diagonal of the variable diagonal matrix a is an angular parameter θ 2 pi/8 ·<N/8>8The function of (a), wherein,<N/8>8representing a modulo-8 operation on N/8.
4. An eight-point Winograd Fourier transform method for removing ith-dimension FFT reordering operation of an N-point index same-sequence prime factor algorithm, wherein N is N1N2…NsAny two different factors NiAnd NjMutilins, i-1, 2, …, s, j-1, 2, …, s, NiThe processing method is characterized by comprising the following steps:
(1) determining an angle parameter theta 2 pi/8 x from N<N/8>8Initializing the value of each element on the diagonal of the variable diagonal matrix A on the basis of the specific value of (1), so that A becomes a constant, and initializing a variable l to 0, wherein l is not less than 0<N/8,<N/8>8Represents the operation of taking the module 8 of the N/8;
(2) from the input sequence x n]Of 8 data whose index is n ═ c<N/8*m+8*l>NThey form a vector v, where 0 ≦ m<8,<N/8*m+8*l>NShows taking the modulus N for N/8 m +8 lOperating;
(3) by means of a complex multiplier M1Multiplying the input matrix I by the vector v to obtain a vector p;
(4) by means of a complex multiplier M2Multiplying the variable diagonal matrix A by the vector p to obtain a vector q;
(5) by means of a complex multiplier M3Multiplying the output matrix O by the vector q to obtain a vector V;
(6) writing 8 data in vector V into output sequence X [ k ] in sequence]In (2), the written index is identical to the read index, and k is still equal to<N/8*m+8*l>N
(7) And (5) gradually changing the value of l by taking 1 as a step length, and repeating the steps (2) to (6) until N/8 times of 8-point WFTA without reordering are completed.
CN201710303794.2A 2017-05-03 2017-05-03 The eight point Winograd Fourier transformers without rearrangement Withdrawn CN107168928A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710303794.2A CN107168928A (en) 2017-05-03 2017-05-03 The eight point Winograd Fourier transformers without rearrangement

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710303794.2A CN107168928A (en) 2017-05-03 2017-05-03 The eight point Winograd Fourier transformers without rearrangement

Publications (1)

Publication Number Publication Date
CN107168928A true CN107168928A (en) 2017-09-15

Family

ID=59814023

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710303794.2A Withdrawn CN107168928A (en) 2017-05-03 2017-05-03 The eight point Winograd Fourier transformers without rearrangement

Country Status (1)

Country Link
CN (1) CN107168928A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2021082747A1 (en) * 2019-11-01 2021-05-06 中科寒武纪科技股份有限公司 Operational apparatus and related product

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2021082747A1 (en) * 2019-11-01 2021-05-06 中科寒武纪科技股份有限公司 Operational apparatus and related product

Similar Documents

Publication Publication Date Title
JP3749022B2 (en) Parallel system with fast latency and array processing with short waiting time
CN102419741A (en) In-place simultaneous prime factor algorithm-based 3780-point discrete Fourier transform processing device and method
CN102609396A (en) Discrete Fourier transform processing device and method in data rights management (DRM) system
WO2022001550A1 (en) Address generation method, related device and storage medium
Yu et al. FPGA architecture for 2D Discrete Fourier Transform based on 2D decomposition for large-sized data
Wang et al. Generic mixed-radix FFT pruning
CN115706799A (en) Efficient transform and transpose for rate-distortion optimization and reconstruction in video encoders
Ayhan et al. A 128∶ 2048/1536 point FFT hardware implementation with output pruning
KR20120072226A (en) Fast fourier transform
EP2144172A1 (en) Computation module to compute a multi radix butterfly to be used in DTF computation
CN107168928A (en) The eight point Winograd Fourier transformers without rearrangement
CN112953549B (en) Storage processing method and device for sparse matrix
JP5486226B2 (en) Apparatus and method for calculating DFT of various sizes according to PFA algorithm using Ruritanian mapping
JP2010016831A (en) Device for computing various sizes of dft
CN107168929A (en) The nine point Winograd Fourier transformers without rearrangement
CN107193777A (en) The seven point Winograd Fourier transformers without rearrangement
CN107180012A (en) The five point Winograd Fourier transformers without rearrangement
CN107038145A (en) The four point Winograd Fourier transformers without rearrangement
CN107193778A (en) The three point Winograd Fourier transformers without rearrangement
US20140365547A1 (en) Mixed-radix pipelined fft processor and fft processing method using the same
US20080228845A1 (en) Apparatus for calculating an n-point discrete fourier transform by utilizing cooley-tukey algorithm
Srinivasaiah et al. Low power and area efficient FFT architecture through decomposition technique
TWI237773B (en) Fast fourier transform processor and dynamic scaling method thereof and radix-8 fast Fourier transform computation method
CN113434811B (en) IP core of 2048-point FFT (fast Fourier transform) processor
CN111404858A (en) Efficient FFT processing method and device applied to broadband satellite communication system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
WW01 Invention patent application withdrawn after publication

Application publication date: 20170915

WW01 Invention patent application withdrawn after publication