ABSTRACT
This poster introduces the "BxBFFT" parallel-pipelined Fast Fourier Transform (FFT), which gives higher clock speeds (Fmax) than competitors with substantial savings in power and logic resources. In comparisons with the Xilinx SSR FFT, Spiral FFT, Astron FFT, and ZipCPU FFT, the BxBFFT had clock speeds above 650MHz in cases where all others were below 300MHz. The BxBFFT's LUTs and power were lower by a factor of ~1.5. The BxBFFT had faster Vivado implementation and faster RTL simulation, for improved productivity in design, testing, and verification. BxBFFT simulations were over 10 times faster than the Xilinx SSR FFT. The BxBFFT supports more features than other FFTs, including real-to-complex FFTs, non-power-of-2 FFTs, and features for high reliability in adverse environments. The BxBFFT's improved performance has been verified in real applications. One customer design had to operate with a reduced workload due to excessive current draw of the Xilinx SSR FFT. A quick replacement of the Xilinx SSR FFT with the BxBFFT lowered die temperature by 34.8 degree Celsius and allowed the design to operate under full load. The source of the BxBFFT's performance is intensive optimization of well-known FFT algorithms, not new algorithms. The BxBFFT's coding style gives better control over synthesis to avoid and resolve performance bottlenecks. Automated generation of top-level code supports 13 different choices for radix and 2 different choices for data flow at each stage, to make optimal choices for each BxBFFT size. This results in a highly efficient FFT.
Index Terms
- An Efficient High-Speed FFT Implementation
Recommendations
A Series-parallel Mixed Structure FFT Processor
ICDSP '19: Proceedings of the 2019 3rd International Conference on Digital Signal ProcessingThe proposed fixed-point radix-2 decimation-in-time Fast Fourier Transform (R2DIT FFT) processor is implemented by the series-parallel mixed structure. The 64-point FFT is decomposed into 8-channel 8-point FFT structure. Each channel is a serial 8-point ...
FPGA Implementation of High Speed FFT Algorithm Based on Split-Radix
IITAW '08: Proceedings of the 2008 International Symposium on Intelligent Information Technology Application WorkshopsFast Fourier Transform (FFT) is the collection of algorithms that performed the Discrete Fourier Transform (DFT). As a medium to perform the transform from time domain to frequency domain, FFT is widely used as an indispensible tool in signal processing ...
Efficient 2D FFT implementation on mediaprocessors
We have developed an efficient implementation to compute the 2D fast Fourier transform (FFT) on a new very long instruction word programmable mediaprocessor. Using instruction-level parallelism and a multimedia instruction set, our radix-4 Cooley-Tukey ...
Comments