Version 14 (modified by benl, 7 years ago) |
---|
fft2d-highpass
Applies a highpass filter to a BMP image, using a Fast Fourier Transform. The first argument is the lower cutoff frequency.
Each of the RGB channels is converted to the frequency domain, the lower frequencies are set to zero, then the channels are converted back to the image domain.
repa-fft2d-highpass 2 lena.bmp lena-high2.bmp
Code
The main algorithm is at FFT.hs
The wrapper is at Main.hs
Test Data
http://en.wikipedia.org/wiki/Lenna is a standard test image.
lena.bmp | lena-high2.bmp |
full size | full size |
Runtime
Head version. Compiled with GHC 6.13.20100309. 512x512 image.
Running on a Intel i7 iMac. 2.8Ghz, 4 cores x 2 threads/core. 256k L1, 8MB L2, 8GB main memory.
Times stated include IO.
Threads | Time(s) |
1 | 6.93 |
2 | 4.07 |
3 | 3.42 |
4 | 3.13 |
5 | 3.15 |
6 | 3.08 |
7 | 3.08 |
8 | 3.39 |
Comparisons
Version | Time(s) | Source |
Using Data.Vector.Unboxed | 2.88 | FFT.hs |
Using Jones's inplace C implementation | 0.24 | Jones.c |
Using FFTW using Estimate mode | 0.09 | FFTW.c |
The vector version uses the same recursive radix-2 decimation in time (DIT) algorithm as the Repa version, but is not rank generalised. It applies a recursive 1d FFT to each row and then transposes the matrix, twice each. Recursive FFT algorithms tend to be slower than in-place ones because the data is copied into new vectors at each recursion. A 512 point FFT is built from two 256 point FFTs, which are build from 4 128 point FFTs and so on. The result of each FFT is a new vector which needs to be allocated and then filled.
Jones's version also uses a 1d radix-2 DIT FFT kernel, but it first reorders the values then performs a in-place transform using three nested loops.
FFTW contains deep magic, and is comparable with vendor optimised versions.
Attachments
- lena-thumb.jpg (6.6 kB) - added by benl 7 years ago.
- lena-high2-thumb.jpg (5.6 kB) - added by benl 7 years ago.