Simple DFT in Opencv

Simple DFT in C
Posted on December 7, 2013 by batchloaf

Ive been spending some time recently thinking about the Discrete Fourier Transform (DFT), as well as the Fast Fourier
Transform (FFT) which provides a computationally efficient means of calculating the DFT of a signal.
The DFT is defined as follows. Given a discrete-time sequence x[n] where n = 0,1,2,.,N-1
De Moivres Theorem tells us that
Therefore, we can rewrite the DFT summation as separate expressions for the real and imaginary part of X[k] as follows
(assuming x[n] is real).
where
and
As part of my thought process, Ive been sketching out different implementations of the DFT and FFT in C and
MATLAB/Octave. This one struck me as particularly nice because its a very plain and simple DFT by brute force.
//
//
//
//
//
//
//
//
//
//
//
dft.c - Simple brute force DFT

Written by Ted Burke
Last updated 7-12-2013
To compile:
gcc dft.c -o dft.exe
To run:
dft.exe
#include <stdio.h>
#include <stdlib.h>
#include <math.h>
#define N 16
#define PI2 6.2832
int main()
{
// time and frequency domain data arrays
int n, k;
// indices for time and frequency domains
float x[N];
// discrete-time signal, x
float Xre[N], Xim[N]; // DFT of x (real and imaginary parts)
float P[N];
// power spectrum of x
// Generate random discrete-time signal x in range (-1,+1)

srand(time(0));
for (n=0 ; n<N ; ++n) x[n] = ((2.0 * rand()) / RAND_MAX) - 1.0;
// Calculate DFT of x using brute force
for (k=0 ; k<N ; ++k)
{
// Real part of X[k]
Xre[k] = 0;
for (n=0 ; n<N ; ++n) Xre[k] += x[n] * cos(n * k * PI2 / N);
// Imaginary part of X[k]
Xim[k] = 0;
for (n=0 ; n<N ; ++n) Xim[k] -= x[n] * sin(n * k * PI2 / N);
// Power at kth frequency bin
P[k] = Xre[k]*Xre[k] + Xim[k]*Xim[k];
}
// Output results to MATLAB / Octave M-file for plotting
FILE *f = fopen("dftplots.m", "w");
fprintf(f, "n = [0:%d];\n", N-1);
fprintf(f, "x = [ ");
for (n=0 ; n<N ; ++n) fprintf(f, "%f ", x[n]);
fprintf(f, "];\n");
fprintf(f, "Xre = [ ");
for (k=0 ; k<N ; ++k) fprintf(f, "%f ", Xre[k]);
fprintf(f, "];\n");
fprintf(f, "Xim = [ ");
for (k=0 ; k<N ; ++k) fprintf(f, "%f ", Xim[k]);
fprintf(f, "];\n");
fprintf(f, "P = [ ");
for (k=0 ; k<N ; ++k) fprintf(f, "%f ", P[k]);
fprintf(f, "];\n");
fprintf(f, "subplot(3,1,1)\nplot(n,x)\n");
fprintf(f, "xlim([0 %d])\n", N-1);
fprintf(f, "subplot(3,1,2)\nplot(n,Xre,n,Xim)\n");
fprintf(f, "subplot(3,1,3)\nstem(n,P)\n");
fclose(f);
// exit normally
return 0;
}
To compile and run the above code, do the following:
When you run the executable file dft.exe, it writes an M-file called dftplots.m which can then be run in MATLAB or
Octave to produce plots of the results. This is the M-file produced by dft.exe:
n = [0:15];
x = [ -0.763848 0.445723 0.350261 0.036225 -0.217444 0.121921 -0.604968 0.623280
-0.966613 0.806513 -0.387738 0.412580 0.580554 0.705191 0.440352 -0.077792 ];
Xre = [ 1.504196 0.561935 -1.724586 -1.655775 -1.165293 -0.460708 -2.462571
2.365616 -4.643086 2.365602 -2.462494 -0.460714 -1.165152 -1.655658 -1.724676
0.561582 ];
Xim = [ 0.000000 1.771657 -0.359353 -1.262270 -1.085051 -0.100950 -0.105040
-0.259128 0.000203 0.258721 0.105206 0.100861 1.085066 1.262431 0.359592
-1.771775 ];
P = [ 2.262606 3.454538 3.103332 4.334917 2.535245 0.222443 6.075290 5.663288
21.558247 5.663008 6.074944 0.222430 2.534947 4.334937 3.103816 3.454561 ];
subplot(3,1,1)
plot(n,x)
xlim([0 15])
subplot(3,1,2)
plot(n,Xre,n,Xim)
xlim([0 15])
subplot(3,1,3)
stem(n,P)
xlim([0 15])
I ran dftplots.m in Octave, as shown below:
It produced the following graphs:
The top plot is the original random discrete-time sequence x[n]. The second plot shows the real and imaginary parts of
X[k]. Note the symmetry of X[k], which is just as we expect for real x[n]. The final plot is P[k], the power spectrum of
x[n]. Basically, each value of P[k] is the square of the magnitude of the corresponding complex value X[k]. Again the
symmetry is just as we expect for real x[n].
As a follow up to the above example, I modified the program a little to perform the DFT somewhat more efficiently and to
transform a longer (64-sample) frame of data. I also modified the random time domain signal generation process a bit to
add a dominant frequency component (at k=5.7, i.e. between bins 5 and 6).
This is the modified C code:
//
// dft2.c - Basic, but slightly more efficient DFT
// Written by Ted Burke
// Last updated 10-12-2013
//
// To compile:
//
gcc dft2.c -o dft2.exe
//
// To run:
//
dft2.exe
//
#include <stdio.h>
#include <stdlib.h>
#include <math.h>
// N is assumed to be greater than 4 and a power of 2
#define N 64
#define PI2 6.2832
// Twiddle factors (64th roots of unity)
const float W[] = {
1.00000, 0.99518, 0.98079, 0.95694, 0.92388, 0.88192, 0.83147, 0.77301,
0.70711, 0.63439, 0.55557, 0.47139, 0.38268, 0.29028, 0.19509, 0.09801,
-0.00000,-0.09802,-0.19509,-0.29029,-0.38269,-0.47140,-0.55557,-0.63440,
-0.70711,-0.77301,-0.83147,-0.88192,-0.92388,-0.95694,-0.98079,-0.99519,
-1.00000,-0.99518,-0.98078,-0.95694,-0.92388,-0.88192,-0.83146,-0.77300,
-0.70710,-0.63439,-0.55556,-0.47139,-0.38267,-0.29027,-0.19508,-0.09801,
0.00001, 0.09803, 0.19510, 0.29030, 0.38269, 0.47141, 0.55558, 0.63440,
0.70712, 0.77302, 0.83148, 0.88193, 0.92388, 0.95694, 0.98079, 0.99519
};
int main()
{
// time and frequency domain data arrays
int n, k;
// time and frequency domain indices
float x[N];
// discrete-time signal, x
float Xre[N/2+1], Xim[N/2+1]; // DFT of x (real and imaginary parts)
float P[N/2+1];
// power spectrum of x
// Generate random discrete-time signal x in range (-1,+1)
srand(time(0));
for (n=0 ; n<N ; ++n) x[n] = ((2.0 * rand()) / RAND_MAX) - 1.0 + sin(PI2 * n
* 5.7 / N);
// Calculate DFT and power spectrum up to Nyquist frequency
int to_sin = 3*N/4; // index offset for sin
int a, b;
for (k=0 ; k<=N/2 ; ++k)
{
Xre[k] = 0; Xim[k] = 0;
a = 0; b = to_sin;
for (n=0 ; n<N ; ++n)
{
Xre[k] += x[n] * W[a%N];
Xim[k] -= x[n] * W[b%N];
a += k; b += k;
}
P[k] = Xre[k]*Xre[k] + Xim[k]*Xim[k];
}
// Output results to MATLAB / Octave M-file for plotting
FILE *f = fopen("dftplots.m", "w");
fprintf(f, "n = [0:%d];\n", N-1);
fprintf(f, "x = [ ");
for (n=0 ; n<N ; ++n) fprintf(f, "%f ", x[n]);
fprintf(f,
fprintf(f,
for (k=0 ;
fprintf(f,
fprintf(f,
for (k=0 ;
fprintf(f,
fprintf(f,
for (k=0 ;
fprintf(f,
fprintf(f,
fprintf(f,
fprintf(f,
fprintf(f,
fprintf(f,
fprintf(f,
fclose(f);
"];\n");
"Xre = [ ");
k<=N/2 ; ++k) fprintf(f, "%f ", Xre[k]);
"];\n");
"Xim = [ ");
k<=N/2 ; ++k) fprintf(f, "%f ", Xim[k]);
"];\n");
"P = [ ");
k<=N/2 ; ++k) fprintf(f, "%f ", P[k]);
"];\n");
"subplot(3,1,1)\nplot(n,x)\n");
"xlim([0 %d])\n", N-1);
"subplot(3,1,2)\nplot([0:%d],Xre,[0:%d],Xim)\n", N/2, N/2);
"xlim([0 %d])\n", N/2);
"subplot(3,1,3)\nstem([0:%d],P)\n", N/2);
"xlim([0 %d])\n", N/2);
// exit normally
return 0;
}
When I ran the resulting M-file in Octave, the following graphs were produced:
The dominant frequency component introduced into the random signal results in the single prominent peak in the above
graph.
By the way, I generated the values for the global array of twiddle factors in the above example using the following short C
program (which runs on the PC rather than the dsPIC):
//
// twiddle.c - Print array of twiddle factors
// Written by Ted Burke
// Last updated 10-12-2013
//
// To compile:
//
gcc twiddle.c -o twiddle.exe
//
// To run:
//
twiddle.exe
//
#include <stdio.h>
#include <math.h>
#define N 64
int main()
{
int n;
for (n=0 ; n<N ; ++n)
{
printf("%8.5lf", cos(n*6.2832/N));
if (n<N-1) printf(",");
if ((n+1)%8==0) printf("\n");
}
return 0;
}
Fourier Transform
Goal
In this section, we will learn
To find the Fourier Transform of images using OpenCV
To utilize the FFT functions available in Numpy
Some applications of Fourier Transform
We will see following functions : cv2.dft(), cv2.idft() etc
Theory
Fourier Transform is used to analyze the frequency characteristics of various filters. For images, 2D
Discrete Fourier Transform (DFT) is used to find the frequency domain. A fast algorithm called
Fast Fourier Transform (FFT) is used for calculation of DFT. Details about these can be found in
any image processing or signal processing textbooks. Please see Additional Resources section.
For a sinusoidal signal,
, we can say is the frequency of signal, and if its
frequency domain is taken, we can see a spike at . If signal is sampled to form a discrete signal, we
get the same frequency domain, but is periodic in the range
or
(or
for N-point
DFT). You can consider an image as a signal which is sampled in two directions. So taking fourier
transform in both X and Y directions gives you the frequency representation of image.
More intuitively, for the sinusoidal signal, if the amplitude varies so fast in short time, you can say it
is a high frequency signal. If it varies slowly, it is a low frequency signal. You can extend the same
idea to images. Where does the amplitude varies drastically in images ? At the edge points, or noises.
So we can say, edges and noises are high frequency contents in an image. If there is no much changes
in amplitude, it is a low frequency component. ( Some links are added to Additional Resources which
explains frequency transform intuitively with examples).
Now we will see how to find the Fourier Transform.
Fourier Transform in Numpy
First we will see how to find Fourier Transform using Numpy. Numpy has an FFT package to do this.
np.fft.fft2() provides us the frequency transform which will be a complex array. Its first argument is
the input image, which is grayscale. Second argument is optional which decides the size of output
array. If it is greater than size of input image, input image is padded with zeros before calculation of
FFT. If it is less than input image, input image will be cropped. If no arguments passed, Output array
size will be same as input.
Now once you got the result, zero frequency component (DC component) will be at top left corner. If
you want to bring it to center, you need to shift the result by in both the directions. This is simply
done by the function, np.fft.fftshift(). (It is more easier to analyze). Once you found the frequency
transform, you can find the magnitude spectrum.
8
import cv2
import numpy as np
from matplotlib import pyplot as plt
img = cv2.imread('messi5.jpg',0)
f = np.fft.fft2(img)
fshift = np.fft.fftshift(f)
magnitude_spectrum = 20*np.log(np.abs(fshift))
plt.subplot(121),plt.imshow(img, cmap = 'gray')
plt.title('Input Image'), plt.xticks([]), plt.yticks([])
plt.subplot(122),plt.imshow(magnitude_spectrum, cmap = 'gray')
plt.title('Magnitude Spectrum'), plt.xticks([]), plt.yticks([])
plt.show()
Result look like below:
See, You can see more whiter region at the center showing low frequency content is more.
So you found the frequency transform Now you can do some operations in frequency domain, like
high pass filtering and reconstruct the image, ie find inverse DFT. For that you simply remove the
low frequencies by masking with a rectangular window of size 60x60. Then apply the inverse shift
using np.fft.ifftshift() so that DC component again come at the top-left corner. Then find inverse FFT
using np.ifft2() function. The result, again, will be a complex number. You can take its absolute
value.
rows, cols = img.shape
crow,ccol = rows/2 , cols/2
fshift[crow-30:crow+30, ccol-30:ccol+30] = 0
f_ishift = np.fft.ifftshift(fshift)
img_back = np.fft.ifft2(f_ishift)
img_back = np.abs(img_back)
plt.subplot(132),plt.imshow(img_back, cmap = 'gray')
plt.title('Image after HPF'), plt.xticks([]), plt.yticks([])
plt.subplot(133),plt.imshow(img_back)
plt.title('Result in JET'), plt.xticks([]), plt.yticks([])
plt.show()
Result look like below:
The result shows High Pass Filtering is an edge detection operation. This is what we have seen in
Image Gradients chapter. This also shows that most of the image data is present in the Low frequency
region of the spectrum. Anyway we have seen how to find DFT, IDFT etc in Numpy. Now lets see
how to do it in OpenCV.
If you closely watch the result, especially the last image in JET color, you can see some artifacts (One
instance I have marked in red arrow). It shows some ripple like structures there, and it is called
ringing effects. It is caused by the rectangular window we used for masking. This mask is converted
to sinc shape which causes this problem. So rectangular windows is not used for filtering. Better
option is Gaussian Windows.
Fourier Transform in OpenCV
OpenCV provides the functions cv2.dft() and cv2.idft() for this. It returns the same result as previous,
but with two channels. First channel will have the real part of the result and second channel will have
the imaginary part of the result. The input image should be converted to np.float32 first. We will see
how to do it.
import numpy as np
import cv2
img = cv2.imread('messi5.jpg',0)
dft = cv2.dft(np.float32(img),flags = cv2.DFT_COMPLEX_OUTPUT)
dft_shift = np.fft.fftshift(dft)
magnitude_spectrum = 20*np.log(cv2.magnitude(dft_shift[:,:,0],dft_shift[:,:,1]))
plt.subplot(122),plt.imshow(magnitude_spectrum, cmap = 'gray')
plt.show()
Note
You can also use cv2.cartToPolar() which returns both magnitude and phase in a single shot
So, now we have to do inverse DFT. In previous session, we created a HPF, this time we will see how
to remove high frequency contents in the image, ie we apply LPF to image. It actually blurs the
image. For this, we create a mask first with high value (1) at low frequencies, ie we pass the LF
content, and 0 at HF region.
10
rows, cols = img.shape

crow,ccol = rows/2 , cols/2
# create a mask first, center square is 1, remaining all zeros
mask = np.zeros((rows,cols,2),np.uint8)
mask[crow-30:crow+30, ccol-30:ccol+30] = 1
# apply mask and inverse DFT
fshift = dft_shift*mask
f_ishift = np.fft.ifftshift(fshift)
img_back = cv2.idft(f_ishift)
img_back = cv2.magnitude(img_back[:,:,0],img_back[:,:,1])
plt.subplot(122),plt.imshow(img_back, cmap = 'gray')
plt.show()
See the result:
Note
As usual, OpenCV functions cv2.dft() and cv2.idft() are faster than Numpy counterparts. But Numpy
functions are more user-friendly. For more details about performance issues, see below section.
Performance Optimization of DFT
Performance of DFT calculation is better for some array size. It is fastest when array size is power of
two. The arrays whose size is a product of 2s, 3s, and 5s are also processed quite efficiently. So if
you are worried about the performance of your code, you can modify the size of the array to any
optimal size (by padding zeros) before finding DFT. For OpenCV, you have to manually pad zeros.
But for Numpy, you specify the new size of FFT calculation, and it will automatically pad zeros for
you.
So how do we find this optimal size ? OpenCV provides a function, cv2.getOptimalDFTSize() for
this. It is applicable to both cv2.dft() and np.fft.fft2(). Lets check their performance using IPython
magic command %timeit.
In [16]: img = cv2.imread('messi5.jpg',0)
In [17]: rows,cols = img.shape
In [18]: print rows,cols
342 548
In [19]: nrows = cv2.getOptimalDFTSize(rows)
In [20]: ncols = cv2.getOptimalDFTSize(cols)
11
In [21]: print nrows, ncols

360 576
See, the size (342,548) is modified to (360, 576). Now lets pad it with zeros (for OpenCV) and find
their DFT calculation performance. You can do it by creating a new big zero array and copy the data
to it, or use cv2.copyMakeBorder().
nimg = np.zeros((nrows,ncols))
nimg[:rows,:cols] = img
OR:
right = ncols - cols
bottom = nrows - rows
bordertype = cv2.BORDER_CONSTANT #just to avoid line breakup in PDF file
nimg = cv2.copyMakeBorder(img,0,bottom,0,right,bordertype, value = 0)
Now we calculate the DFT performance comparison of Numpy function:

In [22]: %timeit fft1 = np.fft.fft2(img)
10 loops, best of 3: 40.9 ms per loop
In [23]: %timeit fft2 = np.fft.fft2(img,[nrows,ncols])
It shows a 4x speedup. Now we will try the same with OpenCV functions.
In [24]: %timeit dft1= cv2.dft(np.float32(img),flags=cv2.DFT_COMPLEX_OUTPUT)
In [27]: %timeit dft2= cv2.dft(np.float32(nimg),flags=cv2.DFT_COMPLEX_OUTPUT)
It also shows a 4x speed-up. You can also see that OpenCV functions are around 3x faster than
Numpy functions. This can be tested for inverse FFT also, and that is left as an exercise for you.
Why Laplacian is a High Pass Filter?
A similar question was asked in a forum. The question is, why Laplacian is a high pass filter? Why
Sobel is a HPF? etc. And the first answer given to it was in terms of Fourier Transform. Just take the
fourier transform of Laplacian for some higher size of FFT. Analyze it:
import cv2
import numpy as np
# simple averaging filter without scaling parameter
mean_filter = np.ones((3,3))
# creating a guassian filter
x = cv2.getGaussianKernel(5,10)
gaussian = x*x.T
# different edge detecting filters
# scharr in x-direction
scharr = np.array([[-3, 0, 3],
[-10,0,10],
[-3, 0, 3]])
# sobel in x direction
sobel_x= np.array([[-1, 0, 1],
[-2, 0, 2],
[-1, 0, 1]])
# sobel in y direction
sobel_y= np.array([[-1,-2,-1],
12
[0, 0, 0],
[1, 2, 1]])
# laplacian
laplacian=np.array([[0, 1, 0],
[1,-4, 1],
[0, 1, 0]])
filters = [mean_filter, gaussian, laplacian, sobel_x, sobel_y, scharr]
filter_name = ['mean_filter', 'gaussian','laplacian', 'sobel_x', \
'sobel_y', 'scharr_x']
fft_filters = [np.fft.fft2(x) for x in filters]
fft_shift = [np.fft.fftshift(y) for y in fft_filters]
mag_spectrum = [np.log(np.abs(z)+1) for z in fft_shift]
for i in xrange(6):
plt.subplot(2,3,i+1),plt.imshow(mag_spectrum[i],cmap = 'gray')
plt.title(filter_name[i]), plt.xticks([]), plt.yticks([])
plt.show()
See the result:
From image, you can see what frequency region each kernel blocks, and what region it passes. From
that information, we can say why each kernel is a HPF or a LPF
13

Simple DFT in Opencv

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Simple DFT in Opencv

Uploaded by

Copyright:

Available Formats

Simple DFT in C

Posted on December 7, 2013 by batchloaf

De Moivres Theorem tells us that

dft.c - Simple brute force DFT

// Generate random discrete-time signal x in range (-1,+1)

It produced the following graphs:

To find the Fourier Transform of images using OpenCV

To utilize the FFT functions available in Numpy

Some applications of Fourier Transform

We will see following functions : cv2.dft(), cv2.idft() etc

Result look like below:

Result look like below:

rows, cols = img.shape

See the result:

In [21]: print nrows, ncols

Now we calculate the DFT performance comparison of Numpy function:

See the result:

You might also like