Fast Fourier Transform for image processing

BlitzMax Forums/BlitzMax Programming/Fast Fourier Transform for image processing

ImaginaryHuman

(Posted 2010) [#1]

Hey, I'm fiddling with image processing and trying to get a Fast Fourier Transform to work... ie you convert an image with the fourier transform, you then supposedly multiply the `complex numbers` of the transformed image by complex numbers of the transformed `convolution kernel`, and then inversely transform the result back - thus applying the convolution to the image.

I found C code and have tried to convert it to BlitzMax. I can say that apart from the multiplying/convolving part, I can get an image to `fourier transform` and come back to an image again. I *think* this means the FFT is working, but it could be misleading. But I cannot get the multiply/convolution to work. I just get next to nothing output. If you take out the two lines within the `multiply the image` loop you'll see the image restored to normal at the bottom right, but with the multiply in (which is supposed to do a horizontal blur), I get junk back. Any ideas what I'm doing wrong or how to make it work? Did I make a mistake in the conversion from C code that might be making the FFT not work right?

Any help greatly appreciated.

'http://local.wasp.uwa.edu.au/~pbourke/miscellaneous/dft/

Strict
SetGraphicsDriver GLMax2DDriver()
Graphics 1024,512,0
Cls

Local nx:Int=256*4
Local ny:Int=256

Local FReal:Double[nx*ny]
Local FImaginary:Double[nx*ny]

Local FReal2:Double[nx*ny]
Local FImaginary2:Double[nx*ny]

Local FReal3:Double[nx*ny]
Local FImaginary3:Double[nx*ny]

'Load image as doubles
Local p:TPixmap=LoadPixmap("256x256.png")
p=ConvertPixmap(p,PF_RGBA8888)
Local pb:Byte Ptr=PixmapPixelPtr(p,0,0)
For Local b:Int=0 Until nx*ny
	FReal[b]=Double(pb[b])
	FImaginary[b]=0.0:Double
	FReal2[b]=0.0:Double 'init
	FImaginary2[b]=0.0:Double
	FReal3[b]=0.0:Double 'init
	FImaginary3[b]=0.0:Double
Next

'Convert from image to fourier
FFT2D(FReal,FImaginary,FReal2,FImaginary2,nx,ny,1)

'See it
DrawPixmap p,0,0
Local p2:TPixmap=CreatePixmap(nx/4,ny,PF_RGBA8888,4)
Local pb2:Byte Ptr=PixmapPixelPtr(p2,0,0)
For Local c:Int=0 Until nx*ny
	pb2[c]=Int(FReal[c])
Next


'Create a blur image
Local BlurReal:Double[nx*ny]
Local BlurImaginary:Double[nx*ny]
Local BlurReal2:Double[nx*ny]
Local BlurImaginary2:Double[nx*ny]
For Local d:Int=0 Until nx*ny
	BlurReal[d]=0.0:Double
	BlurImaginary[d]=0.0:Double
	BlurReal2[d]=0.0:Double
	BlurImaginary2[d]=0.0:Double
Next
BlurReal[(nx*(ny/2))+(nx/2)-4]=0.125*255:Double
BlurReal[(nx*(ny/2))+(nx/2)-3]=0.125*255:Double
BlurReal[(nx*(ny/2))+(nx/2)-2]=0.125*255:Double
BlurReal[(nx*(ny/2))+(nx/2)-1]=0.125*255:Double
BlurReal[(nx*(ny/2))+(nx/2)]=0.125*255:Double
BlurReal[(nx*(ny/2))+(nx/2)+1]=0.125*255:Double
BlurReal[(nx*(ny/2))+(nx/2)+2]=0.125*255:Double
BlurReal[(nx*(ny/2))+(nx/2)+3]=0.125*255:Double

'See it
DrawFourier(512,0,nx,ny,BlurReal,True,False)

'Convert blur image to fourier
FFT2D(BlurReal,BlurImaginary,BlurReal2,BlurImaginary2,nx,ny,1)

'See it
DrawFourier(512,256,nx,ny,BlurReal,True,False)

'Multiply fourier image by fourier blur
For Local e:Int=0 Until nx*ny
'	FReal2[e]:*BlurReal[e]
'	FImaginary2[e]:*BlurImaginary[e]

''Java code
'Float re = ar[0][i];
'Float im = ar[1][i];
'Float Rem = mask[0][i];
'Float imm = mask[1][i];
'ar[0][i] = re*Rem-im*imm;
'ar[1][i] = re*imm+im*Rem;

''C code:
'If (isign == 1) {
'ans[i-1]=(fft[i-1]*(dum=ans[i-1])-fft[i]*ans[i])/no2; Multiply FFTs
'ans[i]=(fft[i]*dum+fft[i-1]*ans[i])/no2; To convolve.
'} Else If (isign == -1) {
'If ((mag2=Sqr(ans[i-1])+Sqr(ans[i])) == 0.0)
'nrerror("Deconvolving at response zero in convlv");
'ans[i-1]=(fft[i-1]*(dum=ans[i-1])+fft[i]*ans[i])/mag2/no2;Divide FFTs
'ans[i]=(fft[i]*dum-fft[i-1]*ans[i])/mag2/no2; To deconvolve.
'}

	FReal2[e]=(FReal[e]*BlurReal[e]-FImaginary[e]*BlurImaginary[e])
	FImaginary2[e]=(FReal[e]*BlurImaginary[e]+FImaginary[e]*BlurReal[e])
Next

'Convert from fourier to image
FFT2D(FReal2,FImaginary2,FReal3,FImaginary3,nx,ny,-1)



DrawPixmap p2,0,256',0
DrawFourier(256,0,nx,ny,FReal2,True,False)

Local p3:TPixmap=CreatePixmap(nx/4,ny,PF_RGBA8888,4)
Local pb3:Byte Ptr=PixmapPixelPtr(p3,0,0)
For Local c:Int=0 Until nx*ny
	pb3[c]=Int(FReal3[c])
Next
'DrawPixmap p3,256,256
DrawFourier(256,256,nx,ny,FReal3,True,False)

Flip 1
WaitKey


'/*-------------------------------------------------------------------------
'   Perform a 2D FFT inplace given a complex 2D array
'   The direction dir, 1 for forward, -1 for reverse
'   The size of the array (nx,ny)
'   Return false if there are memory problems or
'      the dimensions are not powers of 2
'*/
'int FFT2D(COMPLEX **c,int nx,int ny,int dir)
'{
Function FFT2D(Real:Double[],Imaginary:Double[],Real2:Double[],Imaginary2:Double[],nx:Int,ny:Int,dir:Int)
'   int i,j;
'   int m,twopm;
'   double *real,*imag;
	Local i:Int
	Local j:Int
	Local RealMem:Double[]
	Local ImaginaryMem:Double[]

	'Make power-of-2 and find logarithms
	Local l2x:Int=0
	Local p:Int=1
	While p<nx
		p:*2
		l2x:+1
	Wend
	nx=1 Shl l2x
	Local l2y:Int=0
	p=1
	While p<ny
		p:*2
		l2y:+1
	Wend
	ny=1 Shl l2y

'---------------------------   /* Transform the rows */
'   real = (double *)malloc(nx * sizeof(double));
'   imag = (double *)malloc(nx * sizeof(double));
'   if (real == NULL || imag == NULL)
'      return(FALSE);
	RealMem=New Double[nx]
	ImaginaryMem=New Double[nx]	

'   if (!Powerof2(nx,&m,&twopm) || twopm != nx)
'      return(FALSE);
	'Just make sure nx and ny are a powers of 2 in code, each can be a different power

'   for (j=0;j<ny;j++)
'   {
	For j=0 Until ny

'      for (i=0;i<nx;i++)
'      {
		For i=0 Until nx

'         real[i] = c[i][j].real;
'         imag[i] = c[i][j].imag;
			RealMem[i]=Real[(j*nx)+i]
			ImaginaryMem[i]=Imaginary[(j*nx)+i]

'      }
		Next

'      FFT(dir,m,real,imag);
			FFT(dir,l2x,RealMem,ImaginaryMem)
			
'      for (i=0;i<nx;i++)
'      {
		For i=0 Until nx

'         c[i][j].real = real[i];
'         c[i][j].imag = imag[i];
			Real2[(j*nx)+i]=RealMem[i]
			Imaginary2[(j*nx)+i]=ImaginaryMem[i]

'      }
		Next

'   }
	Next

'   free(real);
'   free(imag);
	'No need to free the arrays


'-----------------------   /* Transform the columns */
'   real = (double *)malloc(ny * sizeof(double));
'   imag = (double *)malloc(ny * sizeof(double));
'   if (real == NULL || imag == NULL)
'      return(FALSE);
	RealMem=New Double[ny]
	ImaginaryMem=New Double[ny]		'We could probably just re-use these if nx=ny, but apparently they can be different powers of 2

'   if (!Powerof2(ny,&m,&twopm) || twopm != ny)
'      return(FALSE);
	'Just make sure nx and ny are a power of 2 in code

'   for (i=0;i<nx;i++)
'   {
	For i=0 Until nx

'      for (j=0;j<ny;j++)
'      {
		For j=0 Until ny

'         real[j] = c[i][j].real;
'         imag[j] = c[i][j].imag;
			RealMem[j]=Real2[(j*nx)+i]
			ImaginaryMem[j]=Imaginary2[(j*nx)+i]

'      }
		Next

'      FFT(dir,m,real,imag);
		FFT(dir,l2y,RealMem,ImaginaryMem)

'      for (j=0;j<ny;j++)
'      {
		For j=0 Until ny
		
'         c[i][j].real = real[j];
'         c[i][j].imag = imag[j];
			Real[(j*nx)+i]=RealMem[j]
			Imaginary[(j*nx)+i]=ImaginaryMem[j]

'      }
		Next
		
'   }
	Next

'   free(real);
'   free(imag);
	'No need to free the memory
	
'   return(TRUE);
'}
End Function

'/*-------------------------------------------------------------------------
'   This computes an in-place complex-to-complex FFT
'   x and y are the real and imaginary arrays of 2^m points.
'   dir =  1 gives forward transform
'   dir = -1 gives reverse transform
'
'     Formula: forward
'                  N-1
'                  ---
'              1   \          - j k 2 pi n / N
'      X(n) = ---   >   x(k) e                    = forward transform
'              N   /                                n=0..N-1
'                  ---
'                  k=0
'
'      Formula: reverse
'                  N-1
'                  ---
'                  \          j k 2 pi n / N
'      X(n) =       >   x(k) e                    = forward transform
'                  /                                n=0..N-1
'                  ---
'                  k=0
'*/
'int FFT(int dir,int m,double *x,double *y)
'{
Function FFT(dir:Int,m:Int,x:Double[],y:Double[])
'   long nn,i,i1,j,k,i2,l,l1,l2;
'   double c1,c2,tx,ty,t1,t2,u1,u2,z;
	Local nn:Long
	Local i:Long
	Local i1:Long
	Local j:Long
	Local k:Long
	Local i2:Long
	Local l:Long
	Local l1:Long
	Local l2:Long
	Local c1:Double
	Local c2:Double
	Local tx:Double
	Local ty:Double
	Local t1:Double
	Local t2:Double
	Local u1:Double
	Local u2:Double
	Local z:Double
'
'---------------------   /* Calculate the number of points */
'   nn = 1;
	nn=1
	
'   for (i=0;i<m;i++)
'      nn *= 2;
	For i=0 Until m
		nn:*2
	Next

'---------------------   /* Do the bit reversal */
'   i2 = nn >> 1;
'   j = 0;
i2=nn Sar 1 'i2=nn shr 1
	j=0

'   for (i=0;i<nn-1;i++)
'   {
	For i=0 Until nn-1

'      if (i < j)
'      {
		If i<j
'         tx = x[i];
'         ty = y[i];
			tx=x[i]
			ty=y[i]

'         x[i] = x[j];
'         y[i] = y[j];
			x[i]=x[j]
			y[i]=y[j]
			
'         x[j] = tx;
'         y[j] = ty;
			x[j]=tx
			y[j]=ty

'      }
		EndIf

'      k = i2;
		k=i2

'      while (k <= j)
'      {
		While k<=j

'         j -= k;
'         k >>= 1;
			j:-k
			k:Sar 1'k:Shr 1

'      }
		Wend

'      j += k;
		j:+k

'   }
	Next

'----------------------   /* Compute the FFT */
'   c1 = -1.0;
'   c2 = 0.0;
'   l2 = 1;
	c1=-1.0:Double
	c2=0.0:Double
	l2=1

'   for (l=0;l<m;l++)
'   {
	For l=0 Until m

'      l1 = l2;
'      l2 <<= 1;
'      u1 = 1.0;
'      u2 = 0.0;
		l1=l2
		l2:Shl 1
		u1=1.0:Double
		u2=0.0:Double

'      for (j=0;j<l1;j++)
'      {
		For j=0 Until l1

'         for (i=j;i<nn;i+=l2)
'         {
			i=j
			Repeat
		
'            i1 = i + l1;
				i1=i+l1
				
'            t1 = u1 * x[i1] - u2 * y[i1];
'            t2 = u1 * y[i1] + u2 * x[i1];
				t1=u1*x[i1]-u2*y[i1]
				t2=u1*y[i1]+u2*x[i1]

'            x[i1] = x[i] - t1;
'            y[i1] = y[i] - t2;
				x[i1]=x[i]-t1
				y[i1]=y[i]-t2

'            x[i] += t1;
'            y[i] += t2;
				x[i]:+t1
				y[i]:+t2

'         }
				i:+l2	'handle `for loop with while condition`
			Until i>=nn

'         z =  u1 * c1 - u2 * c2;
'         u2 = u1 * c2 + u2 * c1;
'         u1 = z;
			z=u1*c1-u2*c2
			u2=u1*c2+u2*c1
			u1=z

'      }
		Next
			
'      c2 = sqrt((1.0 - c1) / 2.0);
		c2=Sqr((1.0:Double-c1)/2.0:Double)

'      if (dir == 1)
'         c2 = -c2;
		If dir=1 Then c2=-c2

'      c1 = sqrt((1.0 + c1) / 2.0);
		c1=Sqr((1.0:Double+c1)/2.0:Double)

'   }
	Next

'--------------------   /* Scaling for forward transform */
'   if (dir == 1)
'   {
	If dir=1
'      for (i=0;i<nn;i++)
'      {
		For i=0 Until nn

'         x[i] /= (double)nn;
'         y[i] /= (double)nn;
			x[i]:/Double(nn)
			y[i]:/Double(nn)

'      }
		Next

'   }
	EndIf

'   return(TRUE);
'}
End Function


'		//Draws an image
'void draw(int xPos, int yPos, int n, int m, double *g, bool shift, bool neg128) //g is the 
'image to be drawn
'{
Function DrawFourier(xPos:Int,yPos:Int,n:Int,m:Int,G:Double[],shift:Int,neg128:Int)

	Local x:Int
	Local y:Int
	Local x2:Int
	Local y2:Int
	Local red:Int
	Local green:Int
	Local blue:Int
	Local alpha:Int

'  for(int x = 0; x < n; x++)
'  for(int y = 0; y < m; y++)
'  {
	For x=0 Until n Step 4
		For y=0 Until m

'    int x2 = x, y2 = y;
			x2=x
			y2=y

'    if(shift) {x2 = (x + n / 2) % n; y2 = (y + m / 2) % m;} //Shift corners to center
			If shift=True
				x2=(x+n/2) Mod n
				y2=(y+m/2) Mod m
			EndIf

'    ColorRGB color;
'    //calculate color values out of the floating point buffer
'    color.r = int(g[3 * m * x2 + 3 * y2 + 0]); 
'    color.g = int(g[3 * m * x2 + 3 * y2 + 1]); 
'    color.b = int(g[3 * m * x2 + 3 * y2 + 2]); 
			red=Int(G[(y*n)+x])
			green=Int(G[(y*n)+x+1])
			blue=Int(G[(y*n)+x+2])
			alpha=Int(G[(y*n)+x+3])

'    if(neg128) color = color + RGB_Gray;
			If neg128=True
				red:+128
				green:+128
				blue:+128
				alpha:+128
			EndIf

'    //negative colors give confusing effects so set them to 0
'    if(color.r < 0) color.r = 0;
'    if(color.g < 0) color.g = 0;
'    if(color.b < 0) color.b = 0;
			If red<0 Then red=0
			If green<0 Then green=0
			If blue<0 Then blue=0
			If alpha<0 Then alpha=0

'    //set color components higher than 255 to 255
'    if(color.r > 255) color.r = 255;
'    if(color.g > 255) color.g = 255;
'    if(color.b > 255) color.b = 255;
			If red>255 Then red=255
			If green>255 Then green=255
			If blue>255 Then blue=255
			If blue>255 Then blue=255

'    //plot the pixel
'    pset(x + xPos, y + yPos, color);
			SetColor red,green,blue
			Plot (x/4)+xPos,y+yPos

		Next
'  }
	Next
SetColor $ff,$ff,$ff
'}
End Function

Warpy

(Posted 2010) [#2]

This looks like a perfect topic for my maths blog! I'll write a proper post later, but if multiplying complex numbers is your problem:

A complex number is something like a+ib, where a is the real part and b the imaginary part. i is the square root of -1, so i*i=-1. So, when you multiply two complex numbers together, you get:

(a+ib)*(c+id) = a*c + ib*c + a*id + ib*id
= a*c + i*(b*c+a*d) + i*i*b*d
= a*c - b*d + i(b*c + a*d)

So the real part is ac-bd, and the imaginary part is bc+ad. I can't see what the multiplication in your code is meant to be doing, but maybe that will help.

ImaginaryHuman

(Posted 2010) [#3]

Hmm. What does the i() part mean, multiply what's in the brackets by i?

The multiplication in my code above is meant to simply multiply one fourier-transformed image by another. I can't find anywhere online where it tells you exactly HOW to multiply it, and the two pieces of C and java sourcecode I tried to mimick didn't work.

Basically I take an image, I convert with a FFT, I take also the image of a kernel e.g. 5 horizontal pixels, I convert it also with FFT, then `multiply` the two together and then do an inverse FFT to convert back to an image... this is supposed to perform an image `convolution`, like applying a blur. If I can get the blur to work that would be great.

Do you see any signs of errors in my conversion of FFT from C code? Like I said I DO get the image back when inverse transforming, but I don't know if that means it works or if its a because a broken system undoes itself.

Amon	(Posted 2010) [#4]

We need to clone warpy.

ImaginaryHuman

(Posted 2010) [#5]

Good idea. :-) Personal assistants for everyone :-D Do you wash cars too, Warpy?

Warpy

(Posted 2010) [#6]

DOES NOT COMPUTE

Pineapple

(Posted 2010) [#7]

* pokes warpy with a stick *

What are you??

slenkar

(Posted 2010) [#8]

-0-0
--+
--\/

ImaginaryHuman

(Posted 2010) [#9]

So er, anyway, how do I get these fouriers to transform properly and multiply properly?

Warpy

(Posted 2010) [#10]

Weirdly, I've just encountered a bug where I can use DrawPixmap or DrawRect in a frame but not both - if I use DrawRect the Pixmap disappears! I've had to switch to d3d9 because I can't use opengl, but that's weird.

Can't see where you're going wrong with the FFT, far too much code to trawl through. Maybe find another example and put it together piece by piece.

Floyd

(Posted 2010) [#11]

If the goal here is to exactly undo a "very smooth blur" as in your other thread then I would judge that to be impossible. Although we don't have a precise definition of "smooth blur" the fact that you only save pixel values means we just don't have enough information at the end.

I think what you are trying to do here is essentially the same as before, but with FFT doing the grunt work of calculating the inverse. But the inverse is uniquely determined, no matter what technique you use to find it.

Here's one way to think about the problem and whether or not it is feasible. If the forward transformation maps two different inputs to the same output then inversion is impossible. The simplest output to consider is all zeroes. Consider taking the average of five consecutive pixels. What happens if the row of input pixels have color values of 0 or 1, in this pattern:

10001000100010001000...

Every "average of five" will be 1/5 or 2/5, which is 0.2 or 0.4 exactly. This output could be inverted. But now we take the additional step of converting the output to pixel values. That means every output value is 0. This is exactly the same as if the row of input pixels had been

0000000000000000...

Different input, same output hence not invertible.

ImaginaryHuman

(Posted 2010) [#12]

Yes you're right. I came to the same conclusion. If I don't store the exact precision of the result of the blur, whether it be floats, doubles, or just by adding up all of the blur contributions (which actually fits into 2 bytes per pixel for a 256-pixel blur), then that data is lost and cannot be recovered precicely.

One thing I did do is add the `error` difference to the blurred pixel values so that when the blur is undone the numbers add up and it produces the right output. However this means the blurred image is noisy. I need the blurred image to be smoothly blurred at the time of it being saved to disk. The other issue is that simply by doing the blur at all, means that multiple pixel values are essentially being added together and this means you need more precision, at least 16-bit, and the lower 8-bits of that 16-bit value are very very high frequency noise which is what I was trying to avoid. Just a no-go all round so I'm giving up for now. I can see that the FFT would not help at all since it would be reversing an image that has missing data.

Floyd

(Posted 2010) [#13]

If you're up for more adventures I just had an idea.

Assuming only color is of interest you could stuff the missing color information into the unused alpha channel. In the "average of seven neighbors" scheme the resulting value is an integer plus a remainder of 0,1,2,3,4,5 or 6. So for all R,G,B there are 6*6*6 = 216 possible sets of remainders. This fits into the 8-bit alpha channel.

The three remainders can be encoded as a "base 7" number. For example, remainders of 2,4,3 would be represented by the integer 2*7*7 + 4*7 + 3 = 129.

ImaginaryHuman

(Posted 2010) [#14]

What would happen if you wanted to blur longer than 7 pixels, say, at least 64 or more? Would it have a modulo of up to 64?

I'm still concerned that the alpha data is still `noisy` with all these random modulo's. ALL data stored has to be smooth. But I suppose 0..6 isn't too major.

I think I almost need like an `integer blur` if there is such a thing.

Floyd

(Posted 2010) [#15]

This would actually be exact. You take seven color values ,each in the range 0..255 and add them. Let's say they added up to 1500. Dividing by 7 gives 214, with a remainder of 2. The 214 is the color value and the 2 will be part of the information stored in the alpha channel.

By coincidence 7 is the largest number for which this could work. If we used 8 then each remainder would be 0..7, and there would be 7*7*7 = 343 possible remainders for the three colors. This won't fit in 8 bits.

ImaginaryHuman

(Posted 2010) [#16]

How are you storing 3 modulos of 0..7 in a single byte?

By my figuring, each byte is going to have to have its own modulo. For a blur that's 256 bytes long, it would require a whole byte for the modulo of each. So 3 original bytes, plus 3 modulo bytes. Which is the same thing as just storing a Short total and not doing the divide at all.

Floyd

(Posted 2010) [#17]

Oops. I was hallucinating for moment. I said there were 216 possible remainders for the "average of seven pixels". Each remainder is 0 to 6. But that is seven values, not six, so there are 7*7*7 = 343 sets of remainders. That won't fit in a byte, as I stated in my similarly flawed analysis of eight pixels.

But this would work for blocks of six pixels. That feels wrong because of the lack of symmetry. You probably want an odd number of pixels, a middle one and neighbors to either side. So we drop down to five pixels.

Let's think about how this works. We use an average value based on five pixels to produce one pixel in the blurred image. The three color channels are handled separately. First we add five reds, getting an integer value in the range 0 to 5*255 = 1275. Suppose the sum was 942. The resulting color value is 942/5. As an integer this is 188. But that loses information. We really need to remember 188 + 2/5.

There are three color values and three remainders. But we don't need three bytes for the remainders since each is only in the range 0 to 4. If the remainders happened to be 2,4,3 then we encode all three of them as the single integer 2*5*5 + 4*5 + 3 = 73. This is stored in the alpha of the blurred image, along with the 188 red value and whatever green and blue were.

All of this can be reversed. Starting with the pixel value in the blurred image we pick out the colors and the alpha. Red is 188, alpha is 73. The remainder for red must have been the integer 73 / (5*5), which is 2. And that tells us the sum of five reds which produced this must have been 188*5 + 2 = 942. Nothing has been lost.

Similarly the exact green and blue sums are recovered. The remainder for blue is ( alpha mod 5 ). The one for green is ( alpha / 5 ) mod 5.

ImaginaryHuman

(Posted 2010) [#18]

Interesting. So it works, but the problem still remains, how would this work with a 64-pixel blur? A short blur is not enough.

It's interesting though how you encoded three remainders in one value.

Floyd

(Posted 2010) [#19]

how would this work with a 64-pixel blur?

It wouldn't. As stated before there is now too much extra information to be saved.

You could experiment with using a small number of pixels, such a 5, but spread out farther from the center. That would change the image more, but it wouldn't be so smooth.

ImaginaryHuman

(Posted 2010) [#20]

What if the second set of blurring was done including output from the first set, ie blur a group of 5, overwrite the original 5 bytes (plus store the modulo thing somewhere), then move to the right? Would the accumulative effect of it work and be reversible?

Floyd

(Posted 2010) [#21]

I'm not sure what you mean about overwriting the original five. Five pixels are averaged to create one new pixel value.

You can certainly apply the blur procedure to the newly blurred image as many times as you like. The image will continue to get fuzzier. But there is no going back. The color and alpha of a blurred image can be used to retrieve the color values of the image from which it came. But it can't encode the information needed to reverse through a series of images. That would be rather like running a file through a compression routine repeated, hoping it will get smaller and smaller.

I see no hope for this short of coming up with a new image format containing extra data, not visible on screen.

ImaginaryHuman

(Posted 2010) [#22]

I guess it wouldn't work, what I meant, because if you output a blurred pixel to the source image and advance 1 pixel to the right to the next average of 5, it won't get included. That's what I meant - include the blurred pixel from the last averaging as part of the next averaging.

ImaginaryHuman

(Posted 2010) [#23]

Hey Floyd, do you think it would be possible to somehow store the modulo of 5 bytes as part of the blurred result, and somehow extract it again later? ie instead of storing the modulo in a separate byte for every 1-3 blurred bytes, can we like add it to the blur value and do something clever to extract it just before the deblur, so that it is all still reversible?

Floyd

(Posted 2010) [#24]

If you mean blur by taking the average of five pixels then that is exactly the example I gave earlier. Scroll up to message #17.

ImaginaryHuman

(Posted 2010) [#25]

I follow what you said there but what I'm asking is, to instead of storing the modulos in an alpha channel, store it as part of the color channel `on top of` the pixel values?

e.g.

Say you are doing a 2-pixel blur. (Pixel A + Pixel B) /2 = Blur Value. Lets say the result was supposed to be 54.5, so now we store a 54 + 1 remainder, =55. How can we then later take this 55 and know it to be really 54+1? ie to take 54.5*2 instead of 55*2?

Floyd

(Posted 2010) [#26]

There is not enough space. Starting with 0 to 255 in steps of 1 the 256 possible values completely fill one byte.

The average of two such values is 0, .5, 1, 1.5, ... 255. The range is still 0 to 255, but in steps of .5 so there are 511 values. They can't all fit into one byte.

It gets worse with bigger averages. With five values the result is 0 to 255 in steps of .2 so we get 1276 values to encode.

ImaginaryHuman

(Posted 2010) [#27]

How about a modulo 256?

And if we just blur 2 numbers the modulo is either going to be a 0 or a 1, so can we not add 0 or 1 to the 0..254 byte values?

Floyd

(Posted 2010) [#28]

I don't know where you got those numbers.

The sum of two bytes covers every integer from 0 to 510. No amount of trickery will fit all of them into one byte.

ImaginaryHuman

(Posted 2010) [#29]

What I meant is that if the result of 0..255 + 0..255 is more than 255, you can store it in a byte by wrapping it with modulo 256. Ie if it's >255 you subtract 256, if it's <0 you add 256. It works out and is reversible. Problem is it adds noise to the results.

xlsior

(Posted 2010) [#30]

What I meant is that if the result of 0..255 + 0..255 is more than 255, you can store it in a byte by wrapping it with modulo 256.

But how do you know if there was any wrapping at all when looking at the end result?

Floyd

(Posted 2010) [#31]

Exactly. I calculate x+y, and if necessary substract 256. The result is 100. What was x+y?

ImaginaryHuman

(Posted 2010) [#32]

They do it in the PNG filters.

e.g. the `subtract` filter

ByteValue=OriginalValue-PreviousByteValue

If byte A is 251 and byte B is 88, 88-251=-163 ...+256=93

93 gets stored in the byte

Later when it comes to reverse the subtract filter, you go 251+93=344, then 344-256=88.

It works. It's just that this means that each time you add two bytes together to get a value 0..511, and it wraps into one byte value, that makes a lot of noise which works against the blurring effort.

Floyd

(Posted 2010) [#33]

Your "subtract" example is not at all the same. Going in reverse you know the final value is an integer 0 to 255. In the "average of two" blur the reverse is 0 to 255 in steps of 0.5. You can't recover the fractional part.

That being the case no special technique is required. We are back where we started: compute the average and round to integer. The remainder is lost.

ImaginaryHuman

(Posted 2010) [#34]

Hmm.
Well anyway, no biggie, not planning to use that.

Am almost giving up on this whole blur thing, have tried many many different ideas and none of them work, lol