Implement Fast Fourier Transform ops #386

NickShahML · 2015-12-01T13:49:08Z

Hey TF,

Trying to integrate the new Unitary RNN's into tensorflow. Unfortunately, they require Fast Fourier Transform and Inverse Fast Fourier Transforms.

For efficiency this needs to be done on the GPU. On the author's code (theano), they do this by making a Theano Op, and inserting scikit's Fast Fourier and Inverse Fast Fourier Here:

https://github.com/amarshah/complex_RNN/blob/master/fftconv.py

How can this be done in TF? Upon completing the Unitary RNN, I plan to share it to everyone! I asked on the google groups, but didn't get a reply.

vrv · 2015-12-02T01:42:36Z

We'll need to add a few FFT ops to get this done -- probably using cufft or fbfft for GPU. Assigning to @zffchen78 since he has experience in ffts and adding ops :)

NickShahML · 2015-12-02T03:50:48Z

Thank you @vrv and @zffchen78 . Basically, these new types of RNN's are very promising and I hope to integrate them into TensorFlow over the next week if possible. I'm starting to work on it here:
https://github.com/LeavesBreathe/Seq2Seq_Upgrade_TensorFlow

However, it will not be possible to do without FFT and IFFT support so I really appreciate your help!

adder · 2015-12-02T08:10:31Z

So would this imply that the FFT ops only work for GPU and not CPU? Or just very very slow on CPU?

NickShahML · 2015-12-02T15:02:41Z

Well the problem is that the actual unitary RNN uses FFT and IFFT within its cell. Ideally you want the cell's computation done in parallel on the GPU. If its on the CPU, its going to take forever.

Here's the paper: Look at their main equation http://arxiv.org/pdf/1511.06464v1.pdf (its number 6)

adder · 2015-12-02T15:26:27Z

With 'taking forever', you mean compared to the uRNN GPU version or also compared to a 'classical' RNN or a LSTM with the same number of parameters? I would at least hope the speed to convergence demonstrated in the paper would outweigh any extra computation time per iteration. :)

Anyways, I would expect that all ops defined in Tensorflow would be available for both CPU and GPU (and any future device). Me and I guess others don't have day to day access to a suitable GPU. Or is that not one of the design goals?

vrv · 2015-12-02T16:42:08Z

We'll try to provide an implementation for both.

NickShahML · 2015-12-02T16:59:59Z

Perfect! Thanks! @adder, I primarily use GPUs and if you're going to use unitary RNN's for some serious language modeling, you gotta use GPU's (10x faster training time).

I know they may converge more quickly (one of the benefits), but keep in mind that an epoch usually takes at least 24 hrs with normal LSTM's and GRU's for big datasets. So we simply can't afford to put them on CPUs.

zffchen · 2015-12-02T22:19:45Z

@LeavesBreathe , can you elaborate a bit more on your requirements about FFT/IFFT?
Specifically,

do you need r2c (real-to-complex), c2r(complex-to-real), or c2c(complex-complex) fft/ifft support?
do you need 1D, 2D, or 3D fft/ifft? or all of them?
do you need support for batching?
In your original email, you pointed to theano's code fftconv.py, which uses fft/ifft to implement conv2d operation. Do you need conv2d in the end? If that's all you want, tf has conv2d operation already and they run on gpu using cudnn. As I was told, newer version of cudnn uses fft under the hood already.

adder · 2015-12-02T22:44:26Z

I guess you should ask this questions to @LeavesBreathe :)

zffchen · 2015-12-02T22:51:23Z

@LeavesBreathe, is this the paper you are trying to reproduce? http://arxiv.org/pdf/1511.06464v1.pdf And formula (6) is the core computation you need to do. AFAIK, you need 2D, non-batched, complex-to-complex fft/ifft. Is that correct?

NickShahML · 2015-12-02T22:53:53Z

Thanks @adder! @zffchen78, thanks for your help in this matter. I will do my best to answer your questions. The ultimate goal is to replicate this paper: http://arxiv.org/pdf/1511.06464v1.pdf

To do so, I was planning on making a new class in RNN_cell.py. Therefore, I believe with need complex to complex along with 2d support.

The reason why I wanted a pythonic tensorflow op is so that we can assign multiple gpu's to multiple unitary RNN's when the whole integration is complete. So one uRNN would be on gpu 1, and another uRNN would be on gpu 2.

I don't know how hard it is for you to implement these fft's and ifft's but I think 3d support would be nice for future users who may try unitary conv nets. I certainly don't want to ask too much of you though!

If this is of any help to you, the goal is to replicate "Complex_RNN" here: https://github.com/amarshah/complex_RNN/blob/master/models.py#L532

psmit · 2015-12-09T13:09:01Z

I would also very much like to see fft support, not for the training of dnn models, but for the feature extraction in a speech recognition pipeline. In this case only r2c, 1d would be required, preferably with batching both in cpu and gpu.

NickShahML · 2015-12-16T17:33:32Z

Hey @zffchen78 , I just wanted to follow up and ask if this was implemented yet? I can't make any progress on the unitary RNN's without it. Don't mean to bring on any pressure! Just wanted to ask.

zffchen · 2015-12-18T01:19:30Z

I implemented 2D fft in TF code base already. Hopefully, it'll be copied in OSS soon.

NickShahML · 2015-12-18T02:13:36Z

Okay thanks for the headsup. Looking forward to it!

psmit · 2015-12-18T06:45:50Z

@LeavesBreathe Why close this, it isn't there yet!

zffchen · 2015-12-22T00:54:27Z

@LeavesBreathe, we pushed changes today which enables fft2d and ifft2d on gpu. You can take a look of python/kernel_tests/fft_ops_test.py to see if it works for you. We are still figuring out license issues w/ fftw where cpu support for fft2d/ifft2d needs. Hopefully, gpu supports are sufficient for you now. Let me know if you see any problems.

NickShahML · 2015-12-22T02:06:29Z

Awesome, gpu support is really all i needed so let me test it out and get back to you. I am going to be pretty busy over the holidays but come January 4 I should be back to testing this!

zffchen · 2015-12-22T04:58:04Z

@LeavesBreathe, I took a brief of the paper you plan to replicate in TF. I suspect you'll encounter some nuances. E.g., some operations may not have support for complex64 as time being. They are minor problems because they can be easily patched by updating a few .cc file to expand the types they support.

NickShahML · 2015-12-31T21:14:06Z

OKay this is very good to know @zffchen78. I will be sure to look into these formats.

raingo · 2016-01-15T17:09:17Z

I think @LeavesBreathe answer to @zffchen78 question was wrong.

What uRNN needs is batched 1d fft. Both the paper and their theano implementation indicate this.

Paper wise, the Equation (6) is certainly multiplied with a complex vector for each data instance.

Implementation wise, check the following call stack,

To TF core developer: why not simply implement an interface like scikits.cuda: https://github.com/lebedov/scikit-cuda/blob/master/skcuda/fft.py

zffchen · 2016-01-16T04:54:44Z

We welcome the community to contribute the code.

If one wants to make the fft op supports more complete in tensorflow, feel
free to send us changes. It shouldn't be too hard to copy the fft2d impl,
and add fft1d, fft3d, and batched_fft1d, batched_fft2d, batched_3d, etc. as
needs arises, plus testing, etc. These ops' impl will pretty much look like
stream->CreatePlan(...);
stream->DoFft(...);

On Fri, Jan 15, 2016 at 9:10 AM Raingo notifications@github.com wrote:

I think @LeavesBreathe https://github.com/LeavesBreathe answer to
@zffchen78 https://github.com/zffchen78 question was wrong.

What uRNN needs is batched 1d fft. Both the paper and their theano
implementation indicate this.

Paper wise, the Equation (6) is certainly multiplied with a complex vector
for each data instance.

Implementation wise, check the following call stack,

https://github.com/amarshah/complex_RNN/blob/master/models.py#L14

https://github.com/amarshah/complex_RNN/blob/master/fftconv.py#L147

To TF core developer: why not simply implement an interface like
scikits.cuda:
https://github.com/lebedov/scikit-cuda/blob/master/skcuda/fft.py

—
Reply to this email directly or view it on GitHub
#386 (comment)
.

rryan · 2017-04-03T22:22:18Z

The current plan of record is that @tillahoffmann is working on a CPU-based FFT/RFFT using Eigen's TensorFFT module. No firm ETA but hopefully soon -- thanks @tillahoffmann!

pawarrick · 2017-05-08T15:04:05Z

@tillahoffmann, do you have any updates on progress to the CPU implementation of FFT? Many thanks for this.

tillahoffmann · 2017-05-18T13:29:33Z

@pawarrick, complex FFT and IFFT as well as the RFFT are now in master. The IRFFT still needs a bit of work.

pawarrick · 2017-05-18T14:13:24Z

yes thanks very much for your contribution. I saw the discussion about it in this thread #9029

Czxck001 · 2017-05-23T05:21:23Z

@tillahoffmann Very much thanks for your contribution! I've made up the IRFFT part, see #10127

rryan · 2017-05-24T21:16:33Z

Thanks to @tillahoffmann (#9029) and @Czxck001 (#10127), I think we can call this done!

beniroquai · 2017-06-10T10:30:54Z

Great work! Now can I finally hack any fft operation on my laptop.
I'm a bit curious. Do you see any chances, that these OPs could eventually be ported to mobile platforms such as Android?

Androbin · 2017-06-10T10:52:32Z

@beniroquai
As long as they are available for CPU, they can be compiled for Android.
I don't know, if they are in the default op list.
Otherwise you can use selective registration.
See #10254, #10299 and #10351

uschmidt83 · 2017-07-21T13:09:18Z

Hi, I just tried running FFT ops on the CPU with the 1.2.1 release, but still get errors like this:
No OpKernel was registered to support Op 'FFT2D' with these attrs. Registered devices: [CPU], Registered kernels: device='GPU'

Am I missing something?

tillahoffmann · 2017-07-21T14:46:39Z

@uschmidt83, have you tried the nightly build?

beniroquai · 2017-07-21T16:07:17Z

Is it scheduled to put it also in the release version at one point?

…

On 21 Jul 2017 16:49, "Till Hoffmann" ***@***.***> wrote: @uschmidt83 <https://github.com/uschmidt83>, have you tried the nightly build <https://github.com/tensorflow/tensorflow#installation>? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#386 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AEJOuFzD6pxxPx-eCuMp5Cyib7Y93tLzks5sQLpqgaJpZM4GsRs5> .

rryan · 2017-07-21T17:00:16Z

It should be in TF 1.3 which is in the process of being released! Also note tf.contrib.signal in this release which has some signal processing basics that are useful for e.g. computing spectrograms.

…

On Fri, Jul 21, 2017, 9:09 AM beniroquai ***@***.***> wrote: Is it scheduled to put it also in the release version at one point? On 21 Jul 2017 16:49, "Till Hoffmann" ***@***.***> wrote: > @uschmidt83 <https://github.com/uschmidt83>, have you tried the nightly > build <https://github.com/tensorflow/tensorflow#installation>? > > — > You are receiving this because you were mentioned. > Reply to this email directly, view it on GitHub > < #386 (comment) >, > or mute the thread > < https://github.com/notifications/unsubscribe-auth/AEJOuFzD6pxxPx-eCuMp5Cyib7Y93tLzks5sQLpqgaJpZM4GsRs5 > > . > — You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub <#386 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AABnn5k1w8JJTN6yDwV-1ZtDnoDI1dv3ks5sQM0kgaJpZM4GsRs5> .

uschmidt83 · 2017-07-25T14:19:58Z

Thanks, everyone! It does work for me with the 1.3.0-rc0 release.

FedericoMuciaccia · 2017-09-15T15:02:55Z

there isn't an alias for real ffts in TF 1.3.0

present functions:
tf.spectral.fft
tf.spectral.fft2d
tf.spectral.fft3d
tf.spectral.ifft
tf.spectral.ifft2d
tf.spectral.ifft3d
tf.spectral.irfft
tf.spectral.irfft2d
tf.spectral.irfft3d
tf.spectral.rfft
tf.spectral.rfft2d
tf.spectral.rfft3d

present aliases:
tf.fft
tf.fft2d
tf.fft3d
tf.ifft
tf.ifft2d
tf.ifft3d

can someone please create those missing aliases for the next TF release?

rryan · 2017-09-15T15:14:49Z

Hi Federico, This is intentional as we are trying to avoid polluting the top-level namespace in favor of more specific submodules. tf.fft and other fft symbols in the tf module are deprecated now :).

…

On Fri, Sep 15, 2017, 8:06 AM Federico Muciaccia ***@***.***> wrote: there isn't an alias for real ffts in TF 1.3.0 present functions: tf.spectral.fft tf.spectral.fft2d tf.spectral.fft3d tf.spectral.ifft tf.spectral.ifft2d tf.spectral.ifft3d tf.spectral.irfft tf.spectral.irfft2d tf.spectral.irfft3d tf.spectral.rfft tf.spectral.rfft2d tf.spectral.rfft3d present aliases: tf.fft tf.fft2d tf.fft3d tf.ifft tf.ifft2d tf.ifft3d can someone please create those missing aliases for the next TF release? — You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub <#386 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AABnn81zqHwa1D8Hq7cPScHY1S0J14wxks5sipJtgaJpZM4GsRs5> .

WangNuoWa · 2017-10-31T02:50:42Z

Hi everyone,

I would like to do some work on the basis of this article, but the original author uses theano.tensor.fft module to implement batch 1-D FFT, batch 2-D FFT, batch 1-D IFFT, batch 2-D IFFT. Now i want to replace the theano.tensor.fft module directly into tensorflow in tf.fft and tf.fft2d, ok?

https://github.com/ChihebTrabelsi/deep_complex_networks/blob/master/complexnn/fft.py

We look forward to your help, thank you very much!

rryan · 2017-10-31T05:49:55Z

@WangNuoWa -- yes, tf.fft/tf.ifft and tf.fft2d/tf.ifft2d should be drop-in replacements for those (though I have never used theano.tensor.fft), perhaps with slightly different normalization strategies though (I see an "ortho" norm in there).

absudabsu · 2017-12-14T01:07:27Z

Is tf.fft differentiable?

Yes!

…

On Wed, Dec 13, 2017, 5:10 PM Abhejit Rajagopal ***@***.***> wrote: Is tf.fft differentiable? — You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub <#386 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AABnn9VRd1EzVu5grKydHNsOzqLhNmrPks5tAHWcgaJpZM4GsRs5> .

absudabsu · 2017-12-24T19:20:34Z

@rryan, is there anything special we would have to do? I get new warnings now, like these: (you can copy paste the whole thing)

import numpy as np
import tensorflow as tf

x = np.linspace(0,10,num=10,dtype=np.complex64)
w = np.linspace(0,1,num=10,dtype=np.complex64)
y = np.multiply(x,w)

x_tf = tf.Variable(x,dtype=tf.complex64)
w_tf = tf.Variable(w,dtype=tf.complex64)
y_tf = tf.multiply(x_tf,w_tf)

sess = tf.Session()
sess.run(tf.global_variables_initializer())
sess.run(y_tf)
g1 = tf.gradients(y_tf,x_tf)
'''
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/opt/intel/intelpython3/lib/python3.5/site-packages/tensorflow/python/ops/gradients_impl.py", line 472, in gradients
    grad_ys = _DefaultGradYs(grad_ys, ys, colocate_gradients_with_ops)
  File "/opt/intel/intelpython3/lib/python3.5/site-packages/tensorflow/python/ops/gradients_impl.py", line 233, in _DefaultGradYs
    y.dtype)
TypeError: Gradients of complex tensors must set grad_ys (y.dtype = tf.complex64)
'''

a_tf = tf.fft(x_tf)
sess.run(a_tf)
g2 = tf.gradients(a_tf,x_tf)
'''
g2 = tf.gradients(a_tf,x_tf)
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/opt/intel/intelpython3/lib/python3.5/site-packages/tensorflow/python/ops/gradients_impl.py", line 472, in gradients
    grad_ys = _DefaultGradYs(grad_ys, ys, colocate_gradients_with_ops)
  File "/opt/intel/intelpython3/lib/python3.5/site-packages/tensorflow/python/ops/gradients_impl.py", line 233, in _DefaultGradYs
    y.dtype)
TypeError: Gradients of complex tensors must set grad_ys (y.dtype = tf.complex64)
'''

print(x_tf)
print(w_tf)
print(y_tf)
'''
>>> print(x_tf)
<tf.Variable 'Variable:0' shape=(10,) dtype=complex64_ref>
>>> print(w_tf)
<tf.Variable 'Variable_1:0' shape=(10,) dtype=complex64_ref>
>>> print(y_tf)
Tensor("Mul:0", shape=(10,), dtype=complex64)
'''

If its relevant:

pip list | grep tensorflow
tensorflow-gpu (1.4.1)

rryan · 2017-12-24T21:41:34Z

@rryan, is there anything special we would have to do? I get new warnings now, like these: (you can copy paste the whole thing)

I think what you're seeing is independent of whether FFT is differentiable. Here's a colab where I copied your example. You need to do as the error message says and provide a grad_ys argument to tf.gradients if ys is complex, because the TF gradient infrastructure currently won't assume that grad_ys is all ones if the type is complex, as it does when the type is real (I'm not totally sure why this is... maybe the original author of that code didn't want to assume that 1+1j was the proper grad_ys for a complex number because its magnitude is greater than 1?).

You can see this logic here:

tensorflow/tensorflow/python/ops/gradients_impl.py

Lines 232 to 239 in 70e33e0

    
           if grad_y is None: 
        
             if y.dtype.is_complex: 
        
               raise TypeError( 
        
                   "Gradients of complex tensors must set grad_ys (y.dtype = %r)" % 
        
                   y.dtype) 
        
             new_grad_ys.append(array_ops.fill( 
        
                 array_ops.shape(y), constant_op.constant( 
        
                     1, dtype=y.dtype, name="grad_ys_%d" % i)))

zaccharieramzi · 2019-03-25T15:36:06Z

Hi,

I was wondering whether you guys at TF were planning to implement a non-uniform FFT layer.
I don't know if this question belongs here, or if I should create a new issue by itself.

Cheers

rryan · 2019-03-25T20:40:36Z

@zaccharieramzi nobody on the TF team is currently considering that, as far as I know.

AIUI, the most efficient algorithms for it are defined in terms of the FFT -- is it an easy extension of the FFT ops that already exist? If so, I would be happy to code review an addition to tf.signal to add it if someone submits a pull request.

If it is more involved to implement, for example requiring custom kernels, then I think it might be a harder sell to include since there has not been high demand for it, and adding custom kernels increasing the binary size and compile time of TensorFlow.

Could you please file a GitHub issue requesting the feature? Anyone else who is interested would then have a nice place to add a +1 so we can better understand how many people would benefit from it being in TensorFlow itself (as opposed to being implemented as a standalone library).

zaccharieramzi · 2019-03-27T09:58:29Z

Hi @rryan ,

Unfortunately it's not a direct extension of the FFT afaik (I am still new to the topic). So I think it does require a custom kernel. I totally understand that it could be a hard sell and I will file a github issue requesting the feature (I am totally not familiar with C/C++ so I don't think I could implement it myself).

In the meantime I will use ODL to achieve my goals, although slower because the implementation is in python.

EDIT: done in #27193 .

…kip-hcc-runtime-for-hipclang Ignore HCC runtime library in case hip-clang toolchain is used

itskobold · 2022-10-29T02:17:38Z

Hello,

I'm getting this not implemented error...

Gradient for IRFFT3D is not implemented. (I'm assuming I'll get a similar error for RFFT3D too).

Is there any way I could get this working myself? I've taken a look at where the ops for 1D/2D (I)RFFT are defined in fft_ops.py but I'm a bit lost as to what I'd need to do to implement it in 3D.

Thanks in advance for any advice!

tillahoffmann · 2022-10-30T21:43:00Z

I don't have tensorflow installed at the moment to verify, but the three-dimensional RFFT is just a two-dimensional RFFT plus a FFT along the leading dimension. Here's a numpy example.

import numpy as np

x = np.random.normal(0, 1, (15, 16, 17))
# Compute three-dimensional FFT.
fftn = np.fft.rfftn(x, x.shape)
# Do it in two steps.
fftn_two_step = np.fft.fft(np.fft.rfft2(x), axis=0)
np.testing.assert_allclose(fftn, fftn_two_step)

This should allow you to get the gradients by decomposing the three-dimensional transform into a one-dimensional and a two-dimensional transform (assuming those two have gradients implemented).

vrv assigned zffchen Dec 2, 2015

vrv changed the title ~~Unitary RNN --> Fast Fourier Transform on GPU~~ Implement Fast Fourier Transform ops Dec 2, 2015

NickShahML closed this as completed Dec 18, 2015

NickShahML reopened this Dec 18, 2015

NickShahML closed this as completed Dec 31, 2015

NickShahML reopened this Dec 31, 2015

vrv added enhancement stat:contribution welcome Status - Contributions welcome labels Feb 19, 2016

tillahoffmann mentioned this issue Apr 6, 2017

CPU kernels for FFT (WIP) #9029

Merged

Czxck001 mentioned this issue May 23, 2017

Implement IRFFT as a CPU kernel #10127

Merged

rryan closed this as completed May 24, 2017

darkbuck pushed a commit to darkbuck/tensorflow that referenced this issue Jan 23, 2020

Merge pull request tensorflow#386 from ROCmSoftwarePlatform/feature-s…

ff7f455

…kip-hcc-runtime-for-hipclang Ignore HCC runtime library in case hip-clang toolchain is used

Implement Fast Fourier Transform ops #386

Implement Fast Fourier Transform ops #386

Comments

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!