Feature Request: General translation-invariant filters #5618

VHarisop · 2017-02-24T14:55:35Z

Hello everyone,

I have been using Theano lately for experimenting on morphological neural networks, which are the lattice-theoretic counterparts of "traditional" models like the perceptron. Instead of a dot product, as in the case of a multiply-accumulate activation:

z = T.dot(W, x)   # letting x be a vector

the activation of those models is

z = T.max(W + x, axis=1)   # or T.min()

As these activations are essentially dilations and erosions, we would like to experiment with their performance as building blocks of convolutional layers. However, as far as I know Theano does not provide a general way to define a translation-invariant filter on images, with the exception of the convolution operator.

My suggestion would be to add this functionality in the theano.nnet.abstract_conv package, possibly extending BaseAbstractConv2d so that it supports any type of translation-invariant filtering (not necessarily linear).
If you agree that this addition may be useful to others, I would also be willing to work on its implementation, provided that I have some sort of feedback or guidance as to which parts of the codebase should be modified, and how to do this in an efficient manner.

The text was updated successfully, but these errors were encountered:

lamblin · 2017-02-24T18:54:06Z

A generalization of Images2Neighbs, as well as an implementation for its gradient, could be a general base if you want to experiment with different activations. You may not get all the speed possible, but more flexibility, since you would have all the patches explicitly, and could apply all operations on them.

For specific activations (for instance, max), then maybe starting with something like the implementations of Pool could make sense (you could see it as max pooling with a bias).

For the moment, I'm not sure what a good abstract interface for that family of operations would be, or even if it will end up being something widely used, so I would say it is too early to start from AbstractConv or BaseAbstractConv (abstract ops are mostly useful when there are several implementations available for the same kind of device), but maybe we should start by a CPU or GPU implementation.

We can help you here, on theano-dev, or in the comments of a PR if you start one, but I can't guarantee we will always be reactive.

khaotik · 2017-02-28T06:56:04Z

Just a personal idea, if there's a way to generalize elemwise ops a bit, Theano should be able to generate a single efficient kernel for a nonlinear filter, hopefully for both CPU and GPU.

For example, see #5471, this can be done if pad is fused with elemwise.

lamblin added the Enhancement label Feb 24, 2017

khaotik mentioned this issue Mar 2, 2017

Feature request: TensorPadOp #5471

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature Request: General translation-invariant filters #5618

Feature Request: General translation-invariant filters #5618

Uh oh!

Uh oh!

Feature Request: General translation-invariant filters #5618

Feature Request: General translation-invariant filters #5618

Comments

Uh oh!

Uh oh!