Full advanced Indexing support + gradient #1083

jsalvatier · 2012-11-20T02:58:37Z

This patch implements full advanced indexing support as well as the gradient.

It currently relies on this package being installed (https://github.com/jsalvatier/advinc), which only has c function. Obviously this should go in Theano, but it's not clear to me where it should go. I would like some input here.

Also, the patch currently relies on numpy having an interface to MapIter (which is a recent patch), so it's also necessary to check that this works.

Since the patch isn't strictly ready, I'm not sure if this is the right channel to go through, so let me know.

…e general case

…ld be false

…d failure)

goodfeli · 2012-11-20T13:58:36Z

This is the right channel to go through. Thanks for submitting this! It will be great to have advanced indexing. When some more people get to the office we'll figure out where to put things / how to handle the dependencies.

nouiz · 2012-11-20T15:29:24Z

theano/tensor/basic.py

 discrete_dtypes = map(str, scal.discrete_types)
-all_dtypes = map(str, scal.all_types)
+int_dtypes = map(str, scal.int_types)


Why did you remove the uint_dtypes and all_dtypes variable? Even if they are not used in this file, we need them.

I'm not sure what happened here.

nouiz · 2012-11-20T15:47:13Z

Thanks for this PR. It will be really useful I think.

For the advinc repo, I think you should put the containt of the advinc.c file in a c_code_support() fct of the Op. In the c_code() of this op, if it is an inc, you generate some c code that will call your function.

I probably won't have the time to do it before December, but if you want to do it, I can answer questions.

This documentation page could help you understand how we generate c code.

http://www.deeplearning.net/software/theano/extending/cop.html

jsalvatier · 2012-11-20T22:14:20Z

I've done that once before, but this seems like it will make the code significantly more complicated, as I would have to manually put the inplace arguments into a tuple. Isn't there a place I can put code to be compiled when theano is installed and thus simply import the function into python?

nouiz · 2012-11-22T14:59:09Z

There is only 1 function we currently compile like you tell and we want to get rid of it. But this won't happen shortly. So if you choose to do so, I'll accept that. It is in the file theano/gof/cutils.py.

But why doing a tuple is so complicated? Yes a little, but I think much less then what you did for this PR. Or I forget something? It is a long time I didn't do a tuple in C.

jsalvatier · 2012-12-13T03:55:02Z

@nouiz Okay, this should be ready to go now.

lamblin · 2012-12-14T19:23:43Z

OK, so short of forking the development version of NumPy and bundle it with Theano, I guess this is as good as it's going to get.
I'm reluctant to encourage people to checkout the development version of NumPy, though. Should we wait until NumPy's next release to publicize this feature?

jsalvatier · 2012-12-14T20:07:29Z

I think we can probably just say something like "advanced indexing now fully supported, but currently requires numpy dev version, which can be a pain". For me and probably some others advanced indexing is really valuable, and those people would like to know that it's there. Of course, if you don't need advanced indexing then updating numpy is probably not worth it.

jaberg · 2012-12-14T20:15:23Z

I like @jsalvatier's suggestion -- consider not even mentioning that it's a pain. Just say you "advanced indexing and gradients are supported, but these currently require numpy dev version >= X (date)."

jsalvatier · 2013-02-05T03:46:50Z

@nouiz
Hey, I haven't been able to get the compiled-by-theano version of increment inplace to work, would you check out what might be the problem? Could you check out the C code in cutils.py and tell me if there's anything obviously wrong? https://github.com/jsalvatier/Theano-1/blob/8d8be24746c656782cf1e7b8f23ad63c22cf8217/theano/gof/cutils.py

When I run the following program, I get a segfault (on the last line):

from theano import * 
from theano.tensor import * 
import numpy

a = dvector('a')

i = constant([1,0,1])
r = inc_subtensor(a[i], [1.,2.,3.])

f = function([a],[r])
print f(numpy.array([50., 42, 1]))

I'd really like to get this patch settled soon.

jsalvatier · 2013-02-05T04:06:18Z

I should also say, that the same C code works when I compile it in its own library. https://github.com/jsalvatier/advinc/blob/master3/advinc/advinc.c

nouiz · 2013-02-05T16:12:15Z

We are in a deadline for ICML the 15 fev and I need to help for optimization. So probably I won't be able to test it before Thursday/Friday. But I'll look at it.

nouiz · 2013-02-08T17:06:11Z

If I run your code with my current numpy version, it work, but don't use your faster version.

If I try to install your numpy fork, the installation fail with an error about mapping.c not being available. How this file is supposed to be created from the mapping.c.src file?

I install it like this:
sudo python setup.py install

jsalvatier · 2013-02-08T17:24:08Z

By current numpy version do you mean the current dev version? My numpy fork has been integrated into the numpy master, for which mapping.c.src is no longer relevant.

nouiz · 2013-02-08T18:18:20Z

ok, I used the numpy master and I'm able to reproduce the crash.

But what is this PR? numpy/numpy#326 I was thinking it was what you needed?

I check why it segfault.

jsalvatier · 2013-02-08T19:07:33Z

Oh, I should have closed the request, we decided to just add API support (see numpy/numpy#377)

nouiz · 2013-02-08T19:19:26Z

I made a PR to your branch with the fix. We need to import some stuff for numpy in the init of the python module.

Fix segfault as we didn't imported numpy c module stuff.

jsalvatier · 2013-02-08T19:20:46Z

Awesome! Thanks :)

On Fri, Feb 8, 2013 at 11:19 AM, nouiz notifications@github.com wrote:

I made a PR to your branch with the fix. We need to import some stuff for
numpy in the init of the python module.

—
Reply to this email directly or view it on GitHubhttps://github.com//pull/1083#issuecomment-13306544..

nouiz · 2013-02-08T19:43:39Z

theano/scalar/basic.py

@@ -1095,7 +1095,8 @@ def output_types(self, *input_types):
        return upcast_out(*input_types[0])

    def grad(self, inputs, output_gradients):
-        return [None, None]
+        a,b = inputs
+        return [a.zeros_like().astype(theano.config.floatX), b.zeros_like().astype(theano.config.floatX)]


@goodfeli can you confirm that both changes in this file are fine?

Any word on this? I want to push out an alpha of pymc3, and this is PR is an important feature for pymc3.

I'm pretty sure it's correct.

nouiz · 2013-02-12T20:29:20Z

theano/tensor/basic.py

@@ -4183,7 +4185,7 @@ def init_entry(entry, depth=0):
        rval = """
        #define PyArray_set_dim(obj, idx, d) PyArray_DIMS(obj)[idx]=d
        #define PyArray_set_stride(obj, idx, d) PyArray_STRIDES(obj)[idx]=d
-        #define PyArray_set_data(obj, ptr, base) PyArray_BYTES(obj)=ptr
+        #define PyArray_set_data(obj, ptr, base) ((PyArrayObject*)(obj))->data=ptr


Why did you changed that? With the new NumPy C-API, we won't have direct access to the ->data field. Ok the old behavior won't work too, but I'm curious why you changed that? This is the type of change that could break stuff on some other compiler/OS. So if their is no well defined reason, I would prefer not to include that.

nouiz · 2013-02-12T21:00:34Z

I didn't review the code in c_utils. I suppose it is copied from the numpy code, so it should be working correctly.

I made a few small comments. Also, I didn't found existing tests for AdvancedSubtensor and AdvancedIncSubtensor. We need to have some. For example, I catch a case where if we don't do an implace AdvancedIncSubtensor, it would have crashed.

The tests should be in tensor/tests/test_basic.py. There is already the classes TestIncSubtensor1 and T_subtensor. But as said, they don't test the op you changed. If you know existing test for this class, we need to add test for new shape/index that you add support for. Don't change the T_subtensor class, as they are reused for GPU tests. You can make a new class TestAdvancedSubtensor in the same spirit as TestIncSubtensor1.

thanks for this big new feature.

nouiz · 2013-02-13T18:10:01Z

In found some indirect tests for advanced subtensor. In fact it is test for optimization that remove it from the graph by creating some crossentropy ops. Those tests are in theano/tensor/nnet/tests/test_nnet.py.

But I don't think we can reuse them and this is not a good place for the rest of the op.

lamblin · 2013-03-07T18:34:01Z

Continued in gh-1269, closing.

jsalvatier and others added 13 commits June 28, 2012 14:07

initial try at making advanced subtensor and subtensor inc work in th… 8000

70040a5

…e general case

corrected some errors

457bae1

trying out slice objects

771b04d

inferring the shape like that doesn't work

03eca16

fixed a bug that made the broadcastable values be true when they shou…

e7412c0

…ld be false

Merge remote branch 'trunk/master' into advinc

8cc005a

erged

68f1bf9

removed extra stuff

52d6e50

Fixed PyArray_set_data to work with Numpy1.7 C-API.

056e78f

switched to using advinc package

1aecfaf

merged

fe3798c

Merge https://github.com/HapeMask/Theano into advinc

4089fe8

gave binary and unary bit ops proper gradients instead of none (cause…

fdca8d7

…d failure)

nouiz reviewed Nov 20, 2012
View reviewed changes

jsalvatier added 4 commits November 20, 2012 14:48

reverted change

4fe30bb

added back dtype lists and use of advancedsubtensor1

4a7348c

env->fgraph

95fb4eb

slices now can be compared

ddf040a

abalkin mentioned this pull request Dec 9, 2012

Take op [WIP] #1127

Merged

jsalvatier added 2 commits December 12, 2012 12:58

broadcastable calculation fix for advancedsubtensor

66892db

nouiz mentioned this pull request Jan 29, 2013

Add triangle/nonzero functions #1181

Merged

added imports for compiled version

8d8be24

This was referenced Feb 5, 2013

PyMC 3 alpha release pymc-devs/pymc#152

Closed

Theano advanced indexing pymc-devs/pymc#161

Closed

Fix segfault as we didn't imported numpy c module stuff.

da494bb

Merge pull request #1 from nouiz/increment_inplace

7e26d22

Fix segfault as we didn't imported numpy c module stuff.

nouiz reviewed Feb 8, 2013
View reviewed changes

jsalvatier mentioned this pull request Feb 9, 2013

Examples throw assert igrad is not None pymc-devs/pymc#166

Closed

nouiz reviewed Feb 12, 2013
View reviewed changes

jsalvatier mentioned this pull request Feb 19, 2013

Advinc3 #1239

Closed

lamblin closed this Mar 7, 2013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Full advanced Indexing support + gradient #1083

Full advanced Indexing support + gradient #1083

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Full advanced Indexing support + gradient #1083

Full advanced Indexing support + gradient #1083

Uh oh!

Conversation

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!