Tree decoding fix #1606

DeNeutoy · 2018-08-14T20:41:22Z

No description provided.

matt-gardner

Still a couple of questions around indexing that aren't clear to me.

matt-gardner · 2018-08-14T23:31:51Z

allennlp/models/biaffine_dependency_parser.py

        # Shape (batch_size, num_head_tags, sequence_length, sequence_length)
+        # This energy tensor expresses the following relation:
+        # energy[i,j] = "Score that j is the head of i". In this


I think something's backwards in either this comment or the logic above. Because if I substitute j for ROOT and i for some_word, I get "Score that ROOT is the head of some_word", which you set to very negative above, with energy[:, 0, :] = -1e8. Right?

No, that's the wrong way around! You replace j with each of the words at index j, so you get: "Score that word[j] is the head of ROOT". Therefore, if I want ROOT to never be a child of a word, I should zero out the first row.

I'm just saying there's a mismatch between your comment and your code. Oh, wait, no, I was confusing myself because of the batch dimension. When I wrote my comment, I was thinking it was normalized_arc_logits[i, j, :], so j was ROOT, but it's actually normalized_arc_logits[:, i, :], so i is ROOT. Ok, all good.

matt-gardner · 2018-08-14T23:38:04Z

allennlp/tests/predictors/biaffine_dependency_parser_test.py

@@ -25,7 +25,7 @@ def test_uses_named_inputs(self):
        assert head_tags is not None
        assert isinstance(head_tags, list)
        assert all(isinstance(x, int) for x in head_tags)
-
+        print(result)


Remove print.

matt-gardner · 2018-08-14T23:44:37Z

allennlp/tests/models/biaffine_dependency_parser_test.py

+        # This is the correct MST, but not desirable for dependency parsing.
+        assert heads.tolist()[0] == [-1, 0, 0]
+
+        energy[:, :, 0, :] = 0


Is this example really doing what you think it's doing?

>>> t = torch.Tensor([[0, 1, 1], [10, 0, 1], [10, 1, 0]]).view(1, 1, 3, 3) >>> t tensor([[[[ 0., 1., 1.], [ 10., 0., 1.], [ 10., 1., 0.]]]]) >>> t[:, :, 0, :] = 0 >>> t tensor([[[[ 0., 0., 0.], [ 10., 0., 1.], [ 10., 1., 0.]]]])

This is zeroing out the top row, not the first column. Is that what you expected? And in general, I'd recommend using different numbers for every non-zero value, so there are no ties and it's more obvious what the MST should be.

* initial fix * correct approach * fix and test * fix predictor test * fix pylint * use unique edge weights

Mark Neumann and others added 6 commits August 13, 2018 13:17

initial fix

93b216b

correct approach

0192711

fix and test

9c300fd

fix predictor test

0e157b4

fix pylint

1605dc1

Merge branch 'master' into tree-decoding-fix

dd03261

DeNeutoy added this to the Release v0.5.2 milestone Aug 14, 2018

DeNeutoy requested a review from matt-gardner August 14, 2018 23:09

matt-gardner reviewed Aug 14, 2018

View reviewed changes

use unique edge weights

2043474

matt-gardner approved these changes Aug 15, 2018

View reviewed changes

DeNeutoy merged commit 9540125 into allenai:master Aug 15, 2018

gabrielStanovsky pushed a commit to gabrielStanovsky/allennlp that referenced this pull request Sep 7, 2018

Tree decoding fix (allenai#1606)

2e31436

* initial fix * correct approach * fix and test * fix predictor test * fix pylint * use unique edge weights

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Tree decoding fix #1606

Tree decoding fix #1606

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Tree decoding fix #1606

Tree decoding fix #1606

Uh oh!

Conversation

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!