Fixes for seq2seq model #1808

brendan-ai2 · 2018-09-21T18:45:06Z

Addresses:
- possible masking bug in simple_seq2seq #1713, "possible masking bug in simple_seq2seq"
- Simple Seq2Seq Computes Incorrect Dev Log Loss #1134, "Simple Seq2Seq Computes Incorrect Dev Log Loss"

brendan-ai2 · 2018-09-21T18:47:39Z

(Needs an extra test before review.)

matt-gardner

LGTM!

brendan-ai2 · 2018-09-22T02:08:04Z

Thanks!

I added a simple test for an issue that using get_final_encoder_states also solves (bidirectional encoders) and manually tested that the validation loss was improving by implementing a trivial autoencoder. Do we have any established patterns for testing that we don't regress training full models? Feels more like an integration test.

Let me know if it's worth having a test specific to the masking issue.

matt-gardner

(Sorry, I think I started looking over the code before you unrequested your review, and approved it before seeing that you wanted me to wait. It still looks good =).)

I don't think we need any more specific test for the masking issue. It would be way more work than it's worth to make that actually testable in this model code, and the function you're calling already has its own unit tests.

And no, we don't have any tests that do larger tests of training models on real data. That might be nice to have, to run occasionally (nightly?), but we definitely don't want something that big in our PR CI.

matt-gardner · 2018-09-23T04:09:44Z

allennlp/tests/models/encoder_decoders/simple_seq2seq_test.py

+    def test_encoder_decoder_can_train_save_and_load(self):
+        self.ensure_model_can_train_save_and_load(
+                self.param_file,
+                overrides="{model: {encoder: {bidirectional: true}}}"


Why not just change this in experiment.json?

I'd prefer to test with bidirectional set to both true and false. And, naturally, duplicating the config file wasn't very compelling.

Oh, ok. It looks like you can just add this as another method on the other class, too, instead of making a whole new class for this.

brendan-ai2 · 2018-09-24T19:05:16Z

(No worries!)

Thanks for the background on the tests. I filed #1813 so we can consider the periodic retraining.

- Followup from #1808.

brendan-ai2 added 2 commits September 20, 2018 18:15

Some fixes

37f5cfa

Fix long line

69fa1cc

brendan-ai2 requested a review from matt-gardner September 21, 2018 18:45

Merge branch 'master' into seq2seq-fixes

05f93f6

brendan-ai2 removed the request for review from matt-gardner September 21, 2018 18:47

matt-gardner approved these changes Sep 21, 2018

View reviewed changes

brendan-ai2 added 2 commits September 21, 2018 18:49

Bidirectional test

78998ea

Merge branch 'master' into seq2seq-fixes

b3cc0ff

matt-gardner approved these changes Sep 23, 2018

View reviewed changes

brendan-ai2 mentioned this pull request Sep 24, 2018

Periodically retrain models on real data, verify performance. #1813

Closed

brendan-ai2 merged commit 546242f into allenai:master Sep 24, 2018

This was referenced Sep 24, 2018

Simple Seq2Seq Computes Incorrect Dev Log Loss #1134

Closed

possible masking bug in simple_seq2seq #1713

Closed

Seq2Seq Test Cleanup #1814

Merged

brendan-ai2 added a commit that referenced this pull request Sep 25, 2018

Seq2Seq Test Cleanup (#1814)

8be358e

- Followup from #1808.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fixes for seq2seq model #1808

Fixes for seq2seq model #1808

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Fixes for seq2seq model #1808

Fixes for seq2seq model #1808

Uh oh!

Conversation

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!