Implementation of ESIM model #1469

matt-peters · 2018-07-09T22:08:17Z

A modified version of the ESIM model used in "Deep contextualized word representations" (http://aclweb.org/anthology/N18-1202)

DeNeutoy

Nice one, couple of comments but basically LGTM

DeNeutoy · 2018-07-09T22:21:10Z

allennlp/models/esim.py

+from allennlp.nn.util import get_text_field_mask, last_dim_softmax, weighted_sum, replace_masked_values
+from allennlp.training.metrics import CategoricalAccuracy
+
+class InputVariationalDropout(torch.nn.Dropout):


Could you pull this out into a module under allennlp.modules? It's generally useful (infact I was just about to implement the same thing for the dependency parser)

DeNeutoy · 2018-07-09T22:24:27Z

allennlp/predictors/esim.py

+
+    def predict(self, sentence1: str, sentence2: str) -> JsonDict:
+        """
+        Predicts whether the sentence2 is entailed by the sentence1 text.


I don't think this predictor is actually needed - it's exactly the same as the decomposable_attention predictor. You can just delete this file, I think. If you wanted to, you could rename the decomposable attention predictor to be called entailment or something more general.

Now I remember why I added it -- the decomposable attention predictor uses premise and hypothesis but the SNLI / MultiNLI data uses sentence1 and sentence2. This means the existing predictor can't be used with the standard datasets (to e.g. write out predictions to a file for submitting the the leaderboard). Is there an easy way to over ride the existing tensor names?

The hacky alternative (to my hacky fix that cuts and pastes the decomposable attention predictor and change the names) is to just modify the incoming json to change the keys, then modify again to back to the original keys.

@nelson-liu and I were discussing a related issue -- he wanted to use the same model with different dataset readers (which produced differently named fields), we kicked around several bad ideas but never got to any we liked (I don't think)

I'm all for removing the ESIM predictor and will do so now, and open an issue to provide a general solution.

DeNeutoy · 2018-07-09T22:24:54Z

allennlp/models/esim.py

+
+        Parameters
+        ----------
+        input_tensor: torch.FloatTensor


double backticks around the types here make them render nicely in the docs.

DeNeutoy · 2018-07-09T22:26:13Z

allennlp/models/esim.py

+        if self.rnn_input_dropout:
+            projected_enhanced_premise = self.rnn_input_dropout(projected_enhanced_premise)
+            projected_enhanced_hypothesis = self.rnn_input_dropout(projected_enhanced_hypothesis)
+        v_ai = self._inference_encoder(projected_enhanced_premise, premise_mask)


marginally more informative variable names would be better here

These names follow the notation in the original paper and are helpful if someone wanted to align the code with equations.

* WIP: ESIM model * WIP: ESIM model for SNLI * WIP: ESIM * WIP: ESIM * WIP: ESIM * WIP: ESIM * ESLM model with ELMo * Add a ESIM predictor that works with SNLI formatted files * Move ESIM predictor * Clean up * Add test for ESIM * Add predictor for ESIM * pylint * pylint * mypy * fix the docs * ESIM predictor * Add comment to esim training config * Move InputVariationalDropout * pylint * Fix the docs * fix the docs * Remove ESIM predictor * Scrub all of ESIMPredictor

matt-peters added 22 commits May 2, 2018 17:06

WIP: ESIM model

3e9b5ae

WIP: ESIM model for SNLI

8127486

WIP: ESIM

5d775f7

WIP: ESIM

9829160

WIP: ESIM

8c51111

WIP: ESIM

3e1faac

ESLM model with ELMo

9db872d

Add a ESIM predictor that works with SNLI formatted files

c79bd99

:Merge branch 'mp/esim' of github.com:matt-peters/allennlp into mp/esim

fb4d93f

Merge remote-tracking branch 'upstream/master' into mp/esim

8c10588

Merge conflict

2708665

Move ESIM predictor

3f173be

Merge branch 'master' into mp/esim2

fe1cbd2

Clean up

2b722ce

Add test for ESIM

fa5e670

Add predictor for ESIM

3e336a5

pylint

23bfeca

pylint

1d0c905

mypy

12325be

fix the docs

10000

d9730f4

ESIM predictor

4f6d37f

Add comment to esim training config

7b57e42

matt-peters requested a review from DeNeutoy July 9, 2018 22:10

DeNeutoy approved these changes Jul 9, 2018

View reviewed changes

matt-peters added 4 commits July 9, 2018 15:50

Move InputVariationalDropout

7ea3e47

pylint

54db604

Fix the docs

9ae74aa

fix the docs

a3cf48d

matt-peters mentioned this pull request Jul 9, 2018

Allow to use a different dataset reader in predictors #1470

Closed

Remove ESIM predictor

101a71c

Scrub all of ESIMPredictor

9b901f9

matt-peters merged commit ff41dda into allenai:master Jul 10, 2018

matt-peters deleted the mp/esim2 branch July 10, 2018 00:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implementation of ESIM model #1469

Implementation of ESIM model #1469

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Implementation of ESIM model #1469

Implementation of ESIM model #1469

Uh oh!

Conversation

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!