minor memory improvements in _joint_likelihood() of ConditionalRandomField with advanced indexing #1686

LauraRuis · 2018-08-29T13:11:20Z

Minor memory improvements possible in the _joint_likelihood() function of the ConditionalRandomField class by utilizing the advanced indexing feature of PyTorch (https://github.com/pytorch/pytorch/releases/tag/v0.2.0).

Elliminates the need to expand transitions matrix by batch size (line 258 - 260) and to expand the tensor containing indices to the last tags by batch size (line 289).

…eatures

matt-gardner

LGTM, thanks for doing this! There are just a couple of minor issues that would be good to fix before merging.

matt-gardner · 2018-08-29T14:59:54Z

allennlp/modules/conditional_random_field.py

@@ -286,10 +273,7 @@ def _joint_likelihood(self,
        # Transition from last state to "stop" state. To start with, we need to find the last tag
        # for each instance.
        last_tag_index = mask.sum(0).long() - 1
-        last_tags = tags.gather(0, last_tag_index.view(1, batch_size).expand(sequence_length, batch_size))


Can't you just switch the .expand() here to a .squeeze(0), and still remove the three lines below? .squeeze() is generally preferable to .view(-1), as it's easier to reason about (unless there's some efficiency reason to prefer .view(-1) that I don't know about...?).

Yes that makes more sense! I added your comments :)

matt-gardner · 2018-08-29T15:01:08Z

allennlp/modules/conditional_random_field.py

@@ -255,26 +255,13 @@ def _joint_likelihood(self,
        else:
            score = 0.0

-        # Broadcast the transition scores to one per batch element
-        broadcast_transitions = self.transitions.view(1, num_tags, num_tags).expand(batch_size, num_tags, num_tags)


You've removed the need for num_tags, so you need to remove the variable definition above on line 245 (batch_size, sequence_length, _ = logits.size()).

…Field with advanced indexing (allenai#1686) * optimize memory/speed _joint_likelihood() function with new PyTorch features * remove redundant var and change .view(-1) to .squeeze() * Use tags.gather(...) instead of torch.gather(tags, ...)

optimize memory/speed _joint_likelihood() function with new PyTorch f…

d42ec94

…eatures

matt-gardner approved these changes Aug 29, 2018
8000
View reviewed changes

LauraRuis and others added 2 commits August 29, 2018 17:22

remove redundant var and change .view(-1) to .squeeze()

4f63a42

Use tags.gather(...) instead of torch.gather(tags, ...)

2aec3c4

matt-gardner merged commit d1f6748 into allenai:master Aug 29, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

minor memory improvements in _joint_likelihood() of ConditionalRandomField with advanced indexing #1686

minor memory improvements in _joint_likelihood() of ConditionalRandomField with advanced indexing #1686

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

minor memory improvements in _joint_likelihood() of ConditionalRandomField with advanced indexing #1686

minor memory improvements in _joint_likelihood() of ConditionalRandomField with advanced indexing #1686

Uh oh!

Conversation

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!