Tweak conv-rnn model #75

daemon · 2017-10-29T03:08:53Z

Fix misplaced zero_grad()
Tweak model hyperparams and optimization algorithm

- Fix misplaced zero_grad() - Tweak model hyperparams and optimization algorithm

tuzhucheng · 2017-10-29T18:53:19Z

conv_rnn/test.py

@@ -17,19 +18,19 @@ def main():
    parser.add_argument("--gpu_number", default=0, type=int)
    args = parser.parse_args()

-    model.set_seed(5, no_cuda=args.no_cuda)
-    data_loader = data.SSTDataLoader(args.data_dir)
+    model.set_seed(3, no_cuda=args.no_cuda)


make seed configurable?

Actually the seed is useless in test.py, I'll remove it later.

tuzhucheng · 2017-10-29T18:54:32Z

conv_rnn/model.py

@@ -34,8 +34,6 @@ def __init__(self, word_model, **config):
        else:
            raise ValueError("RNN type must be one of LSTM or GRU")
        self.conv = nn.Conv2d(1, n_fmaps, (1, self.hidden_size * 2))
-        if dropout:


does this hurt performance? if we are removing it why are we adding --dropout_prob to train.py?

Paper says no dropout for SST1/2.

daemon added 3 commits October 28, 2017 23:06

Tweak conv-rnn model

6693e39

- Fix misplaced zero_grad() - Tweak model hyperparams and optimization algorithm

Fix typo

a47b506

Add new results

e77095f

tuzhucheng reviewed Oct 29, 2017

View reviewed changes

Clean up extraneous code

1466fa2

tuzhucheng approved these changes Oct 29, 2017

View reviewed changes

tuzhucheng merged commit 7957dc7 into castorini:master Oct 29, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tweak conv-rnn model #75

Tweak conv-rnn model #75

daemon commented Oct 29, 2017

tuzhucheng Oct 29, 2017

daemon Oct 29, 2017

tuzhucheng Oct 29, 2017

daemon Oct 29, 2017

Tweak conv-rnn model #75

Tweak conv-rnn model #75

Conversation

daemon commented Oct 29, 2017

tuzhucheng Oct 29, 2017

Choose a reason for hiding this comment

daemon Oct 29, 2017

Choose a reason for hiding this comment

tuzhucheng Oct 29, 2017

Choose a reason for hiding this comment

daemon Oct 29, 2017

Choose a reason for hiding this comment