torch.optim.lbfgs - added box constraint and line search methods(back… #938

ChangYong-Oh · 2017-03-06T13:31:49Z

torch.optim.lbfgs

added box constraint
added line search methods(backtracking, goldstein, weak_wolfe)
Box constraint is imposed via line search(line search is conducted under constraints)
Box constraint is specified by the similar way in scipy fmin_l_bfgs_b
Line search method choice is given by string(originally, it seems to be going to be given by function)

torch.autograd._function.reduce.Prod class

if prod is taken over some tensor containing zero, this returns inf, because the original implementation is done by [product of all elements]/[an element]. All these cases are taken care of separately. Still, tensor.prod() returns inf if there is only one element in a tensor.

This should be fixed.

…tracking, goldstein, weak_wolfe) torch.tensor.prod - resolved the problem with zero element

torch/autograd/_functions/reduce.py

+                return grad_output.new(self.input_size).fill_(0)
+            else:
+                grad_input = grad_output.new(self.input_size).fill_(0)
+                indexing_tuple = tuple(zero_loc[0].numpy())


apaszke · 2017-03-17T19:46:29Z

I took your prod fixes and refactored them slightly so that they reuse some intermediate results. Can you please remove this change from this PR? I'll try to get someone who knows line search methods to review your LBFGS changes soon. Thanks!

hassec · 2018-05-30T12:24:32Z

@apaszke Has anyone looked at the line search part of this MR yet?
Or is this MR completely dead?

ezyang · 2018-05-30T12:59:30Z

@pytorchbot retest this please

ezyang · 2018-05-30T13:40:22Z

@pytorchbot retest this please

ezyang · 2018-05-30T14:31:32Z

@pytorchbot retest this please

hassec · 2018-06-22T18:22:21Z

Is there anything one can do to push this along?
IMHO LBFGS would greatly benefit from having these line search functions implemented...

soumith · 2018-06-22T18:48:06Z

@hassec we are blocked on finding competent reviewers for this stuff, who know the relevant literature and can verify the code implemented

hassec · 2018-06-23T13:59:04Z

@soumith thanks for the quick feedback.
Mentioning in #6564 that there is simply a review needed might be useful to find someone?

fehiepsi · 2018-06-25T08:50:26Z

@soumith @hassec I implemented strong Wolfe line search in #8824 with enough references. You might want to take a look if you are interested in. :)

ssnl · 2018-06-25T17:38:29Z

I'll review this in the coming days.

weiyangfb · 2018-08-21T19:05:56Z

@ssnl

matthieuheitz · 2019-02-13T07:53:05Z

What is the state of this PR ?
This L-BFGS-B would be a great addition to Pytorch !

ianwilliamson · 2019-02-22T21:50:21Z

Wondering what the status of this pull request is. Would be great to have constraint options.

soumith · 2019-02-23T00:00:57Z

@matthieuheitz @ianwilliamson would either of you be willing to review the PR for correctness against the papers? that's what it's blocked on at the moment.

GeoffNN · 2019-04-09T19:16:41Z

Is anyone looking at this?

ezyang · 2019-04-09T20:45:07Z

No not at the moment. We need someone to review the PR for correctness (math), and it also needs some rebasing.

facebook-github-bot

@vincentqb has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

vincentqb · 2019-06-18T22:13:24Z

torch/optim/lbfgs.py

+        assert offset == self._numel()
+        return deriv
+
+    def _max_alpha(self, d):


Reference for bounded-constrained optimization: equation 16.71 in Numerical Optimization 2nd Edition by Nocedal and Wright.

vincentqb · 2019-06-18T22:47:29Z

torch/optim/lbfgs.py

+        original_param_data_list = self._copy_param()
+        phi_0 = closure().data[0]
+        phi_0_prime = self._directional_derivative(d)
+        alpha_k = 1.0


Unlike the _goldstein and _weak_wolfe, _backtracking is not invoking _max_alpha here.

vincentqb · 2019-06-18T22:58:07Z

torch/optim/lbfgs.py

+            if u_bnd is not None:
+                from_u_bnd = ((u_bnd-p.data)/p_grad)[p_grad>0]
+                min_u_bnd = torch.min(from_u_bnd) if from_u_bnd.numel() > 0 else max_alpha
+            max_alpha = min(max_alpha, min_l_bnd, min_u_bnd)


This line takes the minimal alpha across all the directions. This implies that, if the point is near the boundary, the next step is constrained to remain very close and the descent could stall. I expect instead to do different step size in each directions, see 16.71 mentioned in previous comment. Is there another reference for this?

The relevant part of scipy is the fortran lbfgsb library's cauchy function. The gradient is scaled differently for each component, as done in Nocedal equation 16.71.

vincentqb · 2019-06-19T03:14:00Z

I will close this PR since

[pytorch] Add strong Wolfe line search for lbfgs #8824 was just approved and implements line search using strong Wolfe method.
The changes in reduce.py conflicts with recent history.
The box constraint does not appear to be correct, and a different algorithm should be implemented, see comments above.

* pipe through index mode * replace codegen srings * cache index mode * use std limit * move definitions * rename INDEX_TYPE

ChangYong-Oh added 2 commits March 6, 2017 14:17

torch.optim.lbfgs - added box constraint and line search methods(back…

531f781

…tracking, goldstein, weak_wolfe) torch.tensor.prod - resolved the problem with zero element

torch.autograd._function.reduce.py(Prod) : bug fixed in backward

01bb1cf

soumith added the ready for review (this tag is deprecated) All PRs are ready for review unless they are draft, WIP, or have undismissed requested changes label Mar 13, 2017

apaszke mentioned this pull request Mar 15, 2017

Fix issue #995 #1002

Closed

fmassa reviewed Mar 15, 2017

View reviewed changes

apaszke added requested-changes and removed ready for review (this tag is deprecated) All PRs are ready for review unless they are draft, WIP, or have undismissed requested changes requested changes labels Mar 22, 2017

soumith added the ready label Jun 22, 2017

soumith removed the ready label Jul 3, 2017

hassec mentioned this pull request May 18, 2018

[Feature Request] Optimization with constraint (L-BFGS-B) #6564

Open

zou3519 assigned ssnl Jun 25, 2018

weiyangfb added the ready for review (this tag is deprecated) All PRs are ready for review unless they are draft, WIP, or have undismissed requested changes label Aug 21, 2018

ezyang added module: optimizer Related to torch.optim and removed ready for review (this tag is deprecated) All PRs are ready for review unless they are draft, WIP, or have undismissed requested changes labels Apr 9, 2019

ezyang added the open source label Jun 5, 2019

facebook-github-bot reviewed Jun 5, 2019

View reviewed changes

vincentqb reviewed Jun 18, 2019

View reviewed changes

vincentqb closed this Jun 19, 2019

vincentqb mentioned this pull request Jun 26, 2019

Box constraints for optimizers #22281

Open

nrontsis mentioned this pull request Jun 23, 2020

BFGS/Quasi-Newton optimizers? jax-ml/jax#1400

Open

jjsjann123 pushed a commit to jjsjann123/pytorch that referenced this pull request Aug 5, 2021

32b mode indexing support (pytorch#938)

f900554

* pipe through index mode * replace codegen srings * cache index mode * use std limit * move definitions * rename INDEX_TYPE

zhuhong61 pushed a commit to zhuhong61/pytorch that referenced this pull request Jun 8, 2022

Add libroctracer dependency unconditionally for ROCm (pytorch#938)

79132be

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

torch.optim.lbfgs - added box constraint and line search methods(back… #938

torch.optim.lbfgs - added box constraint and line search methods(back… #938

ChangYong-Oh commented Mar 6, 2017 •

edited by soumith

Loading

This comment was marked as off-topic.

This comment was marked as off-topic.

apaszke commented Mar 17, 2017

hassec commented May 30, 2018

ezyang commented May 30, 2018

ezyang commented May 30, 2018

ezyang commented May 30, 2018

hassec commented Jun 22, 2018

soumith commented Jun 22, 2018

hassec commented Jun 23, 2018

fehiepsi commented Jun 25, 2018

ssnl commented Jun 25, 2018

weiyangfb commented Aug 21, 2018

matthieuheitz commented Feb 13, 2019

ianwilliamson commented Feb 22, 2019

soumith commented Feb 23, 2019

GeoffNN commented Apr 9, 2019

ezyang commented Apr 9, 2019

facebook-github-bot left a comment

vincentqb Jun 18, 2019 •

edited

Loading

vincentqb Jun 18, 2019 •

edited

Loading

vincentqb Jun 18, 2019

vincentqb Jun 19, 2019 •

edited

Loading

vincentqb commented Jun 19, 2019 •

edited

Loading

torch.optim.lbfgs - added box constraint and line search methods(back… #938

torch.optim.lbfgs - added box constraint and line search methods(back… #938

Conversation

ChangYong-Oh commented Mar 6, 2017 • edited by soumith Loading

This comment was marked as off-topic.

This comment was marked as off-topic.

apaszke commented Mar 17, 2017

hassec commented May 30, 2018

ezyang commented May 30, 2018

ezyang commented May 30, 2018

ezyang commented May 30, 2018

hassec commented Jun 22, 2018

soumith commented Jun 22, 2018

hassec commented Jun 23, 2018

fehiepsi commented Jun 25, 2018

ssnl commented Jun 25, 2018

weiyangfb commented Aug 21, 2018

matthieuheitz commented Feb 13, 2019

ianwilliamson commented Feb 22, 2019

soumith commented Feb 23, 2019

GeoffNN commented Apr 9, 2019

ezyang commented Apr 9, 2019

facebook-github-bot left a comment

Choose a reason for hiding this comment

vincentqb Jun 18, 2019 • edited Loading

Choose a reason for hiding this comment

vincentqb Jun 18, 2019 • edited Loading

Choose a reason for hiding this comment

vincentqb Jun 18, 2019

Choose a reason for hiding this comment

vincentqb Jun 19, 2019 • edited Loading

Choose a reason for hiding this comment

vincentqb commented Jun 19, 2019 • edited Loading

ChangYong-Oh commented Mar 6, 2017 •

edited by soumith

Loading

vincentqb Jun 18, 2019 •

edited

Loading

vincentqb Jun 18, 2019 •

edited

Loading

vincentqb Jun 19, 2019 •

edited

Loading

vincentqb commented Jun 19, 2019 •

edited

Loading