Skip to content
Discussion options

You must be logged in to vote

@Huadangfan
Thanks for your interest in the difference between these two strategies. Here is some information for your reference:

For method 0, the stpe length will decrease at the (n+1)-th iteration if the objective function at the n-th iteration is greater than that at the (n-1)-th iteration. In the case, the step length is too large for linear approximation (or called first-order Tayler expansion). A decreased step length is preferred.

For method 1, the stpe length with decrease at the (n+1)-th iteration if the gradient at n-th iteration significantly differs from that at the (n-1)-th iteration. For example, the angle between two kernels is greater than the default value 120 degree, wh…

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@Huadangfan
Comment options

@JingChen-Thu
Comment options

Answer selected by xumi1993
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants