<?xml version="1.0" encoding='utf-8'?>
<!DOCTYPE wml PUBLIC "-//WAPFORUM//DTD WML 1.1//EN" "http://www.wapforum.org/DTD/wml_1.1.xml">
<wml>
<card id="card1" title="Gradient descent - Page 15 - Wikipedia">
<p>
<a accesskey="1" href="page.php?w=Gradient_descent&amp;p=14">1.Previous</a><br />
<a accesskey="3" href="page.php?w=Gradient_descent&amp;p=16">3.Next</a>
</p>
<p>the fairly weak assumption that  is continuously differentiable, we may prove that:This inequality implies that the amount by which we can be sure the function  is decreased depends on a trade off between the two terms in square brackets. The first term in square brackets measures the angle between the descent direction and the negative gradient. The second term measures how quickly the gradient changes along the descent direction.</p>

<p>In principle inequality  could be optimized over  and  to choose an optimal step size and direction. The</p><p>
<a accesskey="1" href="page.php?w=Gradient_descent&amp;p=14">1.Previous</a><br />
<a accesskey="3" href="page.php?w=Gradient_descent&amp;p=16">3.Next</a>
</p>

<do type="prev" label="Search">
        <go href="search.wml"/>
</do>

</card>
</wml>
