Conjugate Gradient and Quasi-Newton

Lange, Kenneth

doi:10.1007/978-1-4614-5838-8_11

Kenneth Lange²

Part of the book series: Springer Texts in Statistics ((STS,volume 95))

12k Accesses

Abstract

Our discussion of Newton’s method has highlighted both its strengths and its weaknesses. Related algorithms such as scoring and Gauss-Newton exploit special features of the objective function \(f(\boldsymbol{x})\) in overcoming the defects of Newton’s method. We now consider algorithms that apply to generic functions \(f(\boldsymbol{x})\). These algorithms also operate by locally approximating \(f(\boldsymbol{x})\) by a strictly convex quadratic function. Indeed, the guiding philosophy behind many modern optimization algorithms is to see what techniques work well with quadratic functions and then to modify the best techniques to accommodate generic functions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Hardcover Book: USD 179.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Acton FS (1990) Numerical methods that work. Mathematical Association of America, Washington, DC
MATH Google Scholar
Beltrami EJ (1970) An algorithmic approach to nonlinear analysis and optimization. Academic, New York
MATH Google Scholar
Byrd RH, Nocedal J (1989) A tool for the analysis of quasi-Newton methods with application to unconstrained minimization. SIAM J Numer Anal 26:727–739
Article MathSciNet MATH Google Scholar
Ciarlet PG (1989) Introduction to numerical linear algebra and optimization. Cambridge University Press, Cambridge
Google Scholar
Conn AR, Gould NIM, Toint PL (1991) Convergence of quasi-Newton matrices generated by the symmetric rank one update. Math Program 50:177–195
Article MathSciNet MATH Google Scholar
Davidon WC (1959) Variable metric methods for minimization. AEC Research and Development Report ANL–5990, Argonne National Laboratory, Argonne
Google Scholar
Dennis JE Jr, Schnabel RB (1996) Numerical methods for unconstrained optimization and nonlinear equations. SIAM, Philadelphia
Book MATH Google Scholar
Fiacco AV, McCormick GP (1968) Nonlinear programming: sequential unconstrained minimization techniques. Wiley, Hoboken
MATH Google Scholar
Fletcher R (2000) Practical methods of optimization, 2nd edn. Wiley, Hoboken
Google Scholar
Fletcher R, Powell MJD (1963) A rapidly convergent descent method for minimization. Comput J 6:163–168
Article MathSciNet MATH Google Scholar
Fletcher R, Reeves CM (1964) Function minimization by conjugate gradients. Comput J 7:149–154
Article MathSciNet MATH Google Scholar
Golub GH, Van Loan CF (1996) Matrix computations, 3rd edn. Johns Hopkins University Press, Baltimore
MATH Google Scholar
Henrici P (1982) Essentials of numerical analysis with pocket calculator demonstrations. Wiley, Hoboken
MATH Google Scholar
Hestenes MR, Stiefel E (1952) Methods of conjugate gradients for solving linear systems. J Res Natl Bur Stand 29:409–439
Article MathSciNet Google Scholar
Kelley CT (1999) Iterative methods for optimization. SIAM, Philadelphia
Book MATH Google Scholar
Khalfan HF, Byrd RH, Schnabel RB (1993) A theoretical and experimental study of the symmetric rank-one update. SIAM J Optim 3:1–24
Article MathSciNet MATH Google Scholar
Miller KS (1987) Some eclectic matrix theory. Robert E Krieger Publishing, Malabar
MATH Google Scholar
Moré JJ, Sorensen DC (1983) Computing a trust region step. SIAM J Sci Stat Comput 4:553–572
Article MATH Google Scholar
Nazareth L (1979) A relationship between the BFGS and conjugate gradient algorithms and its implications for new algorithms. SIAM J Numer Anal 16:794–800
Article MathSciNet MATH Google Scholar
Nocedal J, Wright S (2006) Numerical optimization, 2nd edn. Springer, New York
MATH Google Scholar
Press WH, Teukolsky SA, Vetterling WT, Flannery BP (1992) Numerical recipes in Fortran: the art of scientific computing, 2nd edn. Cambridge University Press, Cambridge
Google Scholar
Sorensen DC (1997) Minimization of a large-scale quadratic function subject to spherical constraints. SIAM J Optim 7:141–161
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Biomathematics, Human Genetics, Statistics, University of California, Los Angeles, CA, USA
Kenneth Lange

Authors

Kenneth Lange
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Lange, K. (2013). Conjugate Gradient and Quasi-Newton. In: Optimization. Springer Texts in Statistics, vol 95. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-5838-8_11

Download citation

DOI: https://doi.org/10.1007/978-1-4614-5838-8_11
Published: 21 October 2012
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-5837-1
Online ISBN: 978-1-4614-5838-8
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics