Required or Recommended Resources |
• Sra, S., Nowozin, S., & Wright, S. J. (Eds.). (2012). Optimization for machine learning. Mit Press.• Schmidt, M., Le Roux, N., & Bach, F. (2017). Minimizing finite sums with the stochastic average gradient. Mathematical Programming, 162, 83-112.• Newton, D., Yousefian, F., & Pasupathy, R. (2018). Stochastic gradient descent: Recent trends. In INFORMS TutORials in Operations Research. Published online: 19 Oct 2018; 193-220.• Bottou, L., Curtis, F. E., & Nocedal, J. (2018). Optimization methods for large-scale machine learning. SIAM review, 60(2), 223-311.• Netrapalli, P. (2019). Stochastic gradient descent and its variants in machine learning. Journal of the Indian Institute of Science, 99(2), 201-213. |