Convergence and efficiency of subgradient methods for quasiconvex minimization
Krzysztof C. Kiwiel
Momentum and Learning Rate Adaptation
Documentaries, videos and podcasts
- Machine learningA field of computer science enabling computers to learn.
- Deep learningBranch of machine learning based on learning data representations.
- Stochastic gradient descent (SGD)Gradient-based optimization algorithm used in machine learning and deep learning for training artificial neural networks.
- Vanishing gradient problemThe vanishing gradient problem can occur when training neural networks using gradient descent with backpropagation. When the derivative of the activation function tends to be very close to zero, the gradient used to updated the weights of the network may be too small for effective learning.