Gradient-Descent
Loss Functions: What a Model Is Really Optimizing
· ☕ 9 min read · âœī¸ k4i
A practical guide to loss functions: when to use MSE, MAE, Huber, binary cross entropy, cross entropy, KL divergence, hinge loss, contrastive loss, and triplet loss.
Loss Functions: What a Model Is Really Optimizing
Batch vs Stochastic Gradient Descent
· ☕ 4 min read · âœī¸ k4i
understand batch gradient descent, stochastic gradient descent, and mini-batch gradient descent.
Batch vs Stochastic Gradient Descent