Train loss vs validation loss

Question

I have a few basic questions about tracking losses during training.

If I am using mini-batch training, should I validate after each batch update or after I have seen the entire dataset?
What should be the condition to stop the training to prevent overfitting? Do you save the model at that point?
In case I use mini-batch training losses fluctuate a lot, depending on the random choice of training data, and sometimes validation loss is less than training loss. Is this normal? I think my confusion about this point will be answered in 1 by itself.

David Masip · Answer 1 · 2018-04-27T06:49:57.907

2

Both approaches can be done: I recommend to validate after every batch when your are just playing around with your gradient method, to see if the validation accuracy goes up or down and to figure out how everything is going.
In this setting, you can adopt early stopping, although there are many ways to prevent from overfitting. Early stopping checks, at the end of an epoch, your validation accuracy, and saves the model if it is the best so far. If the validation accuracy does not increase in the next $n$ epochs (and here $n$ is a parameter that you can decide), then you keep the last model you saved and stop your gradient method.
Validation loss can be lower than training loss, this happens sometimes. In this case, you can state that you are not overfitting.

edited Apr 27 '18 at 06:49

answered Apr 26 '18 at 21:15

David Masip

Hey thanks!! it clears a lot of confusion for me, however, I actually meant to say that my validation loss is lower than training loss. I will make that edit – pg2455 Apr 26 '18 at 21:17
Validation loss can be lower than training loss, this has happened to me many times, specially in deep learning when using dropout. – David Masip Apr 26 '18 at 21:18
So how do you decide about overfitting? – pg2455 Apr 26 '18 at 21:47
If your validation loss is lower than your training, then you can be sure that you are not overfitting – David Masip Apr 27 '18 at 05:21
1

Does the minibatch-Validation step imply performing Cross Validation too? – Kari Apr 27 '18 at 07:18
What do you mean? I don't understand – David Masip Apr 27 '18 at 07:34
Did I answer it? – David Masip May 04 '18 at 05:56

1 Answers1