Questions tagged [loss]
37 questions
4
votes
0 answers
How Does the Reward Model in ChatGPT Calculate Losses?
Reading the InstructGPT paper(which seems to be what ChatGPT was built off of), I found this equation for the reward function.
However, I'm struggling to understand how this equation is used to translate rankings between different response to a…
itisyeetimetoday
- 141
- 1
2
votes
1 answer
HuggingFace Transformers is giving loss: nan - accuracy: 0.0000e+00
I am a HuggingFace Newbie and I am fine-tuning a BERT model (distilbert-base-cased) using the Transformers library but the training loss is not going down, instead I am getting loss: nan - accuracy: 0.0000e+00.
My code is largely per the boiler…
JasonExcel
- 21
- 2
2
votes
0 answers
How to calculate MAE and threshold in a multivariate time series
I'm trying to understand how to calculate the MAE in my time series and then the thresholds to understand which of my data in the test set are anomalies. I'm following this tutorial, which is based on a univariate time series, and they calculate it…
Fabio
- 53
- 9
2
votes
3 answers
Custom loss function for regression
I am trying to write a custom loss function for a machine learning regression task. What I want to accomplish is following:
Reward higher preds, higher targets
Punish higher preds, lower targets
Ignore lower preds, lower targets
Ignore lower preds,…
Filip
- 21
- 2
1
vote
0 answers
How to interpret high loss value from model.evaluate() on test data
I'm collecting some metrics for my model's performance using:
# Evaluate the model on the test data using `evaluate`
print("Evaluate on test data")
results = model.evaluate(test_images)
print("test loss, test acc:", results)
I have an very high…
TomSelleck
- 115
- 5
1
vote
1 answer
Accuracy graph of binary classification by CNN
Why in binary classification of images with CNN the loss and accuracy graph are so unstable? I mean accuracy of validation test does not increase smoothly, it goes to 80%, then comes to 60%, then again goes to 84% and so on. Same is the case with…
Nagh
- 129
- 5
1
vote
1 answer
Should Discriminator Loss increase or decrease?
This question is purely based on the theoretical aspect of GANs.
So, when training a GAN how should the discriminator loss look like?
Should the loss of discriminator increase (as the generator is successfully fooled discriminator)
Or should the…
Sai Sreenivas
- 188
- 1
- 8
1
vote
3 answers
Regression problem with Deep Learning
I'm working on the Housing Price dataset, where the target is to predict the housing price.
The price of the house will always be positive and according to me, it's possible that the model can predict a negative outcome for some of the samples.
If…
vipin bansal
- 1,252
- 9
- 17
1
vote
0 answers
Calculationg perplexity (in natural language processing) manually
I am trying to understand Perplexity within Natural Language Processing as a metric more fully. And I am doing so by creating manual examples to understand all the component parts. Is the following correctly understood:
Given a lists W of words (as…
Piskator
- 125
- 7
1
vote
1 answer
What does that mean if the loss looks like this?
I have a problem. I have trained a model. And as you can see, there is a zigzag in the loss. In addition, the validation loss is increasing.
What does this mean if you only look at the training curve? Is there an overfitting?
And the model does not…
Test
- 89
- 7
1
vote
1 answer
The proper loss function for regression that prediction values do not lie on one side of the real values
I'm doing a prediction task using machine learning. First I'm doing a regression task, then I use the values to predict its class.
I used MSE as loss function. However, my prediction values are generally smaller than real values. It will produce the…
user900476
- 165
- 6
1
vote
0 answers
Should the model be defined again before training it to new data?
I wanted to fit the LSTM model on new data set in a loop so I have implemented it like this
#................................define model...........................
model =Sequential()
model.add(LSTM(100, activation='relu',…
Stupid_Intern
- 157
- 1
- 8
1
vote
0 answers
The case of (1,478) dim and parameters of neural network to find out
colleagues, actually I am kind'a new to NN, but hard trying..
I have data:
Index: 40073 entries (excluded from training, UID)
Columns: 484 entries
dtypes: bool(468), float64(2), int64(13), object(1)
I used only 478 arguments. The Y is moneySpend…
Gleb Karpushkin
- 11
- 2
0
votes
0 answers
Is my CNN model overfitting or underfitting?
I would like to be sure of whether the model is overfitting or undercutting. Being new to this, is there any specific point to identify when to stop the training process. Any help in this regard would be helpful. Thanks.
Anna
- 1
0
votes
0 answers
Q values loss per episode and mean absolute error
I am new to deep reinforcement learning! I am following this code for my adaptation problem (doing actions)
https://github.com/jaromiru/AI-blog/blob/master/CartPole-DQN.py
I am wondering how I can evaluate the training, I already got the average…