Highest Voted 'hyperparameter' Questions - Data Science Stack Exchange

114

votes

10 answers

Choosing a learning rate

I'm currently working on implementing Stochastic Gradient Descent, SGD, for neural nets using back-propagation, and while I understand its purpose I have some questions about how to choose values for the learning rate. Is the learning rate related…

asked Jun 16 '14 at 18:08

ragingSloth

1,824
3
14
15

49

votes

7 answers

What is the difference between model hyperparameters and model parameters?

I have noticed that such terms as model hyperparameter and model parameter have been used interchangeably on the web without prior clarification. I think this is incorrect and needs explanation. Consider a machine learning model, an SVM/NN/NB based…

machine-learning parameter hyperparameter language-model

asked Sep 24 '16 at 11:24

minerals

2,137
3
17
19

40

votes

6 answers

How to set the number of neurons and layers in neural networks

I am a beginner to neural networks and have had trouble grasping two concepts: How does one decide the number of middle layers a given neural network have? 1 vs. 10 or whatever. How does one decide the number of neurons in each middle layer? Is it…

machine-learning neural-network deep-learning hyperparameter hyperparameter-tuning

asked Jan 13 '18 at 15:26

stk1234

573
1
5
6

21

votes

4 answers

Hyperparameter search for LSTM-RNN using Keras (Python)

From Keras RNN Tutorial: "RNNs are tricky. Choice of batch size is important, choice of loss and optimizer is critical, etc. Some configurations won't converge." So this is more a general question about tuning the hyperparameters of a LSTM-RNN on…

python neural-network deep-learning keras hyperparameter

asked Jan 17 '16 at 18:26

wacax

3,370
4
22
45

11

votes

2 answers

What is the most efficient method for hyperparameter optimization in scikit-learn?

An overview of the hyperparameter optimization process in scikit-learn is here. Exhaustive grid search will find the optimal set of hyperparameters for a model. The downside is that exhaustive grid search is slow. Random search is faster than grid…

scikit-learn hyperparameter hyperparameter-tuning grid-search randomized-algorithms

asked Mar 13 '19 at 19:42

Brian Spiering

20,142
2
25
102

10

votes

2 answers

How do scientists come up with the correct Hidden Markov Model parameters and topology to use?

I understand how a Hidden Markov Model is used in genomic sequences, such as finding a gene. But I don't understand how to come up with a particular Markov model. I mean, how many states should the model have? How many possible transitions? Should…

machine-learning model-selection hyperparameter markov

asked Oct 09 '15 at 00:02

ABCD

3,510
2
18
30

10

votes

4 answers

Which is first ? Tuning the parameters or selecting the model

I've been reading about how we split our data into 3 parts; generally, we use the validation set to help us tune the parameters and the test set to have an unbiased estimate on how well does our model perform and thus we can compare models based on…

cross-validation model-selection hyperparameter hyperparameter-tuning

asked Nov 27 '18 at 21:00

Ahmed Gharbi

103
1
6

8

votes

2 answers

XGBoost and Random Forest: ntrees vs. number of boosting rounds vs. n_estimators

So I understand the main difference between Random Forests and GB Methods. Random Forests grow parallel trees and GB Methods grow one tree for each iteration. However, I am confused on the vocab used with scikit's RF regressor and xgboost's…

python random-forest decision-trees xgboost hyperparameter

asked Apr 22 '20 at 15:06

Jack Armstrong

233
2
6

8

votes

1 answer

Is it OK to try to find the best PCA k parameter as we do with other hyperparameters?

Principal Component Analysis (PCA) is used to reduce n-dimensional data to k-dimensional data to speed things up in machine learning. After PCA is applied, one can check how much of the variance of the original dataset remains in the resulting…

machine-learning pca hyperparameter

asked Mar 27 '19 at 18:58

J. Doe

81
1
2

8

votes

1 answer

How can you decide the window size on a pooling layer?

On the convolutional neural network, there used one or more pooling layers. As far as I know many tutorials instruct you to set it either 2 or 3 for the window size. For example, in this tutorial: Pooling Layers After some ReLU layers, programmers…

deep-learning hyperparameter convolution

asked Jun 03 '17 at 22:15

Blaszard

901
1
13
29

7

votes

3 answers

Regression model with variable number of parameters in dataset?

I work in physics. We have lots of experimental runs, with each run yielding a result, y and some parameters that should predict the result, x. Over time, we have found more and more parameters to record. So our data looks like the following: Year 1…

machine-learning regression linear-regression parameter hyperparameter

asked Jul 14 '16 at 18:52

JoseOrtiz3

172
6

7

votes

1 answer

Overfitting for minority class after SMOTE w/ random forests

I used SMOTE to make a predictive model, with class 1 having 1800 samples and 35000+ of class 0 samples. Hence, as per SMOTE, synthetic samples were created and the random forest was trained. However, I am now getting most results as class 1 when I…

python random-forest pandas class-imbalance hyperparameter

asked May 09 '16 at 14:18

TdBm

423
1
5
15

6

votes

1 answer

Why do BERT classification do worse with longer sequence length?

I've been experimenting using transformer networks like BERT for some simple classification tasks. My tasks are binary assignment, the datasets are relatively balanced, and the corpus are abstracts from PUBMED. The median number of tokens from…

deep-learning bert transformer hyperparameter-tuning hyperparameter

asked Dec 31 '19 at 18:08

Hooked

207
1
6

6

votes

1 answer

Neural Network Golf: smallest network for a certain level of performance

I am interested in any data, publications, etc about what is the smallest neural network that can achieve a certain level of classification performance. By small I mean few parameters, not few arithmetic operations (=fast). I am interested…

neural-network convolutional-neural-network hyperparameter

asked Apr 11 '15 at 21:35

Alex I

3,142
1
21
27

6

votes

3 answers

Which parameters are hyper parameters in a linear regression?

Can the number of features used in a linear regression be regarded as a hyperparameter? Perhaps the choice of features?

linear-regression hyperparameter hyperparameter-tuning

asked May 14 '18 at 01:15

Vykta Wakandigara

213
1
2
6

Questions tagged [hyperparameter]