1

In order to test an algorithm, I am looking for a reference data set for ridge regression in research papers. Kind of like the equivalent of MNIST but for regression.

Peter
  • 7,277
  • 5
  • 18
  • 47
Marie
  • 23
  • 3

1 Answers1

1

Try using the Boston house data set.

import pandas as pd
from sklearn.datasets import load_boston

boston = load_boston()
boston_df = pd.DataFrame(boston.data, columns = boston.feature_names)
boston_df.insert(0, 'Price', boston.target)

Then you can use Ridge Regression to predict the housing prices from the other features in the data set.

Source: https://towardsdatascience.com/ridge-and-lasso-regression-a-complete-guide-with-python-scikit-learn-e20e34bcbf0b

Derek O
  • 354
  • 1
  • 10
  • Thanks you very much but there is not enough data in this dataset, I was thinking more of a number of data in the same range as mnist – Marie Apr 03 '20 at 22:06
  • You might try looking through some of the datasets here: https://vincentarelbundock.github.io/Rdatasets/datasets.html – Derek O Apr 04 '20 at 01:19