2

I have a dataset of entities which each belong to a particular group (i.e. entity=schools and group=school district). I also have lots of auxiliary variables on each entity.

However, for my response variable I only have information at the group level (i.e. response variable only at the district level, but regressor variables at the school level).

Can someone recommend an algorithm or class of algorithms that might be appropriate here? I'd rather not aggregate all the auxiliary information up to the group level and throw away all info contained in the correlations among the auxiliary variables.

Ethan
  • 1,625
  • 8
  • 23
  • 39

1 Answers1

0

The two most common options:

  • Unsupervised machine learning at the individual school level.

  • Supervised machine learning with the target being school district level.

Brian Spiering
  • 20,142
  • 2
  • 25
  • 102