I am trying to create a pollution prediction LSTM. I've seen an example on the web to cater for a Multivariate LSTM to predict the pollution levels for one city (Beijing), but what about more than one city? I don't really want a separate network for every city, I'd like a single generalised model/network for all x cities. But how do I feed that data into the LSTM?
Say I have the same data for each city, do I...
1) Train on all data for one city, then the next city, and so on until all cities are done.
2) Train data for all cities on date t, then data for all cities on t+1, then t+2 etc.
3) Something completely different.
Any thoughts?