8

I am looking for a publicly available, easy to download, small data set of ensemble forecasts (or reforecasts), with corresponding observations or reanalysis. We are planning to illustrate some methods for a small software package, so ideally, the data set should be available for the foreseeable future, or even better, freely distributable. Further, ideally, the data set would be given as SV or text file (anything that can be read easily without requiring additional software packages, as it would be the case with GRIB for example). Any suggestions?

By easy to download I mean: Ideally, without creating any login, for example directly possible within Python or R; small in the sense of a couple of months and data for only one location are enough; there should be some verifying observation or re-analysis associated with the forecasts

hichris123
  • 1,920
  • 2
  • 23
  • 38
user3825755
  • 223
  • 1
  • 4
  • 2
    Welcome to Earth Science SE! Easy to download means 'without creating a login' or 'download by a web interface'? Please define 'small'. Should it be a global or a regional data set or should it be data at one specific location (where also measurements are available)? – daniel.heydebreck Jul 06 '16 at 07:59
  • @daniel.neumann I have added some details to the question – user3825755 Jul 06 '16 at 08:04
  • I've never seen "small data set" and "ensemble forecast" used in the same sentence before. I would suggest http://www.atmos.washington.edu/~ens/uwme.cgi but their verification page is still under construction. – f.thorpe Jul 06 '16 at 21:20
  • @farrenthorpe I have also never used these terms in one sentence before, the reason here is that we are only looking for a small illustrative example for some software package. Thanks for the link to the UWME data, however, I do not see any option to download any data there – user3825755 Jul 07 '16 at 07:30
  • 3
    NCEP's NOMADS server hosts the GFS ensemble forecasts output, you can get them here: http://nomads.ncdc.noaa.gov/data.php?name=access#EnsembProb. The data is in grib format, that is something you will have to deal with. Model output is almost never distributed as plain text for practical reasons. Observations from NOAA land stations and NDBC buoys are also publicly available. Google is your friend. – milancurcic Jul 07 '16 at 16:07
  • I have never heard about SV. What is it? I would expect GRIB or maybe netcdf as format... – BHF Jul 07 '16 at 20:35

3 Answers3

2

You can try if this looks simple enough. Still requires registration, but it is free for reasonable use: Global Ensemble Forecast System (GEFS).

Ken Graham
  • 277
  • 1
  • 3
  • 10
kakk11
  • 338
  • 2
  • 7
2

For the historic forecasts check out the Global Ensemble Forecast System (GEFS) from NOAA. It has historic forecasts going back to 2015. For validation try Daymet (North American only) or PRISM (continental USA only) for gridded temperature and precip from historic observations.

The GEFS data is in GRIB format. But thats just something you can't get around. The European climate center (ECMWF) have their own suite of commands in a program called grib_api, which includes a handy GRIB to netCDF converter if that's a format your more familiar with.

All of these are free to download without signing up for anything. PRISM has it's own R package prism.

The GEFS data was previously available through the R package rnoaa, but this functionality has been dropped. You can try the Rnoaa4cast and the noaaGEFSpoint packages package.

David LeBauer
  • 998
  • 7
  • 22
Shawn
  • 121
  • 4
0

Synthesise your own dataset.

Get some real data, and take some averages and add some noise (or some other simple munging), such that it's not possible to get back to the original raw data.

Then you've got a realistic data set that you have all the rights on, that's exactly the size that you want, in exactly the format you want, and you can distribute it however you see fit.

410 gone
  • 4,080
  • 18
  • 40