(This article was first published on Maxwell B. Joseph, and kindly contributed to R-bloggers)
The Maungawhau volcano dataset is an R classic, often used to illustrate 3d plotting.
Being on a Gaussian process kick lately, it seemed fun to try to interpolate the volcano elevation data using a subset of the full dataset as training data.
Even with only 1% of the data, a squared exponential Gaussian process model does a decent job at estimating the true elevation surface (code here):
The upper row of plots show the true elevation surface, estimated surface based on 1% of the data (53 of the 5307 cells), and the squared error in estimation.
The lower plots show the same data in heatmap form, with the location of sampled points shown as crosses.
To leave a comment for the author, please follow the link and comment on their blog: Maxwell B. Joseph.
R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...