The forgotten semantics of regression modelling in Geography
This article is concerned with the semantics associated with the statistical analysis of spatial data. It takes the simplest case of the prediction of variable y as a function of covariate(s) x, in which predicted y is always an approximation of y and only ever a function of x, thus, inheriting many of the spatial characteristics of x, and illustrates several core issues using “synthetic” remote sensing and “real” soils case studies. The outputs of regression models and, therefore, the meaning of predicted y, are shown to vary due to (1) choices about data: the specification of x (which covariates to include), the support of x (measurement scales and granularity), the measurement of x and the error of x, and (2) choices about the model including its functional form and the method of model identification. Some of these issues are more widely recognized than others. Thus, the study provides definition to the multiple ways in which regression prediction and inference are affected by data and model choices. The article invites researchers to pause and consider the semantic meaning of predicted y, which is often nothing more than a scaled version of covariate(s) x, and argues that it is naive to ignore this.
| Item Type | Article |
|---|---|
| Open Access | Green |
| Additional information | This research was supported by the China-UK bilateral collaborative research on critical zone science (the Natural Environment Research Council Newton Fund NE/N007433/1, the National Natural Science Foundation of China NO. 41571130083), and the National Key Research and Development Program of China (No. 2016YFC0501601). All of the data preparation, analyses and mappings were undertaken in R 3.5.1, the open source software. The code and data used in this analysis are available from https://github.com/lexcomber/SematicsOfStats. The DOI for the data and code are at https://zenodo.org/badge/latestdoi/123798087. |
| Project | Modelling and managing critical zone relationships between soil, water and ecosystem processes across the Loess Plateau |
| Date Deposited | 05 Dec 2025 10:06 |
| Last Modified | 19 Dec 2025 14:45 |
Explore Further
- 10.1111/gean.12199 (DOI)


