Distance metric choice can both reduce and induce collinearity in geographically weighted regression
This paper explores the impact of different distance metrics on collinearity in local regression models such as geographically weighted regression. Using a case study of house price data collected in Hà Nội, Vietnam, and by fully varying both power and rotation parameters to create different Minkowski distances, the analysis shows that local collinearity can be both negatively and positively affected by distance metric choice. The Minkowski distance that maximised collinearity in a geographically weighted regression was approximate to a Manhattan distance with (power = 0.70) with a rotation of 30°, and that which minimised collinearity was parameterised with power = 0.05 and a rotation of 70°. The results indicate that distance metric choice can provide a useful extra tuning component to address local collinearity issues in spatially varying coefficient modelling and that understanding the interaction of distance metric and collinearity can provide insight into the nature and structure of the data relationships. The discussion considers first, the exploration and selection of different distance metrics to minimise collinearity as an alternative to localised ridge regression, lasso and elastic net approaches. Second, it discusses the how distance metric choice could extend the methods that additionally optimise local model fit (lasso and elastic net) by selecting a distance metric that further helped minimise local collinearity. Third, it identifies the need to investigate the relationship between kernel bandwidth, distance metrics and collinearity as an area of further work.
| Item Type | Article |
|---|---|
| Open Access | Green |
| Additional information | This work arose from a Newton Mobility Fund awarded to the National University of Civil Engineering, Vietnam and the University of Leeds, UK, ‘Developing a housing model based 22 on the status-quality trade off theory’, award no. NG150097. This research was also supported by the Natural Environment Research Council Newton Fund grant (NE/N007433/1) and a UK Biotechnology and Biological Sciences Research Council grant (BB/J004308/1). All of the analyses and mapping were undertaken in R 3.3.2 the open source statistical software. The GWR analyses used the GWmodel package, v2.0-1 (Gollini et al, 2015) and the data and codeused to develop this analysis are available at https://github.com/lexcomber/GW-dists. |
| Keywords | Geographically weighted regression - GWR, Distance metrics, Model fit, Collinearity, GWR |
| Project | Newton Fellowship, Modelling and managing critical zone relationships between soil, water and ecosystem processes across the Loess Plateau, The North Wyke Farm Platform [2012-2017] |
| Date Deposited | 05 Dec 2025 09:11 |
| Last Modified | 19 Dec 2025 14:10 |


