|
Titel |
On the verification of climate reconstructions |
VerfasserIn |
G. Bürger |
Medientyp |
Artikel
|
Sprache |
Englisch
|
ISSN |
1814-9324
|
Digitales Dokument |
URL |
Erschienen |
In: Climate of the Past ; 3, no. 3 ; Nr. 3, no. 3 (2007-07-11), S.397-409 |
Datensatznummer |
250001069
|
Publikation (Nr.) |
copernicus.org/cp-3-397-2007.pdf |
|
|
|
Zusammenfassung |
The skill of proxy-based reconstructions of Northern hemisphere
temperature is reassessed. Using an almost complete set of proxy and
instrumental data of the past 130 years a multi-crossvalidation is
conducted of a number of statistical methods, producing a distribution
of verification skill scores. Among the methods are multiple
regression, multiple inverse regression, total least squares, RegEM,
all considered with and without variance matching. For all of them the
scores show considerable variation, but previous estimates, such as a
50% reduction of error (RE), appear as outliers and more realistic
estimates vary about 25%. It is shown that the overestimation of
skill is possible in the presence of strong persistence (trends). In
that case, the classical "early" or "late" calibration sets are
not representative for the intended (instrumental, millennial)
domain. As a consequence, RE scores are generally inflated, and the
proxy predictions are easily outperformed by stochastic, a priori
skill-less predictions.
To obtain robust significance levels the multi-crossvalidation is
repeated using stochastic predictors. Comparing the score
distributions it turns out that the proxies perform significantly
better for almost all methods. The scores of the stochastic predictors
do not vanish, nonetheless, with an estimated 10% of spurious skill
based on representative samples. I argue that this residual score is
due to the limited sample size of 130 years, where the memory of the
processes degrades the independence of calibration and validation
sets. It is likely that proxy prediction scores are similarly inflated
and have to be downgraded further, leading to a final overall skill
that for the best methods lies around 20%.
The consequences of the limited verification skill for millennial
reconstructions is briefly discussed. |
|
|
Teil von |
|
|
|
|
|
|