1 Introduction

The Arctic region is expected to undergo rapid climatic changes in the 21st century in response to anthropogenic warming, for example rising temperatures (Chapman and Walsh 2007), reduced sea ice (Zhang and Walsh 2006) and snow cover (Symon et al. 2005), melting permafrost (Lawrence et al. 2008), increased precipitation (Kattsov et al. 2007), reduced stability in the wintertime Arctic atmosphere (Pavelsky et al. 2010) and increased cloud cover (Vavrus et al. 2009).

However, these studies also demonstrate that there is a wide variation in the size of these projected changes between climate models. In order to begin to improve and constrain future Arctic projections, the uncertainties associated with these projections need to be quantified, understood and potentially attributed to a particular source.

Uncertainties have been examined in detail for the future Arctic climate projections within the CMIP3 global climate model ensemble (Solomon et al. 2007) by a range of previous studies. Liu et al. (2008), for example, examined the variation in Arctic surface air temperature across the CMIP3 ensemble, Kattsov et al. (2007)—Arctic precipitation, Holland et al. (2010)—the Sea Ice mass budget and Eisenman et al. (2007)—Arctic cloud cover and Longwave radiation fluxes. However, all these studies use a diverse set of approaches and definitions to examine a particular aspect of Arctic climate projections. It is therefore difficult to draw clear comparisons between them in order to quantify and understand uncertainty in processes across the Arctic region. The goal of this paper, therefore, is to consistently and systematically assess and quantify the uncertainties in projections of Arctic climate change, across a range of local processes, and to attempt to discover and attribute the sources of these uncertainties. To put this analysis in contact, we first briefly discuss the factors affecting Arctic Climate and the sources of uncertainty in model projections of Arctic Climate.

1.1 Arctic climate

The mean climate of the Arctic arises as a balance between a range of Arctic processes (Fig. 1). The temperature and salinity profiles in the ocean and the temperature and humidity profiles in the atmosphere play a key role. The structures of these profiles are highly seasonal, and changes in the annual mean are likely to arise from a change in the balance of the seasonally varying forcings. For example, the atmospheric temperature inversion arises as a consequence of surface radiative cooling together with a number of other factors [warm air advection, subsidence, cloud processes, surface melt, topography—see Curry et al. (1996)]. In the ocean, temperature and salinity profiles arise from freshwater runoff from the land surface, sea ice melting and ocean advection. The resultant cold, low salinity waters can insulate the overlying sea ice from deeper, warmer waters (Bourgain et al. 2011).

Fig. 1
figure 1

Key processes governing Arctic climate. a Net shortwave (SW) solar radiation (modulated by scattering of radiation by clouds and surface albedo). b Net longwave radiation emitted by surface and by clouds. c Turbulent (sensible + latent) fluxes. d Atmosphere heat transport. e Ocean heat transport. f Atmosphere temperature profile. g Ocean temperature/salinity profile. h Sea ice, thickness and extent. i Land surface processes. j Precipitation. k Freshwater flux e.g. from rivers. Sarah Keeley/Rob Hine ECMWF

There are strong feedbacks within the system. As the ocean sea ice (high albedo) melts it reveals a darker (low albedo) ocean surface—resulting in increased absorption of solar (shortwave) radiation, causing an additional warming and increased melting [ice-albedo feedback-Winton (2006)]. As overlying sea ice insulates the atmosphere from the ocean surface, increased melting leads to increase heat and moisture fluxes into the atmosphere. These increased fluxes may in turn lead to increases in high altitude cloud cover, resulting in reduced outgoing longwave radiation losses from the surface—leading to surface that is warmer than otherwise [cloud-radiation feedback—Abbot et al. (2009)]. The increased high altitude clouds are accompanied by increased specific humidity, which traps more of the outgoing surface longwave radiation—also leading to increased surface warming [water-vapour feedback—Curry et al. (1995)].

The global response to increased atmospheric CO2 concentration can alter the northward heat transport into the Arctic, resulting in changing forcings from ocean and the atmosphere. A reduction in sea ice cover can also directly feedback on the heat transports by reducing the meridional global temperature gradient. Land processes also play a key role in Arctic climate—a retreat of the seasonal snow cover reduces the land surface albedo leading to anomalous warming, and melting of permafrost increases the availability of soil moisture for evaporation. These can result in changes to the Arctic freshwater budget, via precipitation and river runoff changes, which can ultimately alter the ocean surface salinity, and hence stability.

Some processes that occur within the climate system are not represented within the CMIP3 climate models [see Randall et al. (2007)]. For example land-fast ice and biological processes of the shelf seas of the Arctic are still not resolved in models. Land surface schemes, due to a lack of vertical depth in the soil, are not able to represent permafrost well. A number of processes are under development for inclusion in future Earth System models that may be important for Arctic Climate, e.g. interactive chemistry and biogeochemistry. Such missing processes represent an uncertainty in the modelling of Arctic climate-one we are currently unable to quantify or assess. The CMIP5 and future ensembles are likely to include improvements in these areas.

1.2 Sources of uncertainty

There are four sources of uncertainty in climate model projections of Arctic climate:

  • Scenario uncertainty—Uncertainty in future levels of greenhouse gases.

  • Structural uncertainty—The different methods different models use to represent the same physical process e.g. Model resolution, coordinate systems (e.g. constant density vs. constant height).

  • Parameter uncertainty—The value of a parameter within a parameterization of a process not explicitly physically resolved in a climate model (e.g. surface ice albedo) chosen from within the given observational range.

  • Intrinsic internal variability—The inherent variability, or noise, within the climate system, due to its chaotic nature.

The relative impact of these sources of uncertainty on Arctic climate over time can be seen in Fig. 2, following the methods of Hawkins and Sutton (2009, 2011) Model uncertainty (structural plus parameter uncertainty) dominates these quantities in the near term, but scenario uncertainty dominates in the long term.

Fig. 2
figure 2

The total uncertainty in projected Arctic temperature (a) and precipitation (b) is separated into its various components using the CMIP3 ensemble projections of Arctic climate, using the methods of Hawkins and Sutton (2009, 2011). The black line represents the observations for temperature from the NASA GISS dataset Hansen et al. (2010)

We do not consider scenario uncertainty in this paper, since this is defined by external factors such as population growth, technology and economics. We therefore eliminate this source of uncertainty by examining the behaviour of the Arctic climate when subjected to a concentration of CO2 double that of pre-industrial levels. This is reasonable, since any improvements that bring about a reduction in projection uncertainty are likely to come through the modifications to model structures and parameters rather than increased certainty about future greenhouse gas emissions. Any reduction in uncertainty is ultimately limited by the intrinsic model internal variability.

1.3 Model parameterization

Many important climate processes are not modelled explicitly in current climate models. Such processes occur on a spatial scale (e.g. convection) or timescale (e.g. cloud microphysics) that is below the resolution of the model, and so are represented in a simplified way—often as a statistical or generalised representation of a process for an entire model grid box. Such simplified models are termed parameterizations. For example, the surface albedo within the Arctic can vary dramatically within an area of the size of model grid box. Even if an entire grid box was full of sea ice there would be variations in albedo due to pooling of water on the ice, snow depth and leads etc. Therefore over the tens of kilometres represented in one model grid box a simplified or parameterized version of the actual physical processes is applied. Such parameterizations may be weakly constrained by observations and as such may potentially be a means to tune models to the mean observed climate.

As noted above, the two goals of this study are to:

  • systematically assess the uncertainty in simulations of current and future Arctic climate.

  • assess and understand the causes of the uncertainty in future projections of Arctic climate.

The structure of this paper is as follows. Section 2 describes the models, experiments and methods used in this study. Section 3 describes the results of the analysis of model uncertainty. Section 4 discusses the implications of the results and summarises our conclusions. (This study is a synthesis and summary of a systematic review of Arctic Uncertainty conducted for the UK National Environmental Research Council (Hodson et al. 2010).)

2 Models and methods

There are a variety of definitions of the Arctic region in use across the literature. Because relevant Arctic land processes (e.g. permafrost) are poorly represented in current models, we choose to define the Arctic as the region north of 70°N. This incorporates more ocean grid points (70 %) than land grid points (30 %) and hence will weight the assessment to consider the importance of ocean and sea ice processes over land processes. All area average quantities are therefore calculated between 70°N and 90°N. To simplify the analysis we will examine only annual mean Arctic variables in this study. In order to compare the different sources of model uncertainty discussed above, we examined two ensembles: a multi-model ensemble (CMIP3), and a perturbed-parameter ensemble of the Hadley Centre model HadCM3, to be referred to as the THC-QUMP ensemble.

2.1 CMIP3

The CMIP3 multi model ensemble contains 22 coupled climate models [see Table 1, Randall et al. (2007)]. The models vary in their structure, parameterization, resolution, and whether or not they employ flux adjustments to ensure a stable climate. The variance in model structure arises from the different approximations and numerical methods used to model the equations that describe physical process (e.g. fluid flow, radiation) and also the interaction with the different co-ordinate systems that may be employed (e.g. vertical levels vs. density levels, location of poles (over land or ocean) etc). Models also differ in the way in which they represent sub-gridscale processes (parameterization). The inter-model variations in these methods contribute to the structural and parameter uncertainty in climate projections.

Table 1 Models comprising the CMIP3 ensemble used in this study

2.2 THC-QUMP

The 22 member THC-QUMP ensemble was created by perturbing parameters within model parameterizations (e.g. cloud formation and precipitation and ice structure and albedo) to within likely ranges. The ensemble can be broadly described as atmospheric parameter changes without flux correction. The aim of the ensemble is to span the range of climate sensitivities consistent with a uniform prior on parameters but in the process maximise the chance of getting plausible model versions and span a wide range of parameter settings—in other words an attempt is made to uniformly sample the parameter space consistent with a stable climate—for more details see: Murphy et al. (2004), Stainforth et al. (2005), and Collins et al. (2006, 2011). The ensemble comprises of the unperturbed member, the base state which has the same parameter settings as HadCM3 in the CMIP3 ensemble, and other members that have been perturbed away from the base state through changes to multiple model parameters. The perturbations were chosen according to a “Latin Hypercube” design which maximises the number of potential interactions between perturbations. The parameter perturbations are shown in detail in Tables 2, 3, 4, 5, and 6.

Table 2 Perturbed parameter values for the THC-QUMP ensemble
Table 3 Perturbed parameter values for the THC-QUMP ensemble
Table 4 Perturbed parameter values for the THC-QUMP ensemble
Table 5 Perturbed parameter values for the THC-QUMP ensemble
Table 6 Description of parameters in Tables 2, 3, 4, and 5

The perturbations were chosen to span the range of observational uncertainty at the time of the experiment design, with the aim of producing a series of experiments with a wide range of climate sensitivities. (Note that this meant that the albedo of cold ice was not perturbed at all). In additional, several parametization schemes were switched on or off.

Some parameters were perturbed as a linked set, namely threshold for cloud to rain conversion rate, cloud fraction at saturation, sea ice albedo, surface gravity wave parameters, forest roughness lengths, number of soil levels accessed for evapotranspiration. Perturbations requiring a logical switch involved invoking an additional feature or process (non-spherical ice particles, shortwave water vapour continuum absorption, sulphur cycle, surface-canopy energy exchange), removing a process (dependence of stomatal conductance on CO2) or altering the method of representing a process (flow dependent Rhcrit, vertical gradient of cloud water in grid box).

Perturbations to these parameters causes some minor climate drift in some ensemble members. However, no flux adjustment was applied to nudge these members back to the climatology of the unperturbed model (Vellinga and Wu 2008). Some ensemble members exhibit climate drift in the Arctic resulting in low initial states of sea ice extent, such that the summer ice cover was lost early on in the 2 × CO2 experiments.

Each member was first integrated for 100 years as a spin-up, to assess the amount of climate drift and model stability. Model experiments were then initialized from the end of these spin-up integrations. Across the ensemble a gradual weakening of the Meridional Overturning Circulation (MOC) occurs as CO2 concentrations increase, within the range reported in the Third Assessment Report (Cubasch et al. 2001). No rapid shutdown of the MOC is seen.

Both ensembles allow sampling of parameter uncertainty and internal variability in future projections. The CMIP3 ensemble allows the sampling of both parameter and structural uncertainty (see Sect. 2.1). It is clear that parameter uncertainty and structural uncertainty are not independent quantities, since climate models are in part constrained by past observations of climate. Hence it is likely that some of the structural uncertainty is compensated by the parameter uncertainty, i.e. parameters are tuned within uncertainty to compensate for biases introduced by structural uncertainties—and hence are likely to be negatively correlated with each other. This means that we cannot deduce whether parameter or structural uncertainty plays a greater role in a given process from a simple comparison of the CMIP3 and THC-QUMP ensembles.

2.3 Model experiments

For each model in the ensemble, two experiments were performed: a Control integration and a 2 × CO2 experiment. In the Control integration the atmospheric concentration of carbon dioxide (CO2) is held at a constant level throughout the model experiment (usually pre-industrial levels of 290 ppmv—although three models used present day levels 348 ppmv—see Table 1); any variations in the climate system seen in this experiment are hence solely due to internal variability. In the 2 × CO2 experiment the carbon dioxide concentration is increased at a rate of 1 % per year starting from the CO2 concentration in the control integration. Hence at around year 70 of the 2 × CO2 experiment, the CO2 concentration is double that of the control integration.

2.4 Estimation of internal variability

The spread in climate variables across models (e.g. Fig. 3) may occur due to differences in parameterization and model structure, as discussed above. But it may also arise as a consequence of sampling the underlying internal climate variability. A rigorous treatment of the internal variability would involve a full Analysis of Variance (ANOVA) (see von Storch and Zwiers 2002; Yip et al. 2011; Hodson and Sutton 2008), but here we simply calculate an estimate of the internal variability contribution to the overall model spread. For a given variable X mt , where m is the ensemble member and t is time, we assume a normal distribution with a mean μ m and variance σ 2 int : X mt N m 2 int ). An estimate of internal variability \(\hat{\sigma}_{int}\) is given by:

$$ \hat{\sigma}_{int} = \left({\frac{1}{M}}\sum^{M}_{m=1}{\frac{1} {T-1}}\sum_{t=1}^T (X_{mt}-\overline{X}_{m})^2\right)^{1/2} $$
(1)

for M ensemble members and T time points. Here \(\overline{X}_{m}\) is the time mean:

$$ \overline{X}_{m} ={\frac{1}{T}} \sum_{t=1}^{T} X_{mt} $$
(2)

i.e. (1) is the square root of the mean of the time variance in the Control integrations, across the ensemble. In this case we used 80 years of the control run (T = 80). Given this estimate of internal variability, for an ensemble composed of 20 years means (sampled from the control integration), we would expect a spread in the means due to sampling of internal variability (σ v ) of:

$$ \sigma_v = {\frac{\hat{\sigma}_{int}}{\sqrt{20}}} $$
(3)
Fig. 3
figure 3

Climatology of sea ice fraction in the Control experiment for 11 of the CMIP3 models (including IAP-FGOALS) compared with the Observed Sea ice fraction (1980–1999) (upper left)

This is the spread we would expect between the 20 year means of each control integration if the true long-term means were identical. Hence it is the minimum contribution we expect from internal variability alone. If we assume that the magnitude of the internal variability is the same in the 2 × CO2 experiment as the Control integration, then minimum spread we expect between the mean projected changes (2 × CO2-Control) across an ensemble will be: \(\hat{\sigma}_{int}\sqrt{2}/\sqrt{20}\)—since the variances sum.

Does this measure capture the full effect of internal variability? Each model in the ensemble is initialised from different initial conditions. Since we remove the 80 year mean in computing (1) we may remove the impact of initial conditions. For example, each model may be in different phases of a longer multi-decadal oscillation, or minor differences in atmospheric initial conditions may trigger such oscillations. Perturbed initial condition ensembles would be required to refine this measure of internal variability further, for example Mahlstein and Knutti (2012) and Deser et al. (2012). For this reason we can only consider (3) to be a lower bound estimate for the contribution from internal variability.

3 Results

We now analyse the CMIP3 and THC-QUMP ensembles to assess the magnitude of the uncertainty (spread) in both the model control climatology and the climate change (2 × CO2-Control) response (the spread in the difference between 2 × CO2 and Control integrations) in a number of key Arctic variables.

3.1 Climatology and ensemble spread in key Arctic variables

In this study we are principally concerned with the spread in the projected changes in the Arctic under a doubling in the concentration of atmospheric CO2. However, there is also a spread in the climatologies of the control integrations of the models, from which the changes are measured. This spread in climatologies would be unimportant if each model responded linearly to a doubling of CO2. However, as will be shown, the initial model state (or climatology) can determine the magnitude of the projected changes. Hence we briefly outline the spread in the climatologies. As noted above, we only consider annual mean quantities here. The main differences in the CMIP3 model climatologies are well documented in the IPCC report (Randall et al. 2007). In Fig. 3 we present a 20 year mean sea ice fraction from the control integration of 11 of the CMIP3 models, together with the mean observed sea ice fractions (1980–1999). The Figure shows that there are a wide range of sea ice extents within the CMIP3 ensemble, with one model (IAP-FGOALS) having annual mean sea ice as far south as the UK. (For this reason, we exclude the IAP-FGOALS model from the remainder of our analysis). This Figure also highlights the differences in spatial resolution across the ensemble and the subtleties of regional differences that may be hidden by analysis of the climatology within our chosen definition of the Arctic (e.g. 70°N).

Table 7 shows estimates of the ensemble mean and ensemble spread for key Arctic variables in the control integrations of both the CMIP3 and THC-QUMP ensembles. The table also shows an estimate of the fraction of the ensemble spread that is simply due to sampling of the internal (or inter-annual) variability within each model. As noted above, this must be considered a lower bound, since we are unable to explicitly sample the uncertainty in initial conditions.

Table 7 Mean and Spread of key Arctic variables in the control integrations (20 years)

The mean sea ice extent and volume, surface air temperature and precipitation (Table 7) are consistent (within one standard deviation) between the THC-QUMP and CMIP3 ensembles. The mixed layer temperatures, however, are not consistent between ensembles. These means are comparable to the observational estimates that exist, although the observations do not generally lie within one standard deviation of the ensemble means. This is partly due to the varying definitions of the Arctic region used across different observational studies. For instance, observed surface air temperature estimates (Liu et al. 2008) were computed over a wider Arctic region than our chosen region (>70°N), hence resulting in a warmer mean surface temperature due to the inclusion of warmer lower-latitude regions.

Some quantities have a large ensemble spread (uncertainty, one standard deviation across the ensemble) compare to their mean, (e.g sea ice volume,Footnote 1 and mixed layer temperatures). Whilst others, notably surface air temperature and precipitation, have a smaller ensemble spread. Hence some Arctic quantities are modelled more preciselyFootnote 2 than others. Some of this spread will be due to systematic differences between models but a portion will be due to sampling of the intrinsic internal variability within each control. We have estimated this contribution for each model (Table 7). It is clear that, although this contribution is larger for some quantities than others, it is not the dominant source of spread in the ensemble (except perhaps for sea ice extent in the THC-QUMP ensemble (see foot note 1). In other words, the ensemble spread in these variables is mostly due to the parameter and structural uncertainties contained within the ensemble, and is likely not solely an artifact of the sampling of internal (inter-annual) variability (although see Sect. 2.4). Note also, that the spread in quantities within the THC-QUMP ensemble are always smaller in magnitude than those in the CMIP3 ensemble. This implies that the parameter uncertainty is not likely to entirely explain the spread in the CMIP3 ensemble, and that structural uncertainty must play a role.Footnote 3

3.2 Ensemble mean and spread in projected changes under 2 × CO2

The ensemble mean and spread (1σ) in the projected changes under 2 × CO2 in key Arctic variables are shown in Table 8. There is a considerable variation in the spread between variables. In both ensembles, the spread in sea ice volume changes is about half the size of the ensemble mean change. However, changes in surface air temperature are much better constrained, with uncertainties less than a third of the mean change in both ensembles.

Table 8 Change and Spread of key Arctic variables at double pre-industrial  CO2 concentration

For some variables (e.g. sea ice volume) the THC-QUMP spread is comparable to that in the CMIP3 ensemble: 55 versus 49 % (the sampling of internal/inter-annual variability only accounts for a small part of the spread in both cases). Hence the sampling of parameter uncertainty (THC-QUMP) results in a similar uncertainty range for projections of ice volume change as does the sampling of structural and parameter uncertainty (CMIP3). We cannot however, confidently conclude that parameter uncertainty is the dominant contributor to uncertainty in the CMIP3 projections, since, as noted previously 2 the parameter and structural uncertainties in the CMIP3 ensemble are very likely to be anti-correlated. Nevertheless, we can conclude that future constraints on model parameter uncertainty are likely to lead to a reduction in the uncertainty of future projections of sea ice volume.

The situation is somewhat different for surface air temperature—the sampling of parameter uncertainty (THC-QUMP) results in a much smaller uncertainty range than the CMIP3 ensemble. This implies that the sampled range of parameter uncertainty may not be sufficient to explain the projected spread in surface air temperatures in the CMIP3 ensemble, and that structural uncertainties, such as the boundary layer and cloud schemes along with the treatment of surface fluxes, may play a significant role. The caveats are that THC-QUMP may not have sampled the full range of parameter uncertainty present in the CMIP3 ensemble, although significant efforts were made to do so (Collins et al. 2006), and that some of the CMIP3 models may use different parameterization schemes.

Regardless of these caveats, it is clear from the table that, in both ensembles, the uncertainty in the spread is greater than the estimate of the contribution from the internal variability (except for sea ice extent in THC-QUMP—but see 1 ), where it never contributes more than half of the spread (although see Sect. 2.4). This implies that there may be scope for reducing uncertainties in Arctic projections by improving model structure and parameterizations, since projections are likely not dominated by interannual internal climate variability.

3.3 The impact of the model control climatology on projected changes

The time-mean response of each climate model to constant atmospheric greenhouse gas concentrations (control integration) varies between models—resulting in a spread in model climatologies (e.g. Fig. 3). Each CO2 doubling experiment is initialized from a state in the control integration. The spread in the control climates across the ensemble would be unimportant if the Arctic climate within the model responded linearly to the doubling of CO2. However, previous studies (e.g. Holland et al. 2010) have demonstrated that such a spread in the climatologies is partly responsible for the spread (hence uncertainty) in projections of Arctic climate change.

To determine the relationship of the spread in the climatologies with the spread in the projected changes under a doubling of CO2 we look at the correlations across each ensemble. For example, Fig. 4a shows the change in sea ice volume at the CO2 doubling time, against the sea ice volume climatology in the control integration, for each model in the CMIP3 ensemble. There is a clear negative relationship between the initial volume of sea ice (climatology) and the magnitude of the change in sea ice volume at CO2 doubling as first noted by Holland et al. (2010). That is, the ice volume at the start of the 2 × CO2 experiment, determines, in a large part, the magnitude of the sea ice volume reduction upon CO2 doubling. This behaviour could arise if a significant number of models were free of sea ice by CO2 doubling (hence the volume change would only be dependent on mean volume in the control). To test this we reexamine the relationship in September (the summer sea ice minimum) sea ice volumes half-way through the CO2 doubling experiment (Fig. 4b)—here we have explicitly excluded two models where the sea ice volumes fall below 103 km3 by CO2 doubling (years 60–79) to exclude sea-ice free conditions. The relationship is even stronger—suggesting that the effect is real.

Fig. 4
figure 4

a The relationship between the projected changes in annual mean sea ice volume under CO2 doubling and the climatology of the annual mean sea ice volume for all models in the CMIP3 ensemble. The correlation is −0.79. See also Holland et al. (2010). b as a But for September sea ice volume (years 40–59)— for all models except for two where the sea ice volume drops below 103 km3 in years 60–79. Correlation: −0.90

To further examine the dependence of Arctic climate change on the spread of model climatologies we repeat this correlation analysis for the change in seven key Arctic variables (as in Table 8) and their relation to a range of quantities in the control climatology (means of years 61–80 in the control integrations). The results are presented for both ensembles in Table 9. There are clear similarities and differences between the two ensembles. The large (negative) correlations between sea ice volume climatology (in the Control integration) and the sea ice volume change are clearly seen in both ensembles.

Table 9 Correlations between changes in Arctic variables (2 × CO2Control: columns) and a variable in the control integration climatology (Control: rows) (left:CMIP3, right:THC-QUMP)

As noted previously THC-QUMP ensemble differ in the values of certain parameters within key model parameterizations. The similarity of the correlation of the sea ice volume change and climatology between the two ensembles suggest that the relationship is fundamentally related to parameter uncertainty. Figure 5 shows the variation of the sea ice volume climatology (in the THC-QUMP control integration) as one of the model parameters varies: Ice albedo at 0 °C. The dataset is limited but ice volumes are significantly different (p < 0.03) between upper and lower albedo ranges. Taken together with the result that sea ice volume changes are negatively correlated with sea ice volumes in the control (Table 9), this may suggest that sea ice volume changes under a doubling of CO2 are directly related to the value of parameters contained within the sea ice albedo parameterization. Hence uncertainty in sea ice volume projections may be explained, in part, by the uncertainty in the sea ice albedo parameterization schemes. Physically this could arise because the surface albedo controls the climatological radiation balance within the region, and hence the ice energy budget and thus the climatological ice volume. Once thick sea ice is established, the top surface of the ice becomes well-insulated from the cold ocean underneath the ice (−1.8 °C for freezing seawater). This may allow the top surface of the ice to melt in response to a given surface heat flux, whereas a thinner ice—in closer thermal contact with a cold ocean ‘buffer’—would not. Hence explaining the greater sea ice loss in the models with greater initial sea ice volumes.

Fig. 5
figure 5

The variation of sea ice volume climatology and variations in the albedo parameter in the QUMP ensemble. The ice volumes in the upper half of the albedo range (0.56–0.62) are significantly different (p < 0.03) to the ice volumes in the lower half of the albedo range (0.5–0.56)

Examination of Table 9 reveals a number of other interesting relationships. In the CMIP3 ensemble, the northward ocean heat transport in the Control integration climatology is significantly correlated with changes in five of the seven Arctic variables. Some of these relationships can be seen more clearly in Fig. 6 where they are plotted together with the data for Atlantic only ocean heat transport (where available). These suggest that the climatology of the model ocean heat transport into the Arctic explains a substantial part of the spread (uncertainty) in key aspects of Arctic climate change (surface air temperature, precipitation, mixed layer temperatures and sea ice extent).

Fig. 6
figure 6

Relationships between changes (2 × CO2 − Control) in Arctic variables and the time mean Ocean Heat Transport at 70N (black: Global (Total N Ocean Heat Trans), red: Atlantic [Atl Ocean Heat Transport)]—across all available models. a Air Temperature (SAT), b Mixed Layer Temp, c Precip, d Sea Ice Extent. Correlations that are not significant (p ≥ 0.05) are given in brackets. Variables in bold as defined in Sect. 6.2

A similar relationship between surface air temperature and the climatology of the model ocean heat transport into the Arctic has also recently been documented in the A1B scenario experiments (Mahlstein and Knutti 2010). It is interesting to note that the correlation between sea ice volume changes and Atlantic Ocean Heat transport are the opposite sign to the correlation between sea ice extent and Atlantic Ocean Heat transport, although the former is not statistically significant. This opposition of signs persists even if we consider all northern hemisphere ice (0:90N) rather that that within our Arctic definition (70:90N).

These relationships are entirely absent from the THC-QUMP ensemble. This may suggest that this dependence is due to structural uncertainty, rather than parameter uncertainty. A study by Jackson et al. (2011) reveals that the spread of MOC strengths in THC-QUMP is less than that of the CMIP3 ensemble, with most THC-QUMP ensemble members having an MOC strength within the observational range. It is important to note that the model parameters that were varied to create the THC-QUMP ensemble were not chosen specifically to cause a spread in ocean heat transports—no ocean dynamics parameters were perturbed, for example (see Table 6)—hence they may under-sample any parameter uncertainty that exists. Hence, the lack of these relationships in the THC-QUMP table may be because these rely on the interaction of the spread in ocean heat transport and sea ice volume—an interaction that does not occur in the THC-QUMP ensemble.

The THC-QUMP ensemble does show a strong positive correlation between the control climatological surface air temperature and sea ice volume change. However, this is likely due to the link between control climatological sea ice volume and temperature. Higher temperatures in the control will be accompanied by a lower volume of sea ice. As noted previously, models with lower initial ice volumes result in smaller ice volume changes under 2 × CO2. Such changes are always negative (a reduction) in a warming climate, hence higher temperatures are related to smaller negative changes in ice volume. Therefore we would expect a positive correlation between control SAT and sea ice volume.

Similarly, we expect that many of the other significant correlations in the THC-QUMP ensemble are related to the spread in sea ice volume climatologies (for example: Shortwave (SW) flux at the top of the atmosphere (TOA Up SW)—High sea ice volumes are likely to have high sea ice coverage, increasing the upward top of the atmosphere flux in the shortwave band. High sea ice coverage results in less heat being stored in the mixed layer during summer—hence lower annual mean mixed layer temperatures).

The result shown in Fig. 5 and Table 9 imply that reductions in the spread in Arctic model climatologies are an important step in reducing uncertainty in future Arctic climate projections. Figure 5 suggests that for ice volume this uncertainty may be reduced by better constraining the ice albedo parameter in model parameterizations (although this may reveal underlying structural uncertainties—see Sect. 2). Furthermore, Table 9 shows the importance of a realistic ocean heat climatology when forming projections of future Arctic climate change.

4 Conclusions

The goal of this study is to assess and quantify the uncertainty in current Arctic climate projections in a consistent manner. We have examined the uncertainty (ensemble spread) in the response of Arctic Climate to a doubling of atmospheric CO2 concentrations in both a multimodel ensemble (CMIP3) and a single-model perturbed-parameter ensemble (THC-QUMP).

The range of models used in the CMIP3 ensemble have control climatologies that are generally consistent with historical observations of Arctic climate, although the spread of the climatologies can be quite large, particularly for some quantities (e.g. sea ice volume). Both structural and parameter uncertainties are likely to be significant sources of spread, since the spread of both ensembles (CMIP3 and THC-QUMP) is larger than our estimate of intrinsic climate noise (internal variability)—although further analysis will be required to robustly confirm this (see Sect. 2.4).

The ensemble spread of model climatologies is generally greater for the CMIP3 (multimodel) ensemble than the THC-QUMP (single model, multiple parameter) ensemble. This may suggest that structural uncertainty explains a notable fraction of the ensemble spread. However, although considerable efforts were made to uniformly sample parameter uncertainty (see Sect. 2.2), it is hard to directly relate this to parameter uncertainty within the CMIP3 ensemble

Our key findings from this study are that:

  • The uncertainty (ensemble spread) in the projections of changes in many Arctic quantities (e.g sea ice volume, mixed layer temperature) is large (∼30 − 50 %), when compared to the ensemble mean changes.

  • Sub-gridscale parameterizations and model structure variations are likely to be the most significant contributions to this uncertainty, rather than intrinsic internal variability. However, further work sampling internal variability using perturbed initial condition ensembles will be required to robustly test this conclusion (Sect. 2.4).

  • Climatological errors (mean state model biases) in both sea ice volume and northward ocean heat transport into the Arctic are critical factors in determining the climate response of the Arctic to a CO2 doubling [these findings mirror studies by Holland et al. (2010) and Mahlstein and Knutti (2010)].

  • Such climatological errors in northward ocean heat transport into the Arctic in a model may be ultimately due to structural variations between models (CMIP3) rather than uncertainties in sub-gridscale parameterizations (THC-QUMP).

One limitation of this study is that we have only considered variations in annual means. However, the Arctic is a strongly seasonal climate system driven by the large variation in solar shortwave radiation over the annual cycle. In winter, conditions are often dominated by high surface pressure and strong atmospheric inversion. The resulting heat loss to space leads to sea ice growth and and increased mixed layer depth. In summer, by contrast, the near-permanent sunlight induces surface melting of the sea ice, a well-mixed boundary layer and 90 % cloud cover. Consequently, very different processes determine Arctic climate during summer and winter and model ensemble spread will likely vary with season. We have partly analysed the seasonal dependence in the analysis of shortwave (SW) and longwave (LW) fluxes, both which dominate at different parts of the year—hence capture some of the seasonality. But a full analysis will require seasonally resolved data. Such an analysis may reveal stronger signals in the relationships between processes than we have documented here.

This caveat aside, we have demonstrated that there is considerable uncertainty in climate model projections of Arctic climate and that this uncertainty likely has its origins in model structural and parameter uncertainty, rather than internal variability, although further work with multiple-initial conditions ensembles is required to robustly test this. A large part of this uncertainty results from model biases (inter-model differences in the mean state). Addressing these biases, through technical model improvements together with better constraints on models from improved observations (most notably, heat transports into the Arctic and Arctic sea ice volume), will likely lead to significant reductions in the uncertainty of model projections of future Arctic climate.