1 Introduction

The Mediterranean Sea is a semi-enclosed basin connected to the Atlantic Ocean at the Strait of Gibraltar. It is also connected to the Red Sea at the Suez channel and to the Black Sea at the Strait of Bosphorus. There is increasing evidence that hydrographic conditions in several areas of the Mediterranean basin are changing. The western basin deep waters, among the most observed and studied water masses of the Mediterranean Sea, show increase in temperature (~+3.5 × 10−3 °C year−1, Bethoux et al. 1998; Bethoux and Gentili 1999). Corresponding changes in the eastern Mediterranean are less evident albeit Manca et al. (2004) showed a temperature increase in the Ionian (~+2.8 × 10−3 °C year−1) and Levantine basins (~+3.7 × 10−3 °C year−1). In the latter basin a lower increase (~+1.2 × 10−3 °C year−1) was found by Tsimplis and Baker (2000). Changes in the Mediterranean water characteristics are shown to impact the water leaving the Mediterranean Sea through the Strait of Gibraltar. Millot et al. (2006) indeed found a large warming of the densest Mediterranean water outflowing through the Strait of Gibraltar (~+0.3 °C in early 2000s compared to 20 years earlier). An increase in the Mediterranean Sea temperature during the last decades was also shown by Criado-Aldeanueva et al. (2012). The increase is deduced from an unbalance between the net heat flux at the Strait of Gibraltar (based on a combination of climatological data sets and in situ measurements), and a long-term average of the net heat flux at the sea surface. Despite the above mentioned evidences of changes in the thermal properties of the Mediterranean basin, a full understanding of the mechanisms that produce them and of the contribution of the heat budget components to these changes is still not reached.

The Mediterranean heat budget equation can be written as follows:

$$\frac{\Delta HC}{\Delta t} = SHF + GHF + BHF + SCHF + RHF$$
(1)

where HC is the heat content (the thermal energy stored in the basin) and \(\frac{\Delta HC}{\Delta t}\) is the heat content rate. \(\frac{\Delta HC}{\Delta t}\) results from an unbalance between the different heat fluxes at the basin boundaries: the net heat flux at the sea surface (SHF), at the Strait of Gibraltar (GHF), at the Bosphorus Strait (BHF), in the Suez channel (SCHF) and the heat input by the rivers (RHF). The contribution of the rivers, the Bosphorus Strait and the Suez channel are much less important in terms of the water flux exchange, and thus in the heat exchange (e.g. Bethoux 1979; Harzallah et al. 1993; Garrett et al. 1993; Mariotti et al. 2002). Hence, the heat content rate in the Mediterranean Sea is mainly controlled by the net heat fluxes at the Strait of Gibraltar (GHF) and at the sea surface (SHF).

Several estimates of the heat budget components GHF and SHF were obtained based on observations and model simulation (e.g. Garrett et al. 1993; Astraldi et al. 1999; Pettenuzzo et al. 2010; Criado-Aldeanueva et al. 2012; Sevault et al. 2014). The net heat flux through the Strait of Gibraltar (GHF) depends on the water exchanged at this strait. The flow at this strait has two branches: an inflow of Atlantic waters into the Mediterranean operating in the upper layers and an outflow towards the opposite direction operating in the lower layers. The observation estimates for the mean inward and outward fluxes are respectively +0.72 and −0.68 Sv (Bryden et al. 1994), +0.81 and −0.76 Sv (Baschek et al. 2001) and +0.78 and −0.67 Sv (Tsimplis and Bryden 2000). For the period 1994–1996 Astraldi et al. (1999) found higher values +1.01 and −0.97 Sv. More recently, Soto-Navarro et al. (2010) obtained +0.81 ± 0.06 Sv and −0.78 ± 0.05 Sv based on direct current meter observations during the period 2004–2009 and a combination of different products for the surface freshwater flux in the Mediterranean. In the frame of the European project CIRCE five RCSMs were used to simulate the climate evolution of the Mediterranean region during the period 1950–2050 (Gualdi et al. 2013a, b; Dubois et al. 2012). The ensemble average of the mean inward and outward fluxes for the 1961–1990 period were +0.85 and −0.81 Sv, respectively. Hence the amount of water being exchanged at the Strait has been quantified to be of the order of 0.8–1 Sv in both layers. The net water flux corresponding to the above mentioned estimates ranges between +0.04 and +0.11 Sv.

Returning to the heat flux through the Strait of Gibraltar, comparatively fewer estimates are available. Based on current meter measurements at the strait, Garrett et al. (1993) and Macdonald et al. (1994) respectively obtained +7 ± 3 Wm−2 and +5.2 ± 1.3 Wm−2 as net heat flux. Combining climatological values with in situ measurements Criado-Aldeanueva et al. (2012) proposed +3.2 ± 1.5 Wm−2. Astraldi et al. (1999) estimated the net, inward and outward heat fluxes from measurements for the period 1994–1996 as corresponding to +5.0, +32.3 and −27.2 Wm−2, respectively. From the CIRCE simulations the ensemble average mean values of these fluxes are found +4.31, +19.59 and −15.27 Wm−2 for the period 1961–1990. However the CIRCE simulations showed large inter-model spread and, as stated by Gualdi et al. (2013b), their heat flux estimates should be carefully considered.

An accurate estimate of the net heat flux at the sea surface (SHF) is still not reached due to the lack of observations over the sea to constrain the atmospheric products. Thus, the closure of the heat budget remained a concern in several studies (e.g. Harzallah 1990; Garrett et al. 1993; Josey et al. 1999). Sevault et al. (2014) compared different estimates of the surface heat flux components for the period 1985–2004 based on in situ and satellite observations and found net heat flux values ranging from −10 to 0 Wm−2. The well known ERA40 (Uppala et al. 2005) and ERA-Interim reanalyses (Dee et al. 2011) provided surface and atmospheric fields widely used in several modelling studies of the Mediterranean Sea. However these analyses present some non negligible biases. The ERA40 net heat flux at the sea surface is shown to be too large (−13 Wm−2 for the period 1958−2001, Pettenuzzo et al. 2010; −15 ± 6 Wm−2 for the period 1961−2000, Sanchez-Gomez et al. 2011). Pettenuzzo et al. (2010) showed that such a too large heat flux is basically the result of underestimation of the shortwave radiation. They also showed that despite being few, severe winter episodes induce important heat losses, thus largely contributing to the total heat flux biases. The authors used satellite and in situ observational data sets to obtain a corrected net heat flux of −7 Wm−2. The European project ENSEMBLES generated an ensemble of regional dynamical downscaling sets of ERA40. In particular, twelve high-resolution (25 km) atmospheric RCSMs were forced by the same ERA40 forcing for the period 1961–2000. Sanchez-Gomez et al. (2011) compared the simulated net heat fluxes at the sea surface and obtained an ensemble average value of −9 Wm−2 but the spread was large (from −40 to +21 Wm−2). They also showed large differences in the net heat flux values obtained from a combination of different data sets (−3, +5 and +9 Wm−2). The CIRCE simulations provided net heat flux values corresponding to a heat loss by the sea and satisfying the heat budget closure (−6 to −2 Wm−2 for the period 1961–1990; Gualdi et al. 2013b; Dubois et al. 2012). The last authors concluded that ocean–atmosphere coupling is a necessary approach for consistent climate simulations of the Mediterranean Sea.

Recently a new ensemble of simulations has been set up in order to simulate the state and the evolution of the Mediterranean domain. The simulations are conducted in the frame of the Mediterranean coordinated experiment Med-CORDEX (Ruti et al. 2015) and supported by the HyMeX program (Drobinski et al. 2014). The simulations use the RCSMs and are intended to reproduce in a model the functioning of the Mediterranean region including for most of them the sea, the atmosphere, the land and the river components. The added-value of this new set of simulations is the use of such multi-component models with high resolution downscaled atmospheric conditions. In this paper we use the results from this new data set to characterize the mean values and long term changes of the heat budget components of the Mediterranean Sea. The results are compared to available observations and to previous simulations. The goal is twofold: on one hand we want to analyse the similarities and discrepancies among the Med-CORDEX ensemble and to do a first assessment of the quality of the model results; on the other hand we want to identify which is the main source of uncertainty responsible for the model spread in terms of the basin average temperature.

The paper is organised as follows: the next section summarizes the models and the simulations used in the study; a detailed description is given in the ‘Appendix’. Section 3 presents mean values and trends of the heat budget components from the simulations with a comparison to available observations. Section 4 discusses the impacts of model similarities and differences on the results. The major conclusions of this paper are presented in Sect. 5.

2 Models and data used

The Med-CORDEX data used in this study are RCSM outputs from six coupled models all forced laterally by the ERA-Interim reanalyses and six ocean forced models that use dynamically downscaled versions of ERA40/ERA-Interim reanalyses as surface forcing. The simulations cover the Mediterranean region and the entire data set is available from the Med-CORDEX data base (www.medcordex.eu/medcordex.php). The models used and the simulations performed are detailed in the ‘Appendix’.

The simulated temperature time series are compared to two observational data sets based on in situ profiles. The first set is the RIXEN gridded data set (Rixen et al. 2005). It provides a reconstruction of the Mediterranean Sea temperature for the period 1950–2000 on a 0.2° × 0.2° grid using a variational inverse model. Formal errors at each grid point were obtained from this approach (Rixen et al. 2001). The second one is EN4 (EN4.0.1, Good et al. 2013). It is an update of EN3 (Ingleby and Huddleston 2007) obtained by optimal interpolation on a 1° × 1° grid. The data used cover the period 1910–2013. The basin averaged temperature and its uncertainty for these two data sets are obtained by averaging the temperature and its uncertainty at each data grid point over the Mediterranean Sea volume.

The quantitative comparison between model outputs and observations use simple statistics. The period 1980–2010 common to most data sets is considered as a reference period. However some simulations start after 1980 or end before 2010. In this case the statistics are performed based on the shorter simulated period. Time averages are shown with their 95 % confidence intervals and with significance tests (null hypothesis of zero mean). Significance tests are also shown for correlations at the same confidence level (null hypothesis of absence of correlation). The statistics also include trends, their 95 % level confidence intervals and significance tests (null hypothesis of trend absence) using the two-tailed Mann–Kendall test (Mann 1945; Kendall 1975). The heat content rate is calculated from the time derivative of the basin average temperature yearly values using a constant water density.

3 The heat budget components

3.1 The basin average temperature

The average temperature over the basin volume is one of the most important parameters that permit to investigate the state and the evolution of the Mediterranean Sea. The temporal evolution of the average temperature from MedCORDEX models and Rixen and EN4 observational products are shown in Fig. 1. The corresponding mean values over the reference period 1980–2010 are summarized in Table 2. Most models show a significant positive bias relative to observations in particular at the end of the simulated period. Without considering CMCC-MED, the bias in the mean value is less than ~0.1 °C in 1980 and increases up to ~0.2 °C in 2010. Concerning the variability, most models show similar interannual variations. Noteworthy, the abrupt cooling during 1980–1985, 1991–1993 and 2002–2005 and the sudden warming during 1989–1991 (e.g. Brankart and Pinardi, 2001) are particularly well reproduced. The average correlations with the two observational data sets (using detrended series, not shown) are 0.55 for RIXEN and 0.65 for EN4. On the other hand correlations are larger for the forced models (0.60 for RIXEN and 0.73 for EN4) compared to the coupled ones (0.50 for RIXEN and 0.57 for EN4). As it will be shown below this illustrates the stronger constraints set in the forced models.

Fig. 1
figure 1

Annual series of the average temperature over the Mediterranean Sea (°C) from the Med-CORDEX simulations. Series from the two observational products (RIXEN and EN4) are also shown with their associated uncertainty (shaded areas)

Time series of the simulated average temperature clearly show positive trends during the period 1980–2010. The trend estimates from model results and observations are shown in Table 4. All trends are positive and, except INSTMED06-ARPERA2, are significant at the 95 % level. The simulated trends are higher than the observed ones except those of the two INSTMED06 simulations. The low trends for these two simulations may be related to the use of the correction of atmospheric heat fluxes. Also, these simulations use a monthly climatology rather than monthly varying values when relaxing the sea surface temperature (see the ‘Appendix’ and Table 1). This induces damping of the surface temperature variations and hence a lowering of the average temperature trend. The ensemble average trend is +8.1 × 10−3 °C year−1 which is higher than the reported trends in the western basin, the Inonian Sea and the Levantine Basin, +1.2 to +3.7 × 10−3 °C year−1 (Bethoux et al. 1998; Bethoux and Gentili 1996, 1999; Tsimplis and Baker 2000; Manca et al. 2004), although the periods considered are different. On the other side, the coupled simulations show larger trends than the forced ones. The average trend for the coupled models is +12.1 × 10−3 °C year−1 while for the forced models it is +4.3 × 10−3 °C year−1. Such a difference will be discussed in view of differences in the boundary heat fluxes SHF and GHF.

Table 1 Description of the Med-CORDEX models used in the present study
Table 2 Mediterranean Sea mean temperature and heat budget components from the Med-CORDEX simulations and from previous estimates
Table 3 Correlation coefficients between the heat content rate and the net heat flux at the sea surface and between the heat content rate and the net heat flux at the Strait of Gibraltar based on the Med-CORDEX simulations

3.2 Surface heat flux and heat content rate

In order to get insights into the processes that lead to the spread among models we look at the different components in the right hand side of (1). The mean net heat flux at the sea surface varies between −4.8 and +2.2 Wm−2. On average, coupled models simulate lower heat loss than forced models (the average value of the net heat flux at the sea surface is −0.7 Wm−2 for coupled models while it is −4.1 Wm−2 for the forced ones). This result is in agreement with the stronger warming simulated by the coupled models and partly explains the corresponding higher temperature trends shown in Table 4. The time series of the net heat flux at the sea surface and the heat content rate from the different models are shown in Fig. 2a, b, respectively. For the net heat flux, apart from differences in the mean values, interannual variations are similar (interproduct correlations range from 0.8 to 0.95). Such a high resemblance is attributed to that the same atmospheric fields (ERA40/ERA-Interim) are used as atmospheric boundary conditions in all the simulations (see Sect. 2). For the heat content rate the resemblance between models is similarly high. In addition the correlations with the observational products also show high values, ~0.7 for RIXEN and ~0.8 for EN4 and all series are most of the time within the confidence interval of these two observational data sets (Fig. 2b). It is also important to note that variations of the heat content rate closely follow those of the net heat flux at the sea surface. The correlation between both variables ranges between 0.4 and 0.7 (Table 3) in most of the simulations with a mean value of 0.53. Variations of the heat content rate show higher correlations with the net heat flux at the sea surface in coupled models (0.59) than in forced ones (0.48). At the opposite, correlations of the heat content rate with observations are higher in forced models (0.68 vs. 0.61 for Rixen and 0.78 vs. 0.69 for EN4). Hence compared to coupled modes, forced models simulate heat content rate variations closer to observations but less coherent with the surface fluxes. This may be related to the relaxation procedure in the forced models, which brings the simulated sea surface temperatures to values closer to the observed ones, hence not necessary coherent with the surface fluxes.

Table 4 Trend estimates of the Mediterranean Sea average temperature and of the heat budget components from the Med-CORDEX simulations and from previous estimates
Fig. 2
figure 2

a Annual series of the net heat flux at the sea surface of the Mediterranean Sea based on the Med-CORDEX simulations. b Annual series of the heat content rate in the Mediterranean Sea from the same simulations. Series from the two observational products (RIXEN and EN4) with their associated uncertainty (shaded areas) are also shown. Units are Wm−2

Trends of the net heat flux at the sea surface and of the heat content rate (i.e. acceleration of the heat content) are shown in Table 4. Although most trends are not significantly different from zero at the 95 % level, it is useful to examine if there are similarities in the different models. The net heat flux at the sea surface shows that the sign and magnitude of the trends differ among models. At the opposite, most models show positive trends of the heat content rate in agreement with the two gridded observation data sets. The average trend from the simulations is +58 × 10−3 Wm−2 year−1 which is close to the trend value form the RIXEN data set, +52 × 10−3 Wm−2 year−1 and lower than the trend value from the EN4 data set, +188 × 10−3 Wm−2 year−1.

3.3 Role of the exchanges at the Strait of Gibraltar

For the Strait of Gibraltar, all models simulate a positive and significant (at the 95 % level) net heat gain (Table 2). Values range between +1.4 and +6.7 Wm−2 with a mean value of +4.9 Wm−2 and no significant differences between coupled and forced models. Looking at the mean inward and outward heat fluxes there is a rather important spread between models. The mean values range between +14.5 and +32.4 Wm−2 for the inward heat flux and between −10.2 and −26.6 Wm−2 for the outward heat flux. It is worth noting that the exchanged fluxes are larger in the coupled models than in the forced ones; the mean inward flux is +23.4 (+18.8) Wm−2 and the mean outward flux is −18.6 (−13.8) Wm−2 for the coupled (forced) models. It is interesting to note the relatively weak interannual variability of the modelled heat flux at the Strait of Gibraltar; the ensemble average annual standard deviation (std) is 0.4 Wm−2 while it is 4.7 Wm−2 for the net heat flux at the sea surface. This is in agreement with the weak variability shown by Astraldi et al. (1999) in a nearly two-year heat flux series. The time series of the heat fluxes at the Strait are shown in Fig. 3. For most models the simulations show different time evolutions even for the net heat flux (Fig. 3c). In opposition to the high correlations between the heat content rate and the net heat flux at the sea surface, most correlations between the heat content rate and the net heat flux at the Strait of Gibraltar (Table 3) are not significant except for the NEMOMED12 and the CNRM-RCSM4 simulations. It would be interesting to further examine the reasons for such significant correlations for these two simulations in particular the dependence on the 3D temperature distribution inside the basin.

Fig. 3
figure 3

Annual series of inward (a), outward (b) and net (c) heat fluxes through the Strait of Gibraltar from the Med-CORDEX simulations. Units are Wm−2

The model results show large differences in the simulated heat fluxes at the Strait of Gibraltar. More insight can be gained looking at what determines these fluxes: the water flux and the temperature of the exchanged masses. Mean values of the water fluxes at the Strait of Gibraltar are shown in Table 5. All models, except MORCE-LMD show very close net water fluxes ranging from +0.042 to +0.076 Sv. It is worth recalling that the net water flux at the Strait is mainly determined by the net water flux through the Mediterranean Sea surface which equals the difference between the evaporation and the precipitation (E–P). This quantity is provided in all models by a downscaling of ERA40/ERA-Interim fields. The low values in MORCE-LMD are due to an excess of precipitation over evaporation. The inward and outward fluxes show a larger spread (ranging from +0.44 to +1.08 Sv for the inflow and −0.45 to −1.02 Sv for the outflow). Although these values are within the generally accepted values, they can be up to a 20 % higher or lower than recent estimates based on observations (e.g., +0.81 and −0.78 Sv for the inflow and outflow, respectively; Soto-Navarro et al. 2010). The relatively low values simulated by the two INSTMED06 simulations are probably due to the limited depth of the Strait of Gibraltar which reduces the area across the Strait and thus reduces the exchanged transport.

Table 5 Water fluxes and temperatures at the Strait of Gibraltar from the Med-CORDEX simulations and from previous estimates

Time series of the simulated water fluxes at the Strait of Gibraltar are shown in Fig. 4. Corresponding series reconstructed from the observations made at the Strait for the period 2004–2010 (Jordà et al. 2016) are also shown. The length of the observations (6 years) prevents drawing any conclusion, but a first comparison suggests higher fluxes in the observational products. The time evolution of the inward and outward water fluxes (Fig. 4a, b) closely resemble the corresponding ones for the heat fluxes (Fig. 3a, b) and also show no significant correlations among the different models. In opposition to the net heat flux (Fig. 3c) the net water flux (Fig. 4c) shows highly coherent series (expect for a shift of series for the MORCE-LMD model). This was somehow expected for the forced models, as far as the net water flux at the sea surface is derived from a downscaling of the same reanalysis. Interestingly, the coupled models also reproduce similar net water flux series as the forced ones.

Fig. 4
figure 4

Annual series of inward (a), outward (b) and net (c) water fluxes through the Strait of Gibraltar from the Med-CORDEX simulations. Units are in Sv (1 Sv = 106 m3 s−1). Estimates based on observations of Astraldi et al. (1999) and from Jordà et al. (2016) are also shown

Table 5 shows the mean temperature values obtained dividing inward and outward heat fluxes by the corresponding water fluxes. This provides the equivalent average temperatures of the inflowing and outflowing water masses. For the inflow component the temperature corresponds to that of the Atlantic Ocean water entering the Mediterranean Sea. For the outflow component it represents the temperature of the Mediterranean water leaving the basin towards the Atlantic Ocean. Hence the latter is an indication of the Mediterranean basin thermal state at intermediate and deep layers. Most models present a value of the mean temperature of the inflowing water around ~16.8 °C. The ENEA-PROTHEUS and MITGCM-REMO-ERA40 show the highest values, 17.7 and 19.1 °C, respectively whereas MORCE-LMD shows the lowest value, 15.3 °C. These values are strongly determined by the lateral boundary conditions imposed in the Atlantic side. Similarly, the temperature of the leaving water, which is defined by the initial conditions and by the heat redistribution simulated by each model, shows large variations among models. The lowest value is 12.2 °C for MORCE-LMD and the highest one is 15.4 °C for ENEA-PROTHEUS. It is interesting to mention that the NEMOMED12-ARPERA2 simulations show lower outflow temperatures compared to the NEMOMED8 in spite of using the same atmospheric forcing. This was already shown by Soto-Navarro et al. (2015) who attributed the lower temperatures in NEMOMED12 to a better representation of the proportion of deep waters in the outflow. Also the CNRM-RCSM4-SN model simulates cooler outflowing water than the CNRM-RCSM4-AN model, in agreement with the lower basin average temperature simulated by the CNRM-RCSM4-SN model shown in Fig. 1. The difference between the mean temperatures of the inflowing and outflowing waters is ~3.2 °C with the weakest value (1.6 °C) for MORCE-LMD and the highest one for NEMOMED12-ALDERA (4.1 °C). From the observations typical mean values of the temperature at the layers (0–150 m) and (150–300 m) are 15.4 and 13.4 °C, respectively. Thus the difference is 2 °C (Bethoux and Gentili 1999). The inflow and outflow temperatures derived following the same approach but based on the mean water and heat fluxes of Astraldi et al. (1999) give higher values, 19.2 and 16.8 °C respectively, with a difference of 2.4 °C.

The time series of the temperature of the inflowing water (Fig. 5a) show different behaviours that reflect different approaches used for modelling the Atlantic Ocean zone. The MITGCM-REMO-ERA40, CMCC-MED, MORCE-LMD, ENEA-PROTHEUS and INSTMED06 models show small interannual variability; those models use a relaxation towards the climatology in the Atlantic zone. The remaining models show a rather similar behaviour with a relatively high interannual variability. Those models use a relaxation towards time-varying observed conditions in the Atlantic zone (see Beuvier et al. 2010). The temperature of the outflowing water (Fig. 5b) shows a much weaker interannual variability, a signature of the slowly varying intermediate and deep waters.

Fig. 5
figure 5

Annual series of the mean temperature of the inflowing water (a), of the outflowing water (b) and of their difference (c) from the Med-CORDEX simulations. The mean temperatures are deduced from the ratio of the heat and the water fluxes through the Strait of Gibraltar for the inward and outward components. Units are °C

Trends of the heat and water fluxes at the Strait of Gibraltar are presented in Tables 4 and 6, respectively. The inward and outward fluxes show no coherent trends for both heat and water. Similarly, the net heat flux shows different trend signs, although trends are significant for four coupled models. For the net water flux, although trends are weak and not significant at the 95 % level (except MORCE-LMD), most models (nine) show positive values. The ensemble average is +0.2 × 103 m3 s−1 year−1 with values ranging from −0.2 to +1.3 × 103 m3 s−1 year−1. Fenoglio-Marc et al. (2013) found an increase in the net water flux at the Strait of Gibraltar of ~+0.8 × 103 m3 s−1 year−1 based on the results from one single model. The net water flux increase was explained by Fenoglio-Marc et al. (2013) as resulting from an increase in the net evaporation at the sea surface which has also been demonstrated by Mariotti (2010) but with a significant multi-decadal variability. Table 6 also shows that most models simulate positive trends of the temperatures of the inflowing and outflowing waters. For the outflowing water the trends are significant for eleven models and the average value is around +8.4 × 10−3 °C year−1 about half the trend estimate of +15 × 10−3 °C year−1 proposed by Millot et al. (2006). In agreement with those authors, such temperature increase is suggested to be related to a change of the hydrographic conditions inside the Mediterranean basin.

Table 6 As Table 5 but for the trend estimates

4 Discussion

In opposition to the highly resembling series of the net heat flux at the sea surface (the annual correlations between pairs of simulations are greater than 0.7 for all cases), the agreement on the series of the net heat flux through the Strait of Gibraltar is low and significant correlations (0.5–0.8) are found only between models which are using the same boundary conditions in the Atlantic (namely the NEMOMED8, NEMOMED12 and CNRM models). In order to get an insight of the reasons behind this intermodel spread, we assumed a two layer system in Gibraltar, with Atlantic water flowing towards the Mediterranean in the upper layer and Mediterranean water flowing towards the Atlantic in the lower layer. Thus, we can formulate the net heat flux through the Strait of Gibraltar as

$$GHF = \phi_{IN} \Delta T_{GIB} + \phi_{NET} T_{OUT}$$
(2)

where \(\phi_{IN}\) and \(\phi_{NET}\) are the inward and net water fluxes and \(\Delta T_{GIB} = T_{IN} - T_{OUT}\) is the difference of temperatures of the inflowing and outflowing waters. Next, we can do a simple estimate of the expected spread in the Strait of Gibraltar heat flux using the intermodel spread of each component (Tables 2, 5) and keeping the rest of the terms constant. Thus, it can be seen that the spread in GHF induced by the spread in \(\phi_{IN}\), in \(\Delta T_{GIB}\) and in \(\phi_{NET}\) are 0.96, 0.92 and 0.85 Wm−2, respectively. The spread induced by the spread in \(T_{OUT}\) is only 0.05 Wm−2. The spread in \(\phi_{IN}\) is related to that in the difference of densities of the inflowing and outflowing waters, mainly due to the model initialization but also to the particular way the Strait is solved. Here the bottom drag and the topography play a key role in determining the exchange transports (e.g. Soto-Navarro et al. 2015). The spread in \(\Delta T_{GIB}\) can be associated to the initial state of the models. This can be illustrated in the large \(\Delta T_{GIB}\) difference between the NEMOMED12 and NEMOMED8 simulations (Fig. 5c) that foremost results from the difference in \(T_{OUT}\) (Fig. 5b) which is strongly dependent on the initial conditions. The evolution of the temperature inside the basin can also modify \(\Delta T_{GIB}\), but the change during the analysed period is very slow. Finally, the spread in the \(\phi_{NET}\) has a strong impact in spite of being relatively small (intermodel std = 0.04 Sv). The small intermodel spread in \(\phi_{NET}\) has to be linked to the E–P imposed at the basin surface. It is worth mentioning that the interannual variability of the net water flux \(\phi_{NET}\) is probably underestimated as most analysed models do not include the atmospheric pressure as a forcing variable. Therefore, most models miss the impact of the atmospheric pressure gradient, which is a non-negligible contributor to the interannual variability of the flow through the Strait of Gibraltar (e.g. Gomis et al. 2006).

For most models a significant warming of the outflow is found. Such warming is probably linked to a change in the Mediterranean thermal conditions. It agrees with the warming of the densest Mediterranean outflowing water due to changes in its composition (Millot et al. 2006). The lower basin average temperature and the cooler outflowing water in CNRM-RCSM4-SN compared to CNRM-RCSM4-AN (Tables 2, 5) may reflect an effect of surface fluxes on the thermal conditions in the basin and on the water outflowing at the Strait. Indeed the CNRM-RCSM4-SN presents a stronger net heat loss (Table 2) due to a stronger latent heat loss (Sevault et al. 2014). Hence hydrographic changes induced by changes in the atmospheric conditions may impact the exchange at the Strait. It is interesting to further investigate these changes in particular the water masses involved and the consequences on the thermohaline circulation.

Most models agree with observations on the Mediterranean warming. However, the model spread is large and the rates are higher than in the observations, especially for the coupled models. The trend in the Mediterranean average temperature results from an imbalance between the different terms in (1), and especially between the net heat fluxes at the sea surface and at the Strait of Gibraltar. The model ensemble spread in the mean value of those two components, in particular in the net heat flux at the sea surface, explains the different warming rates among models (see Table 2). Unfortunately, there are no independent observational data sets that could be used to constrain with enough accuracy neither the mean heat loss at the surface nor the mean heat gain through the Strait of Gibraltar (MedMAHB report 2015, http://marine-climate.uib.es/medworkshop.html).

The present study showed differences between the forced and coupled model simulations, in particular the larger spread in the coupled ones. The larger spread in the mean net heat flux at sea surface in coupled models (range from −3.5 to +2.2 Wm−2 while −4.8 to −2.9 Wm−2 in the forced models), and the lower correlations of the average temperature with observations also in coupled models (Sect. 3.1) may be related, at least partly, to the atmospheric internal variability generated inside the domain of the regional models (e.g. Laprise et al. 2012; Herrmann et al. 2011). The trace of the internal variability remains even for averages over a long period (Laprise et al. 2012; de Elía et al. 2007). Hence, the atmospheric patterns simulated by the regional models may differ from the patterns of the driving analyses. In the present simulations, despite the high intermodel correlation values shown above, some intermodel variability exists in the net heat flux at the sea surface series (std of 2.7 Wm−2) and is almost the same for coupled and forced models. It is suggested that part of it is attributed to the internal variability generated inside the model domain. In the coupled models such internal variability is transmitted to (and feedbacks with) the ocean component through the two-way fluxes at sea-atmosphere interface. In the forced models the ocean component is constrained by the relaxation to the observed or climatic sea surface temperatures (see Table 1) damping the effect of the internal variability. This gives an explanation to the lower correlations of the heat content rate with observations in the coupled models than in the forced ones.

The initial conditions and spin-up may have an effect on the average temperature mean value and evolution. Simulations showing higher mean temperature value (e.g. CMCC-MED, ENEA-PROTHEUS, CNRM-RCSM4-AN) also have higher temperature value at the start of the simulation. Overall, the correlation between the temperature bias relative to observations at the start of the simulation and the mean temperature during the simulation is 0.78, significant at the 99 % level. This shows the crucial role of the starting state especially for coupled models (e.g. CMCC-MED). The impact of the initial conditions and spin-up on the average temperature interannual variability is less important as temperature variations are driven mainly by the heat fluxes at the sea surface (see Fig. 2). We also mention that the effect of the spin-up duration is difficult to investigate as several simulations use nearly the same length whereas no spin-up is used in other simulations.

Different approaches are used in the models shown in this study to simulate the Atlantic zone (see Table 1). In particular the internannual variability of the temperature and salinity are considered in some models whereas others use climatological values. The sea level is also used with climatological or interannually varying values but in addition some models consider open boundary conditions permitting explicit mass exchange (INSTMED06) or a mass exchange deduced by relaxing the sea level towards an observational data set (e.g. NEMOMED12). Attempts were made to investigate the impacts of the varying Atlantic conditions but due to the limitation in the number of cases no conclusive results were reached.

The models used have different resolutions both in the horizontal and vertical coordinates. In addition most models use spatially varying coordinates with increased resolution when approaching the Strait of Gibraltar. The width of the strait of Gibraltar is represented by two or three grid points in the different models. The INSTMED06 model which has a comparatively coarse resolution in the Mediterranean basin uses three grid points. Hence it is not straightforward to investigate the impacts of the resolution on the heat budget of the Mediterranean Sea. On the other hand there is some evidence that models with a deeper Strait of Gibraltar simulate stronger water and heat exchange. Again, although this is needed to be confirmed as it is based on only four different values of the strait depth (see Table 1), it highlights the importance of reproducing, as accurately as possible, the strait geometry in the models, in particular its depth.

5 Conclusions

The heat budget components of the Mediterranean Sea are studied based on a set of numerical simulations performed using the RCSMs which include both forced and coupled models. The first ones are Mediterranean Sea models forced by downscaled versions of the ERA40/ERA-Interim reanalyses. For the coupled ones the atmospheric components are driven outside the Mediterranean domain by the ERA-Interim reanalysis meanwhile they are free to evolve inside the domain. The study focuses on mean values and trends of the heat budget components for the 1980–2010 period (31 years). Model results are compared to available observations and to previous simulations.

The different models simulate similar evolutions of the Mediterranean Sea average temperature in agreement with observations. Major climate events, as the cooling after 1980 and the sudden warming in 1990, are well reproduced by the models. This consistency is attributed to the use of the same atmospheric reanalyses (ERA40/ERA-Interim) in the atmospheric components of the RCSMs used. The heat content rate presents a similarity among models and is highly related to the surface heat fluxes. On the other hand, the heat fluxes through the Strait of Gibraltar show a large intermodel spread and little impact on the heat content rate variability. In addition the initial conditions determine the mean value of the average temperature and have an impact on the temperature trends. However it does not seem to affect the interannual variability.

Average temperature trends are positive and significant in most models. They are larger than the trends computed from observational products. Higher temperature trends and lower surface heat losses are simulated by the coupled models compared to the forced ones. At the opposite, the mean values of the net heat flux at the Strait of Gibraltar are nearly the same in the forced and coupled models. Hence the higher temperature trends in the coupled models are attributed to the larger heat budget imbalance simulated in those models, resulting mainly from their lower surface heat loss. The resemblance between the simulated and observed heat content rates is stronger in the forced models than in the coupled ones. This is attributed to the effect of the sea surface temperature relaxation in the forced models, which reduces the impact of the internal variability generated inside the domain and brings the simulated temperatures closer to the observed ones.

At the Strait of Gibraltar, all models simulate a positive net heat gain with values ranging between +1.5 and +6.8 Wm−2 and a mean of 4.9 Wm−2. The intermodel spread is due to the different values in the temperature difference between inflowing and outflowing waters at the Strait, in the exchange transports and in the net water flux. Unfortunately, the observational data sets are not accurate enough to constrain the mean values of the net heat fluxes at the sea surface and at the Strait of Gibraltar. It is also worth mentioning that the net water flux at the Strait of Gibraltar is similar for most models with intermodel spread relatively small (0.04 Sv) in both forced and coupled models. This is expected for the forced models that are forced by the same atmospheric water fluxes at the sea surface but was not obvious for the coupled models. Further studies are needed to better understand the water cycle simulated by the different models and its impact on the heat budget of the Mediterranean Sea.

Large differences between models are found in the temperatures of the inflowing and outflowing waters, which reflect differences in the way the Atlantic Ocean is considered and differences in the initial conditions and spin-up periods. However, due to the limitation in the number of cases no conclusive results were reached. The temperature of the outflowing water shows positive and significant trends. The trends are higher in the coupled models than in the forced ones. It is assumed that the Mediterranean Sea warming, found in most models and in particular in the coupled ones, induces changes in the hydrographic conditions that impact the Strait of Gibraltar.