Introduction

Flooding poses an ever-present economic, societal1,2 and environmental3 risk that is likely to increase in the future4,5,6,7,8. An improved understanding of historical changes in flood hazard and the underlying driving mechanisms is, therefore, critical for predicting future changes for better adaptation strategies.

Flood estimation and flood management have traditionally been based on flood frequency analysis, following traditional stationary or non-stationary flood distributions accounting for shifts in flood extremes, climate and scale9,10,11,12,13,14. Recent accelerations in population growth, together with changes in the economy and land use patterns have also underlined how humans contribute to the complexity of the flood-river system15, as they are inextricably linked to water resources, and are active agents in altering atmosphere, hydrology and geomorphology16,17,18,19,20,21.

Nonetheless, while individual attribution studies linking either atmospheric or landscape drivers to floods are valuable, many times they fail to consider that massive channels widening or narrowing have been recorded22,23. These changes in river conveyance are driven by the interactions between hydrology, geomorphology, atmospheric drivers, and human activities, and by the interdependencies of processes at different spatiotemporal scales24,25,26,27,28,29,30,31,32,33,34,35,36,37.

Changes in channel geometry at short and intermediate time scales (1 to 100 years) have clear relevance for flood hazard prediction38. Fluvial systems function in feedback loops, where processes and their alteration influence one another (Fig. 1), and the cycle may either amplify or reset, continuing over time.

Figure 1
figure 1

The Colorado’s flood of 2013 changed the geomorphology of the St. Vrain Creek (CO) (a in 2012- and b in 2017) (Image from Google Earth, 2019 Digital Globe) and altered the stage-discharge relationship (c). The map was arranged using ArcGis 10.7 [www.arcgis.com].

Changes in river conveyance capacity do not impact the amount of water that flows through the river system during a flood, but they have a bearing on the probability of a flood event overtopping the banks and flood defences. However, practice typically assumes that an equilibrium relationship exists between the long‐term average geometry of the channel (i.e., its width, depth, and slope) and the supply of water and sediment to the stream. Although the universality of the equilibrium concept has been challenged (e.g.39) it generally approaches the status of canonical law.

Ignoring interdependencies of flood drivers, and the mutual effects of river morphology implicitly promotes a simplified view of the challenges inherent to flood management. Nevertheless, significant effort is still needed to determine how the connections between river conveyance and other flood-drivers look like under different boundary conditions, such as underlying climate, water and sediment characteristics, and in response to internal thresholds and feedbacks.

This paper proposes a framework to investigate the circular causality of channel and floods dynamics, to identify those variables that take part in the behaviour of the river system. It aims to conceptualise and quantify processes and feedbacks between flood properties and rivers. The novelty of the framework is that it considers the broader connectivity between hillslope, floodplain, and stream channel to advance our understanding of flood risk. We propose an analysis that includes historical field-measured flow data, and benefits also from current remotely sensed topographic information.

By characterising the longitudinal river variability together with atmospheric, hydrologic and geomorphologic indices, this research contributes to: (i) a broader discussion about the role of compound drivers in flood trends (ii) identify what catchment‐ and reach‐scale factors influence the interaction among flood drivers, (iii) understand the importance of including river properties and geomorphology in flood analysis.

This study is not meant to develop a general model relating different properties, but rather to document the possible existence of an effect. We offer a starting point upon which to build future research, including a more extensive number of samples and variables. Flood studies typically separate slowly varying boundary conditions of the Earth System (by using static channel properties measured at the outlet of the watershed) from the fast varying hydrological processes (representing the full range of flows). We suggest this approach may no longer be appropriate for estimating future flood risk in a changing world - i.e. for critical infrastructure with a horizon typically within 10–30 yrs.

The remainder of this paper will describe in detail (as independent parameters) the variability of river geometry, connectivity, and flows across basins of interest, as a critical first step in characterising the historical, modern, and future behaviour of these fluvial systems (Chapter 3). Chapter 4 will describe the interdependence of variables. Chapter 5 will show how this transdisciplinary framework making use of remote sensing and historical measured flow allows for a better understanding of flood risk

Ultimately, quantifying the circular causality and understanding the interaction of the hydrologic and geomorphic drivers on flood will allow for revising and refining our estimation of future flood risk and thus leading to better climate adaptation and increase in the resilience of communities and critical infrastructure.

The Landscape Framework

The investigated landscape framework includes the variables and acronyms summarised in Table 1, and it considers:

  1. (I).

    Hydraulic Scaling Function(s) (HSF), as power-law functions representing the longitudinal variability of river bankfull geometry. HSFs provide information on the overall river morphology and storage capacity40. They offer a quantitative description of how channel width and related properties (depth, velocity) vary with changing discharge along the river course (‘downstream’ hydraulic geometry) and between rivers at a comparable discharge frequency. Specifically, we focus on the coefficients of the power-law that relates bankfull-channel dimensions (width wbkf) to drainage area (ALidar) (Chapt 7.1, and 1.2 in supplementary material).

  2. (II).

    The concept of topographic ‘sediment connectivity’ (IC), understood as ‘the continuity of sediment transfer from a source to a sink’, as a new framework to unravel provenance, pathways and fate of sediments, as well as variability of erosion rates within a catchment41,42 (Chapt 7.2, and 1.3 in supplementary material).

  3. (III).

    River flows and characteristic properties of the daily flow distribution43,44, and the decadal trend of exceedance of specific flow quantiles38 (Chapt. 7.3 and 1.5 in supplementary material)

  4. (IV).

    Climatologic characteristics of precipitation, investigated in the form of daily Concentration Index (CI, Chapt. 7.4 and 1.4 in supplementary material). This index offers an indicator for temporal precipitation distribution, and it allows an assessment of seasonal precipitation changes. We will generally refer to ‘climate’ speaking about this specific element.

Table 1 Acronyms considered in this study, their meaning and measurement units.

River geometry

Current studies in geomorphic drivers of floods38 focus on variability considering channel measurements at a single site. Yet, the channel width is related in a remarkably regular way to bankfull discharge as it varies along the river course45,46,47. The average coefficients obtained for the rivers from Lidar (Fig. 2a) are α = 3.6 ± 2.3 and β = 0.39 ± 0.21 and they are in line with the regional HSF reported in the literature [1.0 < α < 5.83 and 0.1 < β < 0.6]48,49,50,51,52,53,54. They also well capture field-surveyed bankfull widths at the outlet [standard deviation of percentage error ±10%, average absolute error ~0.60 m, standard deviation of absolute errors ± ~2 m], with higher accuracy (15%) than downstream regressions generally obtained from field survey (around 30%).

Figure 2
figure 2

The wide range of flows that each watershed experiences leads to the formation of a unique degree of adjustment of bankfull channel geometry (α and β). The variability of the coefficients across watersheds suggests that there are a set of width-drainage area values (and therefore, possibly, discharge) that are shared by cross-sections across watersheds (a), and that paired scaling and exponents exhibit very strong semi-log relationships (a straight line when one variable is logged and the other variable is not) over the investigated region. Displayed plots show: (a) local HSF for each watershed as compared to the regional regression from field survey (Dataset S2 supplements): (b) coefficients of the HSF (α and β) in the log-linear domain, with the derived regression as compared to those of the Snake, Missouri, Connecticut, Clark Fork, Chattahoochee and Mohawk rivers 60. Points and HSF provided by our study are colour-coded according to the watershed drainage area (from blue to red for increasing values).

By aggregating local HSF parameter pairs from many distributed locations and plotting the coefficients in a log-linear domain55, it is possible to identify a regional trend that does not follow a downstream direction [slope = −0.70 and intercept = 0.73 (RMSE = 0.013 m, R2 = 0.78) (Fig. 2b)]. Despite being derived using drainage area rather than discharge, the log-linear equation for this study well overlaps with literature values55, presenting, however, steeper slopes for the watersheds within the Connecticut river domain (Fig. 2b). These differences pose more emphasis on exploring the correspondence between HSF and discharge-area relationship56 to investigate the causes for a significant departure from linearity in the log-domain.

Flow variation plays a vital role in channel evolution and maintenance. The wide range of flows that each watershed experience leads to the formation of unique bankfull channel geometry. In turn, observing the degree of the adjustment process (coefficients of the HSF, Fig. 2a) implicitly contains information about the combined effects of erosion and deposition of both the channel and the floodplain, which should be considered as parts of a single unit, and further contains a description of the variability of flood-river interactions along the river course. Comparing HSF coefficients for the considered watersheds (Fig. 2) suggests that there are a set of width-drainage area values that are shared by cross-sections across watersheds. The range of values highlights the existence of a ‘geomorphological allometry’57 [static changes in stream channel size, relative to similar values of drainage area, at a variety of watershed scales]. Generalising, the coefficients of the HSF represents the implicit signature of additional variables, other than drainage area, including flow regimes, regional climatic and physiographic factors, geological characteristics, the responsiveness of the catchment, and human activities across the investigated watersheds.

With field surveys, the uncertainty in the determination of a bankfull width and discharge is significant58. As well, the determination of bankfull discharge frequency is not easy because the classic hydrological methods developed for floods reach their limits. For flood hazard estimation, the proposed Lidar-based analysis leads to the conclusion that only one of the two traditional calibration parameters of the HSF may suffice to infer bankfull widths from the drainage area if a regional trend is well constrained. Channel geometry alters flood hazard locally38, yet the proposed investigating HSFs across watersheds suggests that transient changes in channel dimensions at a point are minor compared to the variability of downstream trends. Investigating the relationship between HSF coefficients and other flood drivers will allow i) investigating the response of a river system to climate and landform settings (sediment connectivity) and ii) diagnosing the causal agency of river channel and flow properties.

Sediment connectivity

This chapter evaluates the concept of sediment connectivity as a proxy for the processes involved in sediment transfer across multiple scales.

The watersheds present different degrees of connectivity to the investigated reach (Fig. S2 supplements, Fig. 3), from fully linked to fully unlinked [low-connected (low and medium-low) areas covering from 25% to 70% and highly connected areas (Medium-High and high) from 29% to about 70%]. More efficient sediment connectivity is observed in smooth steep catchments rather than from dissected or stepped landscapes [as also in (Baartman et al., 2013)]. Nevertheless, various landscapes features, such as drastic changes in slope (Fig. 3b, 01174900 –Cadwell Creek near Belchertown, MA) or human activities (01199050 – Salmon Creek, CT, Fig. 3d) increase landscape connectivity (connectivity index -IC- within the high cluster, as shown in Fig. 3a,c respectively), determining the amount of sediment to be potentially delivered to the river reach, and therefore, possibly, influencing flood risk.

Figure 3
figure 3

Various landscapes features [high slope (b), and agricultural fields (d)] increase landscape connectivity (more extensive areas with high values of the connectivity index -IC-), determining the amount of rainfall to be transformed into runoff and sediment to be possibly delivered to the river reach, and therefore influencing flood risk. Examples are shown for the Cadwell (a,b) and Salmon Creek (c,d) watersheds. The figure was arranged using ArcGis 10.7 [www.arcgis.com]. Slope was evaluated using customized scripts in Matlab 2018b [https://www.mathworks.com/release2018b], while the connectivity index was derived using the stand-alone tool by133.

Notwithstanding the intra-watershed variability, within each watershed, large portions of the landscape tend to present low values of connectivity (they are exceeded >70% of times, Fig. 4a). Clusters of disconnected landscapes can be seen [decreasing slope in the Cumulative Distribution Function -CDF-, where the gradient increases again later], highlighting the existence of diverse (across watersheds) but consistent (within watersheds) morphodynamics units, acting as sinks of sediments. The high variability of connectivity appears independent from the scaling (drainage area).

Figure 4
figure 4

Cumulative (CDF, (a) and complementary cumulative distribution (cCDF, b) function for the connectivity index (IC), obtained for each watershed (colour-coded according to the watersheds drainage area, from blue to red for increasing values). Within each watershed large portions of the landscape tend to present low values of connectivity (a), while generally smaller extents show comparable high connectivity (b). Clusters of disconnected (a) or connected (b) landscapes can be seen [decreasing slope in the CDF or cCDF, where the gradient increases again later], highlighting the existence of diverse (across watersheds) but consistent (within watersheds) morphodynamics units, acting as sinks or source of sediments.

The complementary CDF (cCDF, Fig. 4b, that describes the probability that a particular value for a random variable will be exceeded) highlights that fully connected landscapes generally cover the lowest extent of the watershed. Nevertheless, clusters of high connectivity values, possibly representing various morphodynamics units, can be seen for the connected landscape as well (Fig. 4b, decreasing slope in the cCDF where the gradient increases again later). Whether these clusters are proximally located to potential sediment sources upstream, channel aggradation might be likely if sediment supply and runoff increases from upstream areas. Channel bed aggradation might also induce further channel avulsions as it forces floodwaters across the floodplain.

Notwithstanding essential assumptions of the effectiveness of local measures on inferring catchment related erosion rates, monitoring potential sediment delivery at a river section can provide an order of magnitude of erosion and depositional processes within a catchment59,60. The presence and spatial configuration of different linkages and blockages as well as their capacity to store or remove sediment within different spatial scales, as highlighted by Figs. 3 and 4, determines the efficiency of the sediment cascades61,62 and the potential effects on flood hazard. We can expect catchments with a higher rate of connectivity, and in sediment-prone landscapes, to be more prone to dramatic floodings. When and if sediments are delivered, a single flooding happening at the local scale can be affected by the presence of sediments, which are moved by water from the main channel and deposited along the floodplains, changing the channel capacity and therefore driving to a possible increase in the flood risk and the residence time of water on floodplains63.

Flows

The distribution of daily specific discharge (Fig. 5) for the analysed catchments presents considerable variability in low and high flow values. Low flow conditions exhibit a distinct separation between the cold and warm colours (Fig. 5a), suggesting that catchments with larger drainage area exhibit higher values of specific discharge. Watersheds showing a CDF highly different from the others (i.e. 01187300 and 01109070 in Fig. 5b) present lakes and wetlands, and coarse soils. These ‘wetter’ catchments might have a damping effect on water variability and dry periods, and they have a greater ability to maintain soil moisture, store precipitation and thus increase base flows43.

Figure 5
figure 5

Cumulative (CDF, (a) and complementary cumulative distribution (cCDF, (b) function for the specific discharge, obtained for each watershed (colour-coded according to the watersheds drainage area, from blue to red for increasing values). The CDF highlights the existence of ‘wetter’ catchments (i.e. 01187300 and 01109070) with greater ability to maintain soil moisture, store precipitation and thus increase the base flow. The cCDF provides a compact signature of a catchment functioning that appears to be not related to changes in watershed scale (warm and cold colours in (b) are not well separated), but rather to elements such as land-use (i.e. urban areas and agricultural lands rather than forested watersheds).

The cCDF (Fig. 5b), as flow duration curve, provides a compact signature of a catchment functioning. The analysis shows that flow signature is not related to changes in watershed scale (warm and cold colours in Fig. 5b are not well separated), but rather to elements such as land-use (i.e. urban areas and agricultural lands rather than forested watersheds).

Collectively, the information derived from the comparison of the specific discharge distributions among watersheds testifies that runoff generation (e.g. surface and subsurface contributions) varies considerably across the different catchments.

Almost 90% of the sites (14/16) show increasing trends in low-flow exceedance (Fig. 6a). Nearly 70% of them (11/16) show significant increasing trends in exceedance of extreme flows (Q95) as well. Flow frequency trends for extreme flows (an average decadal increase of 6.6% and decrease of 3.0%, Fig. 6b) are typically larger than that of low-flows (an average decadal increase of 2.2% and decrease of 0.9%, Fig. 6a).

Figure 6
figure 6

Flow frequency mean-unbiased exponential trend (in units of exceedance of Q30 (a) and Q95 (b), n/yr) versus year obtained for each watershed (colour-coded according to the watersheds drainage area, from blue to red for increasing values). The slope of each curve identifies increasing (positive) or decreasing (negative) trends during the years. These curves demonstrate that nonstationarity in flood frequency is common. Trends in exceedance of low (a) and extreme flows (b) are widespread across scales and landscape units, challenging existing paradigms of flood frequency analysis and channel design.

About 30% of the watersheds shows asynchronous trends in flow exceedance (decadal increasing in exceedance of low-flows vs decadal decrease in extreme flow exceedance). Statistically significant low-flow trends are nearly three times more common than statistically significant high flows (43% vs 12%), suggesting that trends in low flows might be more widespread and easier to detect over decadal time scales.

These findings suggest that flood hazard is generally nonstationary, and undermines most efforts to characterise flood hazard over decadal time scales by fitting theoretical probability distribution functions to historical flood records38.

The Interdependence of Variables

Connecting changes in river flood hazard to its drivers requires two main assumptions: (i) the resulting signal is a mixture of component signals, and (ii) the patterns of the component signals are known to some degree14.

For this chapter, the observed signal will be the trend in exceedance of Q95 (xQ95), while the component processes are flow properties (Q), landscape properties (connectivity -IC-, physiographic section Phsec), climate (CI) and river geometry (α,β), whose correlation can be considered to be a fingerprint associated with many catchments within the region. As a consequence, these variables can be used for regional flood change investigation, rather than for attribution in individual catchments.

Flow, landscape, climate and river properties identify dominant processes in flood hazard, and the strength of correlation amongst them (lower triangular matrix in Fig. 7, where values of correlation -dCor- close to 0 represents no correlation, and 1 represents perfect correlation) determine the strength, speed and spatiotemporal variability of the rainfall-runoff response. As a consequence, the interaction between them reinforces or offsets the decadal trends of exceedance of extreme flows (xQ95, upper triangular matrix in Fig. 7, where shades of red represent progressively increasing decadal trend, white represent no trend, and shades of blue represent decreasing trends).

Figure 7
figure 7

Atmospheric (CI), landscape (IC, Phsec), river (α,β) and flow (Q, xQ) properties are correlated to different degrees (values of correlation -dCor- close to 0 represent no correlation, and 1 represent perfect correlation) [Lower triangular matrix in the image; shades of blue highlights the correlation strength, from weak (light) to strong (dark)], with some strong non-linear dependence emerging between specific variables [dCor values marked with ** are statistically significant at the 0.05 level, values marked with * are significant at the 0.1 level, values marked with ° are significant at the 0.2 level.] The interaction between variables [open circles in the upper triangular matrix] reinforces or offsets the decadal trends of exceedance of extreme flows [upper triangular matrix; shades of red represent progressively increasing decadal trend, white represent no trend, and shades of blue represent decreasing trends]. We used a kernel density estimation (KDE138,) to display in a non-parametric way the decadal trend for each 2 way combination of landscape indicators, using the start positions of each variable within the scatterplot. The correlation strength allows to define cluster of variables (dendrogram) that are more closely related. Elements that are merged at a lower height (shorter branches in Fig. 7) are more similar (dependent) than Elements that merge at greater height. It is possible to identify four separate clusters, including flow properties (A), reach properties, climate and flood hazard (B), morphodynamics properties (C), and scale-dependent parameters (D). Figure was arranged using customized scripts on Rstudio Version 1.2.1335. See Data and software availability for details.

One characteristic fingerprint emerging is that of ‘drainage allometry’: drainage density, drainage frequency, watershed order, the length of the analysed channel, and length of the longest trunk of the network presents strong statistically significant changes relative to similar values of drainage area, at a variety of watershed scales (Fig. 7).

Flow properties also well represent a specific fingerprint, displaying strong correlations (Q95, Qmean, QCV, Qbkf) (Fig. 7).

Further fingerprint is given by sediment connectivity at the catchment scale (ICmean, IC30, IC95) and physiographic settings (Phsec). The dispersion of sediment connectivity (ICcv), however, appears to be mostly related to scale (Alidar), with a nonlinear behavior and smaller CVs in larger catchments (Fig. 7).

Reach scale connectivity (the percentage of high or low connection to the investigated stream ICH% and ICL%) is strongly related to the trend in low flows (xQ30) and channel geometry (a,b).

The distance correlation confirms the non-linear nature of the relationship between the coefficients of the HSF (α and β, as shown previously in Fig. 2). The HSF coefficients also appear to be related with some network properties, connectivity and the decadal trend in extreme flows (xQ95).

Low-flows exceedance trends are highly correlated with landscape properties (connectivity -IC-, physiographic section Phsec). Exceedance of extreme flows (xQ95) is conversely related to some network parameters (drainage density and frequency), climate (CI) and HSF.

It is crucial to notice, however, that it should never be concluded there is ‘no association’ just because a p value is larger than a threshold64. The use of the correlation distance in a clustering approach offers a different angle to observe the relationships among parameters. Fingerprints that are merged at a lower height (shorter branches in Fig. 7) are more similar (dependent) than fingerprints that merge at greater height. One can observe, for example, that the flow properties (A in Fig. 7) cluster together, and that reach properties, climate and flood hazard (B in Fig. 7) also cluster together, but these two clusters are well separated and not merged until the second to last step when there are four clusters.

This highlights that, if on the one hand, flood hazard signals essentially reflect observed changes in precipitation13,65,66,67,68, reach-scale connectivity and channel geometry might also adjust to rainfall erosivity, as well as to the repercussions of climate on water cycle.

One can also observe that morphodynamics properties (C in Fig. 7, connectivity -IC-, physiographic section -Phsec- and trends in low-flows) cluster together, as do the drainage allometry signature together with catchment scale connectivity dispersion (D in Fig. 7).

The morphodynamics properties cluster confirms how catchments geologic characteristics impact infiltration and thresholds of overland flows, modifying the landscape and determining connectivity to the network. As well, it highlights how feedbacks between landscape components (connectivity, physiographic regions) are a critical component of watershed discharge69.

The drainage allometry signature cluster confirms the fact that river networks have long been recognized as possessing self-similar structures over a considerable range of scales70,71,72,73,74,75.

The identification of fingerprint clusters (Fig. 7) offers the basis to identify multiple mechanisms that typically contribute to flood risk. These mechanisms can be characterized as a set of interconnected components (or network, Fig. 8, Table 2). Each objects (e.g., morphodynamics properties and scale-dependent parameters in Fig. 7), regimes (e.g. flow properties in Fig. 7) or phenomena (e.g. reach properties, climate and flood hazard in Fig. 7) are connected by fluxes of matter and energy, feedbacks, spatial or temporal sequencing or adjacency, statistical correlations, and process-response relationships.

Figure 8
figure 8

Components of the various clusters (Flow properties (A), reach properties, climate and flood hazard (B), morphodynamics properties (C) and scale-dependent parameters (D)) assume different roles within the network depending on the centrality measure considered [larger importance is highlighted by symbols with sizes proportional to the centrality measure: betweenness (a), closeness (b), degree (c), and strength (d)]. Mean connectivity and rate of dis-connectivity (ICmean, IC30 in a) have more control over the network, because more information pass through these nodes (highest betweenness). Physiographic properties, low-flows and climate (Phsec, Q30, CI in b) have more direct influence on other vertices (lowest closeness). The rate of channel adjustment, decadal trend in extreme flows and the rate of disconnectivity (slope of the HSF β, xQ95 and IC30 in (c)) are connected to lots of nodes at the heart of the network (highest degree), however the decadal trend in extreme flows (xQ95, d) is the node with the higher level of correlation (highest strength). Figure was arranged using customized scripts on Rstudio Version 1.2.1335. See Data and software availability for details.

Table 2 Flow properties (A), reach properties, climate and flood hazard (B), morphodynamics properties (C) and scale-dependent parameters (D) presents present different roles in the network (most important roles highlighted in bold). Clusters with higher betweenness centrality (C) have more control over the network, because more information pass through their nodes. Their role is related to the network’s connectivity, in so much as high betweenness vertices have the potential to disconnect graphs if removed. Assuming that vertices can only pass messages to or influence their existing connections, a clusters with low closeness centrality (B) means that are directly connected or “just a hop away” from most others in the network. In contrast, clusters in very peripheral locations may have high closeness centrality scores (D), indicating the high number of hops or connections they need to take to connect to distant others in the network. Degree centrality shows how many connections a cluster has. They may be connected to lots of nodes at the heart of the network (D), but they might also be far off on the edge of the network. Clusters with high strength (B) present connections with a higher level of correlation.

A first point that the network allows to discuss is the importance of network components for the whole network (betweenness). Clusters with higher betweenness centrality (such as the morphodynamics properties, C, in Figs. 7, 8a and Table 2) have more control over the network, because more information passes through their nodes. Among the nodes within this cluster, mean connectivity and rate of dis-connectivity (ICmean, IC30 in Fig. 8a) emerge as prominent. Their role is related to the network’s connectivity, in so much as high betweenness vertices have the potential to disconnect graphs if removed. Describing sediment connectivity in a landscape allows to consider the spatial organization of various physiographic units and their contribution to sediment production, transport and deposition, and emphasizes the importance of landscape features to enhance the transfer of water and sediment towards flooded areas.

Further aspect to consider relates to paths through the network, i.e., their length, cost, or routing capacity (closeness in Fig. 8b, Table 2). The shortest (geodesic) path is that which contains the smallest number of edges; it can also be a least-cost path in terms of the lowest cumulative cost or friction. Reach properties, climate and flood hazard are directly connected to most others in the network (low closeness, Fig. 8b, Table 2). Amongst the nodes of this cluster, physiographic properties, low-flows and climate (Phsec, Q30, CI in Fig. 8b) have more direct influence on other vertices (lowest closeness).

In this complex network, it is important to identify what nodes are connected or reachable from one another. The degree of a node is the number of edges that connect it to other nodes. If the edges are weighted, and the weight attribute is considered in adding up incoming and outgoing edges, the corresponding node property is its strength. Scale-dependent parameters (D in Figs. 7, 8c, Table 2) may be (in average) connected to lots of nodes at the heart of the network, but they might also be far off on the edge of the network. However, the rate of channel adjustment (slope of the HSF β), decadal trend in extreme flows (xQ95) and the rate of disconnectivity (IC30 in Fig. 8c) are the nodes with the highest degree. The same nodes have increased importance in terms of strength (Fig. 8d), with the decadal trend in extreme flows (xQ95, d) being the node with the higher level of correlation (highest strength).

Centrality depends on the way the graph-theoretical model of flood drivers is constructed, although even the simplest network representation, not taking directionality of flows into account, still provides a coarse-grained assessment of the most important nodes according to their contribution to the regional variability, and highlights how trends in flood hazard (xQ95) are the element connected to lots of nodes at the heart of the network. Their role implies that spreading power is determined by both the degree of the node and the degree of its neighbors (nodes of the reach properties, climate and flood hazard cluster). This implies that changes in climate, channel geometry and the degree of connectivity to the investigated river are prominent, and they have multiple alternative ways to modulate and transfer changes to flood hazard.

The observed network (Fig. 8) is (intentionally) undirected. Future analysis of network components should include directional edges, to identify sets of connected entities within a database containing many watersheds distributed across areas and at different points in time76. Similar networks/sub-networks, as appropriate, should also be formulated for different temporal and spatial scales of interest. This would provide a framework for assessing the historical contingency and stability of the network, and would allow to understand how contemporary features are sensitive to changes in an environmental characteristic of the past.

River and flood: processes and feedbacks

The investigated landscape shows geomorphological allometry (Fig. 2), clusters of morphodynamics units (Fig. 4), signatures of a catchment functioning (Fig. 5) that are linked to trends in exceedance of low and extreme flows (Fig. 6) widespread across scales and landscape units, challenging existing paradigms of flood frequency analysis and channel design.

Hydraulic forces, climate or geomorphic variables, if considered alone, are not sufficient to explain the flood response (Figs. 7 and 8) [see e.g.4,29,77,78,79,80].

Non-linear relationships, depending on which processes dominated under a particular hydrologic regime, emerge across watersheds (Fig. 7)4,14,81. The long line of recent major floods across the world highlights risks of new climate reality5,6,13,82. Nevertheless, climate impact on flood hazard is complex and depends on the river flood generation mechanism, and difficulties exist in disentangling the climatic component from substantial natural variability and direct human impacts on flows78,83.

Looking at climate (Fig. 7), higher precipitation concentration (CI) represented by greater percentages of the yearly total precipitation in a few rainy days, has the potential to cause flood and it is indeed correlated with higher discharge (Q95) and positively correlated with increasing trends in flood hazard (xQ95). Nevertheless, increasing trends in flood hazard also depend on the overall river morphology and storage capacity, as well as landscape connectivity (cluster C in Figs. 7, 8, Table 2). Observing climate together with other variables shows high temporal concentration of precipitation (high CI) is related to increasing trends in flood hazard especially when channels narrows (decreasing exponent of the HSF), or if sediment connectivity increases CV.

More information pass through the nodes representing sediment connectivity (highest betweenness). Sediment supply can change with altered connectivity upstream and changes in hillslope–channel coupling60,84,85. Seasonal timing and sequence of events can affect the watershed response: extreme rainfall events can lead to significant soil loss86,87, and modify depression storage changing the connectivity of overland flow88, with implications for downstream flood risk and sediment‐related flood damages33,63,89.

Considerable narrowing and decrease in channel conveyance over short timescales might substantially increase the potential for floodplain inundation, as increasing trends in exceedance of extreme flows are registered for rivers with lower values of the HSF coefficients (Fig. 7). This might be even more evident for landscapes where rivers present lower capacity (small exponent of HSF) and high sediment delivery potential to the investigated channel (high levels of ICH%) (Fig. 7), or when watersheds present a very extensive network (high Dd in Fig. 7) but with low level of conveyance in their main reach (low b values).

Among the catchment-scale properties, drainage network structure also emerged as having a critical role (drainage density, drainage frequency, Figs. 7, 8). One must consider that river planform structure is one of the elements mostly modified in anthropogenic landscapes28,29,30,31. The centrality of these parameters in the graph, and their correlation with decadal trends in flood hazard, hints to the fact that these changes that might happen relatively quickly compared to the long-standing life of a river, might be a very sensitive trigger to further flood-feedbacks.

From a network perspective (Fig. 8), longitudinal channel adjustment might be able to disperse changes quickly to many other components of the network (low closeness and high strength/degree). This strengthen the idea that river geometry is not a static collector that accommodate and convey (or otherwise) the runoff generated by precipitation distributions. Instead, it dynamically adjusts to/and adjusts flows, which means that altered channel properties due to changes in drivers will, in turn, alter the risk of future flooding (Fig. 1). One should consider that channel geometry adjusts to climate34,90,91 or anthropogenic pressure15,29,92, and according to channel-maintaining93 and channel-changing discharges26,62,94,95. This form of feedback means that channel geomorphic response could cause a legacy of altered flood risk, that might be comparable to extreme events that might occur in the future.

The analysis further indicates the importance of morphological variables and trajectory of river longitudinal adjustment as compared to different drivers or receiver of changes. Sediment connectivity emerges as a potentially critical factor (Fig. 7, Fig. 8) [as highlighted by62,89,96], being connected to both changes in channel properties (HSF), and increasing decadal trends in flood hazard, independently from scaling (Fig. 7). The network and clustering analysis underlines how trends in flood events are ultimately governed by a balance of energy associated with atmosphere, flows of water combined with erosion, and transport.

The Way Forward

Identify relationships between flows, catchment drivers, and downstream hydraulic geometry is currently complex. The current existing relationships are hard to compare exhaustively because they are derived from extensive field surveys or gaging stations that changed during the years, they are estimated using hydraulic scaling principles developed for larger regions, and they have been derived with varying flow levels in different studies, and through several different methods of fitting lines to relationships.

The proposed approach produces a robust and consistent estimate of longitudinal river adjustment across multiple watersheds. This describes how the dynamic properties of a river channel accommodate increases in discharge. Nevertheless, the capacity of a flood to modifying or being modified by channel properties is also strongly influenced by the availability of sediments in the landscape. With this in mind, sediment data might not always be available or be very hard to determine. This work shows how investigating the topography-based index of sediment connectivity allows providing additional information into the variability of flood hazard.

The HSF and sediment connectivity within this framework are derived from a Lidar capturing the landscape at one point in time, but the grouping of data indicates that neither general climatic setting nor the size of the drainage basin has any consistent influence on the downstream relations or the value of connectivity. HSFs do, however, seem to be related to changes in discharge and potential sediment load, and both parameters essentially correspond to an integral result of the long-term history of processes that have acted over the landscape. Overall, our findings suggest that climate or landscape alone do not explain changes in flood hazard. Rather, they act together with sediment connectivity and river network properties, fundamentally altering the frequency and magnitude of flood events.

The nature of non-linearity of the analysed relationships suggests that one should also consider that there is significant variability in the response of different geomorphic and extreme events across watersheds. While all considered watersheds will likely respond to extreme climatological events, they will react differently to the same magnitude of forcing, and the same geomorphic system may itself respond differently, depending on its condition at the time of the forcing.

Trends in flood hazard are often addressed considering climatic, hydrologic or geomorphological processes as independent flood drivers. Our results highlight that this is not enough to understand hazards that are intertwined with the complex dynamics of river systems and hillslope processes. Studying independent drivers alone might be insufficient and misleading for projecting flood risk over long timescales, especially when shifts in river morphology may be significant. Although it is widely acknowledged that flood change may be caused by several drivers that act at the same time, multidriver attribution studies are rare. In this paper we suggest a holistic view of flood risk, where we differentiate between drivers representing the three compartments potentially responsible for river flood change: atmosphere, landscape, and river conveyance. Our findings pinpoint the importance of considering river longitudinal variability and sediment properties in drawing flood trends, suggesting that these elements together with atmospheric and flow drivers, should be considered in modelling future scenarios and drawing associated management strategies.

Material and Methods

The study considers sixteen watersheds in Connecticut and Massachusetts (USA, Fig. 9). The selected catchments belong to the physiographic division of the Appalachian Highland, and the physiographic province of New England. Most of them belongs to the New England Upland section (01187300, 01181000, 01171500, 01169900, 01174600, 01174900, 01163200, 01175670, 01184100, 01096000, 01170100), five sites belong to the Seaboard Lowland (01101000, 01105870, 01109070, 01109000, 01105600) and one to the Taconic section (01199050).

Figure 9
figure 9

study areas and available lidar data, and drainage network with Strahler order. USGS Station ID is reported for each watershed, as described in dataset S1, S2 in the supplements. The bar near each watershed is scaled to 1 km. Acquisition dates for the Lidar range from 2010 to 2015 (for the sites in Massachusetts) and 2016 (for the sites in Connecticut). The declared vertical accuracy for the data range between 0.06 and 1.10 m. The figure has been arranged using Matlab 2018b [https://www.mathworks.com/release2018b].

Available information for each watershed include (i) field surveyed bankfull width and drainage area at the outlet, bankfull discharge and its return period (Dataset S1, supplements, retrieved from97); (ii) daily discharge (Q) records for United States Geological Survey (USGS) gauging stations, (iii) Lidar high-resolution topography, in the form of Digital Terrain Models (DTMs) (Fig. 9).

Additionally, for each watershed, according to a geomorphologically extracted network98 (Supplement, Chapt 1.1), we defined drainage density Dd, the watershed order Ω, drainage frequency (Df) and the length of the main stem of the network, where the main stem is defined as the network reach that at each junction drains the greatest portion of the watershed (Dataset S1, supplements). These parameters are often used to compare basins of different size and to establish catchment‐scale hydrological parameters99.

This chapter offers a brief description and an explanation of the rationale behind the choice of the various drivers. The reader should refer to the supplementary materials for a detailed description of the considered techniques.

River geometry

Aspects of the river channel that relate to sediment, planform, and flow resistance are key to understanding conveyance capacity and overbank flooding potential100, as conveyance is reflected in the meander length and slope of the river channel100.

When trying to understand the linkage with flooding, an important parameter to focus on is the bankfull hydraulic geometry, which provides information on the channel’s morphology and storage capacity40. This concept emerges from the field evidence that rivers are in a perpetual state of flux and constantly adapt to recent floods and changing sediment loads101. The study of hydraulic geometry has been prominent starting from the 1960s and highlighted how channel capacity and widths scale with bankfull discharge, the latter typically being the discharge with a one to two-year recurrence interval40,47,102,103,104,105,106,107,108. While regime models based on a single flood discharge yield meaningful predictions of average conditions in many systems, formative flow and effective discharge diverge in some circumstances101,109,110,111, calling into question the veracity of the underlying assumption of equilibrium between a single discharge (however, it is defined) and the reach‐average channel dimensions. Differences across watersheds regarding flow regimes, catchment size, regional climatic and physiographic factors, geological characteristics, the responsiveness of the catchment, and human activities40,47,102,103,104,105,106,107,108 can be described observing adjustment of hydraulic geometry as power-law functions (Hydraulic Scaling Function HSF) relating bankfull properties and flows in the downstream direction along the river profile.

This paper focuses on HSF derived automatically from Lidar DTMs112,113 (Supplement, Chapt 1.2, Dataset S2). The proposed lidar-based algorithm uses a statistical approach to delineate threshold landscape curvatures114,115,116 for defining the hydrologic floodplain where the river can flow (overall valley shape)98,112,113,117, and thresholds of local curvature for defining homogeneous reaches along the river and to measure their bankfull width.

Sediment connectivity

Among catchment characteristics, an important catchment driver that enhance our comprehension of landscape processes and water modeling is sediment connectivity61. As a proxy, we investigated the topography-based index of sediment connectivity (IC42,118) (Supplement, chapt 1.2). This index provides information about the percentage of the topographically defined catchment area that contributes to sediment runoff, and thus relates to the geomorphic effectiveness of floods. In particular, IC evaluates the potential connection between hillslopes and features (in this case the investigated channel), which act as targets for the potentially transported sediment. The main advantage of this index is that it considers impedance to sediment flows according to a weight factor that can be based on different methods. For a detailed analysis, we considered different parameters of the IC, including the mean, the 30 and 95 percentiles, the coefficient of variation and the percentage of areas with different degrees of connectivity (Dataset S2 in the supplement).

Flows

Understanding flow signatures as they relate to catchment properties summarize river flow dynamics119, offering a way to understand flood generation processes and hence their flood frequency response43. Different parts of the flow spectrum have different degrees of influence on hydro-geomorphic processes. High-magnitude low-frequency flows, associated with overbank flows and thus flood hazard are often relevant to significant and abrupt geomorphic changes. Low-magnitude flows determine habitat conditions for the survival and the functioning of indigenous riverine biota and therefore are relevant to long-term changes in eco-hydrology of river systems; and intermediate events which, being formative events, conduct most of the geomorphic work and are in dynamic equilibrium with the current morphological and geometric configuration of a given channel reach. Flow signatures can be identified by several different properties of the river flow120. In this work we focus primarily on characteristic properties of the daily flow distribution. Specifically, to characterize hydrologic flow regime in each catchment, we estimated the low (30th), high (95th), average and coefficient of variation of discharge and the bankfull discharge estimates. To permit comparison among catchments of different drainage area, we expressed flow quantities as specific discharge (i.e. flow normalized by catchment area). The flow data were retrieved from all current and discontinuous water data for the study sites from the USGS National Water Information System (NWIS) and corresponded to various record lengths but with more than 30 yrs of record for all stations (Dataset S2 in the supplement).

Climate (Rainfall)

Precipitation is one of the most significant water cycle components and the primary driver for floods. Any changes in precipitation (as direct water input) and temperature (as controlling factor of snowmelt and evapotranspiration) impacts the magnitude-frequency of water discharges entering a channel reach, possibly leaving a signature on river properties. Precipitation concentration is an important parameter of the climatic description of a single location or region, which complements the information provided by other, more common, variables such as the annual precipitation or the seasonality121. The CI (Concentration Index)122 is able to relate the magnitude of the precipitation events to the time period in which they occur. The CI can act as an estimator of rainfall erosivity123 as well as of the repercussions of climate change on water cycle124 and floods78. For this reason, the analysis of daily precipitation concentration is an important aspect to consider in order to evaluate both hydrological and geomorphological drivers linked to flood hazard. For this work, we will consider the CI values published by125 (Dataset S1 in the supplement) to characterize the climatology of rainfall intensity regime in our study areas and will thus refer to CI when we mention “Climate”, and we refer to a climatologic characteristics of precipitation.

Trends on flow exceedance

We considered trends for two distinct flow thresholds representing low flows (Q30) and high flows (Q95). For each site, we estimated the Q30 and Q95 values and counted for each year the total number of events exceeding the considered threshold for at least 5 consecutive days. Trends in flow exceeding Q30 are considered as a proxy of changes in low flow (baseflow) regime, while trends of events exceeding the Q95 thresholds are considered in this study as a proxy of trends in “flood hazard”, acknowledging, however, that the events identified in this way do not necessarily lead to water outside the river banks (i.e. flood inundation).

For identification of temporal trends in flow exceedance and their significance, we applied a mean unbiased exponential least squares curve38, which allowed us to avoid predicting negative values.

Connectivity and flow distribution analysis

Statistical analysis of connectivity and flow properties is carried out to identify similarities and differences and classify hydrologic regimes and landscapes across the analyzed watersheds. When one wants to estimate the probability of a flood and develop suitable means to mitigate its effects, one needs to determine a threshold value and calculate the probability that the river discharge data do not exceed the chosen threshold value126 or equivalently estimate the probability that the threshold will be exceeded127. Similarly, to identify fully disconnected landscape, one has to estimate the probability of a connectivity value, determine a threshold value (i.e17,60) and calculate the probability that the IC does not exceed the chosen threshold value.

In statistics, the probability of non-exceedance is expressed based on the cumulative distribution function (CDF), while equivalently the probability of exceedance is expressed using the complementary CDF (or cCDF), which is also termed “flow duration curve” in hydrology44. In this work we present both CDF and cCDF because they allow us to visually emphasize the left and right hand tails (i.e. extremes) of the distributions.

Interdependence of variables

Recognizing the potential impact of outliers in deriving relationships based on a relatively small sample size, we emphasize that this study is not meant to develop a general model, but rather to carry out a forensic investigation and document the possible existence of a relationship among the different factors examined. On the long run, this work will offer a starting point to aggregating enough studies to provide a clearer understanding of the complex interactions under analysis. In the process of changing the hydro-geomorphological organization, systems traverse various phases, following (possibly) non-linear rules. To investigate the relationships between variables, we used the distance correlation index, dCor128. This index has the advantage of being able to capture non-monotonic associations and ensure that the correlation is zero if and only if the variables are statistically independent, which is not the case for, e.g. Pearson’s product-moment correlation. The range of dCor is 0 to 1, and the closer dCor is to 1, the stronger the dependence between the variables. The statistical significance was assessed using the distance covariance test with 1000 replicates128, and significance was tested at the 5, 10 and 25% significance level.

The distance correlation has been then further used as a measure of dissimilarity (1-dCorr) to identify a hierarchical clustering of the different variables129. This analysis has been done to identify groups of variables mostly connected, rather than to define a model for the variables interaction.

The same dissimilarity measure has been used to create a undirected network130 to represent the system of interacting units in the hydrological and geomorphological realm. The network structure can visually formulate nonlinearity, solely based on the considered data without requiring knowledge about the underlying physical processes. The network has been characterized by some centrality measures, named Betweenness (the number of shortest paths that the focal node lies on), Closeness (the mean shortest path between a focal node and all other nodes in the network), Degree (the number of edges that connect the focal node to other nodes), and Strength (measures the strength of vertices in terms of the total weight of their connections).

Data and software availability

The considered Lidar data and discharge data are of public domain from the open web-services of131 and132. The climatic data were provided by125 and are freely available on GitHub. The analysis of sediment connectivity was carried out using the open-source codes provided on GitHub by133. The correlation analysis and clustering was performed in R using the package by134. Network analysis was also implemented in R using the package135. KDE in Fig. 7 has been evaluated using the R package by136 and interpolated using137. Flow trends have been evaluated considering the work of38 and the codes available on GitHub (https://github.com/LouiseJSlater/GRL2015). The codes for the network extraction and the HSF computation from lidar are available upon request to the main author of this paper.