Skip to main content

Advertisement

Log in

LAI estimation across California vineyards using sUAS multi-seasonal multi-spectral, thermal, and elevation information and machine learning

  • Original Paper
  • Published:
Irrigation Science Aims and scope Submit manuscript

Abstract

In agriculture, leaf area index (LAI) is an important variable that describes occurring biomass and relates to the distribution of energy fluxes and evapotranspiration components. Current LAI estimation methods at subfield scale are limited not only by the characteristics of the spatial data (pixel size and spectral information) but also by the empiricity of developed models, mostly based on vegetation indices, which do not necessarily scale spatiality (among different varieties or planting characteristics) or temporally (need for different LAI models for different phenological stages). Widely used machine learning (ML) algorithms and high-resolution small unmanned aerial system (sUAS) information provide an opportunity for spatial and temporal LAI estimation addressing the spatial and temporal limitations. In this study, considering both accuracy and efficiency, a point-cloud-based feature-extraction approach (Full Approach) and a raster-based feature-extraction approach (Fast Approach) using sUAS information were developed based on multiple growing seasons (2014–2019) to extract and generate vine-scale information for LAI estimation in commercial vineyards across California. Three known ML algorithms, Random Forest (RF), eXtreme Gradient Boosting (XGB), and Relevance Vector Machine (RVM), were considered, along with hybrid ML schemes based on those three algorithms, coupled with different feature-extraction approaches. Results showed that the hybrid ML technique using RF and RVM and the Fast Approach with 9 input variables, called RVM-RFFast model, performs better than others in a visual and statistical assessments of the generated LAI being also computationally efficient. Furthermore, using the generated LAI products in the quantification of energy balance using the two-source energy balance Priestley-Taylor version (TSEB-PT) model and EC tower data, the results indicated excellent estimation of net radiation (Rn) and latent heat flux (LE), good estimation of surface heat flux (G), and poor estimation of sensible heat flux (H). Additionally, TSEB-PT sensitivity analysis performed by regenerating LAI maps based on the generated LAI map (from − 15% of the original LAI map to + 15% with a 5% gap) showed that LAI uncertainty had a major impact on G, followed by evapotranspiration partitioning (T/ET), H, LE, and Rn. When considering the annual growth cycle of grapevines, the impact of LAI uncertainty on the T/ET in the veraison stage was larger than in the fruit set stage.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

Notes

  1. Mention of trade names or commercial products in this publication is solely for the purpose of providing specific information and does not imply recommendation or endorsement by the U.S. Department of Agriculture.

References

Download references

Acknowledgements

This study was possible thanks to support from USDA-Agricultural Research Service, NASA Applied Sciences Water Resources Grant NNX17AF51G and the Utah Water Research Laboratory Student Fellowship. The authors are also grateful for the extraordinary support from the Utah State University AggieAir sUAS program staff and E&J Gallo scientific teams for data collection support and analysis. The authors would like to thank Dr. Ayman Nassar for his preliminary work in TSEB model and footprint-area calculation; Wasim Akram Khan for helping with the computer parallelization; and Carri Richards for editing the manuscript.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rui Gao.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix

Variables from the full feature extraction approach mentioned in Feature extraction approaches

See Table 4

Table 4 Variables considered in this study for the Full approach gained from the spectral, thermal, DSM, and point cloud data

Important variables supporting each ML algorithm mentioned in LAI model via the fast approach and Model selection for LAI estimation

See Tables 5, 6 and 7

Table 5 Identified important variables for each ML algorithm gained from the full feature extraction approach
Table 6 Identified important variables for each ML algorithm gained from the fast feature extraction approach
Table 7 Metrics for the evaluation of the RVM-RFFull and RVM-RFFast models in training and test datasets

The ANOVA and Tukey tests mentioned in Model selection for LAI estimation

The ANOVA and Tukey tests are used again to check whether the mean of different ML models, considering the input variables from both the Full and Fast approach, is statistically the same as the mean from the ground LAI measurements. Table 8 shows the results from both the ANOVA test and the Tukey test. The most important question at this point is whether the null hypothesis (the same as previously explained) is accepted when comparing the mean value of the ground measurements with the mean value estimated by the hybrid RVM-RF algorithm for both situations (input variables obtained from the Full approach and the Fast approach).

When evaluating the eight experiments based on the inputs from the Full approach, Table 5 reports different results compared with that shown in Fig. 7. Only three experiments, “RVM-XGBFull-Cover,” “RVM-RFFull,” and “RVM-XGBFull-Weight,” report the same mean value as the ground measurement. Evaluating the eight experiments based on the inputs from the Fast approach,

Table 6 shows a similar signal to Fig. 8. Except for the experiment called “RVM-XGBFast-Gain,” all of the combination models can provide the same mean estimation as the ground LAI measurements. Six experiments accepted the null hypothesis, which is more than when the Full approach is considered as the feature-extraction approach.

See Table 8

Table 8 ANOVA and Tukey test for the ML algorithms considering a few variables as the input (color figure online)

During the evaluation of each experiment, the most important point is verified: the null hypothesis is accepted for the hybrid RVM-RF algorithm for both situations. The difference shown in this table suggests that the ANOVA and Tukey tests are necessary as a confirmation tool to double-check the candidate ML algorithm.

Relevant locations mentioned in Model selection for LAI estimation for both RVM-RFFull and RVM-RFFast

Considering the characteristic of the RVM algorithm providing RVs, which have non-zero weights, from the training dataset (Tipping 2001; Fletcher 2010; Torres-Rua et al. 2012; Khader and McKee 2014), this section explores RV information based on the hybrid ML algorithm (RVM-RF). Since this structure provides accurate LAI estimation when considering the inputs from the Full approach and efficient LAI estimation when considering the Fast approach, the same process is explored under both situations: RVM-RFFull and RVM-RFFast.

RVM-RFFull model

Figure 12 shows the geographic distribution of the training dataset (red circles) and the RVs (green circles). The green circles obtained from the RVM algorithm illustrate where the ground measurements are more important. For SLM, the RVs concentrate on the right side of the field, and one RV is located on the bare soil area. RIP 760 shows a pair of parallel lines and more RVs from the south line. RIP 720 recognizes only two positions on the northeast corner as important locations, and one of them provides the RV three times (another is two times). BAR recognizes around seven out of 49 positions as important locations, and most of them come from the south block. Taking more ground LAI measurements at the green-circle sites shown in Fig. 12 (a large variability is shown at these locations) would be recommended in the future if the combination of the RVM-RFFull model is adopted.

See Fig. 12

Fig. 12
figure 12

The geographic distribution of the RVs from the RVM-RFFull model

RVM-RFFast model

Similar to Figs. 12 and 13 shows the geographic distribution of the training dataset (red circles) and the RVs (green circles). One difference compared to Fig. 12 is that no position provides RVs three times. SLM identifies more RV locations. RIP 720 and RIP 760 each provide six RV positions. BAR identifies five positions corresponding to five RVs.

See Fig. 13

Fig. 13
figure 13

The geographic distribution of the RVs from the RVM-RFFast model

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Gao, R., Torres-Rua, A.F., Aboutalebi, M. et al. LAI estimation across California vineyards using sUAS multi-seasonal multi-spectral, thermal, and elevation information and machine learning. Irrig Sci 40, 731–759 (2022). https://doi.org/10.1007/s00271-022-00776-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00271-022-00776-0

Navigation