Abstract
Any large, multifaceted data collection that is challenging to handle with traditional management practices can be branded ‘Big Data.’ Any big data containing geo-referenced attributes can be considered big geospatial data. The increased proliferation of big geospatial data is currently reforming the geospatial industry into a data-driven enterprise. Challenges in the big spatial data domain can be summarized as the ‘Big Vs’ – variety, volume, velocity, veracity and value. Big spatial data sources can be considered in two broad classes, active and passive, as each is impacted to varying degrees. Some of these challenges may be alleviated by reducing unprocessed, or minimally processed, (raw) data to features, which we refer to as the extraction of Elements of Interest (EOI). In fact, many applications require EOI extraction from raw data to enable their basic employment. This chapter presents current state-of-the-art methods to create EOI from some types of georeferenced big data. We classify the data types into two realms: active and passive. Active data are those collected specifically for the purpose to which they are applied. Passive data are those collected for purposes other than those for which they are utilized, included those ‘collected’ for no particular purpose at all. The chapter then presents use cases from both the active and passive spatial realms, including the active applications of terrain feature extraction from digital elevation models and vegetation mapping from remotely-sensed imagery and passive applications like building identification from VGI and point-of-interest data mining from social networks for land use classification. Finally, the chapter concludes with future research needs.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Adhikari H, Valbuena R, Pellikka PKE, Heiskanen J (2020) Mapping forest structural heterogeneity of tropical montane forest remnants from airborne laser scanning and Landsat time series. Ecol Indic 108:105739. https://doi.org/10.1016/j.ecolind.2019.105739
Al Nuaimi E, Al Neyadi H, Mohamed N, Al-Jaroodi J (2015) Applications of big data to smart cities. J Internet Serv Appl 6:1–15. https://doi.org/10.1186/s13174-015-0041-5
Anders NS, Seijmonsbergen AC, Bouten W (2009) Multi-scale and object-oriented image analysis of high-res LiDAR data for geomorphological mapping in Alpine mountains. Geomophometry 2009:61–65
Arundel ST (2016) Pairing semantics and object-based image analysis for national terrain mapping – a first-case scenario of cirques. In: Kerle N, Gerke M, Lefevre S (eds) GEOBIA 2016: solutions and synergies. University of Twente Faculty of Geo-Information and Earth Observation (ITC), Enscede
Baatz M, Schäpe A (2000) Multiresolution Segmentation – an Optimization Approach for High Quality Multi-Scale Image Segmentation. In: Strobl J, Blaschke T, Griesebner, G (eds) Proceedings of Angewandte Geographische Informationsverarbeitung XII. Wichmann, Heidlelberg, pp. 12–23
Barbier G (2011) Social network data analytics. Soc Netw Data Anal. https://doi.org/10.1007/978-1-4419-8462-3
Blaschke T (2010) Object based image analysis for remote sensing. ISPRS J Photogramm Remote Sens 65:2–16. https://doi.org/10.1016/j.isprsjprs.2009.06.004
Blaschke T, Strobl J (2001) What’s wrong with pixels? Some recent developments interfacing remote sensing and GIS. Geo-Inf-Syst 14:12–17
Blaschke T, Hay GJ, Kelly M, Lang S, Hofmann P, Addink E, Queiroz Feitosa R, van der Meer F, van der Werff H, van Coillie F, Tiede D (2014) Geographic object-based image analysis – towards a new paradigm. ISPRS J Photogramm Remote Sens 87:180–191. https://doi.org/10.1016/j.isprsjprs.2013.09.014
Boyd D, Crawford K (2012) Critical questions for big data: provocations for a cultural, technological, and scholarly phenomenon. Inf Commun Soc 15:662–679. https://doi.org/10.1080/1369118X.2012.678878
Cambria E, Rajagopal D, Olsher D, Das D (2013) Big social data analysis. Big Data Comput:401–414. https://doi.org/10.1201/b16014-19
Camiz S, Poscolieri M (2015) Geomorpho: a methodology for the classification of terrain units. In: Jasiewicz J, Zwoliński Z, Mitasova H, Hengl T (eds) Geomorphometry for geosciences. Adam Mickiewicz University, Poznań, pp 149–152
Cánovas-García F, Alonso-Sarría F (2015) A local approach to optimize the scale parameter in multiresolution segmentation for multispectral imagery. Geocarto Int 30. https://doi.org/10.1080/10106049.2015.1004131
Castilla G, Hay GJ (2008) Image objects and geographic objects. In: Object-based image analysis. Springer, Berlin/Heidelberg, pp 91–110
Chen CLP, Zhang C (2014) Data-intensive applications, challenges , techniques and technologies: a survey on Big Data. Inf Sci 275:314–347
Chen G, Weng Q, Hay GJ, He Y (2018) Geographic object-based image analysis (GEOBIA): emerging trends and future opportunities. GISci Remote Sens 55:159–182. https://doi.org/10.1080/15481603.2018.1426092
Csillik O, Evans IS, Drăguţ L (2015) Transformation (normalization) of slope gradient and surface curvatures, automated for statistical analyses from DEMs. Geomorphology 232:65–77. https://doi.org/10.1016/j.geomorph.2014.12.038
Cui K, Jiang Y, Li Y, Pfoser D (2019) A vocabulary recommendation method for spatiotemporal data discovery based on Bayesian network and ontologies. Big Earth Data 3:220–231. https://doi.org/10.1080/20964471.2019.1652431
Drăguţ L, Eisank C (2012) Automated object-based classification of topography from SRTM data. Geomorphology 141–142:21–33. https://doi.org/10.1016/j.geomorph.2011.12.001
Drăguţ L, Tiede D, Levick SR (2010) ESP: a tool to estimate scale parameter for multiresolution image segmentation of remotely sensed data. Int J Geogr Inf Sci 24:859–871. https://doi.org/10.1080/13658810903174803
Ehsani AH, Quiel F (2008) Geomorphometric feature analysis using morphometric parameterization and artificial neural networks. Geomorphology 99:1–12. https://doi.org/10.1016/j.geomorph.2007.10.002
Eisank C, Drăguț LD (2010) Detecting characteristic scales of slope gradient. In: Car A, Griesebner G, Strobl J (eds) Geospatial Crossroads@GI_Forum’10. Wichman, Berlin, pp 48–57
Emani CK, Cullot N, Nicolle C (2015) ScienceDirect understandable big data: a survey. Comput Sci Rev 17:70–81. https://doi.org/10.1016/j.cosrev.2015.05.002
Gerçek D, Toprak V, Strobl J (2011) Object-based classification of landforms based on their local geometry and geomorphometric context. Int J Geogr Inf Sci 25:1011–1023. https://doi.org/10.1080/13658816.2011.558845
Gonçalves J, Pôças I, Marcos B, Mücher CA, Honrado JP (2019) SegOptim—a new R package for optimizing object-based image analyses of high-spatial resolution remotely-sensed data. Int J Appl Earth Obs Geoinf 76:218–230. https://doi.org/10.1016/j.jag.2018.11.011
Gong P, Wang J, Yu L, Zhao Y, Zhao Y, Liang L, Niu Z, Huang X, Fu H, Liu S, Li C, Li X, Fu W, Liu C, Xu Y, Wang X, Cheng Q, Hu L, Yao W, Zhang H, Zhu P, Zhao Z, Zhang H, Zheng Y, Ji L, Zhang Y, Chen H, Yan A, Guo J, Yu L, Wang L, Liu X, Shi T, Zhu M, Chen Y, Yang G, Tang P, Xu B, Giri C, Clinton N, Zhu Z, Chen J, Chen J (2013) Finer resolution observation and monitoring of global land cover: first mapping results with Landsat TM and ETM+ data. Int J Remote Sens 34:2607–2654. https://doi.org/10.1080/01431161.2012.748992
Goodchild MF (2013) The quality of big ( geo ) data. Dialogues Hum Geogr 3:280–284. https://doi.org/10.1177/2043820613513392
Graff LH, Usery EL (1993) Automated classification of generic terrain features in digital elevation models. Photogramm Eng Remote Sensing 59:1409–1417
Graham M, Shelton T (2013) Geography and the future of big data, big data and the future of geography. Dialogues Hum Geogr 3:255–261
Hayat MK, Daud A, Alshdadi AA, Banjar A, Abbasi RA, Bao Y, Dawood H (2019) Towards deep learning prospects: insights for social media analytics. IEEE Access 7:36958–36979. https://doi.org/10.1109/ACCESS.2019.2905101
Hecht R, Meinel G, Buchroithner M (2015) Automatic identification of building types based on topographic databases – a comparison of different data sources. Int J Cartogr 1:18–31. https://doi.org/10.1080/23729333.2015.1055644
Huang H, Liu C, Wang X, Biging GS, Chen Y, Yang J, Gong P (2017) Mapping vegetation heights in China using slope correction ICESat data, SRTM, MODIS-derived and climate data. ISPRS J Photogramm Remote Sens 129:189–199. https://doi.org/10.1016/j.isprsjprs.2017.04.020
Huesca M, Riaño D, Ustin SL (2019) Spectral mapping methods applied to LiDAR data: application to fuel type mapping. Int J Appl Earth Obs Geoinf 74:159–168. https://doi.org/10.1016/j.jag.2018.08.020
Janowicz K, Gao S, Mckenzie G, Hu Y (2019) GeoAI: spatially explicit artificial intelligence techniques for geographic knowledge discovery and beyond. Int J Geogr Inf Sci 00:1–12. https://doi.org/10.1080/13658816.2019.1684500
Jensen JR (2015) Introductory digital image processing: a remote sensing perspective, 4th edn. Prentice Hall Press, Upper Saddle River
Jiang S, Alves A, Rodrigues F, Ferreira J, Pereira FC (2015) Mining point-of-interest data from social networks for urban land use classification and disaggregation. Comput Environ Urban Syst 53:36–46. https://doi.org/10.1016/j.compenvurbsys.2014.12.001
Kanevski M, Maignan M (2004) Analysis and modelling of spatial environmental data. EPFL Press, Lausanne
Kanevski M, Pozdnukhov A, Timonin V (2008) Machine learning algorithms for geospatial data. Appl Soft Tools. Iemss.org 320–327
Keim D, Andrienko G, Fekete JD, Görg C, Kohlhammer J, Melançon G (2008) Visual analytics: definition, process, and challenges. LectNotes ComputSci(including Subser Lect Notes Artif Intell Lect Notes Bioinformatics) 4950 LNCS, 154–175. https://doi.org/10.1007/978-3-540-70956-5_7
Khadim FK, Su H, Xu L, Tian J (2019) Soil salinity mapping in Everglades National Park using remote sensing techniques and vegetation salt tolerance. Phys Chem Earth 110:31–50. https://doi.org/10.1016/j.pce.2019.01.004
Killeen P, Ding B, Kiringa I, Yeap T (2019) IoT-based predictive maintenance for fleet management. 2nd international conference on emerging data and industry 4.0 (EDI40) Leuven, Belgium
Kim M, Madden M, Warner T (2008) Estimation of optimal image object size for the segmentation of forest stands with multispectral IKONOS imagery. Object-Based Image Anal. 291–307. https://doi.org/10.1007/978-3-540-77058-9_16
Kim M, Madden M, Xu B (2010) GEOBIA vegetation mapping in Great Smoky Mountains National Park with spectral and non-spectral ancillary information. Photogramm Eng Remote Sensing 76:137–149. https://doi.org/10.14358/PERS.76.2.137
Kitchin R (2013) Big data and human geography: opportunities, challenges and risks. Dialogues Hum Geogr 3:262–267. https://doi.org/10.1177/2043820613513388
Koyama CN, Watanabe M, Hayashi M, Ogawa T, Shimada M (2019) Mapping the spatial-temporal variability of tropical forests by ALOS-2 L-band SAR big data analysis. Remote Sens Environ 233:111372. https://doi.org/10.1016/j.rse.2019.111372
Lang N, Schindler K, Wegner JD (2019) Country-wide high-resolution vegetation height mapping with Sentinel-2. Remote Sens Environ 233:111347. https://doi.org/10.1016/j.rse.2019.111347
Lecun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521:436–444. https://doi.org/10.1038/nature14539
Lee J, Kang M (2015) Geospatial Big Data: challenges and opportunities ⋆. Big Data Res 2:74–81. https://doi.org/10.1016/j.bdr.2015.01.003
Lekamalage L, Kasun C, Zhou H, Huang G, Vong CM (2013) Representational learning with extreme learning machine for big data. IEEE Intell Syst 28:31–34
Leuenberger M, Kanevski M (2015) Extreme learning machines for spatial environmental data. Comput Geosci 85:64–73. https://doi.org/10.1016/j.cageo.2015.06.020
Li W, Goodchild MF, Church R (2013) An efficient measure of compactness for two-dimensional shapes and its application in regionalization problems. Int J Geogr Inf Sci 27:1227–1250. https://doi.org/10.1080/13658816.2012.752093
Li S, Dragicevic S, Castro FA, Sester M, Winter S, Coltekin A, Pettit C, Jiang B, Haworth J, Stein A, Cheng T (2016) Geospatial big data handling theory and methods: a review and research challenges. ISPRS J Photogramm Remote Sens 115:119–133. https://doi.org/10.1016/j.isprsjprs.2015.10.012
Lillesand T, Kiefer RW, Chipman J (2015) Remote Sensing and Image Interpretation, Seventh Edition. John Wiley & Sons, Inc, Hoboken pp 736
Liu Y, Bian L, Meng Y, Wang H, Zhang S, Yang Y, Shao X, Wang B (2012) Discrepancy measures for selecting optimal combination of parameter values in object-based image analysis. ISPRS J Photogramm Remote Sens 68:144–156. https://doi.org/10.1016/j.isprsjprs.2012.01.007
Liu Y, Liu X, Gao S, Gong L, Kang C, Zhi Y, Chi G, Shi L (2015) Social sensing: a new approach to understanding our socioeconomic environments. Ann Assoc Am Geogr 105:512–530. https://doi.org/10.1080/00045608.2015.1018773
Manfré LA, de Albuquerque Nóbrega RA, Quintanilha JA (2015) Regional and local topography subdivision and landform mapping using SRTM-derived data: a case study in southeastern Brazil. Environ Earth Sci 73:6457–6475. https://doi.org/10.1007/s12665-014-3869-2
Martin H, Bucher D, Suel E, Zhao, Perez-Cruz F, Raubal M, Martin H, Bucher D, Suel E, Zhao P, Perez-Cruz F (2018) Graph Convolutional Neural Networks for Human Activity Purpose Imputation from GPS-based Trajectory Data 1–6. https://doi.org/10.3929/ethz-b-000310251
Maxwell AE, Warner TA, Fang F (2018) Implementation of machine-learning classification in remote sensing: an applied review. Int J Remote Sens. https://doi.org/10.1080/01431161.2018.1433343
Michel J, Youssefi D, Grizonnet M (2015) Stable mean-shift algorithm and its application to the segmentation of arbitrarily large remote sensing images. IEEE Trans Geosci Remote Sens 53:952–964. https://doi.org/10.1109/TGRS.2014.2330857
Miliaresis GC, Argialas DP (2002) Quantitative representation of mountain objects extracted from the global digital elevation model (GTOPO30). Int J Remote Sens 23:949–964. https://doi.org/10.1080/01431160110070690
Miller HJ, Goodchild MF (2015) Data-driven geography. GeoJournal 80:449–461. https://doi.org/10.1007/s10708-014-9602-6
Ming D, Li J, Wang J, Zhang M (2015) Scale parameter selection by spatial statistics for GeOBIA: using mean-shift based multi-scale segmentation as an example. ISPRS J Photogramm Remote Sens 106. https://doi.org/10.1016/j.isprsjprs.2015.04.010
Mitchell L, Frank MR, Harris KD, Dodds PS, Danforth CM (2013) The geography of happiness: connecting Twitter sentiment and expression, demographics, and objective characteristics of place. PLoS One 8. https://doi.org/10.1371/journal.pone.0064417
Najafabadi MM, Villanustre F, Khoshgoftaar TM, Seliya N, Wald R, Muharemagic E (2015) Deep learning applications and challenges in big data analytics. J Big Data 2:1. https://doi.org/10.1186/s40537-014-0007-7
Qi W, Saarela S, Armston J, Ståhl G, Dubayah R (2019) Forest biomass estimation over three distinct forest types using TanDEM-X InSAR data and simulated GEDI lidar data. Remote Sens Environ 232:111283. https://doi.org/10.1016/j.rse.2019.111283
Robinne FN, Bladon KD, Miller C, Parisien MA, Mathieu J, Flannigan MD (2018) A spatial evaluation of global wildfire-water risks to human and natural systems. Sci Total Environ 610–611:1193–1206. https://doi.org/10.1016/j.scitotenv.2017.08.112
Scrucca L (2013) GA: a package for genetic algorithms in R. J Stat Softw 53:1–37. https://doi.org/10.18637/jss.v053.i04
Shekhar S, Evans MR, Kang JM, Mohan P (2011) Identifying patterns in spatial information: a survey of methods. Wiley Interdiscip Rev Data Min Knowl Discov 1:193–214. https://doi.org/10.1002/widm.25
Shekhar S, Gunturi V, Evans MR, Yang K (2012) Spatial Big-data challenges intersecting mobility and cloud computing. In: Proceedings of the eleventh ACM international workshop on data engineering for wireless and mobile access, MobiDE’12. ACM, New York, pp 1–6. https://doi.org/10.1145/2258056.2258058
Sinha G, Mark DM (2010) Cognition-based extraction and modelling of topographic eminences. Cartogr Int J Geogr Inf Geovisualization 45:105–112. https://doi.org/10.3138/carto.45.2.105
Smith R, Reid N (2013) Carbon storage value of native vegetation on a subhumid-semi-arid floodplain. Crop Pasture Sci 64:1209–1216. https://doi.org/10.1071/CP13075
Stanton C, Starek MJ, Elliott N, Brewer M, Maeda MM, Chu T (2017) Unmanned aircraft system-derived crop height and normalized difference vegetation index metrics for sorghum yield and aphid stress assessment. J Appl Remote Sens 11:026035. https://doi.org/10.1117/1.jrs.11.026035
Takahashi S (2006) Algorithms for extracting surface topology from digital elevation models. Topol Data Struct Surfaces Introd Geogr Inf Sci 31–51. https://doi.org/10.1002/0470020288.ch3
Xie Y, Sha Z, Yu M (2008) Remote sensing imagery in vegetation mapping: a review. J Plant Ecol 1:9–23. https://doi.org/10.1093/jpe/rtm005
Yang C, Huang Q, Li Z, Liu K, Hu F (2017) Big Data and cloud computing: innovation opportunities and challenges. Int J Digit Earth 10:13–53. https://doi.org/10.1080/17538947.2016.1239771
Yao X, Li G (2018) Big spatial vector data management: a review. Big Earth Data 2:108–129. https://doi.org/10.1080/20964471.2018.1432115
Yuan J, Roy Chowdhury PK, McKee J, Yang HL, Weaver J, Bhaduri B (2018) Exploiting deep learning and volunteered geographic information for mapping buildings in Kano, Nigeria. Sci Data 5:1–8. https://doi.org/10.1038/sdata.2018.217
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Arundel, S.T., Usery, E.L. (2021). Spatial Data Reduction Through Element-of-Interest (EOI) Extraction. In: Werner, M., Chiang, YY. (eds) Handbook of Big Geospatial Data. Springer, Cham. https://doi.org/10.1007/978-3-030-55462-0_5
Download citation
DOI: https://doi.org/10.1007/978-3-030-55462-0_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-55461-3
Online ISBN: 978-3-030-55462-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)