A new approach to a legacy concern: Evaluating machine-learned Bayesian networks to predict childhood lead exposure risk from community water systems

https://doi.org/10.1016/j.envres.2021.112146Get rights and content
Under a Creative Commons license
open access

Highlights

  • Low blood lead levels may be attributable to drinking water in community water systems.

  • Current Lead and Copper Rule action level not low enough to detect health risk.

  • Machine-learned Bayesian networks may be used to predict lead exposure risk.

Abstract

Lead in drinking water continues to put children at risk of irreversible neurological impairment. Understanding drinking water system characteristics that influence blood lead levels is needed to prevent ongoing exposures. This study sought to assess the relationship between children's blood lead levels and drinking water system characteristics using machine-learned Bayesian networks. Blood lead records from 2003 to 2017 for 40,742 children in Wake County, North Carolina were matched with the characteristics of 178 community water systems and sociodemographic characteristics of each child's neighborhood. Bayesian networks were machine-learned to evaluate the drinking water variables associated with blood lead levels ≥2 μg/dL and ≥5 μg/dL. The model was used to predict geographic areas and water utilities with increased lead exposure risk. Drinking water characteristics were not significantly associated with children's blood lead levels ≥5 μg/dL but were important predictors of blood lead levels ≥2 μg/dL. Whether 10% of water samples exceeded 2 ppb of lead in the most recent year prior to the blood test was the most important water system predictor and increased the risk of blood lead levels ≥2 μg/dL by 42%. The model achieved an area under the receiver operating characteristic curve of 0.792 (±0.8%) during ten-fold cross validation, indicating good predictive performance. Water system characteristics may thus be used to predict areas that are at risk of higher blood lead levels. Current drinking water regulatory thresholds for lead may be insufficient to detect the levels in drinking water associated with children's blood lead levels.

Keywords

Bayesian networks
Drinking water
Blood lead levels
Machine learning
Risk assessment
Health disparity

Cited by (0)

1

Present address: RTI International, Center for Environmental Health, Risk, and Sustainability, 3040 East Cornwallis Road, P.O. Box 12194, Research Triangle Park, NC 27709.

2

Present address: Hazen & Sawyer, 4011 WestChase Boulevard, Suite 500, Raleigh, NC 27607, USA.

3

Present address: Columbia Water Center, Columbia University, 500 West 120th St New York, New York, 10027, USA