Inferring heterogeneous treatment effects of crashes on highway traffic: A doubly robust causal machine learning approach

You are here: Homepage > Search

Inferring heterogeneous treatment effects of crashes on highway traffic: A doubly robust causal machine learning approach

Li, Shuang / Pu, Ziyuan / Cui, Zhiyong / Lee, Seunghyeon / Guo, Xiucheng / Ngoduy, Dong

Highlights • A doubly robust causal machine learning framework is proposed for estimating causal effect of traffic crashes on speed reduction. • A Conditional Shapley Value Index (CSVI) is proposed for adverse variables elimination to improve the efficiency of estimation. • Heterogeneous treatment effects of crashes are estimated based on high-resolution spatial–temporal traffic dataset. • Results based on real-world data reveal the impact mechanisms of different types of crashes along with the upstream road segments in the post-crash time period.

Abstract Accurate estimating causal effects of crashes on highway traffic is crucial for mitigating the negative impacts of crashes. Previous studies have built up a series of methods via traditional causal inference theory and machine learning methods to estimate the impacts of crashes. Since the structures and variable dimensions of traditional causal inference models are pre-defined, they can not accommodate the characteristics of individual crashes. They only can estimate the average causal effects for the crashes in certain categories, e.g., crash types, crash severity, and occurring locations. For machine learning-based algorithms, they cannot be used for causal reasoning due to their reliance on correlation rather than causation. However, considering the impacts of crashes on traffic status vary across influential factors, such as time periods and locations, heterogeneous causal effects are essential for a better understanding of the effects on traffic status and crash intervention strategy development. To address the aforementioned issues, this study proposes a novel doubly robust causal machine learning framework to infer heterogeneous treatment effects of crashes on highway traffic status. Doubly Robust Learning (DRL), integrating machine learning techniques to perform predictive tasks, is applied into the framework due to its stronger robustness. Considerning treatment predictors and colliders may bring bias in estimation results, Conditional Shapley Value Index (CSVI) is proposed for selecting confounders from numerous factors. A 3-year crah dataset collected by 3594 real highway crashes in Washington is utilized for demonstrating the designed experiments, including construting confidence intervals, estimated errors evaluation, and sensitivity analysis of variable selection for various thresholds of CSVI. According to the results, the distinctive propagation and dissipation processes of congestion caused by various types of crashes can be achieved. The results also validate the effectiveness of variable selection, and the superiority in estimation accuracy compared to the selected baseline models. Future study includes considering spatial–temporal causal relationships and predicting counterfactual real-time traffic conditions.