TY - JOUR
T1 - Handling unobserved confounding for continuous sequential data via extracting key sub-intervals
AU - Ren, Jianmei
AU - Ma, Tiefeng
AU - Liu, Shuangzhe
N1 - Publisher Copyright:
© 2025 Elsevier Inc.
PY - 2025
Y1 - 2025
N2 - Existing methods for addressing unobserved confounding (UC) face practical limitations. Instrumental variable (IV) methods require additional variables, while “instrument-free” (IF) methods rely on specific model assumptions. This paper introduces a novel causal effect estimation method, CanKeSie (Causal Analysis based on Key Sub-interval Extraction), based on key sub-interval extraction to suppress UC in continuous sequential data. The key steps of CanKeSie for estimating fixed causal effects include: (1) extracting sub-intervals centered on change-points of the target variable (i.e., the explanatory variable of interest), (2) performing window regression to obtain a sub-estimator of the regression coefficient for the target variable within each sub-interval, and (3) combining these sub-estimators with appropriate weights to obtain an overall estimator. Among these steps, change-point detection is a core component of CanKeSie. To improve its accuracy, this paper proposes a novel algorithm, WASCC-CCPR (Cutting and Clustering combined with Peak Recognition based on Weighted Adjustment for Shape Context Cost), which incorporates three key techniques: data denoising, data-driven thresholding, and the construction of WASCC statistic. Furthermore, we enhance CanKeSie with spline fitting to enable time-varying causal estimation. Simulations confirm its effectiveness and highlight the superior performance of WASCC-CCPR. Two empirical studies show that CanKeSie's findings align with real-world survey data.
AB - Existing methods for addressing unobserved confounding (UC) face practical limitations. Instrumental variable (IV) methods require additional variables, while “instrument-free” (IF) methods rely on specific model assumptions. This paper introduces a novel causal effect estimation method, CanKeSie (Causal Analysis based on Key Sub-interval Extraction), based on key sub-interval extraction to suppress UC in continuous sequential data. The key steps of CanKeSie for estimating fixed causal effects include: (1) extracting sub-intervals centered on change-points of the target variable (i.e., the explanatory variable of interest), (2) performing window regression to obtain a sub-estimator of the regression coefficient for the target variable within each sub-interval, and (3) combining these sub-estimators with appropriate weights to obtain an overall estimator. Among these steps, change-point detection is a core component of CanKeSie. To improve its accuracy, this paper proposes a novel algorithm, WASCC-CCPR (Cutting and Clustering combined with Peak Recognition based on Weighted Adjustment for Shape Context Cost), which incorporates three key techniques: data denoising, data-driven thresholding, and the construction of WASCC statistic. Furthermore, we enhance CanKeSie with spline fitting to enable time-varying causal estimation. Simulations confirm its effectiveness and highlight the superior performance of WASCC-CCPR. Two empirical studies show that CanKeSie's findings align with real-world survey data.
KW - Continuous sequential data
KW - Key sub-interval extraction
KW - Unobserved confounding
KW - Window regression
UR - http://www.scopus.com/inward/record.url?scp=105005946056&partnerID=8YFLogxK
U2 - 10.1016/j.ins.2025.122325
DO - 10.1016/j.ins.2025.122325
M3 - Article
AN - SCOPUS:105005946056
SN - 0020-0255
VL - 717
SP - 1
EP - 20
JO - Information Sciences
JF - Information Sciences
M1 - 122325
ER -