TY - JOUR
T1 - Exploratory and Interpretable Approach to Estimating Latent Health Risk Factors Without Using Domain Knowledge
AU - Cong, Ruichen
AU - Nishimura, Shoji
AU - Ogihara, Atsushi
AU - Jin, Qun
N1 - Publisher Copyright:
© 2018 Tsinghua University Press.
PY - 2025
Y1 - 2025
N2 - The identification of latent risk factors that can induce to health risks or an abnormal status is an important task in healthcare data analyses. In recent years, health analyses based on neural network models have been applied widely. However, such analysis processes are blackbox and the results lack explainability. Some approaches by constructing a domain model may tackle these issues. However, domain knowledge from an expert is required. In this study, we propose an exploratory and interpretable approach to estimating latent health risk factors without relying on domain knowledge, in which feature selection and causal discovery are used to construct a domain model for uncovering complex relationships in health and medical data. An evaluation experiment conducted on two datasets by comparing the proposed approach with four baselines demonstrated that the proposed approach outperformed the baselines in terms of model fitness. Furthermore, the number of model parameters in our method was smaller than that in the baselines, which reduced model complexity. Moreover, the analysis process of the proposed approach was visible and explainable, which improved the interpretability of the analysis processes.
AB - The identification of latent risk factors that can induce to health risks or an abnormal status is an important task in healthcare data analyses. In recent years, health analyses based on neural network models have been applied widely. However, such analysis processes are blackbox and the results lack explainability. Some approaches by constructing a domain model may tackle these issues. However, domain knowledge from an expert is required. In this study, we propose an exploratory and interpretable approach to estimating latent health risk factors without relying on domain knowledge, in which feature selection and causal discovery are used to construct a domain model for uncovering complex relationships in health and medical data. An evaluation experiment conducted on two datasets by comparing the proposed approach with four baselines demonstrated that the proposed approach outperformed the baselines in terms of model fitness. Furthermore, the number of model parameters in our method was smaller than that in the baselines, which reduced model complexity. Moreover, the analysis process of the proposed approach was visible and explainable, which improved the interpretability of the analysis processes.
KW - health data analysis
KW - health risk estimation
KW - interpretable approach
KW - latent factor exploration
UR - http://www.scopus.com/inward/record.url?scp=85209826416&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85209826416&partnerID=8YFLogxK
U2 - 10.26599/BDMA.2024.9020081
DO - 10.26599/BDMA.2024.9020081
M3 - Article
AN - SCOPUS:85209826416
SN - 2096-0654
VL - 8
SP - 447
EP - 457
JO - Big Data Mining and Analytics
JF - Big Data Mining and Analytics
IS - 2
ER -