Please use this identifier to cite or link to this item:
http://dx.doi.org/10.25673/122856| Title: | Interventional Deep Generative Models for Scalable Causal Discovery and Counterfactual Analysis |
| Author(s): | Abbood, Saif Hameed |
| Granting Institution: | Hochschule Anhalt |
| Issue Date: | 2025-12 |
| Extent: | 1 Online-Ressource (13 Seiten) |
| Language: | English |
| Abstract: | Causal reasoning and “what-if” analysis allow us to predict the outcomes of hypothetical changes and are fundamental to decision support in high-stakes domains such as healthcare, economics, and robotics. Traditional causal-discovery methods can find cause-and-effect graphs under simple assumptions but struggle with large, complex datasets and cannot predict what might happen after a hypothetical change. Algorithms like PC, FCI, and NOTEARS reliably infer directed acyclic graphs (DAGs) under linear or simple nonlinear assumptions but fail to scale to high-dimensional data and lack mechanisms for counterfactual simulation. Conversely, deep generative models learn to reproduce complex data patterns but do not capture cause-and-effect relationships, so they cannot answer “what-if” questions. We propose Interventional Structural Deep Generative Models (IS-DGM), a unified framework that embeds a learnable DAG into the latent space of a variational autoencoder. We prove that, under realistic conditions, our approach can uniquely recover the true causal structure and generate reliable counterfactual predictions. IS-DGM enforces acyclicity via a continuous matrix-exponential penalty, encourages sparsity through 𝐿1 regularization, and introduces alatent-space intervention operator to clamp selected factors and propagate effects through the graph. Under mild exponential-family priors and with diverse interventional data, IS-DGM recovers the true DAG up to element-wise reparameterization. Empirically, on synthetic benchmarks (latent dimensions up to 100), IS-DGM reduces structural Hamming distance by 30–55% and achieves over 50% lower counterfactual RMSE than state-of-the-art baselines. On real clinical data (MIMIC-III), it halves prediction error of treatment-response simulations relative to identifiable VAEs and NOTEARS. Ablation studies confirm the necessity of each loss component, and scalability analyses quantify runtime and memory trade-offs. IS-DGM thus offers a principled, scalable solution for joint causal discovery and counterfactual inference in complex, high-dimensional settings. |
| URI: | https://opendata.uni-halle.de//handle/1981185920/124799 http://dx.doi.org/10.25673/122856 |
| Open Access: | Open access publication |
| License: | (CC BY-SA 4.0) Creative Commons Attribution ShareAlike 4.0 |
| Appears in Collections: | International Conference on Applied Innovations in IT (ICAIIT) |
Files in This Item:
| File | Size | Format | |
|---|---|---|---|
| 3-12-ICAIIT_2025_13(5).pdf | 1.57 MB | Adobe PDF | View/Open |
Open access publication