论文标题
在强大因果推理中求解非线性二进制优化问题的计算框架
A Computational Framework for Solving Nonlinear Binary OptimizationProblems in Robust Causal Inference
论文作者
论文摘要
确定变量之间的因果关系是决策过程中的关键步骤。尽管因果推断需要随机实验,但由于观察数据的广泛可用性和实验的不可行性,研究人员和决策者越来越多地使用观察性研究来检验因果假设。匹配方法是从观察数据中进行因果推断的最常用技术。但是,由于实验者做出的不同选择,一对一匹配中的对分配过程会在推理中产生不确定性。最近,提出了离散的优化模型来解决这种不确定性。尽管离散优化模型可以进行鲁棒的推断,但它们会产生非线性问题并且缺乏可扩展性。在这项工作中,我们提出了贪婪的算法,以从具有连续结果的观察数据中求解强大的因果推理测试实例。我们提出了一个独特的框架,将非线性二进制优化问题重新制定为可行性问题。通过利用可行性配方的结构,我们开发了有效解决可靠测试问题的贪婪方案。在许多情况下,提出的算法实现了全球最佳解决方案。我们在三个现实世界数据集上执行实验,以证明所提出的算法的有效性,并将我们的结果与最新的求解器进行比较。我们的实验表明,所提出的算法在计算时间方面显着优于确切方法,同时得出因果测试的相同结论。数值实验和复杂性分析都表明,所提出的算法确保在决策过程中利用大数据的能力所需的可伸缩性。
Identifying cause-effect relations among variables is a key step in the decision-making process. While causal inference requires randomized experiments, researchers and policymakers are increasingly using observational studies to test causal hypotheses due to the wide availability of observational data and the infeasibility of experiments. The matching method is the most used technique to make causal inference from observational data. However, the pair assignment process in one-to-one matching creates uncertainty in the inference because of different choices made by the experimenter. Recently, discrete optimization models are proposed to tackle such uncertainty. Although a robust inference is possible with discrete optimization models, they produce nonlinear problems and lack scalability. In this work, we propose greedy algorithms to solve the robust causal inference test instances from observational data with continuous outcomes. We propose a unique framework to reformulate the nonlinear binary optimization problems as feasibility problems. By leveraging the structure of the feasibility formulation, we develop greedy schemes that are efficient in solving robust test problems. In many cases, the proposed algorithms achieve global optimal solutions. We perform experiments on three real-world datasets to demonstrate the effectiveness of the proposed algorithms and compare our result with the state-of-the-art solver. Our experiments show that the proposed algorithms significantly outperform the exact method in terms of computation time while achieving the same conclusion for causal tests. Both numerical experiments and complexity analysis demonstrate that the proposed algorithms ensure the scalability required for harnessing the power of big data in the decision-making process.