论文标题
贝叶斯多级多元逻辑回归,用于可观察治疗异质性下的优势决策
Bayesian multilevel multivariate logistic regression for superiority decision-making under observable treatment heterogeneity
论文作者
论文摘要
在医学,社会和行为研究中,我们经常遇到具有多层结构和多个相关因素的数据集。这些数据经常是从研究人群中收集的,该研究人群区分了干预措施的几个子群体具有不同(即异质性)效应的亚群。尽管经常出现此类数据,但分析它们的方法较不常见,研究人员经常诉诸忽略多层次和/或异质结构,仅分析单个因变量或这些组合的组合。这些分析策略是次优的:忽略多级结构会膨胀I型错误率,同时忽略了多元或异质结构掩盖的详细见解。为了全面分析此类数据,当前论文介绍了一种新型的贝叶斯多层次多元逻辑回归模型。考虑了多级数据的群集结构,因此可以通过准确的错误率进行后验推断。此外,该模型在估计平均和条件平均多元治疗效果时共享不同亚群之间的信息。为了促进解释,多元逻辑回归参数被转化为后验成功概率及其之间的差异。数字评估将我们的框架与较不全面的替代方案进行了比较,并强调了对多级结构进行建模的需求:基于多级模型的治疗比较具有目标I型错误率,而单层替代方案导致I型I型错误。此外,当簇数较高时,多级模型比单层模型更强大。 ...
In medical, social, and behavioral research we often encounter datasets with a multilevel structure and multiple correlated dependent variables. These data are frequently collected from a study population that distinguishes several subpopulations with different (i.e., heterogeneous) effects of an intervention. Despite the frequent occurrence of such data, methods to analyze them are less common and researchers often resort to either ignoring the multilevel and/or heterogeneous structure, analyzing only a single dependent variable, or a combination of these. These analysis strategies are suboptimal: Ignoring multilevel structures inflates Type I error rates, while neglecting the multivariate or heterogeneous structure masks detailed insights. To analyze such data comprehensively, the current paper presents a novel Bayesian multilevel multivariate logistic regression model. The clustered structure of multilevel data is taken into account, such that posterior inferences can be made with accurate error rates. Further, the model shares information between different subpopulations in the estimation of average and conditional average multivariate treatment effects. To facilitate interpretation, multivariate logistic regression parameters are transformed to posterior success probabilities and differences between them. A numerical evaluation compared our framework to less comprehensive alternatives and highlighted the need to model the multilevel structure: Treatment comparisons based on the multilevel model had targeted Type I error rates, while single-level alternatives resulted in inflated Type I errors. Further, the multilevel model was more powerful than a single-level model when the number of clusters was higher. ...