If explanatory variables and a response variable of interest are simultaneously observed, then fitting a joint multivariate density to all variables would enable prediction via conditional distributions. Regular vines or vine copulas with arbitrary univariate margins provide a rich and flexible class of multivariate densities for Gaussian or non-Gaussian dependence structures. The density enables calculation of all regression functions for any subset of variables conditional on any disjoint set of variables, thereby avoiding issues of transformations, heteroscedasticity, interactions, and higher-order terms. Only the question of finding an adequate vine copula remains. Heteroscedastic prediction inferences based on vine copulas are illustrated with two data sets, including one from the National Longitudinal Study of Youth relating breastfeeding to IQ. Some usual methods based on linear and quadratic equations are shown to have some undesirable inferences.
Vine Copula Regression for Observational Studies
Vine regression proposes a new approach based on estimating the joint density with vine copulae and computing regression functions directly with a best fitting density.
Journal Article by Roger Cooke, Harry Joe, and Bo Chang — June 5, 2019View Journal Article
Roger M. Cooke
Chauncey Starr Senior Fellow
Working Paper — Nov 24, 2015
Vine regression is illustrated with the National Longitudinal Study of Youth to estimate the effect of the duration of breastfeeding on IQ. The effect varies according to an individual’s IQ and the amount of breastfeeding received.
Dispelling the Epicycles of Regression
Which covariates should I include/exclude? Should I include interaction terms? higher order terms? What about the “noise”, is it really white? Whic...
PG&E Power Outages Reduce Just a Portion of Wildfire Risk
Power outages imposed by PG&E will impact consumers, but won't necessarily mitigate wildfire risk.