Predicting phenotype from genotype CALS Impact Statement uri icon

abstract

  • At the intersection of our work on human cardiovascular disease and our work on immune responses in Drosophila is the problem of prediction. Given genotypic attributes of flies or people, how can we best build a model that allows prediction of phenotype (disease risk, immune competence) given nothing other than the genotype data? Many individuals in the field of systems biology are trying to do this with machine-learning sorts of methods, and we wanted to instead apply knowledge of the underlying pathways to make predictions. Our work on the reverse cholesterol transport pathway has given very sobering insight into the nature of cardiovascular disease, and places limits on how well ANY model could predict this disease. In particular, about half the risk is already known to be induced by diet and exercise, and so at best genotype explains the other half. In fact, genetics explains well under half, because most of that remaining variance is explained by interactions between diet, excerise and genotype. The Drosophila data are much more amenable to this kind of modeling, and the control of the genetic variation seems to be largely why (in controlled backgrounds, single SNPs often explain 15 percent or more of the variance in immune phenotypes). In the end we hope to be able to identify which phenotypes are most easily predicted by genotypic variation, and to better understand why.