Training Gene Expression Prediction Models


Haky Im


July 9, 2021

PrediXcan and TWAS methods in general correlate genetically predicted levels of gene expression traits with complex traits to understand the mechanism behind GWAS loci. A key component is the training of gene expression traits. A tutorial on how to generate elastic net models can be found in this link

Elastic net is a good all purpose prediction approach for complex traits and has been shown to perform well for gene expression traits. Depending on your goals, you may want to use a different approach. For example, if the goal is to maximize the reliability (low false positive) of putatively causal genes, then we showed that a method that uses genetic variants more likely to be causal may work better. Explained in this paper. In the GTEx GWAS subgroup we chose the models that are based on fine-mapping, called mashr-based.