The Atherosclerosis Risk in Communities Study (ARIC) generated genotype and proteomic data from a total of 9,084 participants (7,213 European Americans and 1,871 African Americans). The relative conectrations of plasma proteins or protein complexes was measured from blood samples using an aptamer-based approach. Genotyping of blood samples was imputed to the TOPMed reference panel (GRCh38).

Nilan Chatterjee et al analyzed cis-genetic regulation of the plasma proteome, generating PWAS through TWAS/Fusion pipeline. They study involved 4,665 SOMAmers measuring 4,491 unique plasma proteins or protein complexes encoded by 4,445 autosomal genes.

We created a prediction model compatible with MetaXcan software from the weights generated by Nilan Chatterjee et al’s PWAS study. The steps are documented below:

We validated the model by running SPrediXcan on height and coronary artery disease GWAS, then comparing the results to association found from Whole Blood mashr models.

