Comment[GEAAccession] E-GEAD-420 MAGE-TAB Version 1.1 Investigation Title nominal eQTL summary of ImmuNexUT Experiment Description Nominal eQTL summary table of 28 immune cell subsets from the ImmuNexUT cohort. Experimental Design case control design Experimental Factor Name enter experiment factor name here Experimental Factor Type enter experiment factor name here Person Last Name Nagafuchi Person First Name Yasuo Person Affiliation Department of Allergy and Rheumatology, Graduate School of Medicine, The University of Tokyo Person Roles submitter Public Release Date 2021-06-09 Protocol Name ESUB000727_Protocol_1 ESUB000727_Protocol_2 ESUB000727_Protocol_3 ESUB000727_Protocol_4 ESUB000727_Protocol_5 ESUB000727_Protocol_6 Protocol Type sample collection protocol nucleic acid extraction protocol nucleic acid labeling protocol nucleic acid hybridization to array protocol array scanning and feature extraction protocol normalization data transformation protocol Protocol Description Whole blood from 416 ImmunexUT cohort were collected. Genomic DNA was isolated from peripheral blood using QIAmp DNA Blood Midi kit (QIAGEN). Libraries for whole genome sequencing were prepared using TruSeq DNA PCR-Free Library prep kit (Illumina). Libraries for whole genome sequencing were prepared using TruSeq DNA PCR-Free Library prep kit (Illumina). Whole genomes were sequenced on Illumina HiSeq X with 151-bp pair-end reads. WGS data processing was performed based on the standardized best-practice method proposed by GATK. Samples with genotyping call rates < 99% were removed. We used BEAGLE to impute missing genotypes. Variants with minor allele frequency < 1% were excluded. Genes expressed at low levels (< 5 count in more than 80% samples or < 0.5 CPM in more than 80% samples) were filtered out in each cell subset. The residual expression data were normalized between samples with TMM, converted to CPM and then normalized across samples using an inverse normal transform. A Probabilistic Estimation of Expression Residuals method was applied to normalized expression data to infer hidden covariates. The top 2 genetic principal components, sample collection phase, clinical diagnosis, sex and latent factors were utilized as covariates for eQTL analysis. Mem CD8s, which were collected in phase1 and divided into CM CD8 and EM CD8 in phase2, were analyzed jointly with EM CD8 for eQTL analysis because the majority of the Mem CD8 population consisted of EM CD8. For each cell subset, we used a QTLtools nominal pass and tested for the association of the variants located within 1Mbp from the TSS of the genes. SDRF File ESUB000727.sdrf.txt Comment[Number of channel] single-channel Comment[Array Design REF] A-GEAD-11 Comment[AEExperimentType] transcription profiling by array Comment[BioProject] PRJDB10991 Comment[Last Update Date] 2021-06-09