Non-Hispanic White Linkage Disequilibrium Reference Panel#
LD matrices calculated from whole genome sequencing data from 16571 non-Hispanic white individuals obtained from the Genome Center for Alzheimer’s Disease (GCAD). Correlation matrices were calculated between SNPs within 1361 LD blocks which were obtained from this Github page (generated from 1000 Genomes EUR samples).
Contact#
Oluwatosin Olayinka
Output Format#
Each LD block contains two files of interest:
- an xz-compressed file containing the correlation values, suffixed by
.cor.xz- this file is a compressed file where the matrix is encoded in a space-separated format
- the data is stored in the upper triangle of the matrix
- a Plink
.bimfile suffixed by.cor.xz.bimcontaining unique IDs for each variant
ld_block_ref_file = "/path/to/matrix.cor.xz"
var_names = read.table(paste0(ld_block_ref_file, ".bim"), header = F)$V2
ld <- scan(xzfile(ld_block_ref_file))
ld <- matrix(ld, ncol = sqrt(length(ld)), byrow = TRUE)
ld <- ld + t(ld)
diag(ld) = 1
rownames(ld) = var_names
colnames(ld) = var_namesData Availability#
The generated files can be found on Synapse.
Analysis Notebook Link#
- Generating LD Reference Panel: https://github.com/cumc/xqtl-pipeline/blob/main/code/reference_data/ld_reference_generation.ipynb
ADSP Whole Genome Sequencing LD Reference Panel#
The FunGen-xQTL flagship paper used a large LD reference panel derived from the Alzheimer’s Disease Sequencing Project (ADSP) whole genome sequencing data for fine-mapping analyses.
- Samples: 16,905 individuals of European ancestry
- Variant call: WGS, high-quality SNPs and indels
- Use: LD reference for SuSiE-RSS fine-mapping of xQTL and AD GWAS loci
| Resource | Synapse ID |
|---|---|
| ADSP WGS LD reference (genotype data) | syn69670651 |
| ADSP WGS LD reference (LD matrices) | syn69670652 |