ROSMAP AC alternative splicing QTL#

Religious Orders Study (ROS) or the Rush Memory and Aging Project (MAP) AC alternative splicing.

Please refer to this document for an overview of the ROSMAP project.

Contact#

Ru Feng

Data Descriptions#

Verifying current avaliable location for the dataset now

Samples were extracted using Qiagen’s miRNeasy mini kit (cat. no. 217004) and the RNase free DNase Set (cat. no. 79254), and quantified by Nanodrop and quality was evaluated by Agilent Bioanalyzer.

BrainDescription
Batch 1638
Batch 2252 (117 for PolyA selection & 135 for rRNA depletion)
Batch 3251
Total1141

Notes regarding sample naming: Sequencing samples that were sequenced more than once include the sequencing batch number as the last 2 digits of the id. For example, sample 123_456789_02 is sample “123_456789”, from batch 2. The sample IDs are otherwise randomly-assigned identifiers. Some samples were re-sequenced due to poor quality or low output, they may have “redo” in the sample identifier.

See notebooks in:

The notebooks in this folder contain the commands and data wrangling codes for analysis of the eQTL and sQTL data in ROSMAP. (data wrangling exist because not all data are processed using the xqtl-pipeline from the beginning and need to be reformatted to fit one intermediate step of the pipeline). Since the sQTL phenotype pre-processing used eQTL STAR alignment output, some of the early stage code are shared.

Association data preprocessing#

Association scan using TensorQTL and summary statistics standardization#

  • TensorQTL.ipynb provides the pipeline to generate TensorQTL cis association results for all QTLs.
  • ROSMAP_sQTL_LeafCutter2_commands provides information about the input files for AC/DLPFC/PCC TensorQTL cis association in the base_params variable in [generate_command_1].

Analysis Description#

Brain multi-region RNA-seq data using LeafCutter2 was proccessed with the xQTL-pipeline, for analysis detial please check records here: sQTL_LeafCutter2

  • Genotype used in this analysis on AWS: s3://statfungen/ftp_fgc_xqtl/ROSMAP/genotype/analysis_ready/geno_by_chrom/
  • Phenotype used in this analysis on AWS: s3://statfungen/ftp_fgc_xqtl/sQTL/ROSMAP/AC/leafcutter2/analysis_ready/phenotype_preprocessing/
  • Covariates used in this analysis on AWS: s3://statfungen/ftp_fgc_xqtl/sQTL/ROSMAP/AC/leafcutter2/analysis_ready/covariate_preprocessing/ROSMAP_AC_perind.counts.noise_by_intron.QCed_minc1_mins10.gz_raw_data.qqnorm.imputed.bed.formated.rosmap_cov.ROSMAP_NIA_WGS.leftnorm.bcftools_qc.plink_qc.plink_qc.prune.pca.Marchenko_PC.gz

Results#

In this analysis we performed two methods: leafcutter2 and psichomics.

leafcutter2#

  • Output summary statistics are uploaded to the FTP server: /ftp_fgc_xqtl/projects/rna-seq/BU/ROSMAP_DLPFC/sQTL/association_scan/
  • Original files: s3://statfungen/ftp_fgc_xqtl/analysis_result/cis_association/ROSMAP/sQTL/AC/leafcutter2/

psichomics#

  • data generation using psichomics still in progress