Single-trait Fine-mapping with FineBoost
Source:vignettes/FineBoost_Special_Case.Rmd
FineBoost_Special_Case.Rmd
This vignette demonstrates how to perform single-trait fine-mapping
analysis using FineBoost, a specialized single-trait version of
ColocBoost, with both individual-level data and summary statistics.
Specifically focusing on the 2nd trait with 2 causal variants (644 and
2289) from the Ind_5traits
and Sumstat_5traits
datasets included in the package.
1. Fine-mapping with individual-level data
In this section, we demonstrate how to perform fine-mapping using
individual-level genotype (X
) and phenotype
(Y
) data. This approach uses raw data directly to identify
causal variants.
# Load example data
data(Ind_5traits)
X <- Ind_5traits$X[[2]]
Y <- Ind_5traits$Y[[2]]
res <- colocboost(X = X, Y = Y)
#> Validating input data.
#> Starting gradient boosting algorithm.
#> Gradient boosting for outcome 1 converged after 44 iterations!
#> Performing inference on colocalization events.
colocboost_plot(res)
2. Fine-mapping with summary statistics
This section demonstrates fine-mapping analysis using summary statistics along with a proper LD matrix.
# Load example data
data(Sumstat_5traits)
sumstat <- Sumstat_5traits$sumstat[[2]]
LD <- get_cormat(Ind_5traits$X[[2]])
res <- colocboost(sumstat = sumstat, LD = LD)
#> Validating input data.
#> Starting gradient boosting algorithm.
#> Gradient boosting for outcome 1 converged after 44 iterations!
#> Performing inference on colocalization events.
colocboost_plot(res)
3. LD-free fine-mapping with one causal variant assumption
In scenarios where LD information is unavailable, FineBoost can still perform fine-mapping under the assumption that there is a single causal variant. This approach is less computationally intensive but assumes that only one variant within a region is causal.
# Load example data
res <- colocboost(sumstat = sumstat)
#> Validating input data.
#> Warning in colocboost(sumstat = sumstat): Providing the LD for summary
#> statistics data is highly recommended. Without LD, only a single iteration will
#> be performed under the assumption of one causal variable per outcome.
#> Additionally, the purity of CoS cannot be evaluated!
#> Starting gradient boosting algorithm.
#> Running ColocBoost with assumption of one causal per outcome per region!
#> Performing inference on colocalization events.
colocboost_plot(res)