Assembles data for all the regions
a data frame of region definitions.
A data frame with columns: "id", "z", giving the z-scores for SNPs.
A data frame with columns: "id", "z", giving the z-scores for genes.
a list of preprocessed weights.
a list of data frames with SNP-to-region map for the reference.
The proportion of SNPs to be used for the parameter estimation and
initial screening region steps.
Smaller thin
parameters reduce runtime at the expense of accuracy.
The fine mapping step is rerun using full SNPs for regions with strong gene signals.
Inf or integer. Maximum number of SNPs in a region. Default is Inf, no limit. This can be useful if there are many SNPs in a region and you don't have enough memory to run the program. This applies to the last rerun step (using full SNPs and rerun susie for regions with strong gene signals) only.
Minimum number of genes for a group to be included.
remove SNPs if the total number of SNPs exceeds limit, options: "random", or "z" (trim SNPs with lower |z|). See parameter `maxSNP` for more information.
options for thinning SNPs, "reference": thin reference SNPs, "gwas": thin GWAS SNPs.
If TRUE, identify cross-boundary genes, and adjust region_data.
The number of cores used to parallelize susie over regions
seed for random sampling
path to the log file, if NULL will print log info on screen.
a list with region_data, updated weights, and cross-bounary genes