R/RcppExports.R
f2_blocks_from_Relate.Rd
This function will calculate f2 statistics in blocks of prespecified size for all pairs of populations specified in the poplabels file. Please refer to the Relate documentation for input file formats (https://myersgroup.github.io/relate/). The output is in a format that is directly accepted by the admixtools R package to calculate f3, f4, f4ratio, D statistics and more (https://uqrmaie1.github.io/admixtools/).
f2_blocks_from_Relate(
file_anc,
file_mut,
poplabels,
file_map = NULL,
chrs = NULL,
blgsize = NULL,
mu = NULL,
tmin = NULL,
t = NULL,
transitions = NULL,
use_muts = NULL,
minMAF = NULL,
dump_blockpos = NULL,
apply_corr = NULL
)
Filename of anc file. If chrs is specified, this should only be the prefix, resulting in filenames of ${file_anc}_chr${chr}.anc(.gz).
Filename of mut file. If chrs is specified, this should only be the prefix, resulting in filenames of ${file_anc}_chr${chr}.anc(.gz).
Filename of poplabels file
(Optional) File prefix of recombination map. Not needed if blgsize is given in base-pairs, i.e. blgsize > 100
(Optional) Vector of chromosome IDs
(Optional) SNP block size in Morgan. Default is 0.05 (5 cM). If blgsize is 100 or greater, if will be interpreted as base pair distance rather than centimorgan distance. If blgsize is negative, every tree is its own block.
(Optional) Per base per generation mutation rate to scale f2 values. Default: 1.25e-8
(Optional) Minimum time cutof in generations. Any lineages younger than tmin will be excluded from the analysis. Default: t = 0.
(Optional) Time cutoff in generations. Default: Inf
(Optional) Set this to FALSE to exclude transition SNPs. Only meaningful with use_muts
(Optional) Calculate traditional f2 statistics by only using mutations mapped to Relate trees. Default: false.
(Optional) Minimum frequency cutoff. Default: 1 (i.e. excl singletons)
(Optional) Filename of blockpos file.
(Optional) Use small sample size correction. Default: true.
3d array of dimension #groups x #groups x #blocks. Analogous to output of f2_from_geno in admixtools.
file_anc <- system.file("sim/msprime_ad0.8_split250_1_chr1.anc.gz", package = "twigstats")
file_mut <- system.file("sim/msprime_ad0.8_split250_1_chr1.mut.gz", package = "twigstats")
poplabels <- system.file("sim/msprime_ad0.8_split250_1.poplabels", package = "twigstats")
file_map <- system.file("sim/genetic_map_combined_b37_chr1.txt.gz", package = "twigstats")
#Calculate f2s between all pairs of populations
f2_blocks <- f2_blocks_from_Relate(file_anc, file_mut, poplabels, file_map)
f4_ratio(f2_blocks, popX="PX", popI="P1", pop1="P2", pop2="P3", popO="P4")
#> popO popI pop1 pop2 popX val se
#> 1 P4 P1 P2 P3 PX 0.9974217 0.3076228
#Use a cutoff of 500 generations
f2_blocks <- f2_blocks_from_Relate(file_anc, file_mut, poplabels, file_map, t = 500)
f4_ratio(f2_blocks, popX="PX", popI="P1", pop1="P2", pop2="P3", popO="P4")
#> popO popI pop1 pop2 popX val se
#> 1 P4 P1 P2 P3 PX 0.8110266 0.02085804