Fix vignette

mikkoch · mikkoch · commit af46c6876831 · 2025-01-18T13:34:23.000-05:00
diff --git a/doc/superadmixture.Rmd b/doc/superadmixture.Rmd
@@ -234,7 +234,7 @@ data("fam_amr", package = "superadmixture")
 
 Since the number of loci of the AMR subset of 1000 Genomes dataset is too large for a quick analysis, we created a subset of this dataset by first applying allele frequency filters and LD-pruning to the AMR dataset. We then randomly selected 10,000 SNPs out of LD-pruned SNP sets. We also re-ordered individuals according to their pairwise kinship level. This subset is available in `data/X_amr.rda`. This data has `r nrow(X_amr)` individuals and `r ncol(X_amr)` loci. The associated fam file can be found in `data/fam_amr.rda`. These data can be reproduced by scripts `data-raw/{amr.bash,amr.R}`. 
 
-We adopt the `popkin` package to obtain the Ochoa-Storey (OS) estimate of coancestry among individuals. The following chunk of the code estimates the individual-level coancestry according to the Ochoa-Storey (OS) method by `popkin` package. It should noted that the `popkin` function returns the kinship coefficients instead of the coancestry coefficients. Therefore, we use the `inbr_diag` function in the `popkin` package to map kinship coefficients $\phi_{jk}$'s to coancestry coefficients $\theta_{jk}$'s:
+We adopt the `popkin` package to obtain the Ochoa-Storey (OS) estimate of coancestry among individuals. The following chunk of the code estimates the individual-level coancestry $\hat{\boldsymbol{\Theta}}^{\text{OS}}$ according to the Ochoa-Storey (OS) method by `popkin` package. It should noted that the `popkin` function returns the kinship coefficients instead of the coancestry coefficients. Therefore, we use the `inbr_diag` function in the `popkin` package to map kinship coefficients $\phi_{jk}$'s to coancestry coefficients $\theta_{jk}$'s:
 
 \[
 \theta_{jk} = 
@@ -308,7 +308,7 @@ legend_color_categories(colors = colors_subpops, categories = subpop_order, labe
 
 ## Estimating admixture proportions and coancestry among antecedent populations
 
-The following chunk of the code estimates the admixture proportions $\bQ$ from genotypes. We first estimate the individual specific allele frequencies $\boldsymbol{\Pi}$ using the `est_p_indiv` function in the `superadmixture` package. We then estimate $\bQ$ by decomposing $\boldsymbol{\Pi}$ with `factor_p_indiv` function in the `superadmixture` package.
+The following chunk of the code estimates the admixture proportions $\bQ$ from genotypes. We first estimate the individual specific allele frequencies $\boldsymbol{\Pi}$ using the `est_p_indiv` function in the `superadmixture` package. We then estimate $\bQ$ by decomposing $\boldsymbol{\Pi}$ with `factor_p_indiv` function in the `superadmixture` package. 
 
 ```{r estimate_admix_props_amr, eval=!fast_run, message=FALSE, warning=FALSE}
 library(superadmixture)
@@ -323,7 +323,7 @@ obj <- factor_p_indiv(p_indiv, k_antepops = 3, rowspace = rowspace, verbose = FA
 Q_hat <- obj$Q_hat
 ```
 
-After obtaining individual-level coancestry `coanc_indiv` and admixture proportions `Q_hat`, we can use the function `est_coanc` to estimate population coancestry under the super admixture model and under the standard admixture model. 
+After obtaining individual-level coancestry `coanc_indiv` and admixture proportions `Q_hat`, we can use the function `est_coanc` to estimate population coancestry $\boldsymbol{\Lambda}$ under the super admixture model and under the standard admixture model. In our manuscript, `coanc_pops_sup` is denoted as $\hat{\boldsymbol{\Lambda}}^{\text{sup}}$ and `coanc_pops_std` is denoted as  $\hat{\boldsymbol{\Lambda}}^{\text{std}}$.
 
 ```{r estimate_coanc_pops_amr, eval=!fast_run, message=FALSE, warning=FALSE}
 # estimate population coancestry under the super admixture model
@@ -381,7 +381,7 @@ heatmap_coanc_antepops(coanc_pops_sup, tl.offset = 1)
 
 ## Calculating the individual-level coancestry under the super admixture and standard admixture
 
-We then obtain the corresponding individual-coancestry under the super admixture model and under the standard admixture model. 
+We then obtain the corresponding individual-coancestry under the super admixture model and under the standard admixture model. In our manuscript, `coanc_sup` is denoted as $\hat{\boldsymbol{\Theta}}^{\text{sup}}$ and `coanc_std` is denoted as  $\hat{\boldsymbol{\Theta}}^{\text{std}}$.
 
 ```{r estimate_coanc_supadmix_stdadmix_amr, eval=!fast_run, message=FALSE, warning=FALSE}
 coanc_sup <- t(Q_hat) %*% coanc_pops_sup %*% Q_hat
@@ -489,7 +489,7 @@ kinship <- ifelse(kinship< 0, 0, kinship)
 coanc_indiv <- ifelse(coanc_indiv < 0, 0, coanc_indiv)
 ```
 
-We can visualize the individual-level coancestry of the simulated data using `plot_popkin` function in the `popkin` function. We use the following helper function `plot_colors_subpops` to label the sub-populations.
+We can visualize the individual-level coancestry of the simulated data using `plot_popkin` function in the `popkin` package. We use the following helper function `plot_colors_subpops` to label the sub-populations.
 
 ```{r}
 plot_colors_subpops <- function(pops, srt = 0, cex = 0.6, y = FALSE) {
@@ -538,7 +538,7 @@ obj <- factor_p_indiv(p_indiv, k_antepops = 7, rowspace = rowspace, verbose = FA
 Q_hat <- obj$Q_hat
 ```
 
-After obtaining individual-level coancestry `coanc_indiv` and admixture proportions `Q_hat`, we can use the function `est_coanc` to estimate population coancestry under the super admixture model and under the standard admixture model. The output `coanc_pops_sup` is the coancestry of antecedent populations $\boldsymbol{\Lambda}$ in our manuscript. 
+After obtaining individual-level coancestry `coanc_indiv` and admixture proportions `Q_hat`, we can use the function `est_coanc` to estimate population coancestry $\boldsymbol{\Lambda}$ under the super admixture model and under the standard admixture model. In our manuscript, `coanc_pops_sup` is denoted as $\hat{\boldsymbol{\Lambda}}^{\text{sup}}$ and `coanc_pops_std` is denoted as  $\hat{\boldsymbol{\Lambda}}^{\text{std}}$.
 
 ```{r estimate_coanc_antepops_hgdp, eval=!fast_run, message=FALSE, warning=FALSE}
 # estimate population coancestry under the super admixture model
@@ -606,7 +606,7 @@ heatmap_coanc_antepops(coanc_pops_sup, tl.offset = 1)
 
 ## Calculating individual-level coancestry under the super admixture and standard admixture
 
-We then obtain the corresponding individual-coancestry under the super admixture model and under the standard admixture model. 
+We then obtain the corresponding individual-coancestry under the super admixture model and under the standard admixture model. In our manuscript, `coanc_sup` is denoted as $\hat{\boldsymbol{\Theta}}^{\text{sup}}$ and `coanc_std` is denoted as  $\hat{\boldsymbol{\Theta}}^{\text{std}}$.
 
 ```{r estimate_coanc_supadmix_stdadmix_hgdp, eval=!fast_run, message=FALSE, warning=FALSE}
 coanc_sup <- t(Q_hat) %*% coanc_pops_sup %*% Q_hat
diff --git a/doc/superadmixture.html b/doc/superadmixture.html
diff --git a/vignettes/superadmixture.Rmd b/vignettes/superadmixture.Rmd
@@ -234,7 +234,7 @@ data("fam_amr", package = "superadmixture")
 
 Since the number of loci of the AMR subset of 1000 Genomes dataset is too large for a quick analysis, we created a subset of this dataset by first applying allele frequency filters and LD-pruning to the AMR dataset. We then randomly selected 10,000 SNPs out of LD-pruned SNP sets. We also re-ordered individuals according to their pairwise kinship level. This subset is available in `data/X_amr.rda`. This data has `r nrow(X_amr)` individuals and `r ncol(X_amr)` loci. The associated fam file can be found in `data/fam_amr.rda`. These data can be reproduced by scripts `data-raw/{amr.bash,amr.R}`. 
 
-We adopt the `popkin` package to obtain the Ochoa-Storey (OS) estimate of coancestry among individuals. The following chunk of the code estimates the individual-level coancestry according to the Ochoa-Storey (OS) method by `popkin` package. It should noted that the `popkin` function returns the kinship coefficients instead of the coancestry coefficients. Therefore, we use the `inbr_diag` function in the `popkin` package to map kinship coefficients $\phi_{jk}$'s to coancestry coefficients $\theta_{jk}$'s:
+We adopt the `popkin` package to obtain the Ochoa-Storey (OS) estimate of coancestry among individuals. The following chunk of the code estimates the individual-level coancestry $\hat{\boldsymbol{\Theta}}^{\text{OS}}$ according to the Ochoa-Storey (OS) method by `popkin` package. It should noted that the `popkin` function returns the kinship coefficients instead of the coancestry coefficients. Therefore, we use the `inbr_diag` function in the `popkin` package to map kinship coefficients $\phi_{jk}$'s to coancestry coefficients $\theta_{jk}$'s:
 
 \[
 \theta_{jk} = 
@@ -308,7 +308,7 @@ legend_color_categories(colors = colors_subpops, categories = subpop_order, labe
 
 ## Estimating admixture proportions and coancestry among antecedent populations
 
-The following chunk of the code estimates the admixture proportions $\bQ$ from genotypes. We first estimate the individual specific allele frequencies $\boldsymbol{\Pi}$ using the `est_p_indiv` function in the `superadmixture` package. We then estimate $\bQ$ by decomposing $\boldsymbol{\Pi}$ with `factor_p_indiv` function in the `superadmixture` package.
+The following chunk of the code estimates the admixture proportions $\bQ$ from genotypes. We first estimate the individual specific allele frequencies $\boldsymbol{\Pi}$ using the `est_p_indiv` function in the `superadmixture` package. We then estimate $\bQ$ by decomposing $\boldsymbol{\Pi}$ with `factor_p_indiv` function in the `superadmixture` package. 
 
 ```{r estimate_admix_props_amr, eval=!fast_run, message=FALSE, warning=FALSE}
 library(superadmixture)
@@ -323,7 +323,7 @@ obj <- factor_p_indiv(p_indiv, k_antepops = 3, rowspace = rowspace, verbose = FA
 Q_hat <- obj$Q_hat
 ```
 
-After obtaining individual-level coancestry `coanc_indiv` and admixture proportions `Q_hat`, we can use the function `est_coanc` to estimate population coancestry under the super admixture model and under the standard admixture model. 
+After obtaining individual-level coancestry `coanc_indiv` and admixture proportions `Q_hat`, we can use the function `est_coanc` to estimate population coancestry $\boldsymbol{\Lambda}$ under the super admixture model and under the standard admixture model. In our manuscript, `coanc_pops_sup` is denoted as $\hat{\boldsymbol{\Lambda}}^{\text{sup}}$ and `coanc_pops_std` is denoted as  $\hat{\boldsymbol{\Lambda}}^{\text{std}}$.
 
 ```{r estimate_coanc_pops_amr, eval=!fast_run, message=FALSE, warning=FALSE}
 # estimate population coancestry under the super admixture model
@@ -381,7 +381,7 @@ heatmap_coanc_antepops(coanc_pops_sup, tl.offset = 1)
 
 ## Calculating the individual-level coancestry under the super admixture and standard admixture
 
-We then obtain the corresponding individual-coancestry under the super admixture model and under the standard admixture model. 
+We then obtain the corresponding individual-coancestry under the super admixture model and under the standard admixture model. In our manuscript, `coanc_sup` is denoted as $\hat{\boldsymbol{\Theta}}^{\text{sup}}$ and `coanc_std` is denoted as  $\hat{\boldsymbol{\Theta}}^{\text{std}}$.
 
 ```{r estimate_coanc_supadmix_stdadmix_amr, eval=!fast_run, message=FALSE, warning=FALSE}
 coanc_sup <- t(Q_hat) %*% coanc_pops_sup %*% Q_hat
@@ -489,7 +489,7 @@ kinship <- ifelse(kinship< 0, 0, kinship)
 coanc_indiv <- ifelse(coanc_indiv < 0, 0, coanc_indiv)
 ```
 
-We can visualize the individual-level coancestry of the simulated data using `plot_popkin` function in the `popkin` function. We use the following helper function `plot_colors_subpops` to label the sub-populations.
+We can visualize the individual-level coancestry of the simulated data using `plot_popkin` function in the `popkin` package. We use the following helper function `plot_colors_subpops` to label the sub-populations.
 
 ```{r}
 plot_colors_subpops <- function(pops, srt = 0, cex = 0.6, y = FALSE) {
@@ -538,7 +538,7 @@ obj <- factor_p_indiv(p_indiv, k_antepops = 7, rowspace = rowspace, verbose = FA
 Q_hat <- obj$Q_hat
 ```
 
-After obtaining individual-level coancestry `coanc_indiv` and admixture proportions `Q_hat`, we can use the function `est_coanc` to estimate population coancestry under the super admixture model and under the standard admixture model. The output `coanc_pops_sup` is the coancestry of antecedent populations $\boldsymbol{\Lambda}$ in our manuscript. 
+After obtaining individual-level coancestry `coanc_indiv` and admixture proportions `Q_hat`, we can use the function `est_coanc` to estimate population coancestry $\boldsymbol{\Lambda}$ under the super admixture model and under the standard admixture model. In our manuscript, `coanc_pops_sup` is denoted as $\hat{\boldsymbol{\Lambda}}^{\text{sup}}$ and `coanc_pops_std` is denoted as  $\hat{\boldsymbol{\Lambda}}^{\text{std}}$.
 
 ```{r estimate_coanc_antepops_hgdp, eval=!fast_run, message=FALSE, warning=FALSE}
 # estimate population coancestry under the super admixture model
@@ -606,7 +606,7 @@ heatmap_coanc_antepops(coanc_pops_sup, tl.offset = 1)
 
 ## Calculating individual-level coancestry under the super admixture and standard admixture
 
-We then obtain the corresponding individual-coancestry under the super admixture model and under the standard admixture model. 
+We then obtain the corresponding individual-coancestry under the super admixture model and under the standard admixture model. In our manuscript, `coanc_sup` is denoted as $\hat{\boldsymbol{\Theta}}^{\text{sup}}$ and `coanc_std` is denoted as  $\hat{\boldsymbol{\Theta}}^{\text{std}}$.
 
 ```{r estimate_coanc_supadmix_stdadmix_hgdp, eval=!fast_run, message=FALSE, warning=FALSE}
 coanc_sup <- t(Q_hat) %*% coanc_pops_sup %*% Q_hat