The forensicpopdata package provides allele frequency data for human genetic markers (STRs) commonly used in forensic genetics and kinship analysis. Currently two data sets are included:
NIST1036freqs
)UKDNA17freqs
)In addition, it is possible to access the STRidER
database through the read_STRidER_xml
function.
The forensicpopdata package is not yet available
from CRAN. You can install the forensicpopdata package
using the pak
package
in R:
# install.packages("pak")
::pak("mkruijver/forensicpopdata") pak
The data
function is used to load a data set.
require(forensicpopdata)
#> Loading required package: forensicpopdata
data(NIST1036freqs)
The data set is a named list for each population:
names(NIST1036freqs)
#> [1] "AfAm" "Asian" "Cauc" "Hisp"
For each population, a list of frequencies by locus name is available:
$AfAm$TH01
NIST1036freqs#> 5 6 7 8 9 9.3
#> 0.004385965 0.131578947 0.407894737 0.195906433 0.159356725 0.096491228
#> 10
#> 0.004385965
The read_STRidER_xml
function downloads the most recent
STRidER
database from
https://strider.online/
.
<- read_STRidER_xml() freqs
The data set is a named list for each population:
names(freqs)
#> [1] "Entire Database" "Africa" "America"
#> [4] "Asia" "Europe" "AUSTRIA"
#> [7] "BELGIUM" "BOSNIA AND HERZEGOWINA" "CZECH REPUBLIC"
#> [10] "DENMARK" "DOMINICAN REPUBLIC" "FINLAND"
#> [13] "FRANCE" "GERMANY" "GREECE"
#> [16] "HUNGARY" "IRELAND" "KENYA"
#> [19] "MONTENEGRO" "NORWAY" "POLAND"
#> [22] "SAUDI ARABIA" "SLOVAKIA" "SLOVENIA"
#> [25] "SOUTH AFRICA" "SPAIN" "SWEDEN"
#> [28] "SWITZERLAND" "THAILAND" "VIETNAM"
For each population, a list of frequencies by locus name is available:
$`Entire Database`$TH01
freqs#> 8.3 10 7 7.3 8 10.3
#> 3.58038e-04 2.72467e-02 2.26924e-01 3.58038e-05 1.13426e-01 3.58038e-05
#> 9.3 12 6 11 9 4
#> 2.01862e-01 1.43215e-04 1.93305e-01 6.44468e-04 2.35231e-01 3.58038e-05
#> 5 13.3
#> 7.16076e-04 3.58038e-05