Checks if all the values are maximally distributed in the several batches. Aimed for categorical variables.
Arguments
- report
A data.frame which must contain a batch column. Which can be obtained with
inspect()
.- column
The name of the column one wants to inspect.
Examples
data(survey, package = "MASS")
columns <- c("Sex", "Age", "Smoke")
index <- design(pheno = survey[, columns], size_subset = 70,
iterations = 10)
batches <- inspect(index, survey[, columns])
distribution(batches, "Sex")
#> [1] TRUE
distribution(batches, "Smoke")
#> Warning: Smoke: 1 categories not totally distributed in all batches
#> [1] FALSE