• TWO-SAMPLE TESTS FOR HIGH DIMEMSIONAL MEANS WITH PREPIVOTING and DATA TRANSFORMATION

      Hellebuyck, Rafael Adriel; Department of Biostatistics and Epidemiology (2019-01-08)
      Within the medical field, the demand to store and analyze small sample, large variable data has become ever-abundant. Several two-sample tests for equality of means, including the revered Hotelling’s T2 test, have already been established when the combined sample size of both populations exceeds the dimension of the variables. However, tests such as Hotelling’s T2 become either unusable or output small power when the number of variables is greater than the combined sample size. We propose a test using both prepivoting and Edgeworth expansion that maintains high power in this higher dimensional scenario, known as the “large p small n ” problem. Our test’s finite sample performance is compared with other recently proposed tests designed to also handle the “large p small n ” situation. We apply our test to a microarray gene expression data set and report competitive rates for both power and Type-I error.