Loading...
Multiple Testing in the Presence of Correlations
Banerjee, Bhramori
Banerjee, Bhramori
Citations
Altmetric:
Genre
Thesis/Dissertation
Date
2011
Advisor
Committee member
Group
Department
Statistics
Subject
Permanent link to this record
Collections
Research Projects
Organizational Units
Journal Issue
DOI
http://dx.doi.org/10.34944/dspace/716
Abstract
Simultaneous testing of multiple null hypotheses has now become an integral part of statistical analysis of data arising from modern scientific investigations. Often the test statistics in such multiple testing problem are correlated. The research in this dissertation is motivated by the scope of improving or extending existing methods to incorporate correlation in the data. Sarkar (2008) proposes controlling the pairwise false discovery rate (Pairwise-FDR), which inherently takes into account the dependence among the p-values, thereby making it a more robust, less conservative and more powerful under dependence than the usual notion of FDR. In this dissertation, we further investigate the performance of Pairwise-FDR under a dependent mixture model. In particular, we consider a step-up method to control the Pairwise-FDR under this model assuming that the correlation between any two p-values is the same (exchangeable). We also suggest improving this method by incorporating an estimate of the number of pairs of true null hypotheses developed under this model. Efron (2007, Journal of the American Statistical Association 102, 93-103) proposed a novel approach to incorporate dependence among the null p-values into a multiple testing method controlling false discoveries. In this dissertation, we try to investigate the scope of utilizing this approach by proposing alternative versions of adaptive Bonferroni and BH methods which estimates the number of true null hypotheses from the empirical null distribution introduced by Efron. These newer adaptive procedures have been numerically shown to perform better than existing adaptive Bonferroni or BH methods within a wider range of dependence. A gene expression microarray data set has been used to highlight the difference in results obtained upon applying the proposed and other adaptive BH methods. Another approach to address the presence of correlation is motivated by the scope of utilizing the dependence structure of the data towards further improving some multiple testing methods while maintaining control of some error rate. The dependence structure of the data is incorporated using pairwise weights. In this dissertation we propose a weighted version of the pairwise FDR (Sarkar, 2008) using pairwise weights and a method controlling the weighted pairwise- FDR. We give a discussion on the application of such weighted procedure and suggest some weighting schemes that generates pairwise weights.
Description
Citation
Citation to related work
Has part
ADA compliance
For Americans with Disabilities Act (ADA) accommodation, including help with reading this content, please contact scholarshare@temple.edu