Show simple item record

dc.contributor.advisorSarkar, S. K. (Sanat K.)
dc.contributor.advisorHan, Xu
dc.creatorXu, Yanhui
dc.date.accessioned2020-11-05T19:50:43Z
dc.date.available2020-11-05T19:50:43Z
dc.date.issued2019
dc.identifier.urihttp://hdl.handle.net/20.500.12613/4068
dc.description.abstractFalse discovery control in high dimensional multiple testing has been frequently encountered in many scientific research. Under the multivariate normal distribution assumption, \cite{fan2012} proposed an approximate expression for false discovery proportion (FDP) in large-scale multiple testing when a common threshold is used and provided a consistent estimate of realized FDP when the covariance matrix is known. They further extended their study when the covariance matrix is unknown \citep{fan2017}. However, in reality, the multivariate normal assumption is often violated. In this paper, we relaxed the normal assumption by developing a testing procedure on nonparanormal distribution which extends the Gaussian family to a much larger population. The nonparanormal distribution is indeed a high dimensional Gaussian copula with nonparametric marginals. Estimating the underlying monotone functions is key to good FDP approximation. Our procedure achieved minimal mean error in approximating the FDP compared with other methods in simulation studies. We gave theoretical investigations regarding the performance of estimated covariance matrix and false rejections. In real dataset setting, our method was able to detect more differentiated genes while still maintaining the FDP under a small level. This thesis provides an important tool for approximating FDP in a given experiment where the normal assumption may not hold. We also developed a dependence-adjusted procedure which provides more power than fixed-threshold method. Our procedure also show robustness for heavy-tailed data under a variety of distributions in numeric studies.
dc.format.extent107 pages
dc.language.isoeng
dc.publisherTemple University. Libraries
dc.relation.ispartofTheses and Dissertations
dc.rightsIN COPYRIGHT- This Rights Statement can be used for an Item that is in copyright. Using this statement implies that the organization making this Item available has determined that the Item is in copyright and either is the rights-holder, has obtained permission from the rights-holder(s) to make their Work(s) available, or makes the Item available under an exception or limitation to copyright (including Fair Use) that entitles it to make the Item available.
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/
dc.subjectStatistics
dc.titleLarge Scale Multiple Testing for High-Dimensional Nonparanormal Data
dc.typeText
dc.type.genreThesis/Dissertation
dc.description.departmentStatistics
dc.relation.doihttp://dx.doi.org/10.34944/dspace/4050
dc.ada.noteFor Americans with Disabilities Act (ADA) accommodation, including help with reading this content, please contact scholarshare@temple.edu
dc.description.degreePh.D.
refterms.dateFOA2020-11-05T19:50:43Z


Files in this item

Thumbnail
Name:
TETDEDXXu-temple-0225E-13616.pdf
Size:
4.277Mb
Format:
PDF

This item appears in the following Collection(s)

Show simple item record