Show simple item record

dc.creatorToulis, P
dc.creatorAiroldi, EM
dc.date.accessioned2021-02-03T00:19:57Z
dc.date.available2021-02-03T00:19:57Z
dc.date.issued2017-08-01
dc.identifier.issn0090-5364
dc.identifier.doihttp://dx.doi.org/10.34944/dspace/5673
dc.identifier.otherEZ0LL (isidoc)
dc.identifier.urihttp://hdl.handle.net/20.500.12613/5691
dc.description.abstract© 2017 Institute of Mathematical Statistics. Stochastic gradient descent procedures have gained popularity for parameter estimation from large data sets. However, their statistical properties are not well understood, in theory. And in practice, avoiding numerical instability requires careful tuning of key parameters. Here, we introduce implicit stochastic gradient descent procedures, which involve parameter updates that are implicitly defined. Intuitively, implicit updates shrink standard stochastic gradient descent updates. The amount of shrinkage depends on the observed Fisher information matrix, which does not need to be explicitly computed; thus, implicit procedures increase stability without increasing the computational burden. Our theoretical analysis provides the first full characterization of the asymptotic behavior of both standard and implicit stochastic gradient descent-based estimators, including finite-sample error bounds. Importantly, analytical expressions for the variances of these stochastic gradient-based estimators reveal their exact loss of efficiency. We also develop new algorithms to compute implicit stochastic gradient descent-based estimators for generalized linear models, Cox proportional hazards, M-estimators, in practice, and perform extensive experiments. Our results suggest that implicit stochastic gradient descent procedures are poised to become a workhorse for approximate inference from large data sets.
dc.format.extent1694-1727
dc.language.isoen
dc.relation.haspartAnnals of Statistics
dc.relation.isreferencedbyInstitute of Mathematical Statistics
dc.subjectStochastic approximation
dc.subjectimplicit updates
dc.subjectasymptotic variance
dc.subjectgeneralized linear models
dc.subjectCox proportional hazards
dc.subjectM-estimation
dc.subjectmaximum likelihood
dc.subjectexponential family
dc.subjectstatistical efficiency
dc.subjectnumerical stability
dc.titleAsymptotic and finite-sample properties of estimators based on stochastic gradients
dc.typeArticle
dc.type.genreJournal Article
dc.relation.doi10.1214/16-AOS1506
dc.ada.noteFor Americans with Disabilities Act (ADA) accommodation, including help with reading this content, please contact scholarshare@temple.edu
dc.creator.orcidAiroldi, Edoardo|0000-0002-3512-0542
dc.date.updated2021-02-03T00:19:54Z
refterms.dateFOA2021-02-03T00:19:58Z


Files in this item

Thumbnail
Name:
euclid.aos.1498636871.pdf
Size:
572.0Kb
Format:
PDF

This item appears in the following Collection(s)

Show simple item record