IDEPI: Rapid Prediction of HIV-1 Antibody Epitopes and Other Phenotypic Features from Sequence Data Using a Flexible Machine Learning Platform
Genre
Journal ArticleDate
2014-01-01Author
Hepler, NLScheffler, K
Weaver, S
Murrell, B
Richman, DD
Burton, DR
Poignard, P
Smith, DM
Kosakovsky Pond, SL
Subject
AIDS Dementia ComplexAlgorithms
Antibodies, Neutralizing
Computational Biology
Drug Resistance, Viral
Epitopes
HIV Antibodies
HIV Infections
HIV-1
Human Immunodeficiency Virus Proteins
Humans
Machine Learning
Phenotype
Sequence Analysis, Protein
Software
Permanent link to this record
http://hdl.handle.net/20.500.12613/5333
Metadata
Show full item recordDOI
10.1371/journal.pcbi.1003842Abstract
© 2014 Hepler et al. Since its identification in 1983, HIV-1 has been the focus of a research effort unprecedented in scope and difficulty, whose ultimate goals — a cure and a vaccine – remain elusive. One of the fundamental challenges in accomplishing these goals is the tremendous genetic variability of the virus, with some genes differing at as many as 40% of nucleotide positions among circulating strains. Because of this, the genetic bases of many viral phenotypes, most notably the susceptibility to neutralization by a particular antibody, are difficult to identify computationally. Drawing upon open-source general-purpose machine learning algorithms and libraries, we have developed a software package IDEPI (IDentify EPItopes) for learning genotype-to-phenotype predictive models from sequences with known phenotypes. IDEPI can apply learned models to classify sequences of unknown phenotypes, and also identify specific sequence features which contribute to a particular phenotype. We demonstrate that IDEPI achieves performance similar to or better than that of previously published approaches on four well-studied problems: finding the epitopes of broadly neutralizing antibodies (bNab), determining coreceptor tropism of the virus, identifying compartment-specific genetic signatures of the virus, and deducing drug-resistance associated mutations. The cross-platform Python source code (released under the GPL 3.0 license), documentation, issue tracking, and a pre-configured virtual machine for IDEPI can be found at https://github.com/veg/idepi.Citation to related work
Public Library of Science (PLoS)Has part
PLoS Computational BiologyADA compliance
For Americans with Disabilities Act (ADA) accommodation, including help with reading this content, please contact scholarshare@temple.eduae974a485f413a2113503eed53cd6c53
http://dx.doi.org/10.34944/dspace/5315