Show simple item record

dc.creatorGhalwash, MF
dc.creatorCao, XH
dc.creatorStojkovic, I
dc.creatorObradovic, Z
dc.date.accessioned2021-01-27T21:49:41Z
dc.date.available2021-01-27T21:49:41Z
dc.date.issued2016-04-08
dc.identifier.issn1471-2105
dc.identifier.issn1471-2105
dc.identifier.doihttp://dx.doi.org/10.34944/dspace/5047
dc.identifier.other27059502 (pubmed)
dc.identifier.urihttp://hdl.handle.net/20.500.12613/5065
dc.description.abstract© 2016 Ghalwash et al. Background: Existing feature selection methods typically do not consider prior knowledge in the form of structural relationships among features. In this study, the features are structured based on prior knowledge into groups. The problem addressed in this article is how to select one representative feature from each group such that the selected features are jointly discriminating the classes. The problem is formulated as a binary constrained optimization and the combinatorial optimization is relaxed as a convex-concave problem, which is then transformed into a sequence of convex optimization problems so that the problem can be solved by any standard optimization algorithm. Moreover, a block coordinate gradient descent optimization algorithm is proposed for high dimensional feature selection, which in our experiments was four times faster than using a standard optimization algorithm. Results: In order to test the effectiveness of the proposed formulation, we used microarray analysis as a case study, where genes with similar expressions or similar molecular functions were grouped together. In particular, the proposed block coordinate gradient descent feature selection method is evaluated on five benchmark microarray gene expression datasets and evidence is provided that the proposed method gives more accurate results than the state-of-the-art gene selection methods. Out of 25 experiments, the proposed method achieved the highest average AUC in 13 experiments while the other methods achieved higher average AUC in no more than 6 experiments. Conclusion: A method is developed to select a feature from each group. When the features are grouped based on similarity in gene expression, we showed that the proposed algorithm is more accurate than state-of-the-art gene selection methods that are particularly developed to select highly discriminative and less redundant genes. In addition, the proposed method can exploit any grouping structure among features, while alternative methods are restricted to using similarity based grouping.
dc.format.extent158-
dc.language.isoen
dc.relation.haspartBMC Bioinformatics
dc.relation.isreferencedbySpringer Science and Business Media LLC
dc.rightsCC BY
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectStructured feature selection
dc.subjectBlock coordinate gradient descent
dc.subjectGene expression
dc.subjectMicroarray analysis
dc.subjectPrior knowledge
dc.titleStructured feature selection using coordinate descent optimization
dc.typeArticle
dc.type.genreJournal Article
dc.relation.doi10.1186/s12859-016-0954-4
dc.ada.noteFor Americans with Disabilities Act (ADA) accommodation, including help with reading this content, please contact scholarshare@temple.edu
dc.date.updated2021-01-27T21:49:37Z
refterms.dateFOA2021-01-27T21:49:41Z


Files in this item

Thumbnail
Name:
Structured feature selection ...
Size:
1.693Mb
Format:
PDF

This item appears in the following Collection(s)

Show simple item record

CC BY
Except where otherwise noted, this item's license is described as CC BY