Show simple item record

dc.creatorMidic, U
dc.creatorObradovic, Z
dc.date.accessioned2021-01-31T22:49:15Z
dc.date.available2021-01-31T22:49:15Z
dc.date.issued2012-01-01
dc.identifier.issn1477-5956
dc.identifier.issn1477-5956
dc.identifier.doihttp://dx.doi.org/10.34944/dspace/5465
dc.identifier.other970XX (isidoc)
dc.identifier.other22759577 (pubmed)
dc.identifier.urihttp://hdl.handle.net/20.500.12613/5483
dc.description.abstract© 2012 Midic and Obradovic; licensee BioMed Central Ltd. Background: Intrinsically disordered proteins (IDPs) and regions (IDRs) perform a variety of crucial biological functions despite lacking stable tertiary structure under physiological conditions in vitro. State-of-the-art sequencebased predictors of intrinsic disorder are achieving per-residue accuracies over 80%. In a genome-wide study of intrinsic disorder in human genome we observed a big difference in predicted disorder content between confirmed and putative human proteins. We investigated a hypothesis that this discrepancy is not correct, and that it is due to incorrectly annotated parts of the putative protein sequences that exhibit some similarities to confirmed IDRs, which lead to high predicted disorder content. Methods: To test this hypothesis we trained a predictor to discriminate sequences of real proteins from synthetic sequences that mimic errors of gene finding algorithms. We developed a procedure to create synthetic peptide sequences by translation of non-coding regions of genomic sequences and translation of coding regions with incorrect codon alignment. Results: Application of the developed predictor to putative human protein sequences showed that they contain a substantial fraction of incorrectly assigned regions. These regions are predicted to have higher levels of disorder content than correctly assigned regions. This partially, albeit not completely, explains the observed discrepancy in predicted disorder content between confirmed and putative human proteins. Conclusions: Our findings provide the first evidence that current practice of predicting disorder content in putative sequences should be reconsidered, as such estimates may be biased.
dc.format.extentS19-S19
dc.language.isoen
dc.relation.haspartProteome Science
dc.relation.isreferencedbySpringer Science and Business Media LLC
dc.rightsCC BY
dc.rights.urihttp://creativecommons.org/licenses/by/2.0
dc.subject0604 Genetics
dc.subjectBiomedical
dc.subjectGenetics
dc.subjectHuman Genome
dc.subject1.1 Normal biological development and functioning
dc.titleIntrinsic disorder in putative protein sequences
dc.typeArticle
dc.type.genreConference Proceeding
dc.relation.doi10.1186/1477-5956-10-S1-S19
dc.ada.noteFor Americans with Disabilities Act (ADA) accommodation, including help with reading this content, please contact scholarshare@temple.edu
dc.date.updated2021-01-31T22:49:12Z
refterms.dateFOA2021-01-31T22:49:16Z


Files in this item

Thumbnail
Name:
Intrinsic disorder in putative ...
Size:
1.436Mb
Format:
PDF

This item appears in the following Collection(s)

Show simple item record

CC BY
Except where otherwise noted, this item's license is described as CC BY