DESIGN OF AN AUTOMATIC WORD BOUNDARY DETECTION SYSTEM USING THE COUNTING RULE
dc.contributor.advisor | Yantorno, Robert E. | |
dc.creator | Kanneganti, Sandeep | |
dc.date.accessioned | 2020-10-26T19:19:43Z | |
dc.date.available | 2020-10-26T19:19:43Z | |
dc.date.issued | 2011 | |
dc.identifier.other | 864885006 | |
dc.identifier.uri | http://hdl.handle.net/20.500.12613/1573 | |
dc.description.abstract | Word boundary detection is the stepping stone for many applications like keyword spotting, speech recognition, etc. It is proved that fifty percent of the speech recognition errors are due to the errors in the word boundary detector. Efficient word boundary detection can reduce the recognition errors and improve the performance of keyword spotting algorithms. Word boundary detection also helps in reducing the search space in the keyword spotting algorithm. Speech is non-stationary in nature and most of the time no utterance of the same word will be same as another utterance of same word. This makes it challenging to develop any speech processing algorithm. Many algorithms, to detect word boundaries, use acoustic features, lexical cues, energy, pitch etc. Acoustic features of energy, pitch and Teager Energy were used in this research to detect word boundaries. The strengths and drawbacks of each of the techniques are identified and the information from all the techniques was fused to obtain improved word boundary detection. Energy was able to detect word boundaries with 56% of the time, pitch with 68% of the time and Teager Energy with 72% of the time. Simple counting rule, which is based on reinforcement learning, was used in this research to fuse the detections from the three techniques to make a final decision on the word boundaries. This system does not need prior knowledge about the detection and false alarm probabilities of the techniques. The weights are adapted with the outcome in every iteration. Fusion of the decisions from energy, Teager Energy and pitch yielded a 79% hit rate on spontaneous speech using counting rule whereas linear opinion pool and log opinion pool produced 73% and 74% hit rate respectively. | |
dc.format.extent | 65 pages | |
dc.language.iso | eng | |
dc.publisher | Temple University. Libraries | |
dc.relation.ispartof | Theses and Dissertations | |
dc.rights | IN COPYRIGHT- This Rights Statement can be used for an Item that is in copyright. Using this statement implies that the organization making this Item available has determined that the Item is in copyright and either is the rights-holder, has obtained permission from the rights-holder(s) to make their Work(s) available, or makes the Item available under an exception or limitation to copyright (including Fair Use) that entitles it to make the Item available. | |
dc.rights.uri | http://rightsstatements.org/vocab/InC/1.0/ | |
dc.subject | Electrical Engineering | |
dc.title | DESIGN OF AN AUTOMATIC WORD BOUNDARY DETECTION SYSTEM USING THE COUNTING RULE | |
dc.type | Text | |
dc.type.genre | Thesis/Dissertation | |
dc.contributor.committeemember | Picone, Joseph | |
dc.contributor.committeemember | Silage, Dennis | |
dc.description.department | Electrical and Computer Engineering | |
dc.relation.doi | http://dx.doi.org/10.34944/dspace/1555 | |
dc.ada.note | For Americans with Disabilities Act (ADA) accommodation, including help with reading this content, please contact scholarshare@temple.edu | |
dc.description.degree | M.S.E. | |
refterms.dateFOA | 2020-10-26T19:19:43Z |