Submissions to Scholars Junction will be closed starting Monday, December 21, as we begin migrating to a new platform.

    • Login
    View Item  
    •   Scholars Junction
    • Theses and Dissertations
    • Theses and Dissertations
    • View Item
    •   Scholars Junction
    • Theses and Dissertations
    • Theses and Dissertations
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Search

    My Account

    Login Register

    About

    About This Repository Deposit Your Work Policies and Terms of Use Contact Us More Scholarly Communication Services

    Browse

    Entire Repository Communities & Collections Issue Date Authors Titles Subjects This Collection Issue Date Authors Titles Subjects

    Performance Analysis of Advanced Front Ends on the Aurora Large Vocabulary Evaluation

    View/ Open
    etd-11042003-101837.pdf (1.202 Mb )
    Author
    Parihar, Naveen.
    Item Type
    Thesis
    Advisor
    Picone, Joseph
    Committee
    Lazarou, Georgios Y.
    Younan, Nicolas H.
    Johnson, Corlis
    Jonkman, Jeff
    Metrics
    
    Abstract
    Over the past few years, speech recognition technology performance on tasks ranging from isolated digit recognition to conversational speech has dramatically improved. Performance on limited recognition tasks in noise-free environments is comparable to that achieved by human transcribers. This advancement in automatic speech recognition technology along with an increase in the compute power of mobile devices, standardization of communication protocols, and the explosion in the popularity of the mobile devices, has created an interest in flexible voice interfaces for mobile devices. However, speech recognition performance degrades dramatically in mobile environments which are inherently noisy. In the recent past, a great amount of effort has been spent on the development of front ends based on advanced noise robust approaches. The primary objective of this thesis was to analyze the performance of two advanced front ends, referred to as the QIO and MFA front ends, on a speech recognition task based on the Wall Street Journal database. Though the advanced front ends are shown to achieve a significant improvement over an industry-standard baseline front end, this improvement is not operationally significant. Further, we show that the results of this evaluation were not significantly impacted by suboptimal recognition system parameter settings. Without any front end-specific tuning, the MFA front end outperforms the QIO front end by 9.6% relative. With tuning, the relative performance gap increases to 15.8%. Finally, we also show that mismatched microphone and additive noise evaluation conditions resulted in a significant degradation in performance for both front ends.
    Degree
    Master of Science
    Major
    Electrical Engineering
    College
    College of Engineering
    Department
    Department of Electrical and Computer Engineering.
    URI
    https://hdl.handle.net/11668/19116
    Collections
    • Theses and Dissertations
    Show full item record
    Mississippi State University Libraries
    395 Hardy Rd
    P.O. Box 5408, Mississippi State, MS 39762-5408
    (662) 325-7668
    (662) 325-0011
    (662) 325-8183
    Contact repository admin Report a problem Terms of use Privacy policy Accessibility MSU Legal
     

     

    Mississippi State University Libraries
    395 Hardy Rd
    P.O. Box 5408, Mississippi State, MS 39762-5408
    (662) 325-7668
    (662) 325-0011
    (662) 325-8183
    Contact repository admin Report a problem Terms of use Privacy policy Accessibility MSU Legal