The modifications and improvements of the acoustic recognition component of the SPICOS system for the DARPA naval resource management task are described. These modifications and improvements include: the modeling of the continuous mixture densities of the acoustic vectors, the choice of suitable context-dependent phoneme units and the construction of generalized context phoneme units, and the modeling of transitional information in the acoustic vector. The experimental results show that critical factors are the acoustic resolution of the probability distributions and the context information captured in the acoustic vectors. By these enhancements, the system was able to attain a word error rate of 23.6% and 26.5% on two test sets in speaker-independent recognition mode, when trained on 80 speakers. The word pair grammar reduced the word error rate to 7.1% and 9.3% respectively.


    Zugriff

    Zugriff über TIB

    Verfügbarkeit in meiner Bibliothek prüfen


    Exportieren, teilen und zitieren



    Titel :

    Experiments on mixture-density phoneme-modelling for the speaker-independent 1000-word speech recognition DARPA task


    Weitere Titelangaben:

    Experimente für eine Ansatz-verteilte Lautmodellierung für die Sprecher-unabhängige 1000-Wort Spracherkennung DARP


    Beteiligte:
    Ney, H. (Autor:in)


    Erscheinungsdatum :

    1990


    Format / Umfang :

    4 Seiten, 9 Quellen


    Medientyp :

    Aufsatz (Konferenz)


    Format :

    Print


    Sprache :

    Englisch




    Mathematical Analysis and Speaker-Independent Speech Recognition

    Gur'yanov, A. E. | British Library Online Contents | 1996


    The DARPA PerceptOR evaluation experiments

    Krotkov, E. / Fish, S. / Jackel, L. et al. | British Library Online Contents | 2007


    Air traffic control speech recognition system cross-task & speaker adaptation

    de Cordoba, R. / Ferreiros, J. / San-Segundo, R. et al. | IEEE | 2006



    An Improved HMM/VQ Training Procedure for Speaker-Independent Isolated Word Recognition

    Zhang, Y. / Alder, M. / IEEE; Hong Kong Chapter of Signal Processing | British Library Conference Proceedings | 1994