Accéder directement au contenu Accéder directement à la navigation
Article dans une revue

Impacts of multicollinearity on CAPT modalities: An heterogeneous machine learning framework for computer-assisted French phoneme pronunciation training

Abstract : Phoneme pronunciations are usually considered as basic skills for learning a foreign language. Practicing the pronunciations in a computer-assisted way is helpful in a self-directed or long-distance learning environment. Recent researches indicate that machine learning is a promising method to build high-performance computer-assisted pronunciation training modalities. Many data-driven classifying models, such as support vector machines, backpropagation networks, deep neural networks and convolutional neural networks, are increasingly widely used for it. Yet, the acoustic waveforms of phoneme are essentially modulated from the base vibrations of vocal cords, and this fact somehow makes the predictors collinear, distorting the classifying models. A commonly-used solution to address this issue is to suppressing the collinearity of predictors via partial least square regressing algorithm. It allows to obtain high-quality predictor weighting results via predictor relationship analysis. However, as a linear regressor, the classifiers of this type possess very simple topology structures, constraining the universality of the regressors. For this issue, this paper presents an heterogeneous phoneme recognition framework which can further benefit the phoneme pronunciation diagnostic tasks by combining the partial least square with support vector machines. A French phoneme data set containing 4830 samples is established for the evaluation experiments. The experiments of this paper demonstrates that the new method improves the accuracy performance of the phoneme classifiers by 0.21 − 8.47% comparing to state-of-the-arts with different data training data density.
Type de document :
Article dans une revue
Liste complète des métadonnées

https://hal-univ-bourgogne.archives-ouvertes.fr/hal-03529206
Contributeur : Yannick Benezeth Connectez-vous pour contacter le contributeur
Soumis le : lundi 17 janvier 2022 - 14:57:46
Dernière modification le : jeudi 4 août 2022 - 17:07:33
Archivage à long terme le : : lundi 18 avril 2022 - 20:49:28

Fichier

pone_2021.pdf
Fichiers éditeurs autorisés sur une archive ouverte

Identifiants

Collections

Citation

Yanjing Bi, Chao Li, Yannick Benezeth, Fan Yang. Impacts of multicollinearity on CAPT modalities: An heterogeneous machine learning framework for computer-assisted French phoneme pronunciation training. PLoS ONE, Public Library of Science, 2021, 16 (10), pp.e0257901. ⟨10.1371/journal.pone.0257901⟩. ⟨hal-03529206⟩

Partager

Métriques

Consultations de la notice

13

Téléchargements de fichiers

18