Learner 1 (L1) - is a 5-year old – pre-school bilingual (90% German, 10% Slovak) daughter of the main author of this article
three HMPL-C2 exercise 1 (E1) sessions were executed on days 1, 3 and 5 of the study
each HMPL-C2-E1 session consisted of human-testing phase followed by a mutual human-machine learning phase
in each phase, sequences consisted of 5 repetitions of syllables started with occlusive labial consonant M or B and followed by the vowel A, E, I, O or U, thus yielding sequences from “MA MA MA MA MA” to “BU BU BU BU BU"
speech recordings collected during the learning phase subsequently provided input for the acoustic-model fine-tuning process
As of 2023, there exists no publicly available ASR model which could accurately and reliably process child speech.
In our IHIET 2023 article, we introduce two innovations with which the problem can be partially bypasssed in context of digitally supported reading acquisition app:
In concrete terms, we have shown that after three sessions focusing on acquisition of grapheme-vowel and CV-bigrapheme correspondences had lead, in case of one particular learner, to decrease of WER from 96% to 48%.