Login / Signup

Simulating Speech Error Patterns Across Languages and Different Datasets.

Sofia StrömbergssonJana GötzeJens EdlundKristina Nilsson Björkenstam
Published in: Language and speech (2021)
Children's speech acquisition is influenced by universal and language-specific forces. Some speech error patterns (or phonological processes) in children's speech are observed in many languages, but the same error pattern may have different effects in different languages. We aimed to explore phonological effects of the same speech error patterns across different languages, target audiences and discourse modes, using a novel method for large-scale corpus investigation. As an additional aim, we investigated the face validity of five different phonological effect measures by relating them to subjective ratings of assumed effects on intelligibility, as provided by practicing speech-language pathologists. Six frequently attested speech error patterns were simulated in authentic corpus data: backing, fronting, stopping, /r/-weakening, cluster reduction and weak syllable deletion-each simulation resulting in a "misarticulated" version of the original corpus. Phonological effects were quantified using five separate metrics of phonological complexity and distance from expected target forms. Using Swedish child-speech data as a reference, phonological effects were compared between this reference and a) child speech in Norwegian and English, and b) data representing different modes of discourse (spoken/written) and target audiences (adults/children) in Swedish. Of the speech error patterns, backing-the one atypical pattern of those included-was found to cause the most detrimental effects, across languages as well as across modes and speaker ages. However, none of the measures reflects intuitive rankings as provided by clinicians regarding effects on intelligibility, thus corroborating earlier reports that phonological competence is not translatable into levels of intelligibility.
Keyphrases
  • working memory
  • hearing loss
  • electronic health record
  • palliative care
  • big data
  • emergency department
  • machine learning
  • high resolution
  • deep learning
  • adverse drug
  • psychometric properties
  • high speed
  • virtual reality