Login / Signup

Decoding and synthesizing tonal language speech from brain activity.

Yan LiuZehao ZhaoMinpeng XuHaiqing YuYanming ZhuJie ZhangLinghao BuXiaoluo ZhangJunfeng LuYuanning LiDong MingJin-Song Wu
Published in: Science advances (2023)
Recent studies have shown that the feasibility of speech brain-computer interfaces (BCIs) as a clinically valid treatment in helping nontonal language patients with communication disorders restore their speech ability. However, tonal language speech BCI is challenging because additional precise control of laryngeal movements to produce lexical tones is required. Thus, the model should emphasize the features from the tonal-related cortex. Here, we designed a modularized multistream neural network that directly synthesizes tonal language speech from intracranial recordings. The network decoded lexical tones and base syllables independently via parallel streams of neural network modules inspired by neuroscience findings. The speech was synthesized by combining tonal syllable labels with nondiscriminant speech neural activity. Compared to commonly used baseline models, our proposed models achieved higher performance with modest training data and computational costs. These findings raise a potential strategy for approaching tonal language speech restoration.
Keyphrases
  • neural network
  • autism spectrum disorder
  • hearing loss
  • climate change
  • functional connectivity
  • resting state
  • subarachnoid hemorrhage
  • cerebral ischemia