Journal of Shanghai Jiao Tong University (Medical Science) ›› 2024, Vol. 44 ›› Issue (10): 1279-1286.doi: 10.3969/j.issn.1674-8115.2024.10.010

• Techniques and methods • Previous Articles    

Establishment and verification of auditory brainstem implant vocoder model

ZHANG Qinjie1,2(), HUANG Sui3, TAN Haoyue1,2, ZHOU Xiang1,2, WANG Junyi1,2, LIU Yuzi1,2, WEN Wen1,2, GUO Jia1,2, WU Hao1,2(), JIA Huan1,2()   

  1. 1.Department of Otolaryngology-Head and Neck Surgery, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200011, China
    2.Ear Institute, Shanghai Jiao Tong University School of Medicine; Shanghai Key Laboratory of Translational Medicine on Ear and Nose Diseases, Shanghai 200125, China
    3.Nurotron Biotechnology Co. , Ltd. , Hangzhou 311100, China
  • Received:2024-02-29 Accepted:2024-04-25 Online:2024-10-28 Published:2024-10-28
  • Contact: WU Hao,JIA Huan E-mail:zqj0727@sjtu.edu.cn;wuhao@shsmu.edu.cn;huan.jia.orl@shsmu.edu.cn
  • Supported by:
    Program of Shanghai Key Laboratory of Translational Medicine on Ear and Nose Diseases(14DZ2260300);Shanghai Huangpu District Industrial Support Fund(XK2019015);Shanghai Talent Development Fund(2019047);Collaborative Innovation Project for Translational Medicine at Shanghai Jiao Tong University School of Medicine(TM202011)

Abstract:

Objective ·To develope an auditory brainstem implant (ABI) vocoder based on cochlear implant (CI) vocoder characteristics and ABI electrode array topology, and to verify its reliability. Methods ·An "n-of-m" coding strategy CI/ABI vocoder was constructed based on MATLAB. Within each frame, only the envelopes of the n channels with the highest energy were selected. The interaction coefficient (IC) (range: 1?3), channel numbers (range: 5?22), and electrode array topology (CI/ABI) were adjustable parameters, allowing for the synthesis of simulated speech. Psychoacoustic evaluation was employed, recruiting normal hearing subjects to perform closed-set simulated phoneme perception. The phoneme recognition accuracy (20 vowel questions/condition, 11 consonant questions/condition) was compared with the corresponding conditions of CI and ABI from reference literature to determine the IC value of the vocoder and verify its reliability. Results ·The vocoder successfully synthesized all test stimuli. In the closed-set CI-simulated speech recognition, the simulated vowel and consonant recognition accuracy for IC2 and IC3 conditions showed no significant difference compared to the accuracy reported in the CI reference literature (P>0.05). The difference in vowel and consonant accuracy between IC2 and the literature was smaller than that between IC3 and the literature (vowel |d|=1.6% vs. 20%, consonant |d|=8.4% vs. 9.9%), thus determining the optimal interaction coefficient of this model as 2. Subsequently, when modifying the electrode array topology to ABI, it was found that the simulated phoneme recognition accuracy for a 16-channel ABI was significantly lower than that for the 16-channel CI group, consistent with the reported literature. The simulated vowel and consonant accuracy within the 5?8 channel range for ABI showed no significant difference (P>0.05), also aligning with the trend reported in the literature. Conclusion ·A CI/ABI vocoder based on "n-of-m" coding strategy is established and the optimal IC is determined. The established ABI encoder has been evaluated for high reliability through psychoacoustic experiments. It provides suitable technical means for validating ABI-specific coding strategies.

Key words: auditory brainstem implant, vocoder, phoneme recognition, psychoacoustic, electrode array topology

CLC Number: