上海交通大学学报(医学版) ›› 2024, Vol. 44 ›› Issue (10): 1279-1286.doi: 10.3969/j.issn.1674-8115.2024.10.010

• 论著 · 技术与方法 • 上一篇    

听觉脑干植入声码器模型的开发及验证

张钦杰1,2(), 黄穗3, 谭皓月1,2, 周祥1,2, 王君怡1,2, 刘雨滋1,2, 文雯1,2, 郭嘉1,2, 吴皓1,2(), 贾欢1,2()   

  1. 1.上海交通大学医学院附属第九人民医院耳鼻咽喉头颈外科,上海 200011
    2.上海交通大学医学院耳科学研究所,上海市耳鼻疾病转化医学重点实验室,上海 200125
    3.浙江诺尔康神经电子科技股份有限公司,杭州 311100
  • 收稿日期:2024-02-29 接受日期:2024-04-25 出版日期:2024-10-28 发布日期:2024-10-28
  • 通讯作者: 吴皓,贾欢 E-mail:zqj0727@sjtu.edu.cn;wuhao@shsmu.edu.cn;huan.jia.orl@shsmu.edu.cn
  • 作者简介:张钦杰(1998—),男,硕士生;电子信箱: zqj0727@sjtu.edu.cn
  • 基金资助:
    上海市耳鼻疾病转化医学重点实验室项目(14DZ2260300);上海市黄浦区产业扶持基金(XK2019015);上海市人才发展基金(2019047);上海交通大学医学院转化医学协同创新项目(TM202011)

Establishment and verification of auditory brainstem implant vocoder model

ZHANG Qinjie1,2(), HUANG Sui3, TAN Haoyue1,2, ZHOU Xiang1,2, WANG Junyi1,2, LIU Yuzi1,2, WEN Wen1,2, GUO Jia1,2, WU Hao1,2(), JIA Huan1,2()   

  1. 1.Department of Otolaryngology-Head and Neck Surgery, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200011, China
    2.Ear Institute, Shanghai Jiao Tong University School of Medicine; Shanghai Key Laboratory of Translational Medicine on Ear and Nose Diseases, Shanghai 200125, China
    3.Nurotron Biotechnology Co. , Ltd. , Hangzhou 311100, China
  • Received:2024-02-29 Accepted:2024-04-25 Online:2024-10-28 Published:2024-10-28
  • Contact: WU Hao,JIA Huan E-mail:zqj0727@sjtu.edu.cn;wuhao@shsmu.edu.cn;huan.jia.orl@shsmu.edu.cn
  • Supported by:
    Program of Shanghai Key Laboratory of Translational Medicine on Ear and Nose Diseases(14DZ2260300);Shanghai Huangpu District Industrial Support Fund(XK2019015);Shanghai Talent Development Fund(2019047);Collaborative Innovation Project for Translational Medicine at Shanghai Jiao Tong University School of Medicine(TM202011)

摘要:

目的·基于人工耳蜗(cochlear implant,CI)声码器及听觉脑干植入(auditory brainstem implant,ABI)电极阵列排布特征,开发ABI声码器并验证其可靠性。方法·通过MATLAB软件构建“n-of-m”编码策略的CI/ABI声码器。每一帧内仅选取能量最大的n个通道的包络,其中串扰系数(interaction coefficient,IC)(范围1~3)、通道数(范围5~22)、电极阵列拓扑模式(CI/ABI)为可调节参数,可合成仿真语音。招募正常听力受试者进行闭合式仿真音素的测听,并将音素识别准确率(元音20题/条件,辅音11题/条件)与参考文献中对应条件的CI及ABI植入者的音素准确率进行比较,明确声码器的IC值并验证其可靠性。结果·声码器可成功合成所有测试用仿真声。IC2、IC3 2个条件的仿真元音及辅音识别准确率与参考文献相应数据比较,差异均无统计学意义(P>0.05);IC2与参考文献中元/辅音准确率的差值较IC3与参考文献数据的差值更小(元音|d|:1.6% vs. 20% ;辅音|d|:8.4% vs. 9.9%),确定本模型的最优IC为2。修改电极阵列拓扑模式为ABI,显示16通道ABI仿真音素识别准确率显著低于16通道CI。5~8通道间的ABI仿真元/辅音正确率比较,差异无统计学意义(P>0.05),与参考文献结论一致。结论·成功建立了基于“n-of-m”编码策略的CI/ABI声码器,并确定最佳IC。建立的ABI声码器经心理声学实验评估可靠性较高,可为ABI专属编码策略的模型验证提供合适的技术手段。

关键词: 听觉脑干植入, 声码器, 音素识别, 心理声学, 电极阵列拓扑

Abstract:

Objective ·To develope an auditory brainstem implant (ABI) vocoder based on cochlear implant (CI) vocoder characteristics and ABI electrode array topology, and to verify its reliability. Methods ·An "n-of-m" coding strategy CI/ABI vocoder was constructed based on MATLAB. Within each frame, only the envelopes of the n channels with the highest energy were selected. The interaction coefficient (IC) (range: 1?3), channel numbers (range: 5?22), and electrode array topology (CI/ABI) were adjustable parameters, allowing for the synthesis of simulated speech. Psychoacoustic evaluation was employed, recruiting normal hearing subjects to perform closed-set simulated phoneme perception. The phoneme recognition accuracy (20 vowel questions/condition, 11 consonant questions/condition) was compared with the corresponding conditions of CI and ABI from reference literature to determine the IC value of the vocoder and verify its reliability. Results ·The vocoder successfully synthesized all test stimuli. In the closed-set CI-simulated speech recognition, the simulated vowel and consonant recognition accuracy for IC2 and IC3 conditions showed no significant difference compared to the accuracy reported in the CI reference literature (P>0.05). The difference in vowel and consonant accuracy between IC2 and the literature was smaller than that between IC3 and the literature (vowel |d|=1.6% vs. 20%, consonant |d|=8.4% vs. 9.9%), thus determining the optimal interaction coefficient of this model as 2. Subsequently, when modifying the electrode array topology to ABI, it was found that the simulated phoneme recognition accuracy for a 16-channel ABI was significantly lower than that for the 16-channel CI group, consistent with the reported literature. The simulated vowel and consonant accuracy within the 5?8 channel range for ABI showed no significant difference (P>0.05), also aligning with the trend reported in the literature. Conclusion ·A CI/ABI vocoder based on "n-of-m" coding strategy is established and the optimal IC is determined. The established ABI encoder has been evaluated for high reliability through psychoacoustic experiments. It provides suitable technical means for validating ABI-specific coding strategies.

Key words: auditory brainstem implant, vocoder, phoneme recognition, psychoacoustic, electrode array topology

中图分类号: