51黑料吃瓜在线观看,51黑料官网|51黑料捷克街头搭讪_51黑料入口最新视频

設(shè)為首頁 |  加入收藏
首頁首頁 期刊簡介 消息通知 編委會 電子期刊 投稿須知 廣告合作 聯(lián)系我們
基于語音轉(zhuǎn)換技術(shù)的普通話電子喉語音增強(qiáng)方法研究

Enhancement of mandarin electrolarynx speech based on voice conversion technology

作者: 董睿  李立峰  牛海軍  史晚晴  李陽                         
單位:                                 北京航空航天大學(xué)生物醫(yī)學(xué)工程學(xué)院(北京100191)            
關(guān)鍵詞:                               電子喉;普通話;語音轉(zhuǎn)換;語音增強(qiáng)             
分類號:
出版年·卷·期(頁碼):2015·34·4(361-366)
摘要:

目的 電子喉是喉切除患者使用最多的語音恢復(fù)工具,但是電子喉語音存在發(fā)聲機(jī)械、音調(diào)單一、輻射噪聲大等缺點(diǎn),本文擬運(yùn)用語音轉(zhuǎn)換技術(shù)改善電子喉語音的發(fā)聲效果,提高語音自然度和可懂度。方法 選擇200句分別以自然發(fā)聲和電子喉發(fā)聲的標(biāo)準(zhǔn)普通話日常用語作為訓(xùn)練語料,采用基于混合高斯模型(Gaussian mixed model,GMM)的語音轉(zhuǎn)換方法對電子喉語音進(jìn)行轉(zhuǎn)換,轉(zhuǎn)換參數(shù)為基頻軌跡和聲道譜參數(shù)(0~24階梅爾倒譜系數(shù)),然后對轉(zhuǎn)換后的語音質(zhì)量進(jìn)行主客觀評價。結(jié)果 轉(zhuǎn)換語音的高頻輻射噪聲得到了有效抑制,基頻變化出現(xiàn)。主觀分析結(jié)果顯示,轉(zhuǎn)換語音的自然度和可接受度有所提高,但可懂度變化不大。結(jié)論 使用語音轉(zhuǎn)換技術(shù)可以降低電子喉語音的高頻輻射噪聲,改變聲調(diào)和韻律信息,提高自然度和可接受度,對改善電子喉語音的聽覺質(zhì)量有較大幫助。

Objective Electrolarynx (EL) is the most common assistant device to provide a voice for laryngectomees. However, EL still has several severe problems, such as the extremely unnaturalness and the non-ignorable radiation noises. In this paper, we conduct a study of enhancement of EL speech based on voice conversion (VC) technology in order to improve the naturalness and intelligibility of EL speech. Methods In this article, 200 mandarin daily utterance pairs, recorded as normal speech and EL speech, were served as training data. A Gaussian mixed model (GMM) based method was used to improve the quality of EL speech, and subjective and objective estimation were used to evaluate converted speech. The converting features were F0 and spectrum parameters (0th through 24th Mel-cepstral coefficients). Results The objective results demonstrated that the VC-based method could greatly reduce the radiation noises and improve the F0 contour of mandarin EL speech, closer to that of the target speech. The subjective results indicated that the naturalness and acceptability of mandarin EL speech were upgraded and the intelligibility had no significant difference after converting. Conclusions The VC technology can effectively reduce the high frequency radiation noises, complement tone and rhythm information, upgrade naturalness and acceptability of EL speech, which are greatly helpful to improve speech quality.

參考文獻(xiàn):

服務(wù)與反饋:
文章下載】【加入收藏
提示:您還未登錄,請登錄!點(diǎn)此登錄
 
友情鏈接  
地址:北京安定門外安貞醫(yī)院內(nèi)北京生物醫(yī)學(xué)工程編輯部
電話:010-64456508  傳真:010-64456661
電子郵箱:[email protected]