专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明申请

US20110087488A1 SPEECH SYNTHESIS APPARATUS AND METHOD 有权
标题翻译：语音合成设备和方法
公开(公告)号：US20110087488A1
公开(公告)日：2011-04-14
申请号：US12970162
申请日：2010-12-16
申请人： Ryo Morinaka , Takehiko Kagoshima
发明人： Ryo Morinaka , Takehiko Kagoshima
IPC分类号： G10L11/04 , G10L13/06
CPC分类号： G10L13/06 , G10L13/033 , G10L19/097 , G10L25/15 , G10L2021/0135
摘要： According to an embodiment, a speech synthesis apparatus includes a selecting unit configured to select speaker's parameters one by one for respective speakers and obtain a plurality of speakers' parameters, the speaker's parameters being prepared for respective pitch waveforms corresponding to speaker's speech sounds, the speaker's parameters including formant frequencies, formant phases, formant powers, and window functions concerning respective formants that are contained in the respective pitch waveforms. The apparatus includes a mapping unit configured to make formants correspond to each other between the plurality of speakers' parameters using a cost function based on the formant frequencies and the formant powers. The apparatus includes a generating unit configured to generate an interpolated speaker's parameter by interpolating, at desired interpolation ratios, the formant frequencies, formant phases, formant powers, and window functions of formants which are made to correspond to each other.
摘要翻译：根据实施例，语音合成装置包括：选择单元，被配置为逐个选择说话者的参数，并且获得多个扬声器的参数;所述说话者的参数是针对对应于说话者的语音的各个音调波形而准备的，参数包括共振峰频率，共振峰相位，共振峰功率，以及相关螺旋波形中包含的各共振峰的窗函数。该装置包括：映射单元，其被配置为使用基于共振峰频率和共振峰功率的成本函数在多个扬声器的参数之间使得共振峰彼此对应。该装置包括：生成单元，被配置为通过以期望的内插比率内插使彼此对应的共振峰的共振峰频率，共振峰相位，共振峰功率和窗函数来生成内插说话者的参数。

2. 发明授权

US08175881B2 Method and apparatus using fused formant parameters to generate synthesized speech 有权
标题翻译：使用融合共振峰参数产生合成语音的方法和装置
公开(公告)号：US08175881B2
公开(公告)日：2012-05-08
申请号：US12222725
申请日：2008-08-14
申请人： Ryo Morinaka , Masatsune Tamura , Takehiko Kagoshima
发明人： Ryo Morinaka , Masatsune Tamura , Takehiko Kagoshima
IPC分类号： G10L13/06
CPC分类号： G10L13/07 , G10L13/04
摘要： A phoneme sequence corresponding to a target speech is divided into a plurality of segments. A plurality of speech units for each segment is selected from a speech unit memory that stores speech units having at least one frame. The plurality of speech units has a prosodic feature accordant or similar to the target speech. A formant parameter having at least one formant frequency is generated for each frame of the plurality of speech units. A fused formant parameter of each frame is generated from formant parameters of each frame of the plurality of speech units. A fused speech unit of each segment is generated from the fused formant parameter of each frame. A synthesized speech is generated by concatenating the fused speech unit of each segment.
摘要翻译：对应于目标语音的音素序列被分成多个段。从存储具有至少一个帧的语音单元的语音单元存储器中选择用于每个段的多个语音单元。多个语音单元具有与目标语音一致或相似的韵律特征。为多个语音单元的每个帧生成具有至少一个共振峰频率的共振峰参数。从多个语音单元的每个帧的共振峰参数生成每帧的融合共振峰参数。从每个帧的融合共振峰参数生成每个段的融合语音单元。通过连接每个段的融合语音单元来生成合成语音。

3. 发明申请

US20090048844A1 Speech synthesis method and apparatus 有权
标题翻译：语音合成方法和装置
公开(公告)号：US20090048844A1
公开(公告)日：2009-02-19
申请号：US12222725
申请日：2008-08-14
申请人： Ryo Morinaka , Masatsune Tamura , Takehiko Kagoshima
发明人： Ryo Morinaka , Masatsune Tamura , Takehiko Kagoshima
IPC分类号： G10L13/06
CPC分类号： G10L13/07 , G10L13/04
摘要： A phoneme sequence corresponding to a target speech is divided into a plurality of segments. A plurality of speech units for each segment is selected from a speech unit memory that stores speech units having at least one frame. The plurality of speech units has a prosodic feature accordant or similar to the target speech. A formant parameter having at least one formant frequency is generated for each frame of the plurality of speech units. A fused formant parameter of each frame is generated from formant parameters of each frame of the plurality of speech units. A fused speech unit of each segment is generated from the fused formant parameter of each frame. A synthesized speech is generated by concatenating the fused speech unit of each segment.
摘要翻译：对应于目标语音的音素序列被分成多个段。从存储具有至少一个帧的语音单元的语音单元存储器中选择用于每个段的多个语音单元。多个语音单元具有与目标语音一致或相似的韵律特征。为多个语音单元的每个帧生成具有至少一个共振峰频率的共振峰参数。从多个语音单元的每个帧的共振峰参数生成每帧的融合共振峰参数。从每个帧的融合共振峰参数生成每个段的融合语音单元。通过连接每个段的融合语音单元来生成合成语音。

4. 发明授权

US09002711B2 Speech synthesis apparatus and method 有权
标题翻译：语音合成装置及方法
公开(公告)号：US09002711B2
公开(公告)日：2015-04-07
申请号：US12970162
申请日：2010-12-16
申请人： Ryo Morinaka , Takehiko Kagoshima
发明人： Ryo Morinaka , Takehiko Kagoshima
IPC分类号： G10L13/06 , G10L13/033 , G10L19/097 , G10L25/15 , G10L21/013
CPC分类号： G10L13/06 , G10L13/033 , G10L19/097 , G10L25/15 , G10L2021/0135
摘要： According to an embodiment, a speech synthesis apparatus includes a selecting unit configured to select speaker's parameters one by one for respective speakers and obtain a plurality of speakers' parameters, the speaker's parameters being prepared for respective pitch waveforms corresponding to speaker's speech sounds, the speaker's parameters including formant frequencies, formant phases, formant powers, and window functions concerning respective formants that are contained in the respective pitch waveforms. The apparatus includes a mapping unit configured to make formants correspond to each other between the plurality of speakers' parameters using a cost function based on the formant frequencies and the formant powers. The apparatus includes a generating unit configured to generate an interpolated speaker's parameter by interpolating, at desired interpolation ratios, the formant frequencies, formant phases, formant powers, and window functions of formants which are made to correspond to each other.
摘要翻译：根据实施例，语音合成装置包括：选择单元，被配置为逐个选择说话者的参数，并且获得多个扬声器的参数;所述说话者的参数是针对与扬声器的语音对应的各个音调波形而准备的，参数包括共振峰频率，共振峰相位，共振峰功率，以及相关螺旋波形中包含的各共振峰的窗函数。该装置包括：映射单元，其被配置为使用基于共振峰频率和共振峰功率的成本函数在多个扬声器的参数之间使得共振峰彼此对应。该装置包括：生成单元，被配置为通过以期望的内插比率内插使彼此对应的共振峰的共振峰频率，共振峰相位，共振峰功率和窗函数来生成内插说话者的参数。

5. 发明申请

US20090326951A1 SPEECH SYNTHESIZING APPARATUS AND METHOD THEREOF 审中-公开
标题翻译：语音合成设备及其方法
公开(公告)号：US20090326951A1
公开(公告)日：2009-12-31
申请号：US12423233
申请日：2009-04-14
申请人： Ryo Morinaka , Takehiko Kagoshima
发明人： Ryo Morinaka , Takehiko Kagoshima
IPC分类号： G10L13/06 , G10L13/00 , G10L11/04
CPC分类号： G10L13/06
摘要： Ratios of powers at the peaks of respective formants of the spectrum of a pitch-cycle waveform and powers at boundaries between the formants are obtained and, when the ratios are large, bandwidth of window functions are widened and the formant waveforms are generated by multiplying generated sinusoidal waveforms from the formant parameter sets on the basis of pitch-cycle waveform generating data by the window functions of the widened bandwidth, whereby a pitch-cycle waveform is generated by the sum of these formant waveforms.
摘要翻译：获得了音调周期波形的频谱的各个峰值的峰值与共振峰边界的功率之间的功率比，并且当比率大时，窗口函数的带宽被加宽，并且产生共振峰波形基于通过加宽带宽的窗口函数的音调周期波形生成数据的共振峰参数的正弦波形，由此通过这些共振峰波形的和产生音调周期波形。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式