会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明申请
    • SPEECH SYNTHESIS APPARATUS AND METHOD
    • 语音合成设备和方法
    • US20110087488A1
    • 2011-04-14
    • US12970162
    • 2010-12-16
    • Ryo MorinakaTakehiko Kagoshima
    • Ryo MorinakaTakehiko Kagoshima
    • G10L11/04G10L13/06
    • G10L13/06G10L13/033G10L19/097G10L25/15G10L2021/0135
    • According to an embodiment, a speech synthesis apparatus includes a selecting unit configured to select speaker's parameters one by one for respective speakers and obtain a plurality of speakers' parameters, the speaker's parameters being prepared for respective pitch waveforms corresponding to speaker's speech sounds, the speaker's parameters including formant frequencies, formant phases, formant powers, and window functions concerning respective formants that are contained in the respective pitch waveforms. The apparatus includes a mapping unit configured to make formants correspond to each other between the plurality of speakers' parameters using a cost function based on the formant frequencies and the formant powers. The apparatus includes a generating unit configured to generate an interpolated speaker's parameter by interpolating, at desired interpolation ratios, the formant frequencies, formant phases, formant powers, and window functions of formants which are made to correspond to each other.
    • 根据实施例,语音合成装置包括:选择单元,被配置为逐个选择说话者的参数,并且获得多个扬声器的参数;所述说话者的参数是针对对应于说话者的语音的各个音调波形而准备的, 参数包括共振峰频率,共振峰相位,共振峰功率,以及相关螺旋波形中包含的各共振峰的窗函数。 该装置包括:映射单元,其被配置为使用基于共振峰频率和共振峰功率的成本函数在多个扬声器的参数之间使得共振峰彼此对应。 该装置包括:生成单元,被配置为通过以期望的内插比率内插使彼此对应的共振峰的共振峰频率,共振峰相位,共振峰功率和窗函数来生成内插说话者的参数。
    • 2. 发明授权
    • Method and apparatus using fused formant parameters to generate synthesized speech
    • 使用融合共振峰参数产生合成语音的方法和装置
    • US08175881B2
    • 2012-05-08
    • US12222725
    • 2008-08-14
    • Ryo MorinakaMasatsune TamuraTakehiko Kagoshima
    • Ryo MorinakaMasatsune TamuraTakehiko Kagoshima
    • G10L13/06
    • G10L13/07G10L13/04
    • A phoneme sequence corresponding to a target speech is divided into a plurality of segments. A plurality of speech units for each segment is selected from a speech unit memory that stores speech units having at least one frame. The plurality of speech units has a prosodic feature accordant or similar to the target speech. A formant parameter having at least one formant frequency is generated for each frame of the plurality of speech units. A fused formant parameter of each frame is generated from formant parameters of each frame of the plurality of speech units. A fused speech unit of each segment is generated from the fused formant parameter of each frame. A synthesized speech is generated by concatenating the fused speech unit of each segment.
    • 对应于目标语音的音素序列被分成多个段。 从存储具有至少一个帧的语音单元的语音单元存储器中选择用于每个段的多个语音单元。 多个语音单元具有与目标语音一致或相似的韵律特征。 为多个语音单元的每个帧生成具有至少一个共振峰频率的共振峰参数。 从多个语音单元的每个帧的共振峰参数生成每帧的融合共振峰参数。 从每个帧的融合共振峰参数生成每个段的融合语音单元。 通过连接每个段的融合语音单元来生成合成语音。
    • 3. 发明申请
    • Speech synthesis method and apparatus
    • 语音合成方法和装置
    • US20090048844A1
    • 2009-02-19
    • US12222725
    • 2008-08-14
    • Ryo MorinakaMasatsune TamuraTakehiko Kagoshima
    • Ryo MorinakaMasatsune TamuraTakehiko Kagoshima
    • G10L13/06
    • G10L13/07G10L13/04
    • A phoneme sequence corresponding to a target speech is divided into a plurality of segments. A plurality of speech units for each segment is selected from a speech unit memory that stores speech units having at least one frame. The plurality of speech units has a prosodic feature accordant or similar to the target speech. A formant parameter having at least one formant frequency is generated for each frame of the plurality of speech units. A fused formant parameter of each frame is generated from formant parameters of each frame of the plurality of speech units. A fused speech unit of each segment is generated from the fused formant parameter of each frame. A synthesized speech is generated by concatenating the fused speech unit of each segment.
    • 对应于目标语音的音素序列被分成多个段。 从存储具有至少一个帧的语音单元的语音单元存储器中选择用于每个段的多个语音单元。 多个语音单元具有与目标语音一致或相似的韵律特征。 为多个语音单元的每个帧生成具有至少一个共振峰频率的共振峰参数。 从多个语音单元的每个帧的共振峰参数生成每帧的融合共振峰参数。 从每个帧的融合共振峰参数生成每个段的融合语音单元。 通过连接每个段的融合语音单元来生成合成语音。
    • 4. 发明授权
    • Speech synthesis apparatus and method
    • 语音合成装置及方法
    • US09002711B2
    • 2015-04-07
    • US12970162
    • 2010-12-16
    • Ryo MorinakaTakehiko Kagoshima
    • Ryo MorinakaTakehiko Kagoshima
    • G10L13/06G10L13/033G10L19/097G10L25/15G10L21/013
    • G10L13/06G10L13/033G10L19/097G10L25/15G10L2021/0135
    • According to an embodiment, a speech synthesis apparatus includes a selecting unit configured to select speaker's parameters one by one for respective speakers and obtain a plurality of speakers' parameters, the speaker's parameters being prepared for respective pitch waveforms corresponding to speaker's speech sounds, the speaker's parameters including formant frequencies, formant phases, formant powers, and window functions concerning respective formants that are contained in the respective pitch waveforms. The apparatus includes a mapping unit configured to make formants correspond to each other between the plurality of speakers' parameters using a cost function based on the formant frequencies and the formant powers. The apparatus includes a generating unit configured to generate an interpolated speaker's parameter by interpolating, at desired interpolation ratios, the formant frequencies, formant phases, formant powers, and window functions of formants which are made to correspond to each other.
    • 根据实施例,语音合成装置包括:选择单元,被配置为逐个选择说话者的参数,并且获得多个扬声器的参数;所述说话者的参数是针对与扬声器的语音对应的各个音调波形而准备的, 参数包括共振峰频率,共振峰相位,共振峰功率,以及相关螺旋波形中包含的各共振峰的窗函数。 该装置包括:映射单元,其被配置为使用基于共振峰频率和共振峰功率的成本函数在多个扬声器的参数之间使得共振峰彼此对应。 该装置包括:生成单元,被配置为通过以期望的内插比率内插使彼此对应的共振峰的共振峰频率,共振峰相位,共振峰功率和窗函数来生成内插说话者的参数。