会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明授权
    • Content creation support apparatus, method and program
    • 内容创建支持设备,方法和程序
    • US09304987B2
    • 2016-04-05
    • US14301378
    • 2014-06-11
    • KABUSHIKI KAISHA TOSHIBA
    • Kosei FumeMasahiro Morita
    • G10L15/00G06F17/27G10L13/08G10L15/26G10L13/033
    • G06F17/2755G10L13/033G10L13/08G10L15/26
    • According to one embodiment, a content creation support apparatus includes a speech synthesis unit, a speech recognition unit, an extraction unit, a detection unit, a presentation unit and a selection unit. The speech synthesis unit performs a speech synthesis on a first text. The speech recognition unit performs a speech recognition on the synthesized speech to obtain a second text. The extraction unit extracts feature values by performing a morphological analysis on each of the first and second texts. The detection unit compares a first feature value of a first difference string and a second feature value of a second difference string. The presentation unit presents correction candidate(s) according to the second feature value. The selection unit selects one of the correction candidates in accordance with an instruction from a user.
    • 根据一个实施例,内容创建支持设备包括语音合成单元,语音识别单元,提取单元,检测单元,呈现单元和选择单元。 语音合成单元对第一文本执行语音合成。 语音识别单元对合成语音执行语音识别以获得第二文本。 提取单元通过对第一和第二文本中的每一个执行形态分析来提取特征值。 检测单元将第一差分字符串的第一特征值与第二差分字符串的第二特征值进行比较。 呈现单元根据第二特征值呈现校正候选。 选择单元根据来自用户的指令来选择一个校正候选。
    • 4. 发明授权
    • Speech synthesis dictionary generation apparatus, speech synthesis dictionary generation method and computer program product
    • 语音合成字典生成装置,语音合成字典生成方法和计算机程序产品
    • US09484012B2
    • 2016-11-01
    • US14606089
    • 2015-01-27
    • KABUSHIKI KAISHA TOSHIBA
    • Masahiro Morita
    • G10L13/00G10L13/033
    • G10L13/033
    • According to an embodiment, a speech synthesis dictionary generation apparatus includes an analyzer, a speaker adapter, a level designation unit, and a determination unit. The analyzer analyzes speech data and generates a speech database containing characteristics of utterance by an object speaker. The speaker adapter generates the model of the object speaker by speaker adaptation of converting a base model to be closer to characteristics of the object speaker based on the database. The level designation unit accepts designation of a target speaker level representing a speaker's utterance skill and/or a speaker's native level in a language of the speech synthesis dictionary. The determination determines a parameter related to fidelity of reproduction of speaker properties in the speaker adaptation, in accordance with a relationship between the target speaker level and a speaker level of the object speaker.
    • 根据实施例,语音合成词典生成装置包括分析器,扬声器适配器,音量指定单元和确定单元。 分析器分析语音数据并产生包含对象扬声器的话语特征的语音数据库。 扬声器适配器通过基于数据库将基本模型转换为更靠近对象扬声器的特征的扬声器适配器来生成对象扬声器的模型。 电平指定单元以语音合成词典的语言接受表示说话人的话语技能和/或说话者的本机级别的目标说话者级别的指定。 根据目标扬声器水平与对象扬声器的扬声器水平之间的关系,确定与讲话者适配中的扬声器特性的再现的保真度有关的参数。
    • 5. 发明申请
    • TEXT-TO-SPEECH DEVICE, TEXT-TO-SPEECH METHOD, AND COMPUTER PROGRAM PRODUCT
    • 文本到语音设备,文本到语音方法和计算机程序产品
    • US20160300564A1
    • 2016-10-13
    • US15185259
    • 2016-06-17
    • KABUSHIKI KAISHA TOSHIBA
    • Yu NASUMasatsune TamuraRyo MorinakaMasahiro Morita
    • G10L13/10G10L13/06G10L13/033
    • G10L13/10G10L13/033G10L13/06
    • According to an embodiment, a text-to-speech device includes a context acquirer, an acoustic model parameter acquirer, a conversion parameter acquirer, a converter, and a waveform generator. The context acquirer is configured to acquire a context sequence affecting fluctuations in voice. The acoustic model parameter acquirer is configured to acquire an acoustic model parameter sequence that corresponds to the context sequence and represents an acoustic model in a standard speaking style of a target speaker. The conversion parameter acquirer is configured to acquire a conversion parameter sequence corresponding to the context sequence to convert an acoustic model parameter in the standard speaking style into one in a different speaking style. The converter is configured to convert the acoustic model parameter sequence using the conversion parameter sequence. The waveform generator is configured to generate a voice signal based on the acoustic model parameter sequence acquired after conversion.
    • 根据实施例,文本到语音设备包括上下文获取器,声学模型参数获取器,转换参数获取器,转换器和波形发生器。 上下文获取器被配置为获取影响语音波动的上下文序列。 声学模型参数获取器被配置为获取对应于上下文序列的声学模型参数序列,并且表示目标说话者的标准说话风格中的声学模型。 转换参数获取器被配置为获取与上下文序列相对应的转换参数序列,以将标准语音风格的声学模型参数转换为不同语音风格的声学模型参数。 转换器被配置为使用转换参数序列转换声学模型参数序列。 波形发生器被配置为基于在转换之后获取的声学模型参数序列来生成语音信号。
    • 6. 发明授权
    • Speech synthesis device, speech synthesis method, and computer program product
    • 语音合成装置,语音合成方法和计算机程序产品
    • US09135910B2
    • 2015-09-15
    • US13765012
    • 2013-02-12
    • KABUSHIKI KAISHA TOSHIBA
    • Masatsune TamuraMasahiro Morita
    • G10L13/00G10L13/08G10L13/06G10L13/033G10L15/00
    • G10L13/08G10L13/033G10L13/06
    • According to an embodiment, a speech synthesis device includes a first storage, a second storage, a first generator, a second generator, a third generator, and a fourth generator. The first storage is configured to store therein first information obtained from a target uttered voice. The second storage is configured to store therein second information obtained from an arbitrary uttered voice. The first generator is configured to generate third information by converting the second information so as to be close to a target voice quality or prosody. The second generator is configured to generate an information set including the first information and the third information. The third generator is configured to generate fourth information used to generate a synthesized speech, based on the information set. The fourth generator configured to generate the synthesized speech corresponding to input text using the fourth information.
    • 根据实施例,语音合成装置包括第一存储器,第二存储器,第一发生器,第二发生器,第三发生器和第四发生器。 第一存储器被配置为在其中存储从目标发出的语音获得的第一信息。 第二存储器被配置为在其中存储从任意发出的语音获得的第二信息。 第一生成器被配置为通过转换第二信息以接近目标语音质量或韵律来生成第三信息。 第二生成器被配置为生成包括第一信息和第三信息的信息集。 第三生成器被配置为基于该信息集合生成用于生成合成语音的第四信息。 第四发生器,被配置为使用第四信息生成与输入文本相对应的合成语音。
    • 7. 发明申请
    • SPEECH PROCESSING DEVICE, SPEECH PROCESSING METHOD AND COMPUTER PROGRAM PRODUCT
    • 语音处理设备,语音处理方法和计算机程序产品
    • US20140350922A1
    • 2014-11-27
    • US14194976
    • 2014-03-03
    • KABUSHIKI KAISHA TOSHIBA
    • Yamato OhtaniMasahiro Morita
    • G10L25/18
    • G10L25/18G10L21/038
    • According to an embodiment, a speech processing device includes an extractor, a detector, a generator, a converter, and a compensator. The extractor is configured to extract a speech parameter from a spectral envelope of input speech. The detector is configured to detect a missing band in which a component is missed in the spectral envelope. The generator is configured to generate a parameter for the missing band on the basis of a position of the missing band, statistical information created by using a parameter extracted from a spectral envelope of speech with no missing component, and the extracted speech parameter. The converter is configured to convert the generated parameter to a spectral envelope of the missing band. The compensator is configured to generate a spectral envelope supplemented with the missing band by combining the spectral envelopes of the missing band and of the input speech.
    • 根据实施例,语音处理装置包括提取器,检测器,发生器,转换器和补偿器。 提取器被配置为从输入语音的频谱包络中提取语音参数。 检测器被配置为检测在频谱包络中丢失了组件的丢失频带。 发生器被配置为基于丢失频带的位置生成用于丢失频带的参数,通过使用从没有丢失分量的语音的频谱包络提取的参数创建的统计信息和所提取的语音参数。 转换器被配置为将生成的参数转换为丢失频带的频谱包络。 补偿器被配置为通过组合丢失频带和输入语音的频谱包络来产生补充有缺失频带的频谱包络。