会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明申请
    • CODING AND DECODING A TRANSIENT FRAME
    • 编码和解码一个瞬态帧
    • US20120065980A1
    • 2012-03-15
    • US13228210
    • 2011-09-08
    • Venkatesh KrishnanAnanthapadmanabhan Arasanipalai Kandhadai
    • Venkatesh KrishnanAnanthapadmanabhan Arasanipalai Kandhadai
    • G10L13/00
    • G10L19/20G10L19/025G10L19/097G10L19/22G10L25/93
    • An electronic device for coding a transient frame is described. The electronic device includes a processor and executable instructions stored in memory that is in electronic communication with the processor. The electronic device obtains a current transient frame. The electronic device also obtains a residual signal based on the current transient frame. Additionally, the electronic device determines a set of peak locations based on the residual signal. The electronic device further determines whether to use a first coding mode or a second coding mode for coding the current transient frame based on at least the set of peak locations. The electronic device also synthesizes an excitation based on the first coding mode if the first coding mode is determined. The electronic device also synthesizes an excitation based on the second coding mode if the second coding mode is determined.
    • 描述用于对瞬态帧进行编码的电子设备。 电子设备包括处理器和存储在与处理器电子通信的存储器中的可执行指令。 电子设备获得当前瞬态帧。 电子设备还基于当前瞬态帧获得残留信号。 另外,电子设备基于剩余信号确定一组峰值位置。 电子设备还基于至少一组峰值位置来确定是否使用第一编码模式或第二编码模式来编码当前瞬态帧。 如果确定了第一编码模式,则电子设备还基于第一编码模式合成激励。 如果确定了第二编码模式,则电子设备还基于第二编码模式合成激励。
    • 2. 发明申请
    • SPEECH SYNTHESIS APPARATUS AND METHOD
    • 语音合成设备和方法
    • US20110087488A1
    • 2011-04-14
    • US12970162
    • 2010-12-16
    • Ryo MorinakaTakehiko Kagoshima
    • Ryo MorinakaTakehiko Kagoshima
    • G10L11/04G10L13/06
    • G10L13/06G10L13/033G10L19/097G10L25/15G10L2021/0135
    • According to an embodiment, a speech synthesis apparatus includes a selecting unit configured to select speaker's parameters one by one for respective speakers and obtain a plurality of speakers' parameters, the speaker's parameters being prepared for respective pitch waveforms corresponding to speaker's speech sounds, the speaker's parameters including formant frequencies, formant phases, formant powers, and window functions concerning respective formants that are contained in the respective pitch waveforms. The apparatus includes a mapping unit configured to make formants correspond to each other between the plurality of speakers' parameters using a cost function based on the formant frequencies and the formant powers. The apparatus includes a generating unit configured to generate an interpolated speaker's parameter by interpolating, at desired interpolation ratios, the formant frequencies, formant phases, formant powers, and window functions of formants which are made to correspond to each other.
    • 根据实施例,语音合成装置包括:选择单元,被配置为逐个选择说话者的参数,并且获得多个扬声器的参数;所述说话者的参数是针对对应于说话者的语音的各个音调波形而准备的, 参数包括共振峰频率,共振峰相位,共振峰功率,以及相关螺旋波形中包含的各共振峰的窗函数。 该装置包括:映射单元,其被配置为使用基于共振峰频率和共振峰功率的成本函数在多个扬声器的参数之间使得共振峰彼此对应。 该装置包括:生成单元,被配置为通过以期望的内插比率内插使彼此对应的共振峰的共振峰频率,共振峰相位,共振峰功率和窗函数来生成内插说话者的参数。
    • 4. 发明申请
    • Apparatus, method and program for vioce signal interpolation
    • 用于vioce信号插值的装置,方法和程序
    • US20070271091A1
    • 2007-11-22
    • US11797701
    • 2007-05-07
    • Yasushi Sato
    • Yasushi Sato
    • G10L11/04
    • G10L21/0364G10L19/09G10L19/097G10L25/18
    • A voice signal interpolation apparatus is provided which can restore original human voices from human voices in a compressed state while maintaining a high sound quality. When a voice signal representative of a voice to be interpolated is acquired by a voice data input unit 1, a pitch deriving unit 2 filters this voice signal to identify a pitch length from the filtering result. A pitch length fixing unit 3 makes the voice signal have a constant time length of a section corresponding to a unit pitch, and generates pitch waveform data. A sub-band dividing unit 4 converts the pitch waveform data into sub-band data representative of a spectrum. A plurality of sub-band data pieces are averaged by an averaging unit 5 and thereafter a sub-band synthesizing unit 6 converts the sub-band data pieces into a signal representative of a waveform of the voice by a sub-band synthesizing unit 6. The time length of this signal in each section is restored by a pitch restoring unit 7 and a sound output unit 8 reproduces the sound represented by the signal.
    • 提供了一种语音信号插值装置,其能够在保持高音质的同时在压缩状态下恢复来自人声音的原始人声。 当语音数据输入单元1获取表示要内插的语音的语音信号时,音调导出单元2对该语音信号进行滤波,以从滤波结果中识别音调长度。 音高固定单元3使语音信号具有与单位音调对应的部分的恒定时间长度,并产生音调波形数据。 子带分割单元4将音调波形数据转换为表示频谱的子带数据。 多个子带数据由平均单元5进行平均,此后,子带合成单元6将子带数据段转换成表示子频带合成单元6的声音波形的信号。 每个部分中的该信号的时间长度由音调恢复单元7恢复,并且声音输出单元8再现由该信号表示的声音。
    • 7. 发明申请
    • Speech signal interpolation device, speech signal interpolation method, and program
    • 语音信号插值装置,语音信号插补方法和程序
    • US20040153314A1
    • 2004-08-05
    • US10477320
    • 2003-11-10
    • Yasushi Sato
    • G10L011/04
    • G10L21/0364G10L19/09G10L19/097G10L25/18
    • A voice signal interpolation apparatus is provided which can restore original human voices from human voices in a compressed state while maintaining a high sound quality. When a voice signal representative of a voice to be interpolated is acquired by a voice data input unit 1, a pitch deriving unit 2 filters this voice signal to identify a pitch length from the filtering result. A pitch length fixing unit 3 makes the voice signal have a constant time length of a section corresponding to a unit pitch, and generates pitch waveform data. A sub-band dividing unit 4 converts the pitch waveform data into sub-band data representative of a spectrum. A plurality of sub-band data pieces are averaged by an averaging unit 5 and thereafter a sub-band synthesizing unit 6 converts the sub-band data pieces into a signal representative of a waveform of the voice by a sub-band synthesizing unit 6. The time length of this signal in each section is restored by a pitch restoring unit 7 and a sound output unit 8 reproduces the sound represented by the signal.
    • 提供了一种语音信号插值装置,其能够在保持高音质的同时在压缩状态下恢复来自人声音的原始人声。 当语音数据输入单元1获取表示要内插的语音的语音信号时,音调导出单元2对该语音信号进行滤波,以从滤波结果中识别音调长度。 音高固定单元3使语音信号具有与单位音调对应的部分的恒定时间长度,并产生音调波形数据。 子带分割单元4将音调波形数据转换为表示频谱的子带数据。 多个子带数据由平均单元5进行平均,此后,子带合成单元6将子带数据段转换成表示子频带合成单元6的声音波形的信号。 每个部分中的该信号的时间长度由音调恢复单元7恢复,并且声音输出单元8再现由该信号表示的声音。
    • 8. 发明申请
    • Device and method for interpolating frequency components of signal
    • 用于内插信号频率分量的装置和方法
    • US20040098431A1
    • 2004-05-20
    • US10362421
    • 2003-02-25
    • Yasushi Sato
    • G06F007/38
    • G10L21/038G10L19/0204G10L19/097H04B1/667
    • A frequency interpolation apparatus is provided which reproduces a signal similar to an original signal by approximately recovering suppressed frequency components, from an input signal having the suppressed frequency components in a specific frequency band of the original signal. The input signal is divided into a plurality of signal component sets each having frequency components in a frequency band among a plurality of frequency bands, and a signal component set in the band with the suppressed signal components is synthesized from the plurality of divided signal component sets and added to the input signal. Each of the plurality of divided signal component sets is frequency-converted to a signal component set in the same frequency band, and the signal component set in the band with the suppressed signal components is synthesized through linear combination of the frequency-converted signal component sets. Spectrum envelope information of the frequency components not suppressed but residual in the original signal is extracted and the level of the signal component set to be synthesized is determined from the spectrum envelope information.
    • 提供了一种频率内插装置,其从原始信号的特定频带中具有抑制的频率分量的输入信号中大致恢复抑制的频率分量,再现与原始信号类似的信号。 输入信号被分成多个信号分量集合,每个信号分量集合具有多个频带中的频带中的频率分量,并且从多个分割信号分量集合合成具有抑制信号分量的频带中设置的信号分量 并添加到输入信号。 多个分割信号分量集合中的每一个被频率转换成在相同频带中设置的信号分量,并且通过频率转换信号分量集合的线性组合来合成具有抑制信号分量的频带中设置的信号分量 。 从频谱包络信息中提取未抑制但原始信号中的残差的频率分量的频谱包络信息,并确定要合成的信号分量的电平。
    • 9. 发明授权
    • Method and apparatus for subsampling phase spectrum information
    • 二次采样相位谱信息的方法和装置
    • US06678649B2
    • 2004-01-13
    • US10066073
    • 2002-02-01
    • Sharath Manjunath
    • Sharath Manjunath
    • G10L1912
    • G10L19/097G10L19/02G10L25/27
    • Method and apparatus for subsampling phase spectrum information by analyzing and reconstructing a prototype of a frame. The prototype is analyzed by correlating phase parameters generated from the prototype with phase parameters generated from a reference prototype in multiple frequency bands. The prototype is reconstructed using linear phase shift values by producing a set of phase parameters of the reference prototype, generating a set of linear phase shift values associated with the prototype, and composing a phase vector from the set of phase parameters and the set of linear phase shift values across multiple frequency bands. The prototype is reconstructed using circular rotation values by producing a set of circular rotation values associated with the prototype, generating a set of bandpass waveforms associated with the phase parameters of the reference prototype in multiple frequency bands, and modifying the set of bandpass waveforms based upon the circular rotation values.
    • 通过分析和重建帧的原型对相位频谱信息进行子采样的方法和装置。 通过将从原型产生的相位参数与在多个频带中的参考原型生成的相位参数相关联来分析原型。 使用线性相移值通过产生参考原型的一组相位参数来重构原型,产生与原型相关联的一组线性相移值,以及从相位参数集合和线性组合构成相位矢量 跨越多个频带的相移值。 通过产生与原型相关联的一组圆形旋转值,使用循环旋转值重构原型,产生与多个频带中的参考原型的相位参数相关联的一组带通波形,以及基于 循环旋转值。