会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Forward error correction in speech coding
    • 语音编码中的前向纠错
    • US06757654B1
    • 2004-06-29
    • US09569312
    • 2000-05-11
    • Magnus WesterlundAnders NohlgrenJonas SvedbergAnders UvlidenJim Sundqvist
    • Magnus WesterlundAnders NohlgrenJonas SvedbergAnders UvlidenJim Sundqvist
    • G10L1302
    • G10L19/005
    • An improved forward error correction (FEC) technique for coding speech data provides an encoder module which primary-encodes an input speech signal using a primary synthesis model to produce primary-encoded data, and redundant-encodes the input speech signal using a redundant synthesis model to produce redundant-encoded data. A packetizer combines the primary-encoded data and the redundant-encoded data into a series of packets and transmits the packets over a packet-based network, such as an Internet Protocol (IP) network. A decoding module primary-decodes the packets using the primary synthesis model, and redundant-decodes the packets using the redundant synthesis model. The technique provides interaction between the primary synthesis model and the redundant synthesis model during and after decoding to improve the quality of a synthesized output speech signal. Such “interaction,” for instance, may take the form of updating states in one model using the other model.
    • 用于编码语音数据的改进的前向纠错(FEC)技术提供了一种编码器模块,其使用主要合成模型对输入语音信号进行一次编码以产生初始编码数据,并且使用冗余合成模型对输入语音信号进行冗余编码 以产生冗余编码数据。 分组器将主编码数据和冗余编码数据组合成一系列分组,并通过诸如因特网协议(IP)网络的基于分组的网络传送分组。 解码模块使用主合成模型对分组进行主要解码,并使用冗余合成模型对分组进行冗余解码。 该技术在解码期间和之后提供主要合成模型和冗余合成模型之间的交互以提高合成输出语音信号的质量。 例如,这种“交互”可以采用其他模型在一个模型中更新状态的形式。
    • 7. 发明申请
    • Method and apparatus for increasing perceived interactivity in communications systems
    • 增加通信系统中感知交互性的方法和装置
    • US20050227657A1
    • 2005-10-13
    • US10819376
    • 2004-04-07
    • Tomas FrankkilaJonas SvedbergKrister SvanbroBjorn SvenssonTomas Jonsson
    • Tomas FrankkilaJonas SvedbergKrister SvanbroBjorn SvenssonTomas Jonsson
    • G10L13/06H04L12/56H04Q7/38
    • G10L21/04
    • Perceived interactivity in user communications is achieved by reducing a perceived delay switching the active transmitter in the communication without having to reduce actual transmission and setup delays associated with a communication exchange. A sound signal is identified in the user communication. The sound signal is analyzed to identify or estimate a sound signal segment. The sound signal segment is preferably (though not necessarily) located at the beginning or the end of the sound signal. The sound signal segment may be selected directly from the sound signal itself, from a modified version of the sound signal, or from a signal associated with the sound signal. A determination is made that a length or duration of the sound signal segment should be or can be modified. One or more modifications for the sound signal segment are determined and are provided to one or more processing units to perform the modification(s).
    • 用户通信中的感知交互性通过减少在通信中切换有源发射机的感知延迟而不用减少与通信交换相关联的实际传输和建立延迟来实现。 在用户通信中识别声音信号。 分析声音信号以识别或估计声音信号段。 声音信号段优选地(尽管不一定)位于声音信号的开始或结束处。 可以从声音信号本身,声音信号的修改版本或与声音信号相关联的信号直接选择声音信号段。 确定声音信号段的长度或持续时间应该是或可以被修改。 确定声音信号段的一个或多个修改,并将其提供给一个或多个处理单元以执行修改。
    • 9. 发明授权
    • Method to determine the excitation pulse positions within a speech frame
    • 确定语音帧内的激励脉冲位置的方法
    • US6064956A
    • 2000-05-16
    • US930951
    • 1998-01-05
    • Jonas Svedberg
    • Jonas Svedberg
    • G10L19/10
    • G10L19/10
    • A determination is made of the positions within a speech frame for a given number of excitation pulses in a linear predictive speech encoder. A combination of two known methods is used. The positions of the excitation pulses are calculated in a number of calculation stages according to the first known method. The positions of the excitation pulses are then calculated in a number of calculation stages in accordance with the second method to obtain one of a number of pulse placements. Each calculation according to the second method begins at a starting point from one of a number of positions calculated in accordance with the first method. The proportion between the number of calculation stages in the first method and the second method is chosen so as to obtain the least calculation complexity for a certain given speech quality.
    • PCT No.PCT / SE96 / 00465 Sec。 371日期:1998年1月5日 102(e)日期1998年1月5日PCT提交1996年4月10日PCT公布。 第WO96 / 32712号公报 日期1996年10月17日对线性预测语音编码器中给定数量的激励脉冲的语音帧内的位置进行确定。 使用两种已知方法的组合。 根据第一已知方法,在多个计算阶段中计算激励脉冲的位置。 然后根据第二种方法在多个计算阶段中计算激励脉冲的位置,以获得多个脉冲放置中的一个。 根据第二种方法的每个计算都是从根据第一种方法计算出的多个位置之一开始的起始点。 选择第一种方法中的计算级数与第二种方法之间的比例,以便为某种给定的语音质量获得最小的计算复杂度。
    • 10. 发明授权
    • Efficient speech stream conversion
    • 有效的语音流转换
    • US08543388B2
    • 2013-09-24
    • US12095709
    • 2005-11-30
    • Nicklas SandgrenJonas Svedberg
    • Nicklas SandgrenJonas Svedberg
    • G10L19/00
    • G10L19/012G10L19/173
    • Speech frames of a first speech coding scheme are utilized as speech frames of a second speech coding scheme, where the speech coding schemes use similar core compression schemes for the speech frames, preferably bit stream compatible. An occurrence of a state mismatch in an energy parameter between the first speech coding scheme and the second speech coding scheme is identified, preferably either by determining an occurrence of a predetermined speech evolution, such as a speech type transition, e.g. an onset of speech following a period of speech inactivity, or by tentative decoding of the energy parameter in the two encoding schemes followed by a comparison. Subsequently, the energy parameter in at least one frame of the second speech coding scheme following the occurrence of the state mismatch is adjusted. The present invention also presents transcoders and communications systems providing such transcoding functionality.
    • 第一语音编码方案的语音帧被用作第二语音编码方案的语音帧,其中语音编码方案对于语音帧使用类似的核心压缩方案,优选地与比特流兼容。 识别出第一语音编码方案和第二语音编码方案之间的能量参数中的状态失配的发生,优选地通过确定诸如语音类型转换的预定语音演进的发生,例如, 在语音不活动的时期之后的语音开始,或者通过对两个编码方案中的能量参数的暂时解码进行比较。 随后,调整在发生状态失配之后的第二语音编码方案的至少一帧中的能量参数。 本发明还提供了提供这种代码转换功能的代码转换器和通信系统。