会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Receiver-driven layered error correction multicast over heterogeneous packet networks
    • 接收器驱动的分层纠错多播在异构分组网络上
    • US07697514B2
    • 2010-04-13
    • US11109250
    • 2005-04-18
    • Philip ChouAlbert WangSanjeev MehrotraAlexander Mohr
    • Philip ChouAlbert WangSanjeev MehrotraAlexander Mohr
    • H04L12/56
    • H04L1/007H04L1/0009H04L1/0017H04L1/06H04L1/1812H04L2001/0093
    • A system and method for correcting errors and losses occurring during a receiver-driven layered multicast (RLM) of real-time media over a heterogeneous packet network such as the Internet. This is accomplished by augmenting RLM with one or more layers of error correction information. This allows each receiver to separately optimize the quality of received audio and video information by subscribing to at least one error correction layer. Ideally, each source layer in a RLM would have one or more multicasted error correction data streams (i.e., layers) associated therewith. Each of the error correction layers would contain information that can be used to replace lost packets from the associated source layer. More than one error correction layer is proposed as some of the error correction packets contained in the data stream needed to replace the packets lost in the associated source stream may themselves be lost in transmission. A preferred process for generating the error correction streams involves the use of a unique adaptation of the Forward Error Correction (FEC) techniques. This process encodes the transmission data using a linear transform which adds redundant elements. The redundancy permits losses to be corrected because any of the original data elements can be derived from any of the encoded elements. Thus, as long as enough of the encoded data elements are received so as to equal the number of the original data elements, it is possible to derive all the original elements.
    • 一种用于在异构分组网络(例如因特网)下校正在实时媒体的接收机驱动分层多播(RLM)期间发生的错误和损失的系统和方法。 这是通过用一层或多层纠错信息增强RLM来实现的。 这允许每个接收机通过订阅至少一个纠错层来分别优化所接收的音频和视频信息的质量。 理想地,RLM中的每个源层将具有与其相关联的一个或多个多播的纠错数据流(即,层)。 每个纠错层将包含可用于替换相关源层丢失的分组的信息。 提出了多于一个纠错层,因为包含在替换相关源流中丢失的分组所需的数据流中的一些纠错分组本身可能在传输中丢失。 用于产生纠错流的优选过程涉及使用前向纠错(FEC)技术的唯一适配。 该过程使用添加冗余元素的线性变换对传输数据进行编码。 冗余允许修正损失,因为任何原始数据元素可以从任何编码元素导出。 因此,只要接收到足够的编码数据元素以便等于原始数据元素的数量,就有可能导出所有的原始元素。
    • 2. 发明授权
    • System and method for providing high-quality stretching and compression of a digital audio signal
    • 用于提供数字音频信号的高质量拉伸和压缩的系统和方法
    • US07337108B2
    • 2008-02-26
    • US10660325
    • 2003-09-10
    • Dinei FlorencioPhilip ChouLi-Wei He
    • Dinei FlorencioPhilip ChouLi-Wei He
    • G10L11/06G10L21/04H04B1/66
    • G10L21/04G10L2025/935
    • An adaptive “temporal audio scaler” is provided for automatically stretching and compressing frames of audio signals received across a packet-based network. Prior to stretching or compressing segments of a current frame, the temporal audio scaler first computes a pitch period for each frame for sizing signal templates used for matching operations in stretching and compressing segments. Further, the temporal audio scaler also determines the type or types of segments comprising each frame. These segment types include “voiced” segments, “unvoiced” segments, and “mixed” segments which include both voiced and unvoiced portions. The stretching or compression methods applied to segments of each frame are then dependent upon the type of segments comprising each frame. Further, the amount of stretching and compression applied to particular segments is automatically variable for minimizing signal artifacts while still ensuring that an overall target stretching or compression ratio is maintained for each frame.
    • 提供了一种自适应“时间音频缩放器”,用于自动地拉伸和压缩通过基于分组的网络接收的音频信号的帧。 在拉伸或压缩当前帧的段之前,时间音频缩放器首先计算用于每个帧的音调周期,用于调整用于拉伸和压缩段中的匹配操作的信号模板。 此外,时间音频缩放器还确定包括每个帧的片段的类型或类型。 这些段类型包括“有声”段,“无声”段和包括有声和无声部分的“混合”段。 然后,应用于每个帧的段的拉伸或压缩方法取决于包括每个帧的段的类型。 此外,施加到特定段的拉伸和压缩量可自动变化以最小化信号伪影,同时仍然确保为每个帧维持整体目标拉伸或压缩比。
    • 7. 发明申请
    • System and method for real-time detection and preservation of speech onset in a signal
    • 用于实时检测和保存信号中语音发生的系统和方法
    • US20050055201A1
    • 2005-03-10
    • US10660326
    • 2003-09-10
    • Dinei FlorencioPhilip Chou
    • Dinei FlorencioPhilip Chou
    • G10L11/02G10L11/06
    • G10L25/87G10L2025/783
    • A “speech onset detector” provides a variable length frame buffer in combination with either variable transmission rate or temporal speech compression for buffered signal frames. The variable length buffer buffers frames that are not clearly identified as either speech or non-speech frames during an initial analysis. Buffering of signal frames continues until a current frame is identified as either speech or non-speech. If the current frame is identified as non-speech, buffered frames are encoded as non-speech frames. However, if the current frame is identified as a speech frame, buffered frames are searched for the actual onset point of the speech. Once that onset point is identified, the signal is either transmitted in a burst, or a time-scale modification of the buffered signal is applied for compressing buffered frames beginning with the frame in which onset point is detected. The compressed frames are then encoded as one or more speech frames.
    • “语音起始检测器”提供了可变长度帧缓冲器,与缓冲信号帧的可变传输速率或时间语音压缩相结合。 可变长度缓冲器缓冲在初始分析期间未被清楚地识别为语音或非语音帧的帧。 信号帧的缓冲持续到当前帧被识别为语音或非语音。 如果当前帧被识别为非语音,则缓冲帧被编码为非语音帧。 然而,如果当前帧被识别为语音帧,则搜索缓冲的帧用于语音的实际起始点。 一旦该起始点被识别,则信号以突发方式发送,或者缓冲信号的时间尺度修改被应用于从检测到起始点的帧开始的缓冲帧。 然后将压缩的帧编码为一个或多个语音帧。
    • 8. 发明授权
    • Video image compression using weighted wavelet hierarchical vector
quantization
    • 使用加权小波分层矢量量化的视频图像压缩
    • US5602589A
    • 1997-02-11
    • US293612
    • 1994-08-19
    • Mohan VishwanathPhilip Chou
    • Mohan VishwanathPhilip Chou
    • H04N7/30G06T9/00H03M7/40H04N7/26H04N7/28
    • G06T9/008H04N19/40H04N19/62H04N19/63H04N19/90H04N19/94H04N19/42
    • A weighted wavelet hierarchical vector quantization (WWHVQ) procedure is initiated by obtaining an N.times.N pixel image where 8 bits per pixel (steps 10 and 12). A look-up operation is performed to obtain data representing a discrete wavelet transform (DWT) followed by a quantization of the data (step 14). Upon completion of the look-up, a data compression will have been performed. Further stages and look-up will result in further compression of the data, i.e., 4:1, 8:1, 16:1, 32:1, 64:1, . . . etc. Accordingly, a determination is made whether the compression is complete (step 16). If the compression is incomplete, further look-up is performed. If the compression is complete, however, the compressed data is transmitted (step 18). It is determined at a gateway whether further compression is required (step 19). If so, transcoding is performed (step 20). The receiver receives the compressed data (step 22). Subsequently, a second look-up operation is performed to obtain data representing an inverse discrete wavelet transform of the decompressed data (step 24). After one iteration, the data is decompressed by a factor of two. Further iterations allows for further decompression of the data. Accordingly, a determination is made whether decompression is complete (step 26). If the decompression is in incomplete, further look-ups are performed. If, however, the decompression is complete, the WWHVQ procedure is ended (step 28).
    • 通过获得每像素8位(步骤10和12)的N×N像素图像来启动加权小波分层矢量量化(WWHVQ)过程。 执行查找操作以获得表示离散小波变换(DWT)的数据,随后数据的量化(步骤14)。 完成查找后,将执行数据压缩。 进一步的阶段和查找将导致数据的进一步压缩,即4:1,8:1,16:1,32:1,64:1。 。 。 因此,确定压缩是否完成(步骤16)。 如果压缩不完整,则进一步查找。 然而,如果压缩完成,则传送压缩数据(步骤18)。 在网关确定是否需要进一步的压缩(步骤19)。 如果是,则执行代码转换(步骤20)。 接收器接收压缩数据(步骤22)。 随后,执行第二查找操作以获得表示解压缩数据的逆离散小波变换的数据(步骤24)。 一次迭代后,数据被解压缩一倍。 进一步的迭代允许进一步解压缩数据。 因此,确定减压是否完成(步骤26)。 如果解压缩不完整,则进行进一步的查找。 然而,如果解压缩完成,则WWHVQ过程结束(步骤28)。
    • 10. 发明授权
    • System and method for real-time detection and preservation of speech onset in a signal
    • 用于实时检测和保存信号中语音发生的系统和方法
    • US07412376B2
    • 2008-08-12
    • US10660326
    • 2003-09-10
    • Dinei FlorencioPhilip Chou
    • Dinei FlorencioPhilip Chou
    • G10L11/00G10L11/02
    • G10L25/87G10L2025/783
    • A “speech onset detector” provides a variable length frame buffer in combination with either variable transmission rate or temporal speech compression for buffered signal frames. The variable length buffer buffers frames that are not clearly identified as either speech or non-speech frames during an initial analysis. Buffering of signal frames continues until a current frame is identified as either speech or non-speech. If the current frame is identified as non-speech, buffered frames are encoded as non-speech frames. However, if the current frame is identified as a speech frame, buffered frames are searched for the actual onset point of the speech. Once that onset point is identified, the signal is either transmitted in a burst, or a time-scale modification of the buffered signal is applied for compressing buffered frames beginning with the frame in which onset point is detected. The compressed frames are then encoded as one or more speech frames.
    • “语音起始检测器”提供了可变长度帧缓冲器,与缓冲信号帧的可变传输速率或时间语音压缩相结合。 可变长度缓冲器缓冲在初始分析期间未被清楚地识别为语音或非语音帧的帧。 信号帧的缓冲持续到当前帧被识别为语音或非语音。 如果当前帧被识别为非语音,则缓冲帧被编码为非语音帧。 然而,如果当前帧被识别为语音帧,则搜索缓冲的帧用于语音的实际起始点。 一旦该起始点被识别,则信号以突发方式发送,或者缓冲信号的时间尺度修改被应用于从检测到起始点的帧开始的缓冲帧。 然后将压缩的帧编码为一个或多个语音帧。