会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明授权
    • Using web FAQ data for creating self-service speech applications
    • 使用Web常见问题数据创建自助语音应用程序
    • US07558734B1
    • 2009-07-07
    • US12271912
    • 2008-11-16
    • Osamuyimen T. StewartDavid M. LubenskyEa-Ee JanXiang Li
    • Osamuyimen T. StewartDavid M. LubenskyEa-Ee JanXiang Li
    • G10L21/00
    • G06F17/30861G06F17/2785G06F17/30684G10L15/26H04M3/4938
    • In one example, this invention presents a method of providing the same self-service content that is available on the web interface to users contacting by telephone, knowing that the web and telephone are fundamentally different user interfaces. In one embodiment, this patent seeks to protect the general idea of how to playback web data in real-time to the user over the speech interface. For this purpose, a method is presented comprising of the general steps through which the web data is initially sent to an automatic transformation module. Then, that transformation module refines or re-structures the web data to make it suitable for the speech interface. The algorithm in the module is predicated on the user interface principles of cognitive complexity and limitations on short term memory based on which FAQ types are classified into one of the following four classes: simple, medium, complex, and complex-complex.
    • 在一个示例中,本发明提供了一种方法,即知道网络和电话是根本上不同的用户界面的,通过电话向网络接口提供与用户联系的相同的自助服务内容。 在一个实施例中,该专利旨在保护如何通过语音界面实时地向用户播放web数据的一般思想。 为此,提出了一种方法,其包括最初将web数据发送到自动变换模块的一般步骤。 然后,该转换模块对网络数据进行优化或重构,使其适合于语音界面。 模块中的算法基于用户界面的认知复杂性原理和短期记忆的限制,基于哪些常见问题类型分为以下四个类别之一:简单,中等,复杂和复杂。
    • 3. 发明授权
    • Method and apparatus for speaker recognition using selected spectral
information
    • 使用所选光谱信息进行扬声器识别的方法和装置
    • US5666466A
    • 1997-09-09
    • US365598
    • 1994-12-27
    • Qiguang LinJames L. FlanaganEa-Ee Jan
    • Qiguang LinJames L. FlanaganEa-Ee Jan
    • G10L17/00G10L7/08
    • G10L17/02G10L17/00G10L17/04G10L25/24
    • A method and apparatus are disclosed for robust, text-independent (and text-dependent) speaker recognition in which identification of a speaker is based on selected spectral information from the speaker's voice. Traditionally, speaker recognition systems (i) render a speech sample in the frequency domain to produce a spectrum, (ii) produce cepstrum coefficients from the spectrum, (iii) produce a codebook from the cepstrum coefficients, and (iv) use the codebook as the feature measure for comparing training speech samples with testing speech samples. The present invention, on the other hand, introduces the important and previously unknown step of truncating the spectrum prior to producing the cepstrum coefficients. Through the use of selected spectra as the feature measure for speaker recognition, the present invention has been shown to yield significant improvements in performance over prior art systems.
    • 公开了用于稳健,文本无关(和文本相关)的扬声器识别的方法和装置,其中扬声器的识别基于来自扬声器的声音的所选频谱信息。 传统上,讲话者识别系统(i)在频域中呈现语音样本以产生频谱,(ii)从频谱产生倒谱系数,(iii)从倒谱系数产生码本,以及(iv)使用码本作为 将训练语音样本与测试语音样本进行比较的特征措施。 另一方面,本发明介绍了在产生倒谱系数之前截断频谱的重要且未知的步骤。 通过使用所选择的光谱作为说话者识别的特征量度,已经证明本发明在现有技术系统方面显着提高了性能。
    • 5. 发明授权
    • Resource allocation for voice processing applications
    • 语音处理应用的资源分配
    • US07206387B2
    • 2007-04-17
    • US10645051
    • 2003-08-21
    • Ea-Ee JanBenoit MaisonAndrzei Sakrajda
    • Ea-Ee JanBenoit MaisonAndrzei Sakrajda
    • H04M1/64
    • H04M3/50H04L67/1002H04L2012/6443H04L2012/6481H04M2201/39H04M2201/40H04M2201/41
    • A voice processing system is provided in which sets of engines running on a plurality of servers are configured differently from one another. The sets of engines may be configured to achieve different trade-offs between performance of a task and resources required to perform the task. In the voice processing system, a task routing server is provided that assigns different sets of sub-tasks to different sets of task engines. The number of engines used to perform a task and the number of engines in each set are adjusted. By adjusting the parameters settings for the set of engines based on the type of application, the particular requirements of the application, or the nature and importance of the subtasks, for example, advantages such as improvement of resource utilization and the hardware and software costs reduction may be obtained.
    • 提供语音处理系统,其中在多个服务器上运行的引擎组彼此不同地配置。 发动机组可以被配置为在执行任务和执行任务所需的资源之间实现不同的权衡。 在语音处理系统中,提供了将不同的子任务集分配给不同的任务引擎组的任务路由服务器。 调整用于执行任务的引擎数量和每组中的引擎数量。 通过基于应用程序的类型,应用程序的特定要求或子任务的性质和重要性调整引擎集的参数设置,例如资源利用率的提高以及降低硬件和软件成本的优点 可以获得。
    • 8. 发明申请
    • METHOD AND SYSTEM FOR USING A STATISTICAL LANGUAGE MODEL AND AN ACTION CLASSIFIER IN PARALLEL WITH GRAMMAR FOR BETTER HANDLING OF OUT-OF-GRAMMAR UTTERANCES
    • 使用统计语言模型的方法和系统和与GRAMMAR并行的动作分类器,用于更好地处理超出灰度的UTTERANCES
    • US20080270135A1
    • 2008-10-30
    • US11742149
    • 2007-04-30
    • Vaibhava GoelRamesh GopinathEa-Ee JanKarthik Visweswariah
    • Vaibhava GoelRamesh GopinathEa-Ee JanKarthik Visweswariah
    • G10L15/18
    • G10L15/1822G10L15/193G10L15/197
    • A method (and system) of handling out-of-grammar utterances includes building a statistical language model for a dialog state using, generating sentences and semantic interpretations for the sentences using finite state grammar, building a statistical action classifier, receiving user input, carrying out recognition with the finite state grammar, carrying out recognition with the statistical language model, using the statistical action classifier to find semantic interpretations, comparing an output from the finite state grammar and an output from the statistical language model, deciding which output of the output from the finite state grammar and the output from the statistical language model to keep as a final recognition output, selecting the final recognition output, and outputting the final recognition result, wherein the statistical action classifier, the finite state grammar and the statistical language model are used in conjunction to carry out speech recognition and interpretation.
    • 处理语法语法的方法(和系统)包括使用有限状态语法为对话状态建立统计语言模型,使用有限状态语法生成句子和语义解释,构建统计动作分类器,接收用户输入,携带 使用有限状态语法进行识别,使用统计语言模型进行识别,使用统计动作分类器来查找语义解释,比较有限状态语法的输出和来自统计语言模型的输出,决定输出的哪个输出 从有限状态语法和统计语言模型的输出,作为最终识别输出,选择最终识别输出,并输出最终识别结果,其中统计动作分类器,有限状态语法和统计语言模型是 结合使用来进行语音识别和解释 。