会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Using web FAQ data for creating self-service speech applications
    • 使用Web常见问题数据创建自助语音应用程序
    • US07558734B1
    • 2009-07-07
    • US12271912
    • 2008-11-16
    • Osamuyimen T. StewartDavid M. LubenskyEa-Ee JanXiang Li
    • Osamuyimen T. StewartDavid M. LubenskyEa-Ee JanXiang Li
    • G10L21/00
    • G06F17/30861G06F17/2785G06F17/30684G10L15/26H04M3/4938
    • In one example, this invention presents a method of providing the same self-service content that is available on the web interface to users contacting by telephone, knowing that the web and telephone are fundamentally different user interfaces. In one embodiment, this patent seeks to protect the general idea of how to playback web data in real-time to the user over the speech interface. For this purpose, a method is presented comprising of the general steps through which the web data is initially sent to an automatic transformation module. Then, that transformation module refines or re-structures the web data to make it suitable for the speech interface. The algorithm in the module is predicated on the user interface principles of cognitive complexity and limitations on short term memory based on which FAQ types are classified into one of the following four classes: simple, medium, complex, and complex-complex.
    • 在一个示例中,本发明提供了一种方法,即知道网络和电话是根本上不同的用户界面的,通过电话向网络接口提供与用户联系的相同的自助服务内容。 在一个实施例中,该专利旨在保护如何通过语音界面实时地向用户播放web数据的一般思想。 为此,提出了一种方法,其包括最初将web数据发送到自动变换模块的一般步骤。 然后,该转换模块对网络数据进行优化或重构,使其适合于语音界面。 模块中的算法基于用户界面的认知复杂性原理和短期记忆的限制,基于哪些常见问题类型分为以下四个类别之一:简单,中等,复杂和复杂。
    • 3. 发明授权
    • Methods and apparatus for activating telephone services in response to
speech
    • 响应语音激活电话服务的方法和装置
    • US5719921A
    • 1998-02-17
    • US609029
    • 1996-02-29
    • George J. VysotskyAyman O. AsadiDavid M. LubenskyVijay R. RamanJayant M. Naik
    • George J. VysotskyAyman O. AsadiDavid M. LubenskyVijay R. RamanJayant M. Naik
    • G10L15/00G10L15/06G10L15/20G10L15/22G10L15/26G10L15/28H04M3/42H04M3/44G10L9/08H04M1/30H04M1/66
    • H04M1/271G10L15/065G10L15/20G10L15/22G10L15/26G10L15/34H04M3/42204H04M3/44G10L2015/088H04M2201/40H04M3/42
    • Methods and apparatus for activating telephone services in response to speech are described. A directory including names is maintained for each customer. A speaker dependent speech template and a telephone number for each name, is maintained as part of each customer's directory. Speaker independent speech templates are used for recognizing commands. The present invention has the advantage of permitting a customer to place a call by speaking a person's name which serves as a destination identifier without having to speak an additional command or steering word to place the call. This is achieved by treating the receipt of a spoken name in the absence of a command as an implicit command to place a call. Explicit speaker independent commands are used to invoke features or services other than call placement. Speaker independent and speaker dependent speech recognition are performed on a customer's speech in parallel. An arbiter is used to decide which function or service should be performed when an apparent conflict arises as a result of both the speaker dependent and speaker independent speech recognition step outputs. Stochastic grammars, word spotting and/or out-of-vocabulary rejection are used as part of the speech recognition process to provide a user friendly interface which permits the use of spontaneous speech. Voice verification is performed on a selective basis where security is of concern.
    • 描述了响应于语音激活电话服务的方法和装置。 为每个客户维护包含名称的目录。 每个名字的说话者依赖语音模板和电话号码都作为每个客户目录的一部分进行维护。 扬声器独立语音模板用于识别命令。 本发明的优点在于,允许客户通过说出作为目的地标识符的人的姓名来进行呼叫,而不用说另外的命令或指导词来进行呼叫。 这是通过在没有命令的情况下处理接收到口语名称作为发出呼叫的隐式命令来实现的。 独立于显示扬声器的命令用于调用除呼叫位置之外的功能或服务。 扬声器独立和扬声器相关语音识别是在客户演讲中并行执行的。 当由于说话人依赖和说话者独立的语音识别步骤输出而产生明显的冲突时,仲裁器用于决定应该执行哪个功能或服务。 语音识别过程的一部分使用随机语法,单词发音和/或超出词汇拒绝,以提供允许使用自发语音的用户友好界面。 语音验证是在安全性受到关注的基础上进行的。
    • 4. 发明申请
    • METHOD AND APPARATUS FOR FAST TRANSLATION MEMORY SEARCH
    • 用于快速翻译记忆搜索的方法和装置
    • US20130231916A1
    • 2013-09-05
    • US13412104
    • 2012-03-05
    • Juan M. HuertaDavid M. LubenskyCheng Wu
    • Juan M. HuertaDavid M. LubenskyCheng Wu
    • G06F17/28
    • G06F17/2827G06F17/2836
    • Methods and systems for fast translation memory search include, in response to an input query string, identifying a plurality of hypothesis strings stored in a translation memory as candidates to match the query string. One or more candidates are eliminated, using a processor, where string lengths between the candidates and the query string are at least a cutoff value representing a string edit distance. One or more candidates are eliminated where differences in word frequency distributions between the candidates and the query string are at least the cutoff value. One or more candidates are eliminated by employing a dynamic programming matrix where string edit distances between the candidates and the query string are at least the cutoff value. A number of remaining candidates are outputted as matches to the query string.
    • 用于快速翻译存储器搜索的方法和系统包括响应于输入查询字符串,将存储在翻译存储器中的多个假设字符串识别为与查询字符串匹配的候选。 消除一个或多个候选者,使用处理器,候选者和查询字符串之间的串长度至少是表示字符串编辑距离的截止值。 删除一个或多个候选者,其中候选和查询字符串之间的字频率分布的差异至少为截止值。 通过采用动态编程矩阵来消除一个或多个候选者,其中候选和查询串之间的字符串编辑距离至少为截止值。 剩余的多个候选者作为匹配输出到查询字符串。
    • 6. 发明申请
    • METHOD AND SYSTEM FOR IMPROVED SPEECH RECOGNITION
    • 改进语音识别的方法和系统
    • US20090287483A1
    • 2009-11-19
    • US12120316
    • 2008-05-14
    • Raymond L. CoEe-ee JanDavid M. Lubensky
    • Raymond L. CoEe-ee JanDavid M. Lubensky
    • G10L15/00
    • G10L15/22G10L15/08G10L2015/223
    • A method for speech recognition includes: prompting a user with a first query to input speech into a speech recognition engine; determining if the inputted speech is correctly recognized; wherein in the event the inputted speech is correctly recognized proceeding to a new task; wherein in the event the inputted speech is not correctly recognized, prompting the user repeatedly with the first query to input speech into the speech recognition engine, and determining if the inputted speech is correctly recognized until a predefined limit on repetitions has been met; wherein in the event the predefined limit has been met without correctly recognizing the inputted user speech, prompting speech input from the user with a secondary query for redundant information; and cross-referencing the user's n-best result from the first query with the n-best result from the second query to obtain a top hypothesis.
    • 一种用于语音识别的方法包括:提示用户进行第一次查询以将语音输入到语音识别引擎中; 确定输入的语音是否被正确识别; 其中在所输入的语音被正确识别进行到新任务的情况下; 其中在所输入的语音未被正确识别的情况下,用所述第一询问反复提示所述用户向所述语音识别引擎输入语音,并且确定所输入的语音是否被正确识别,直到已经满足预定的重复限制为止; 其中在没有正确地识别所输入的用户语音的情况下满足所述预定义限制的情况下,用冗余信息的辅助查询提示来自用户的语音输入; 并用第二个查询的n个最佳结果交叉引用第一个查询中用户的最佳结果,以获得最高假设。
    • 9. 发明授权
    • Using partial information to improve dialog in automatic speech recognition systems
    • 使用部分信息来改善自动语音识别系统中的对话
    • US07624014B2
    • 2009-11-24
    • US12206531
    • 2008-09-08
    • Osamuyimen T. StewartDavid M. Lubensky
    • Osamuyimen T. StewartDavid M. Lubensky
    • G10L15/00
    • G10L15/193
    • A method, system and computer readable device for recognizing a partial utterance in an automatic speech recognition (ASR) system where said method comprising the steps of, receiving, by a ASR recognition unit, an input signal representing a speech utterance or word and transcribing the input signal into text, interpreting, by a ASR interpreter unit, whether the text is either a positive or a negative match to a list of automated options by matching the text with a grammar or semantic database representing the list of automated options, wherein if the ASR interpreter unit results in said positive match proceeding to a next input signal and if the ASR interpreter unit results in said negative match rejecting the text as representing said partial utterance, and processing, by a linguistic filtering unit, the rejected text to derive a correct match between the rejected text and the grammar or semantic database. And, then using the derived word for responding to the user in the next dialog turn in order to reduce or eliminate churn in the human-computer spoken dialog interaction.
    • 一种用于识别自动语音识别(ASR)系统中的部分发音的方法,系统和计算机可读设备,其中所述方法包括以下步骤:由ASR识别单元接收表示语音话语或单词的输入信号,并且转录 输入信号到文本中,由ASR解释器单元解释文本是否与自动选项列表的正或负匹配,通过将文本与表示自动选项列表的语法或语义数据库相匹配,其中如果 ASR解释器单元导致所述正匹配进行到下一个输入信号,并且如果ASR解释器单元导致所述否定匹配拒绝文本表示所述部分话语,并且由语言过滤单元处理被拒绝的文本以导出正确的 拒绝文本与语法或语义数据库之间的匹配。 然后,在下一个对话框中使用派生词来响应用户,以减少或消除人机对话交互中的流失。