专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明授权

US07558734B1 Using web FAQ data for creating self-service speech applications 有权
标题翻译：使用Web常见问题数据创建自助语音应用程序
公开(公告)号：US07558734B1
公开(公告)日：2009-07-07
申请号：US12271912
申请日：2008-11-16
申请人： Osamuyimen T. Stewart , David M. Lubensky , Ea-Ee Jan , Xiang Li
发明人： Osamuyimen T. Stewart , David M. Lubensky , Ea-Ee Jan , Xiang Li
IPC分类号： G10L21/00
CPC分类号： G06F17/30861 , G06F17/2785 , G06F17/30684 , G10L15/26 , H04M3/4938
摘要： In one example, this invention presents a method of providing the same self-service content that is available on the web interface to users contacting by telephone, knowing that the web and telephone are fundamentally different user interfaces. In one embodiment, this patent seeks to protect the general idea of how to playback web data in real-time to the user over the speech interface. For this purpose, a method is presented comprising of the general steps through which the web data is initially sent to an automatic transformation module. Then, that transformation module refines or re-structures the web data to make it suitable for the speech interface. The algorithm in the module is predicated on the user interface principles of cognitive complexity and limitations on short term memory based on which FAQ types are classified into one of the following four classes: simple, medium, complex, and complex-complex.
摘要翻译：在一个示例中，本发明提供了一种方法，即知道网络和电话是根本上不同的用户界面的，通过电话向网络接口提供与用户联系的相同的自助服务内容。在一个实施例中，该专利旨在保护如何通过语音界面实时地向用户播放web数据的一般思想。为此，提出了一种方法，其包括最初将web数据发送到自动变换模块的一般步骤。然后，该转换模块对网络数据进行优化或重构，使其适合于语音界面。模块中的算法基于用户界面的认知复杂性原理和短期记忆的限制，基于哪些常见问题类型分为以下四个类别之一：简单，中等，复杂和复杂。

2. 发明授权

US5745649A Automated speech recognition using a plurality of different multilayer perception structures to model a plurality of distinct phoneme categories 失效
标题翻译：使用多个不同的多层感知结构来自动语音识别来模拟多个不同的音素类别
公开(公告)号：US5745649A
公开(公告)日：1998-04-28
申请号：US820937
申请日：1997-03-19
申请人： David M. Lubensky
发明人： David M. Lubensky
IPC分类号： G10L15/14 , G10L15/16 , G10L5/04
CPC分类号： G10L15/16 , G10L15/142
摘要： For speech recognition systems a method for modeling context-dependent phonetic categories using artificial neural nets has been described. First, linguistically motivated context-clustering is employed to reduce the number of context-dependent categories. Second, phone-specific MLP structures are used where the number of outputs in each MLP is based on the number of left and right contexts occurring in a training database. The structure of each MLP can be automatically determined using the cascade-correlation learning algorithm.
摘要翻译：对于语音识别系统，已经描述了使用人造神经网络来对上下文相关语音分类建模的方法。首先，采用语言动机的上下文聚类来减少上下文相关类别的数量。第二，使用电话特定的MLP结构，其中每个MLP中的输出数量基于在训练数据库中发生的左右上下文的数量。可以使用级联相关学习算法自动确定每个MLP的结构。

3. 发明授权

US5719921A Methods and apparatus for activating telephone services in response to speech 失效
标题翻译：响应语音激活电话服务的方法和装置
公开(公告)号：US5719921A
公开(公告)日：1998-02-17
申请号：US609029
申请日：1996-02-29
申请人： George J. Vysotsky , Ayman O. Asadi , David M. Lubensky , Vijay R. Raman , Jayant M. Naik
发明人： George J. Vysotsky , Ayman O. Asadi , David M. Lubensky , Vijay R. Raman , Jayant M. Naik
IPC分类号： G10L15/00 , G10L15/06 , G10L15/20 , G10L15/22 , G10L15/26 , G10L15/28 , H04M3/42 , H04M3/44 , G10L9/08 , H04M1/30 , H04M1/66
CPC分类号： H04M1/271 , G10L15/065 , G10L15/20 , G10L15/22 , G10L15/26 , G10L15/34 , H04M3/42204 , H04M3/44 , G10L2015/088 , H04M2201/40 , H04M3/42
摘要： Methods and apparatus for activating telephone services in response to speech are described. A directory including names is maintained for each customer. A speaker dependent speech template and a telephone number for each name, is maintained as part of each customer's directory. Speaker independent speech templates are used for recognizing commands. The present invention has the advantage of permitting a customer to place a call by speaking a person's name which serves as a destination identifier without having to speak an additional command or steering word to place the call. This is achieved by treating the receipt of a spoken name in the absence of a command as an implicit command to place a call. Explicit speaker independent commands are used to invoke features or services other than call placement. Speaker independent and speaker dependent speech recognition are performed on a customer's speech in parallel. An arbiter is used to decide which function or service should be performed when an apparent conflict arises as a result of both the speaker dependent and speaker independent speech recognition step outputs. Stochastic grammars, word spotting and/or out-of-vocabulary rejection are used as part of the speech recognition process to provide a user friendly interface which permits the use of spontaneous speech. Voice verification is performed on a selective basis where security is of concern.
摘要翻译：描述了响应于语音激活电话服务的方法和装置。为每个客户维护包含名称的目录。每个名字的说话者依赖语音模板和电话号码都作为每个客户目录的一部分进行维护。扬声器独立语音模板用于识别命令。本发明的优点在于，允许客户通过说出作为目的地标识符的人的姓名来进行呼叫，而不用说另外的命令或指导词来进行呼叫。这是通过在没有命令的情况下处理接收到口语名称作为发出呼叫的隐式命令来实现的。独立于显示扬声器的命令用于调用除呼叫位置之外的功能或服务。扬声器独立和扬声器相关语音识别是在客户演讲中并行执行的。当由于说话人依赖和说话者独立的语音识别步骤输出而产生明显的冲突时，仲裁器用于决定应该执行哪个功能或服务。语音识别过程的一部分使用随机语法，单词发音和/或超出词汇拒绝，以提供允许使用自发语音的用户友好界面。语音验证是在安全性受到关注的基础上进行的。

4. 发明申请

US20130231916A1 METHOD AND APPARATUS FOR FAST TRANSLATION MEMORY SEARCH 有权
标题翻译：用于快速翻译记忆搜索的方法和装置
公开(公告)号：US20130231916A1
公开(公告)日：2013-09-05
申请号：US13412104
申请日：2012-03-05
申请人： Juan M. Huerta , David M. Lubensky , Cheng Wu
发明人： Juan M. Huerta , David M. Lubensky , Cheng Wu
IPC分类号： G06F17/28
CPC分类号： G06F17/2827 , G06F17/2836
摘要： Methods and systems for fast translation memory search include, in response to an input query string, identifying a plurality of hypothesis strings stored in a translation memory as candidates to match the query string. One or more candidates are eliminated, using a processor, where string lengths between the candidates and the query string are at least a cutoff value representing a string edit distance. One or more candidates are eliminated where differences in word frequency distributions between the candidates and the query string are at least the cutoff value. One or more candidates are eliminated by employing a dynamic programming matrix where string edit distances between the candidates and the query string are at least the cutoff value. A number of remaining candidates are outputted as matches to the query string.
摘要翻译：用于快速翻译存储器搜索的方法和系统包括响应于输入查询字符串，将存储在翻译存储器中的多个假设字符串识别为与查询字符串匹配的候选。消除一个或多个候选者，使用处理器，候选者和查询字符串之间的串长度至少是表示字符串编辑距离的截止值。删除一个或多个候选者，其中候选和查询字符串之间的字频率分布的差异至少为截止值。通过采用动态编程矩阵来消除一个或多个候选者，其中候选和查询串之间的字符串编辑距离至少为截止值。剩余的多个候选者作为匹配输出到查询字符串。

5. 发明申请

US20120136646A1 Data Security System 有权
标题翻译：数据安全系统
公开(公告)号：US20120136646A1
公开(公告)日：2012-05-31
申请号：US12956739
申请日：2010-11-30
申请人： Carl J. Kraenzel , David M. Lubensky , Baiju Dhirajlal Mandalia , Cheng Wu
发明人： Carl J. Kraenzel , David M. Lubensky , Baiju Dhirajlal Mandalia , Cheng Wu
IPC分类号： G06F17/28
CPC分类号： G06F17/289 , G06F17/2854 , G10L13/02 , G10L15/26
摘要： A method, computer system, and computer program product for translating information. The computer system receives the information for a translation. The computer system identifies portions of the information based on a set of rules for security for the information in response to receiving the information. The computer system sends the portions of the information to a plurality of translation systems. In response to receiving translation results from the plurality of translation systems for respective portions of the information, the computer system combines the translation results for the respective portions to form a consolidated translation of the information.
摘要翻译：一种用于翻译信息的方法，计算机系统和计算机程序产品。计算机系统接收翻译信息。计算机系统基于用于响应于接收信息的信息的安全性的一组规则来识别信息的部分。计算机系统将该部分信息发送到多个翻译系统。响应于从信息的各个部分接收来自多个翻译系统的翻译结果，计算机系统组合各个部分的翻译结果以形成信息的综合翻译。

6. 发明申请

US20090287483A1 METHOD AND SYSTEM FOR IMPROVED SPEECH RECOGNITION 有权
标题翻译：改进语音识别的方法和系统
公开(公告)号：US20090287483A1
公开(公告)日：2009-11-19
申请号：US12120316
申请日：2008-05-14
申请人： Raymond L. Co , Ee-ee Jan , David M. Lubensky
发明人： Raymond L. Co , Ee-ee Jan , David M. Lubensky
IPC分类号： G10L15/00
CPC分类号： G10L15/22 , G10L15/08 , G10L2015/223
摘要： A method for speech recognition includes: prompting a user with a first query to input speech into a speech recognition engine; determining if the inputted speech is correctly recognized; wherein in the event the inputted speech is correctly recognized proceeding to a new task; wherein in the event the inputted speech is not correctly recognized, prompting the user repeatedly with the first query to input speech into the speech recognition engine, and determining if the inputted speech is correctly recognized until a predefined limit on repetitions has been met; wherein in the event the predefined limit has been met without correctly recognizing the inputted user speech, prompting speech input from the user with a secondary query for redundant information; and cross-referencing the user's n-best result from the first query with the n-best result from the second query to obtain a top hypothesis.
摘要翻译：一种用于语音识别的方法包括：提示用户进行第一次查询以将语音输入到语音识别引擎中; 确定输入的语音是否被正确识别; 其中在所输入的语音被正确识别进行到新任务的情况下; 其中在所输入的语音未被正确识别的情况下，用所述第一询问反复提示所述用户向所述语音识别引擎输入语音，并且确定所输入的语音是否被正确识别，直到已经满足预定的重复限制为止; 其中在没有正确地识别所输入的用户语音的情况下满足所述预定义限制的情况下，用冗余信息的辅助查询提示来自用户的语音输入; 并用第二个查询的n个最佳结果交叉引用第一个查询中用户的最佳结果，以获得最高假设。

7. 发明授权

US09002696B2 Data security system for natural language translation 有权
标题翻译：自然语言翻译数据安全系统
公开(公告)号：US09002696B2
公开(公告)日：2015-04-07
申请号：US12956739
申请日：2010-11-30
申请人： Carl J. Kraenzel , David M. Lubensky , Baiju Dhirajlal Mandalia , Cheng Wu
发明人： Carl J. Kraenzel , David M. Lubensky , Baiju Dhirajlal Mandalia , Cheng Wu
IPC分类号： G06F17/18 , G06F17/27 , G06F17/28 , G10L13/02 , G10L15/26
CPC分类号： G06F17/289 , G06F17/2854 , G10L13/02 , G10L15/26
摘要： A method, computer system, and computer program product for translating information. The computer system receives the information for a translation. The computer system identifies portions of the information based on a set of rules for security for the information in response to receiving the information. The computer system sends the portions of the information to a plurality of translation systems. In response to receiving translation results from the plurality of translation systems for respective portions of the information, the computer system combines the translation results for the respective portions to form a consolidated translation of the information.
摘要翻译：一种用于翻译信息的方法，计算机系统和计算机程序产品。计算机系统接收翻译信息。计算机系统基于用于响应于接收信息的信息的安全性的一组规则来识别信息的部分。计算机系统将该部分信息发送到多个翻译系统。响应于从信息的各个部分接收来自多个翻译系统的翻译结果，计算机系统组合各个部分的翻译结果以形成信息的综合翻译。

8. 发明申请

US20120310629A1 SYSTEMS AND METHODS FOR AUTOMATICALLY DETERMINING CULTURE-BASED BEHAVIOR IN CUSTOMER SERVICE INTERACTIONS 失效
标题翻译：用于自动确定客户服务交互中基于文化的行为的系统和方法
公开(公告)号：US20120310629A1
公开(公告)日：2012-12-06
申请号：US13572215
申请日：2012-08-10
申请人： Osamuyimen T. Stewart , David M. Lubensky , Joyram Chakraborty
发明人： Osamuyimen T. Stewart , David M. Lubensky , Joyram Chakraborty
IPC分类号： G10L11/00 , G06F17/27
CPC分类号： G10L15/22 , G06Q30/02
摘要： Systems and methods are provided to automatically determine culture-based behavioral tendencies and preferences of individuals in the context of customer service interactions. For example, systems and methods are provided to process natural language dialog input of an individual to detect linguistic features indicative of individualistic and collectivistic behavioral tendencies and predict whether such individual will be cooperative or uncooperative with automated customer service.
摘要翻译：提供系统和方法以在客户服务交互的背景下自动确定基于文化的行为倾向和个人偏好。例如，提供系统和方法来处理个人的自然语言对话输入以检测指示个人主义和集体主义行为倾向的语言特征，并且预测这样的个人是否与自动化客户服务协作或不合作。

9. 发明授权

US07624014B2 Using partial information to improve dialog in automatic speech recognition systems 有权
标题翻译：使用部分信息来改善自动语音识别系统中的对话
公开(公告)号：US07624014B2
公开(公告)日：2009-11-24
申请号：US12206531
申请日：2008-09-08
申请人： Osamuyimen T. Stewart , David M. Lubensky
发明人： Osamuyimen T. Stewart , David M. Lubensky
IPC分类号： G10L15/00
CPC分类号： G10L15/193
摘要： A method, system and computer readable device for recognizing a partial utterance in an automatic speech recognition (ASR) system where said method comprising the steps of, receiving, by a ASR recognition unit, an input signal representing a speech utterance or word and transcribing the input signal into text, interpreting, by a ASR interpreter unit, whether the text is either a positive or a negative match to a list of automated options by matching the text with a grammar or semantic database representing the list of automated options, wherein if the ASR interpreter unit results in said positive match proceeding to a next input signal and if the ASR interpreter unit results in said negative match rejecting the text as representing said partial utterance, and processing, by a linguistic filtering unit, the rejected text to derive a correct match between the rejected text and the grammar or semantic database. And, then using the derived word for responding to the user in the next dialog turn in order to reduce or eliminate churn in the human-computer spoken dialog interaction.
摘要翻译：一种用于识别自动语音识别（ASR）系统中的部分发音的方法，系统和计算机可读设备，其中所述方法包括以下步骤：由ASR识别单元接收表示语音话语或单词的输入信号，并且转录输入信号到文本中，由ASR解释器单元解释文本是否与自动选项列表的正或负匹配，通过将文本与表示自动选项列表的语法或语义数据库相匹配，其中如果 ASR解释器单元导致所述正匹配进行到下一个输入信号，并且如果ASR解释器单元导致所述否定匹配拒绝文本表示所述部分话语，并且由语言过滤单元处理被拒绝的文本以导出正确的拒绝文本与语法或语义数据库之间的匹配。然后，在下一个对话框中使用派生词来响应用户，以减少或消除人机对话交互中的流失。

10. 发明授权

US07487084B2 Apparatus, program storage device and method for testing speech recognition in the mobile environment of a vehicle 有权
标题翻译：用于在车辆的移动环境中测试语音识别的装置，程序存储装置和方法
公开(公告)号：US07487084B2
公开(公告)日：2009-02-03
申请号：US10210667
申请日：2002-07-31
申请人： Andrew Aaron , Subrata K. Das , David M. Lubensky
发明人： Andrew Aaron , Subrata K. Das , David M. Lubensky
IPC分类号： G10L15/00
CPC分类号： G10L15/01
摘要： A testing arrangement provided for speech recognition systems in vehicles. Preferably included are a “mobile client” secured in the vehicle and driven around at a desired speed, an audio system and speaker which plays back a set of prerecorded utterances stored digitally in a computer arrangement such that the speech of a human being is simulated, transmission of the speech signal to a server, followed by speech recognition and signal-to-noise ratio (SNR) computation. Here, the acceptability of the vehicular speech recognition system is preferably determined via comparison with pre-specified standards of recognition accuracy and SNR values.
摘要翻译：为车辆中的语音识别系统提供的测试装置。优选地包括固定在车辆中并以所需速度驱动的“移动客户端”，音频系统和扬声器，其回放一组预先存储的话语，其以数字方式存储在计算机装置中，使得人的语音被模拟，将语音信号传输到服务器，随后进行语音识别和信噪比（SNR）计算。这里，车辆语音识别系统的可接受性优选地通过与预先指定的识别精度和SNR值的标准相比较来确定。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式