会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Conversational computing via conversational virtual machine
    • 通过对话虚拟机进行会话计算
    • US07137126B1
    • 2006-11-14
    • US09806565
    • 1999-10-01
    • Daniel CoffmanLiam D. ComerfordSteven DeGennaroEdward A. EpsteinPonani GopalakrishnanStephane H. MaesDavid Nahamoo
    • Daniel CoffmanLiam D. ComerfordSteven DeGennaroEdward A. EpsteinPonani GopalakrishnanStephane H. MaesDavid Nahamoo
    • G06F9/54G06F9/50G06F9/44G10L15/00
    • H04M3/50G06F17/30899G10L15/22G10L15/285G10L2015/228H04L67/02H04M1/72561H04M3/42204H04M3/44H04M3/493H04M3/4931H04M3/4936H04M3/4938H04M7/00H04M2201/40H04M2201/60H04M2203/355H04M2250/74
    • A conversational computing system that provides a universal coordinated multi-modal conversational user interface (CUI) (10) across a plurality of conversationally aware applications (11) (i.e., applications that “speak” conversational protocols) and conventional applications (12). The conversationally aware maps, applications (11) communicate with a conversational kernel (14) via conversational application APIs (13). The conversational kernel (14) controls the dialog across applications and devices (local and networked) on the basis of their registered conversational capabilities and requirements and provides a unified conversational user interface and conversational services and behaviors. The conversational computing system may be built on top of a conventional operating system and APIs (15) and conventional device hardware (16). The conversational kernel (14) handles all I/O processing and controls conversational engines (18). The conversational kernel (14) converts voice requests into queries and converts outputs and results into spoken messages using conversational engines (18) and conversational arguments (17). The conversational application API (13) conveys all the information for the conversational kernel (14) to transform queries into application calls and conversely convert output into speech, appropriately sorted before being provided to the user.
    • 一种对话计算系统,其跨越多个会话感知应用(11)(即,“说”对话协议的应用“)和常规应用(12)提供通用协调多模态对话用户界面(CUI)(10)。 对话感知地图,应用程序(11)通过对话应用程序API(13)与对话内核(14)进行通信。 对话内核(14)根据其注册的会话能力和要求,控制应用和设备(本地和网络)之间的对话,并提供统一的会话用户界面和对话服务和行为。 对话计算系统可以构建在常规操作系统和API(15)和常规设备硬件(16)之上。 对话内核(14)处理所有I / O处理和控制对话引擎(18)。 会话内核(14)将语音请求转换为查询,并将会话引擎(18)和会话参数(17)将输出和结果转换为口语消息。 对话应用程序API(13)传达对话内核(14)的所有信息,以将查询转换成应用程序调用,并相反地将输出转换为语音,在提供给用户之前进行适当排序。
    • 2. 发明授权
    • Conversational computing via conversational virtual machine
    • 通过对话虚拟机进行会话计算
    • US07729916B2
    • 2010-06-01
    • US11551901
    • 2006-10-23
    • Daniel CoffmanLiam D. ComerfordSteven DeGennaroEdward A. EpsteinPonani GopalakrishnanStephane H. MaesDavid Nahamoo
    • Daniel CoffmanLiam D. ComerfordSteven DeGennaroEdward A. EpsteinPonani GopalakrishnanStephane H. MaesDavid Nahamoo
    • G10L15/22G10L15/28
    • H04M3/50G06F17/30899G10L15/22G10L15/285G10L2015/228H04L67/02H04M1/72561H04M3/42204H04M3/44H04M3/493H04M3/4931H04M3/4936H04M3/4938H04M7/00H04M2201/40H04M2201/60H04M2203/355H04M2250/74
    • A conversational computing system that provides a universal coordinated multi-modal conversational user interface (CUI) 10 across a plurality of conversationally aware applications (11) (i.e., applications that “speak” conversational protocols) and conventional applications (12). The conversationally aware applications (11) communicate with a conversational kernel (14) via conversational application APIs (13). The conversational kernel 14 controls the dialog across applications and devices (local and networked) on the basis of their registered conversational capabilities and requirements and provides a unified conversational user interface and conversational services and behaviors. The conversational computing system may be built on top of a conventional operating system and APIs (15) and conventional device hardware (16). The conversational kernel (14) handles all I/O processing and controls conversational engines (18). The conversational kernel (14) converts voice requests into queries and converts outputs and results into spoken messages using conversational engines (18) and conversational arguments (17). The conversational application API (13) conveys all the information for the conversational kernel (14) to transform queries into application calls and conversely convert output into speech, appropriately sorted before being provided to the user.
    • 一种对话计算系统,其跨越多个会话感知应用(11)(即,“说”对话协议的应用)和常规应用(12)提供通用协调多模态对话用户界面(CUI)10。 对话感知应用(11)通过对话应用API(13)与对话内核(14)通信。 会话核心14基于其注册的对话能力和需求来控制应用和设备(本地和网络)之间的对话,并提供统一的对话用户界面和对话服务和行为。 对话计算系统可以构建在常规操作系统和API(15)和常规设备硬件(16)之上。 对话内核(14)处理所有I / O处理和控制对话引擎(18)。 会话内核(14)将语音请求转换为查询,并将会话引擎(18)和会话参数(17)将输出和结果转换为口语消息。 对话应用程序API(13)传达对话内核(14)的所有信息,以将查询转换成应用程序调用,并相反地将输出转换为语音,在提供给用户之前进行适当排序。
    • 3. 发明授权
    • Speech recognition system with improved rejection of words and sounds
not in the system vocabulary
    • 语音识别系统改进了排除词和声音的系统词汇
    • US5465317A
    • 1995-11-07
    • US62972
    • 1993-05-18
    • Edward A. Epstein
    • Edward A. Epstein
    • G10L15/06G10L11/02G10L15/14G10L15/28G10L5/06
    • G10L25/78
    • A speech recognizer that selects a command model for a current sound if the best match score for the current sound exceeds its corresponding threshold score. The threshold score is assigned a confidence score based on the best match score and recognition threshold of a prior sound. When the best match score for the current sound exceeds a "poor" confidence score but is less than a "good" confidence score: (a) the word corresponding to the acoustic model having the best match score is accepted as highly likely to correspond to the measured sound if the previously recognized word was accepted as having a high likelihood of corresponding to the previous sound; (b) the word corresponding to the acoustic model having the best match score is rejected as highly unlikely to correspond to the measured sound if the previously recognized word was rejected as having a low likelihood of corresponding to the previous sound; or (c) if there is sufficient intervening silence between a previously rejected word and the current word, then the current word is also accepted as having a high likelihood of corresponding to the measured current sound.
    • 如果当前声音的最佳匹配分数超过其对应的阈值分数,则语音识别器选择当前声音的命令模型。 基于最佳匹配分数和先前声音的识别阈值为阈值分数分配置信度分数。 当当前声音的最佳匹配分数超过“不良”置信度分数,但小于“好”置信度分数时:(a)对应于具有最佳匹配分数的声学模型的单词被接受为极有可能对应于 如果先前识别的字被接受为具有对应于先前声音的高可能性的测量声音; (b)如果先前识别的字被拒绝为具有对应于先前声音的低可能性,则与具有最佳匹配得分的声学模型相对应的字被拒绝非常不可能对应于测量的声音; 或者(c)如果先前拒绝的字与当前字之间存在足够的静音,则当前字也被接受为对应于所测量的当前声音的高可能性。
    • 8. 发明授权
    • Speech coding apparatus with single-dimension acoustic prototypes for a
speech recognizer
    • 具有用于语音识别器的单维声学原型的语音编码装置
    • US5280562A
    • 1994-01-18
    • US770495
    • 1991-10-03
    • Lalit R. BahlJerome R. BellegardaEdward A. EpsteinJohn M. LucassenDavid NahamooMichael A. Picheny
    • Lalit R. BahlJerome R. BellegardaEdward A. EpsteinJohn M. LucassenDavid NahamooMichael A. Picheny
    • G10L19/00G10L15/02G10L19/02H03M7/30G10L9/02
    • G10L19/038H03M7/3082
    • In speech recognition and speech coding, the values of at least two features of an utterance are measured during a series of time intervals to produce a series of feature vector signals. A plurality of single-dimension prototype vector signals having only one parameter value are stored. At least two single-dimension prototype vector signals having parameter values representing first feature values, and at least two other single-dimension prototype vector signals have parameter values representing second feature values. A plurality of compound-dimension prototype vector signals have unique identification values and comprise one first-dimension and one second-dimension prototype vector signal. At least two compound-dimension prototype vector signals comprise the same first-dimension prototype vector signal. The feature values of each feature vector signal are compared to the parameter values of the compound-dimension prototype vector signals to obtain prototype match scores. The identification values of the compound-dimension prototype vector signals having the best prototype match scores for the feature vectors signals are output as a sequence of coded representations of an utterance to be recognized. A match score, comprising an estimate of the closeness of a match between a speech unit and the sequence of coded representations of the utterance, is generated for each of a plurality of speech units. At least one speech subunit, of one or more best candidate speech units having the best match scores, is displayed.
    • 在语音识别和语音编码中,在一系列时间间隔期间测量话音的至少两个特征的值,以产生一系列特征向量信号。 存储仅具有一个参数值的多个单维原型矢量信号。 具有表示第一特征值的参数值和至少两个其它单维原型矢量信号的至少两个单维原型矢量信号具有表示第二特征值的参数值。 多个复合尺寸原型矢量信号具有唯一的识别值,并且包括一个第一维和一个第二维原型矢量信号。 至少两个复合维度原型矢量信号包括相同的第一维原型矢量信号。 将每个特征向量信号的特征值与化合物维度原型矢量信号的参数值进行比较,以获得原型匹配分数。 具有特征矢量信号的具有最佳原型匹配分数的复合维度原型矢量信号的识别值被输出为将被识别的话语的编码表示的序列。 针对多个语音单元中的每一个生成包括语音单元与语音编码表示序列之间的匹配的接近度的估计的匹配分数。 显示具有最佳匹配分数的一个或多个最佳候选语音单元的至少一个语音子单元。