会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明授权
    • System and method for low-latency web-based text-to-speech without plugins
    • 用于低延迟基于Web的文本到语音而不需要插件的系统和方法
    • US09240180B2
    • 2016-01-19
    • US13308860
    • 2011-12-01
    • Alistair D. ConkieMark Charles BeutnagelTaniya Mishra
    • Alistair D. ConkieMark Charles BeutnagelTaniya Mishra
    • G10L13/00G10L13/08G10L13/10
    • G10L13/04G10L13/10
    • Disclosed herein are systems, methods, and non-transitory computer-readable storage media for reducing latency in web-browsing TTS systems without the use of a plug-in or Flash® module. A system configured according to the disclosed methods allows the browser to send prosodically meaningful sections of text to a web server. A TTS server then converts intonational phrases of the text into audio and responds to the browser with the audio file. The system saves the audio file in a cache, with the file indexed by a unique identifier. As the system continues converting text into speech, when identical text appears the system uses the cached audio corresponding to the identical text without the need for re-synthesis via the TTS server.
    • 这里公开的是系统,方法和非暂时的计算机可读存储介质,用于在不使用插件或Flash®模块的情况下减少网页浏览TTS系统中的延迟。 根据所公开的方法配置的系统允许浏览器向web服务器发送具有韵律意义的文本段。 然后,TTS服务器将文本的语调短语转换为音频,并用音频文件对浏览器进行响应。 系统将音频文件保存在缓存中,文件由唯一标识符进行索引。 随着系统继续将文本转换为语音,当出现相同的文本时,系统使用对应于相同文本的缓存音频,而不需要经由TTS服务器重新合成。
    • 8. 发明申请
    • SYSTEM AND METHOD FOR TIGHTLY COUPLING AUTOMATIC SPEECH RECOGNITION AND SEARCH
    • 用于轻松连接自动语音识别和搜索的系统和方法
    • US20110144995A1
    • 2011-06-16
    • US12638649
    • 2009-12-15
    • Srinivas BANGALORETaniya MISHRA
    • Srinivas BANGALORETaniya MISHRA
    • G10L15/00G06F17/30
    • G10L15/18G06F17/30637G06F17/30663G10L15/083
    • Disclosed herein are systems, methods, and computer-readable storage media for performing a search. A system configured to practice the method first receives from an automatic speech recognition (ASR) system a word lattice based on speech query and receives indexed documents from an information repository. The system composes, based on the word lattice and the indexed documents, at least one triple including a query word, selected indexed document, and weight. The system generates an N-best path through the word lattice based on the at least one triple and re-ranks ASR output based on the N-best path. The system aggregates each weight across the query words to generate N-best listings and returns search results to the speech query based on the re-ranked ASR output and the N-best listings. The lattice can be a confusion network, the arc density of which can be adjusted for a desired performance level.
    • 本文公开了用于执行搜索的系统,方法和计算机可读存储介质。 配置为实施该方法的系统首先从自动语音识别(ASR)系统接收基于语音查询的字格,并从信息库接收索引的文档。 该系统基于字格和索引文档,组合至少一个包括查询词,选择的索引文档和权重的三元组。 该系统基于至少一个三重生成通过该字格的N个最佳路径,并且基于该N最佳路径重新排列ASR输出。 系统通过查询字聚合每个权重,以产生N最佳列表,并根据重新排列的ASR输出和N最佳列表将搜索结果返回给语音查询。 晶格可以是混淆网络,其电弧密度可以针对期望的性能水平进行调整。
    • 9. 发明授权
    • System and method for generating challenge utterances for speaker verification
    • 用于产生演讲者验证的挑战话语的系统和方法
    • US09318114B2
    • 2016-04-19
    • US12954094
    • 2010-11-24
    • Ilija ZeljkovicTaniya MishraAmanda StentAnn K. SyrdalJay Wilpon
    • Ilija ZeljkovicTaniya MishraAmanda StentAnn K. SyrdalJay Wilpon
    • G10L17/00G10L17/24G10L15/08G10L17/26
    • G10L17/24G10L15/02G10L15/08G10L17/00G10L17/04G10L17/26G10L2015/025
    • Disclosed herein are systems, methods, and non-transitory computer-readable storage media relating to speaker verification. In one aspect, a system receives a first user identity from a second user, and, based on the identity, accesses voice characteristics. The system randomly generates a challenge sentence according to a rule and/or grammar, based on the voice characteristics, and prompts the second user to speak the challenge sentence. The system verifies that the second user is the first user if the spoken challenge sentence matches the voice characteristics. In an enrollment aspect, the system constructs an enrollment phrase that covers a minimum threshold of unique speech sounds based on speaker-distinctive phonemes, phoneme clusters, and prosody. Then user utters the enrollment phrase and extracts voice characteristics for the user from the uttered enrollment phrase. The system generates a user profile, based on the voice characteristics, for generating random challenge sentences according to a grammar.
    • 本文公开了与说话者验证有关的系统,方法和非暂时的计算机可读存储介质。 在一个方面,系统从第二用户接收第一用户身份,并且基于身份访问语音特征。 该系统根据语音特征根据规则和/或语法随机生成挑战句,并提示第二用户说出挑战句。 系统验证第二用户是否是第一个用户,如果口头的挑战句子与语音特征相匹配。 在注册方面,系统构建了一个基于扬声器独特音素,音素集群和韵律,覆盖独特语音的最小阈值的注册短语。 然后用户发出注册短语,并从发出的注册短语中提取用户的语音特征。 该系统基于语音特征生成用户简档,用于根据语法产生随机挑战语句。