会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 71. 发明授权
    • Using pitch during speech recognition post-processing to improve recognition accuracy
    • 在语音识别后处理中使用音调来提高识别精度
    • US09484027B2
    • 2016-11-01
    • US12635346
    • 2009-12-10
    • Xufang ZhaoUma Arun
    • Xufang ZhaoUma Arun
    • G10L21/00G10L25/00G10L15/20G10L25/90G10L25/15
    • G10L15/20G10L25/15G10L25/90G10L2015/027
    • A method of automated speech recognition in a vehicle. The method includes receiving audio in the vehicle, pre-processing the received audio to generate acoustic feature vectors, decoding the generated acoustic feature vectors to produce at least one speech hypothesis, and post-processing the at least one speech hypothesis using pitch to improve speech recognition accuracy. The speech hypothesis can be accepted as recognized speech during post-processing if pitch is present in the received audio. Alternatively, a pitch count for the received audio can be determined, N-best speech hypotheses can be post-processed by comparing the pitch count to syllable counts associated with the speech hypotheses, and the speech hypothesis having a syllable count equal to the pitch count can be accepted as recognized speech.
    • 一种在车辆中自动语音识别的方法。 该方法包括在车辆中接收音频,对接收的音频进行预处理以产生声学特征向量,解码所生成的声学特征向量以产生至少一个语音假设,以及使用音高对语音假设进行后处理以改善语音 识别精度。 如果接收到的音频中存在音调,则语音假设可以在后处理中被接受为识别语音。 或者,可以确定接收到的音频的音调计数,通过将音调计数与与语音假设相关联的音节计数进行比较,可以对N个最佳语音假设进行后处理,并且具有等于音高计数的音节计数的语音假设 可以被接受为公认的演讲。
    • 76. 发明授权
    • Voice activity detection and pitch estimation
    • 语音活动检测和音调估计
    • US09384759B2
    • 2016-07-05
    • US13590022
    • 2012-08-20
    • Pierre ZakarauskasAlexander EscottClarence S. H. ChuShawn E. Stevenson
    • Pierre ZakarauskasAlexander EscottClarence S. H. ChuShawn E. Stevenson
    • G10L21/00G10L25/00G10L25/93G10L15/00G10L15/20G10L25/78G10L25/90G10L25/18
    • G10L25/78G10L25/18G10L25/90G10L25/93
    • Implementations include systems, methods and/or devices operable to detect voice activity in an audible signal by detecting glottal pulses. The dominant frequency of a series of glottal pulses is perceived as the intonation pattern or melody of natural speech, which is also referred to as the pitch. However, as noted above, spoken communication typically occurs in the presence of noise and/or other interference. In turn, the undulation of voiced speech is masked in some portions of the frequency spectrum associated with human speech by the noise and/or other interference. In some implementations, detection of voice activity is facilitated by dividing the frequency spectrum associated with human speech into multiple sub-bands in order to identify glottal pulses that dominate the noise and/or other inference in particular sub-bands. Additionally and/or alternatively, in some implementations the analysis is furthered to provide a pitch estimate of the detected voice activity.
    • 实现包括可操作以通过检测声门脉冲来检测可听信号中的语音活动的系统,方法和/或设备。 一系列声门脉冲的主频被视为自然语音的语调模式或旋律,也称为音调。 然而,如上所述,语音通信通常在存在噪声和/或其他干扰的情况下发生。 反过来,通过噪声和/或其他干扰,有声语音的波动在与人类语音相关联的频谱的某些部分被屏蔽。 在一些实现中,通过将与人类语音相关联的频谱划分成多个子带来便于语音活动的检测,以便识别支配噪声和/或特别是子带中的其它推断的声门脉冲。 另外和/或替代地,在一些实现中,进一步分析以提供检测到的语音活动的音高估计。