会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 7. 发明申请
    • AUDIO RECOGNITION DEVICE AND AUDIO RECOGNITION METHOD
    • 音频识别装置和音频识别方法
    • US20100324897A1
    • 2010-12-23
    • US12518075
    • 2007-12-07
    • Tadashi EmoriYoshifumi Onishi
    • Tadashi EmoriYoshifumi Onishi
    • G10L15/06
    • G10L15/187G10L15/04G10L15/063G10L15/142G10L15/183G10L15/197
    • Acoustic models and language models are learned according to a speaking length which indicates a length of a speaking section in speech data, and speech recognition process is implemented by using the learned acoustic models and language models. A speech recognition apparatus includes means (103) for detecting a speaking section in speech data (101) and for generating a section information which indicates the detected speaking section, means (104) for recognizing a data part corresponding to a section information in the speech data as well as text data (102) written from the speech data and for classifying the data part based on a speaking length thereof, and means (106) for learning acoustic models and language models (107) by using the classified data part (105).
    • 声学模型和语言模型根据说话长度来学习,该长度表示语音数据中的发音部分的长度,并且通过使用学习的声学模型和语言模型来实现语音识别过程。 语音识别装置包括用于检测语音数据(101)中的说话部分并且用于生成指示检测到的说话部分的部分信息的装置(103),用于识别对应于语音中的部分信息的数据部分的装置(104) 数据以及从语音数据写入的文本数据(102),并且用于基于其语音长度对数据部分进行分类;以及用于通过使用分类数据部分(105)来学习声学模型和语言模型(107)的装置(106) )。
    • 8. 发明授权
    • Language model learning system, language model learning method, and language model learning program
    • 语言模型学习系统,语言模型学习方法和语言模型学习程序
    • US08831943B2
    • 2014-09-09
    • US12302962
    • 2007-05-30
    • Tadashi EmoriYoshifumi Onishi
    • Tadashi EmoriYoshifumi Onishi
    • G10L15/04G10L15/06G10L15/00G10L15/197G10L15/183
    • G10L15/197G10L15/183
    • A language model learning system for learning a language model on an identifiable basis relating to a word error rate used in speech recognition. The language model learning system (10) includes a recognizing device (101) for recognizing an input speech by using a sound model and a language model and outputting the recognized word sequence as the recognition result, a reliability degree computing device (103) for computing the degree of reliability of the word sequence, and a language model parameter updating device (104) for updating the parameters of the language model by using the degree of reliability. The language model parameter updating device updates the parameters of the language model to heighten the degree of reliability of the word sequence the computed degree of reliability of which is low when the recognizing device recognizes by using the updated language model and the reliability degree computing device computes the degree of reliability.
    • 一种语言模型学习系统,用于在与语音识别中使用的单词错误率相关的可识别基础上学习语言模型。 语言模型学习系统(10)包括:识别装置(101),用于通过使用声音模型和语言模型识别输入语音,并输出识别的字序列作为识别结果;可靠度计算装置(103),用于计算 字序列的可靠度,以及语言模型参数更新装置(104),用于通过使用可靠度更新语言模型的参数。 语言模型参数更新装置更新语言模型的参数,以在识别装置通过使用更新的语言模型识别并且可靠性度计算装置计算时,提高计算的可靠性程度低的字序列的可靠性程度 可靠性程度。
    • 9. 发明申请
    • RECOGNIZER WEIGHT LEARNING DEVICE, SPEECH RECOGNIZING DEVICE, AND SYSTEM
    • 识别器重量学习装置,语音识别装置和系统
    • US20100318358A1
    • 2010-12-16
    • US12525930
    • 2008-01-18
    • Yoshifumi OnishiTadashi Emori
    • Yoshifumi OnishiTadashi Emori
    • G10L15/00
    • G10L15/08G10L15/32
    • A speech recognition apparatus (110) selects an optimum recognition result from recognition results output from a set of speech recognizers (s1-sM) based on a majority decision. This decision is implemented with taking into account weight values, as to the set of the speech recognizers, learned by a learning apparatus (100). The learning apparatus includes a unit (103) selecting speech recognizers corresponding to characteristics of speech for learning (101), a unit (104) finding recognition results of the speech for learning by using the selected speech recognizers, a unit (105) unifying the recognition results and generating a word string network, and a unit (106) finding weight values concerning a set of the speech recognizers by implementing learning processing. When finding weight values, the learning apparatus selects a word from each arc set in the word string network based on a majority decision which is taken into account candidates of weight value, and outputs weight value candidates which minimize a recognition error rate of a word string formed of the selected words, as a learning result.
    • 语音识别装置(110)基于多数决定从一组语音识别器(s1-sM)输出的识别结果中选择最佳识别结果。 考虑到由学习装置(100)学习的语音识别器的集合的权重值来实现该决定。 所述学习装置包括:单元(103),选择与学习用语音(101)对应的语音识别符,单元(104),通过使用所选择的语音识别器寻找用于学习的语音的识别结果的单元(104) 识别结果并产生字串网络;以及单元(106),通过实施学习处理来找出关于一组语音识别器的权重值。 当找到权重值时,学习装置基于考虑到权重值的候选的多数决定来选择字串网络中的每个弧设置的单词,并且输出使字串的识别错误率最小化的权重值候选 由所选词组成,作为学习成果。
    • 10. 发明授权
    • Audio recognition apparatus and speech recognition method using acoustic models and language models
    • 使用声学模型和语言模型的音频识别装置和语音识别方法
    • US08706487B2
    • 2014-04-22
    • US12518075
    • 2007-12-07
    • Tadashi EmoriYoshifumi Onishi
    • Tadashi EmoriYoshifumi Onishi
    • G10L15/06
    • G10L15/187G10L15/04G10L15/063G10L15/142G10L15/183G10L15/197
    • Acoustic models and language models are learned according to a speaking length which indicates a length of a speaking section in speech data, and speech recognition process is implemented by using the learned acoustic models and language models. A speech recognition apparatus includes means (103) for detecting a speaking section in speech data (101) and for generating a section information which indicates the detected speaking section, means (104) for recognizing a data part corresponding to a section information in the speech data as well as text data (102) written from the speech data and for classifying the data part based on a speaking length thereof, and means (106) for learning acoustic models and language models (107) by using the classified data part (105).
    • 声学模型和语言模型根据说话长度来学习,该长度表示语音数据中的发音部分的长度,并且通过使用学习的声学模型和语言模型来实现语音识别过程。 语音识别装置包括用于检测语音数据(101)中的说话部分并且用于生成指示检测到的说话部分的部分信息的装置(103),用于识别对应于语音中的部分信息的数据部分的装置(104) 数据以及从语音数据写入的文本数据(102),并且用于基于其语音长度对数据部分进行分类;以及用于通过使用分类数据部分(105)来学习声学模型和语言模型(107)的装置(106) )。