专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明授权

US09009035B2 Method for processing multichannel acoustic signal, system therefor, and program 有权
标题翻译：多通道声信号处理方法，系统及程序
公开(公告)号：US09009035B2
公开(公告)日：2015-04-14
申请号：US13201354
申请日：2010-02-08
申请人： Masanori Tsujikawa , Ryosuke Isotani , Tadashi Emori , Yoshifumi Onishi
发明人： Masanori Tsujikawa , Ryosuke Isotani , Tadashi Emori , Yoshifumi Onishi
IPC分类号： G10L21/02 , H04B15/00 , G10L21/0272
CPC分类号： G10L21/0272
摘要： A method for processing multichannel acoustic signals which processes input signals of a plurality of channels including the voices of a plurality of speaking persons. The method is characterized by detecting the voice section of each speaking person or each channel, detecting overlapped sections wherein the detected voice sections are common between channels, determining a channel to be subjected to crosstalk removal and the section thereof by use of at least voice sections not including the detected overlapped sections, and removing crosstalk in the sections of the channel to be subjected to the crosstalk removal.
摘要翻译：一种用于处理多声道声信号的方法，其处理包括多个说话人的声音的多个声道的输入信号。该方法的特征在于检测每个说话人或每个频道的语音部分，检测重叠部分，其中检测到的语音部分在频道之间是公共的，通过使用至少语音部分确定要进行串扰消除的频道及其部分不包括检测到的重叠部分，并且消除要进行串扰消除的信道的部分中的串扰。

2. 发明申请

US20120029915A1 METHOD FOR PROCESSING MULTICHANNEL ACOUSTIC SIGNAL, SYSTEM THEREFOR, AND PROGRAM 有权
标题翻译：用于处理多通道声学信号的方法，系统及其程序
公开(公告)号：US20120029915A1
公开(公告)日：2012-02-02
申请号：US13201354
申请日：2010-02-08
申请人： Masanori Tsujikawa , Ryosuke Isotani , Tadashi Emori , Yoshifumi Onishi
发明人： Masanori Tsujikawa , Ryosuke Isotani , Tadashi Emori , Yoshifumi Onishi
IPC分类号： G10L21/02
CPC分类号： G10L21/0272
摘要： A method for processing multichannel acoustic signals which processes input signals of a plurality of channels including the voices of a plurality of speaking persons. The method is characterized by detecting the voice section of each speaking person or each channel, detecting overlapped sections wherein the detected voice sections are common between channels, determining a channel to be subjected to crosstalk removal and the section thereof by use of at least voice sections not including the detected overlapped sections, and removing crosstalk in the sections of the channel to be subjected to the crosstalk removal.
摘要翻译：一种用于处理多声道声信号的方法，其处理包括多个说话人的声音的多个声道的输入信号。该方法的特征在于检测每个说话人或每个频道的语音部分，检测重叠部分，其中检测到的语音部分在频道之间是公共的，通过使用至少语音部分确定要进行串扰消除的频道及其部分不包括检测到的重叠部分，并且消除要进行串扰消除的信道的部分中的串扰。

3. 发明申请

US20120046940A1 METHOD FOR PROCESSING MULTICHANNEL ACOUSTIC SIGNAL, SYSTEM THEREOF, AND PROGRAM 有权
标题翻译：用于处理多通道声学信号的方法，系统及程序
公开(公告)号：US20120046940A1
公开(公告)日：2012-02-23
申请号：US13201389
申请日：2010-02-08
申请人： Masanori Tsujikawa , Tadashi Emori , Yoshifumi Onishi , Ryosuke Isotani
发明人： Masanori Tsujikawa , Tadashi Emori , Yoshifumi Onishi , Ryosuke Isotani
IPC分类号： G10L11/00
CPC分类号： G10L21/0272
摘要： A method for processing multichannel acoustic signals, whereby input signals of a plurality of channels including the voices of a plurality of speaking persons are processed. The method is characterized by comprising: calculating the first feature quantity of the input signals of the multichannels for each channel; calculating similarity of the first feature quantity of each channel between the channels; selecting channels having high similarity; separating signals using the input signals of the selected channels; inputting the input signals of the channels having low similarity and the signals after the signal separation; and detecting a voice section of each speaking person or each channel.
摘要翻译：一种用于处理多声道声信号的方法，由此处理包括多个说话人的声音的多个声道的输入信号。该方法的特征在于包括：计算每个信道的多信道的输入信号的第一特征量; 计算通道之间每个通道的第一特征量的相似度; 选择具有高相似性的信道; 使用所选择的信道的输入信号分离信号; 输入具有低相似性的信道的输入信号和信号分离之后的信号; 并且检测每个说话人或每个频道的语音部分。

4. 发明授权

US08954323B2 Method for processing multichannel acoustic signal, system thereof, and program 有权
标题翻译：多通道声信号处理方法，系统及程序
公开(公告)号：US08954323B2
公开(公告)日：2015-02-10
申请号：US13201389
申请日：2010-02-08
申请人： Masanori Tsujikawa , Tadashi Emori , Yoshifumi Onishi , Ryosuke Isotani
发明人： Masanori Tsujikawa , Tadashi Emori , Yoshifumi Onishi , Ryosuke Isotani
IPC分类号： G10L21/02 , G10L15/20 , G10L21/0272
CPC分类号： G10L21/0272
摘要： A method for processing multichannel acoustic signals, whereby input signals of a plurality of channels including the voices of a plurality of speaking persons are processed. The method is characterized by comprising: calculating the first feature quantity of the input signals of the multichannels for each channel; calculating similarity of the first feature quantity of each channel between the channels; selecting channels having high similarity; separating signals using the input signals of the selected channels; inputting the input signals of the channels having low similarity and the signals after the signal separation; and detecting a voice section of each speaking person or each channel.
摘要翻译：一种用于处理多声道声信号的方法，由此处理包括多个说话人的声音的多个声道的输入信号。该方法的特征在于包括：计算每个信道的多信道的输入信号的第一特征量; 计算通道之间每个通道的第一特征量的相似度; 选择具有高相似性的信道; 使用所选择的信道的输入信号分离信号; 输入具有低相似性的信道的输入信号和信号分离之后的信号; 并且检测每个说话人或每个频道的语音部分。

5. 发明授权

US09064499B2 Method for processing multichannel acoustic signal, system therefor, and program 有权
标题翻译：多通道声信号处理方法，系统及程序
公开(公告)号：US09064499B2
公开(公告)日：2015-06-23
申请号：US13201375
申请日：2010-02-08
申请人： Masanori Tsujikawa , Tadashi Emori , Yoshifumi Onishi
发明人： Masanori Tsujikawa , Tadashi Emori , Yoshifumi Onishi
IPC分类号： G10L21/02 , G10L15/20 , G10L21/0272 , G10L19/008
CPC分类号： G10L21/0272 , G10L19/008
摘要： A method for processing multichannel acoustic signals which is characterized by calculating the feature quantity of each channel from the input signals of a plurality of channels, calculating similarity between the channels in the feature quantity of each channel, selecting channels having high similarity, and separating signals using the input signals of the selected channels.
摘要翻译：一种处理多声道声信号的方法，其特征在于根据多个声道的输入信号计算每个声道的特征量，计算每个声道的特征量中的声道之间的相似度，选择具有高相似性的声道，以及分离信号使用所选通道的输入信号。

6. 发明申请

US20120029916A1 METHOD FOR PROCESSING MULTICHANNEL ACOUSTIC SIGNAL, SYSTEM THEREFOR, AND PROGRAM 有权
标题翻译：用于处理多通道声学信号的方法，系统及其程序
公开(公告)号：US20120029916A1
公开(公告)日：2012-02-02
申请号：US13201375
申请日：2010-02-08
申请人： Masanori Tsujikawa , Tadashi Emori , Yoshifumi Onishi
发明人： Masanori Tsujikawa , Tadashi Emori , Yoshifumi Onishi
IPC分类号： G10L15/00 , G10L11/00
CPC分类号： G10L21/0272 , G10L19/008
摘要： A method for processing multichannel acoustic signals which is characterized by calculating the feature quantity of each channel from the input signals of a plurality of channels, calculating similarity between the channels in the feature quantity of each channel, selecting channels having high similarity, and separating signals using the input signals of the selected channels.
摘要翻译：一种处理多声道声信号的方法，其特征在于根据多个声道的输入信号计算每个声道的特征量，计算每个声道的特征量中的声道之间的相似度，选择具有高相似性的声道，以及分离信号使用所选通道的输入信号。

7. 发明申请

US20100324897A1 AUDIO RECOGNITION DEVICE AND AUDIO RECOGNITION METHOD 有权
标题翻译：音频识别装置和音频识别方法
公开(公告)号：US20100324897A1
公开(公告)日：2010-12-23
申请号：US12518075
申请日：2007-12-07
申请人： Tadashi Emori , Yoshifumi Onishi
发明人： Tadashi Emori , Yoshifumi Onishi
IPC分类号： G10L15/06
CPC分类号： G10L15/187 , G10L15/04 , G10L15/063 , G10L15/142 , G10L15/183 , G10L15/197
摘要： Acoustic models and language models are learned according to a speaking length which indicates a length of a speaking section in speech data, and speech recognition process is implemented by using the learned acoustic models and language models. A speech recognition apparatus includes means (103) for detecting a speaking section in speech data (101) and for generating a section information which indicates the detected speaking section, means (104) for recognizing a data part corresponding to a section information in the speech data as well as text data (102) written from the speech data and for classifying the data part based on a speaking length thereof, and means (106) for learning acoustic models and language models (107) by using the classified data part (105).
摘要翻译：声学模型和语言模型根据说话长度来学习，该长度表示语音数据中的发音部分的长度，并且通过使用学习的声学模型和语言模型来实现语音识别过程。语音识别装置包括用于检测语音数据（101）中的说话部分并且用于生成指示检测到的说话部分的部分信息的装置（103），用于识别对应于语音中的部分信息的数据部分的装置（104）数据以及从语音数据写入的文本数据（102），并且用于基于其语音长度对数据部分进行分类;以及用于通过使用分类数据部分（105）来学习声学模型和语言模型（107）的装置（106））。

8. 发明授权

US08831943B2 Language model learning system, language model learning method, and language model learning program 有权
标题翻译：语言模型学习系统，语言模型学习方法和语言模型学习程序
公开(公告)号：US08831943B2
公开(公告)日：2014-09-09
申请号：US12302962
申请日：2007-05-30
申请人： Tadashi Emori , Yoshifumi Onishi
发明人： Tadashi Emori , Yoshifumi Onishi
IPC分类号： G10L15/04 , G10L15/06 , G10L15/00 , G10L15/197 , G10L15/183
CPC分类号： G10L15/197 , G10L15/183
摘要： A language model learning system for learning a language model on an identifiable basis relating to a word error rate used in speech recognition. The language model learning system (10) includes a recognizing device (101) for recognizing an input speech by using a sound model and a language model and outputting the recognized word sequence as the recognition result, a reliability degree computing device (103) for computing the degree of reliability of the word sequence, and a language model parameter updating device (104) for updating the parameters of the language model by using the degree of reliability. The language model parameter updating device updates the parameters of the language model to heighten the degree of reliability of the word sequence the computed degree of reliability of which is low when the recognizing device recognizes by using the updated language model and the reliability degree computing device computes the degree of reliability.
摘要翻译：一种语言模型学习系统，用于在与语音识别中使用的单词错误率相关的可识别基础上学习语言模型。语言模型学习系统（10）包括：识别装置（101），用于通过使用声音模型和语言模型识别输入语音，并输出识别的字序列作为识别结果;可靠度计算装置（103），用于计算字序列的可靠度，以及语言模型参数更新装置（104），用于通过使用可靠度更新语言模型的参数。语言模型参数更新装置更新语言模型的参数，以在识别装置通过使用更新的语言模型识别并且可靠性度计算装置计算时，提高计算的可靠性程度低的字序列的可靠性程度可靠性程度。

9. 发明申请

US20100318358A1 RECOGNIZER WEIGHT LEARNING DEVICE, SPEECH RECOGNIZING DEVICE, AND SYSTEM 有权
标题翻译：识别器重量学习装置，语音识别装置和系统
公开(公告)号：US20100318358A1
公开(公告)日：2010-12-16
申请号：US12525930
申请日：2008-01-18
申请人： Yoshifumi Onishi , Tadashi Emori
发明人： Yoshifumi Onishi , Tadashi Emori
IPC分类号： G10L15/00
CPC分类号： G10L15/08 , G10L15/32
摘要： A speech recognition apparatus (110) selects an optimum recognition result from recognition results output from a set of speech recognizers (s1-sM) based on a majority decision. This decision is implemented with taking into account weight values, as to the set of the speech recognizers, learned by a learning apparatus (100). The learning apparatus includes a unit (103) selecting speech recognizers corresponding to characteristics of speech for learning (101), a unit (104) finding recognition results of the speech for learning by using the selected speech recognizers, a unit (105) unifying the recognition results and generating a word string network, and a unit (106) finding weight values concerning a set of the speech recognizers by implementing learning processing. When finding weight values, the learning apparatus selects a word from each arc set in the word string network based on a majority decision which is taken into account candidates of weight value, and outputs weight value candidates which minimize a recognition error rate of a word string formed of the selected words, as a learning result.
摘要翻译：语音识别装置（110）基于多数决定从一组语音识别器（s1-sM）输出的识别结果中选择最佳识别结果。考虑到由学习装置（100）学习的语音识别器的集合的权重值来实现该决定。所述学习装置包括：单元（103），选择与学习用语音（101）对应的语音识别符，单元（104），通过使用所选择的语音识别器寻找用于学习的语音的识别结果的单元（104）识别结果并产生字串网络;以及单元（106），通过实施学习处理来找出关于一组语音识别器的权重值。当找到权重值时，学习装置基于考虑到权重值的候选的多数决定来选择字串网络中的每个弧设置的单词，并且输出使字串的识别错误率最小化的权重值候选由所选词组成，作为学习成果。

10. 发明授权

US08706487B2 Audio recognition apparatus and speech recognition method using acoustic models and language models 有权
标题翻译：使用声学模型和语言模型的音频识别装置和语音识别方法
公开(公告)号：US08706487B2
公开(公告)日：2014-04-22
申请号：US12518075
申请日：2007-12-07
申请人： Tadashi Emori , Yoshifumi Onishi
发明人： Tadashi Emori , Yoshifumi Onishi
IPC分类号： G10L15/06
CPC分类号： G10L15/187 , G10L15/04 , G10L15/063 , G10L15/142 , G10L15/183 , G10L15/197
摘要： Acoustic models and language models are learned according to a speaking length which indicates a length of a speaking section in speech data, and speech recognition process is implemented by using the learned acoustic models and language models. A speech recognition apparatus includes means (103) for detecting a speaking section in speech data (101) and for generating a section information which indicates the detected speaking section, means (104) for recognizing a data part corresponding to a section information in the speech data as well as text data (102) written from the speech data and for classifying the data part based on a speaking length thereof, and means (106) for learning acoustic models and language models (107) by using the classified data part (105).
摘要翻译：声学模型和语言模型根据说话长度来学习，该长度表示语音数据中的发音部分的长度，并且通过使用学习的声学模型和语言模型来实现语音识别过程。语音识别装置包括用于检测语音数据（101）中的说话部分并且用于生成指示检测到的说话部分的部分信息的装置（103），用于识别对应于语音中的部分信息的数据部分的装置（104）数据以及从语音数据写入的文本数据（102），并且用于基于其语音长度对数据部分进行分类;以及用于通过使用分类数据部分（105）来学习声学模型和语言模型（107）的装置（106））。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式