会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 3. 发明授权
    • Subtitle generation and retrieval combining document with speech recognition
    • 字幕生成和检索将文档与语音识别相结合
    • US07739116B2
    • 2010-06-15
    • US11338100
    • 2006-01-23
    • Kohtaroh MiyamotoNoriko NegishiKenichi Arakawa
    • Kohtaroh MiyamotoNoriko NegishiKenichi Arakawa
    • G10L11/00G10L21/00
    • G06F17/2745G10L15/26G10L2015/088H04N5/44504H04N21/4884
    • Provides subtitle generation methods and apparatus which recognizes voice in a presentation to generate subtitles thereof, and retrieval apparatus for retrieving character strings by use of the subtitles. An apparatus of the present invention includes: a extraction unit for extracting text from presentation documents; an analysis unit for morphologically analyzing text to decompose it into words; a generation unit for generating common keywords by assigning weights to words; a registration unit for adding common keywords to a voice recognition dictionary; a recognition unit for recognizing voice in a presentation; a record unit for recording the correspondence between page and time by detecting page switching events; a regeneration unit for regenerating common keywords by further referring to the correspondence between page and time; a control unit for controlling the display of subtitles, common keywords, text and master subtitles; and a note generation unit for generating speaker notes from subtitles.
    • 提供识别演示文稿中的语音以产生字幕的字幕生成方法和装置,以及通过使用字幕来检索字符串的检索装置。 本发明的装置包括:提取单元,用于从演示文档中提取文本; 用于形态分析文本以将其分解成单词的分析单元; 生成单元,用于通过向单词分配权重来生成公共关键字; 用于将常用关键字添加到语音识别词典的注册单元; 识别单元,用于识别呈现中的声音; 用于通过检测页面切换事件来记录页面和时间之间的对应关系的记录单元; 再生单元,用于通过进一步参考页面和时间之间的对应关系来再现公共关键字; 控制单元,用于控制字幕,常用关键字,文本和主字幕的显示; 以及用于从字幕产生扬声器音符的音符生成单元。
    • 5. 发明申请
    • Multiple Language/Media Translation Optimization
    • 多语言/媒体翻译优化
    • US20120179451A1
    • 2012-07-12
    • US13423537
    • 2012-03-19
    • Kohtaroh MiyamotoAli Sobhi
    • Kohtaroh MiyamotoAli Sobhi
    • G06F17/28
    • G06F17/289G10L15/26
    • A mechanism is provided for optimizing a language/media translation map. A user input is received comprising an input language/media selection, one or more output languages/medias selections, and a threshold for at least one of accuracy or throughput of one or more requested language/media translations. For each of the one or more requested language media translations, a determination is made as to whether an accuracy or throughput of a selected one of an automated translation system or a human resource translator is above the threshold for the at least one of accuracy or throughput. Responsive to the accuracy or throughput being above the threshold, either the selected one of the automated translation system or the selected one of the human resource translator is added to a multiple language/media translation map. An optimized multiple language/media translation map is then generated for use by a translation orchestration module in the data processing system.
    • 提供了一种用于优化语言/媒体翻译图的机制。 接收包括输入语言/媒体选择,一个或多个输出语言/媒体选择以及一个或多个所请求的语言/媒体翻译的精度或吞吐量中的至少一个的阈值的用户输入。 对于所述一个或多个所请求的语言媒体翻译中的每一个,确定自动翻译系统或人力资源翻译器中所选择的一个的准确度或吞吐量是否在准确度或吞吐量中的至少一个之上的阈值之上 。 响应于高于阈值的准确度或吞吐量,所选择的一个自动翻译系统或所选择的一个人力资源翻译器被添加到多语言/媒体翻译图。 然后生成优化的多语言/媒体翻译图,供数据处理系统中的翻译编排模块使用。
    • 6. 发明授权
    • Correction of a caption produced by speech recognition
    • 更正由语音识别产生的字幕
    • US07729917B2
    • 2010-06-01
    • US11688939
    • 2007-03-21
    • Kohtaroh MiyamotoKenichi ArakawaToshiya Ohgane
    • Kohtaroh MiyamotoKenichi ArakawaToshiya Ohgane
    • G10L21/00
    • G10L15/22G10L15/08
    • A device of the present invention obtains a character string of a speech recognition result and a confidence factor thereof. A time monitor monitors time and determines whether or not processing is delayed by checking the confidence factor and time status. When the processing is not delayed, a checker is asked to perform manual judgment. In this event, speech is processed and the manual judgment of the speech recognition result is performed on the basis of the processed speech. When the processing is delayed, automatic judgment is performed by use of the confidence factor. When the character string is judged to be correct as a result of the manual judgment or the automatic judgment, the character string is displayed as a confirmed character string. When the character string is judged to be incorrect, automatic correction is performed by matching on the basis of a next candidate obtained by the speech recognition, texts and attributes of the presentation, a script text, and the like. Character string after the automatic correction is displayed as an unconfirmed character string.
    • 本发明的装置获得语音识别结果的字符串和置信因子。 时间监视器监视时间,并通过检查置信因子和时间状态来确定处理是否被延迟。 当处理不延迟时,要求检查员进行手动判断。 在这种情况下,处理语音,并且基于处理的语音来执行语音识别结果的手动判断。 当处理延迟时,通过使用置信因子进行自动判断。 当作为手动判断或自动判断的结果判断为正确的字符串时,字符串被显示为确认的字符串。 当判断字符串不正确时,通过基于通过语音识别获得的下一个候选者,演示的文本和属性,脚本文本等进行匹配来执行自动校正。 自动修正后的字符串显示为未确认的字符串。
    • 8. 发明申请
    • APPARATUS AND METHOD FOR RENDERING CONTENTS, CONTAINING SOUND DATA, MOVING IMAGE DATA AND STATIC IMAGE DATA, HARMLESS
    • 用于渲染内容,包含声音数据,移动图像数据和静态图像数据的装置和方法
    • US20080262841A1
    • 2008-10-23
    • US11871331
    • 2007-10-12
    • Kohtaroh MiyamotoYohei Ikawa
    • Kohtaroh MiyamotoYohei Ikawa
    • G10L15/00H04N5/91
    • H04N7/1675G11B20/00086G11B20/00137G11B20/0021G11B20/00804G11B27/034G11B27/105G11B27/11H04N21/2353H04N21/6334H04N21/8355
    • A method of rendering multimedia contents harmless is described. The method includes: reading out a predetermined word and the contents from a recording apparatus; replacing the predetermined word in transcript data with a different word, and setting the transcript data including the different word, and the predetermined word, respectively, as transcript data of harmless contents, and as transcript data of unique information; replacing the predetermined word with the different word, and setting the sound data including the different word and the predetermined word according to a time when the predetermined word appears in the firstly mentioned transcript data, respectively, as sound data of the harmless contents, and as sound data of the unique information; replacing the predetermined word in the presentation data with the different word, and the predetermined word, respectively, as presentation data of the harmless contents, and as presentation data of the unique information; recording the harmless contents; and recording the unique information.
    • 描述了使多媒体内容无害化的方法。 该方法包括:从记录装置读出预定字和内容; 用不同的词代替抄本数据中的预定词,并将包括不同单词和预定单词的抄本数据分别设置为无害内容的抄本数据,以及唯一信息的抄录数据; 用不同的字替换预定字,并且分别将预定词出现在首先提及的转录数据中的时间设置为包括不同字和预定字的声音数据作为无害内容的声音数据,并且作为 声音数据的独特信息; 分别用不同的单词和预定单词代替表示数据中的预定单词作为无害内容的显示数据,并将其作为唯一信息的显示数据; 记录无害内容; 并记录唯一信息。
    • 9. 发明授权
    • Computer resources access control apparatus and method
    • 计算机资源访问控制装置及方法
    • US6101569A
    • 2000-08-08
    • US111029
    • 1998-07-07
    • Kohtaroh MiyamotoKenichi Okuyama
    • Kohtaroh MiyamotoKenichi Okuyama
    • G06F12/00G06F9/46G06T1/20
    • G06F9/52Y10S707/99938
    • The present invention is directed to obtaining a correct processing result without an inexpedience such as a starvation by having a plurality of processes gain an access in parallel to a resource such as a VRAM. When one of a plurality of processing means requests an exclusive access to a portion of a resource, a lock range processing part 122 permits an exclusive access only when the process does not result in an inexpedience, for example, when other processing means is not permitted to gain an exclusive access to an overlapped portion. Further, the lock range processing part 122 inhibits permission of an exclusive access to other portion overlapping a portion and permits an exclusive access to such portion when the number of permissions of exclusive accesses to other portion overlapping this portion exceeds a given number while an exclusive access to such portion is not permitted. The lock range processing part 122 thus prevents a starvation from occurring by limiting the number of permissions of an exclusive access to other overlapping portion passing a portion which overlaps the other overlapping portion.
    • 本发明旨在通过使多个进程获得与诸如VRAM的资源并行的访问来获得正确的处理结果而没有诸如饥饿的无效性。 当多个处理装置中的一个请求对资源的一部分进行专用访问时,锁定范围处理部分122仅在处理不导致不能执行时允许独占访问,例如当不允许其他处理装置时 获得对重叠部分的独占访问权限。 此外,锁定范围处理部分122禁止对与部分重叠的其他部分的独占访问权限,并且当与该部分重叠的其他部分的排他访问权限的数量超过给定的数量时,允许对这些部分的独占访问,而专用访问 不允许这样的部分。 因此,锁定范围处理部分122通过限制通过与其他重叠部分重叠的部分的其他重叠部分的专用访问权限的数量来防止出现饥饿。