专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

71. 发明授权

US08682672B1 Synchronous transcript display with audio/video stream in web cast environment 有权
标题翻译：同步录音显示与Web /环境中的音频/视频流
公开(公告)号：US08682672B1
公开(公告)日：2014-03-25
申请号：US10944258
申请日：2004-09-17
申请人： Tommy Ha , Kamalaksha Ghosh
发明人： Tommy Ha , Kamalaksha Ghosh
IPC分类号： G10L21/00 , G10L25/00 , G10L15/00 , G10L21/06 , G06F3/14
CPC分类号： H04N21/47217 , G06F3/1423 , G06F17/3082 , G09B5/02 , G09G5/12 , G09G2370/02 , H04N7/15 , H04N21/234336 , H04N21/4307
摘要： A system and method is described that permits synchronization of a transcript with an audio/video stream of a webcast. The system also permits a user to perform a search of the transcript and then to jump in the webcast audio/video stream to the point identified during the search.
摘要翻译：描述了允许将抄本与网络广播的音频/视频流同步的系统和方法。该系统还允许用户执行对抄本的搜索，然后将网络广播音频/视频流跳转到搜索期间识别的点。

72. 发明授权

US08650027B2 Electrolaryngeal speech reconstruction method and system thereof 有权
标题翻译：电喉语音重建方法及其系统
公开(公告)号：US08650027B2
公开(公告)日：2014-02-11
申请号：US13603226
申请日：2012-09-04
申请人： Mingxi Wan , Liang Wu , Supin Wang , Zhifeng Niu , Congying Wan
发明人： Mingxi Wan , Liang Wu , Supin Wang , Zhifeng Niu , Congying Wan
IPC分类号： G10L11/00 , G10L17/00 , G10L13/00 , G10L21/06
CPC分类号： G06K9/00221 , G10L21/0364 , G10L2021/0575
摘要： The invention provides an electrolaryngeal speech reconstruction method and a system thereof. Firstly, model parameters are extracted from the collected speech as a parameter library, then facial images of a speaker are acquired and then transmitted to an image analyzing and processing module to obtain the voice onset and offset times and the vowel classes, then a waveform of a voice source is synthesized by a voice source synthesis module, finally, the waveform of the above voice source is output by an electrolarynx vibration output module, wherein the voice source synthesis module firstly sets the model parameters of a glottal voice source so as to synthesize the waveform of the glottal voice source, and then a waveguide model is used to simulate sound transmission in a vocal tract and select shape parameters of the vocal tract according to the vowel classes.
摘要翻译：本发明提供一种电咽喉语音重建方法及其系统。首先，从收集的语音中提取模型参数作为参数库，然后获取一个说话者的面部图像，然后传送给图像分析处理模块，以获得语音起始和偏移时间以及元音类，然后是波形一个语音源由语音源合成模块合成，最后，上述语音源的波形由电声振动输出模块输出，其中语音源合成模块首先设置声门声源的模型参数，以便合成声门声源的波形，然后使用波导模型来模拟声道中的声音传播，并根据元音类选择声道的形状参数。

73. 发明申请

US20130339032A1 SERVER AND METHOD OF CONTROLLING THE SAME 有权
标题翻译：服务器及其控制方法
公开(公告)号：US20130339032A1
公开(公告)日：2013-12-19
申请号：US13918538
申请日：2013-06-14
申请人： SAMSUNG ELECTRONICS CO., LTD.
发明人： Seung-il YOON , Tae-hwan CHA
IPC分类号： G10L21/06
CPC分类号： G10L21/06 , G10L15/22 , G10L25/54 , H04N21/42203 , H04N21/472
摘要： A server which interacts with a display apparatus is provided. The server includes a storage unit configured to store conversation patterns for each service domain, a communication unit configured to receive a user's voice from the display apparatus, and a control unit configured to determine a service domain including the user's voice, generate response information corresponding to the user's voice based on a conversation pattern of the determined service domain, and to control the communication unit to transmit the response information to the display apparatus. When it is determined that a currently received user's voice is included in another service domain which is different from a service domain including a previously received user's voice, the control unit generates the response information corresponding to the currently received user's voice based on a conversation pattern of the other service domain.
摘要翻译：提供了与显示装置交互的服务器。服务器包括：存储单元，被配置为存储每个服务域的会话模式;通信单元，被配置为从显示装置接收用户的语音;以及控制单元，被配置为确定包括用户的语音的服务域，生成对应于基于所确定的服务域的会话模式的用户的语音，并且控制通信单元将响应信息发送到显示装置。当确定当前接收的用户的话音被包括在不同于包括先前接收到的用户语音的服务域的另一服务域中时，控制单元基于当前接收到的用户的语音的对话模式生成与当前接收到的用户的语音相对应的响应信息其他服务域。

74. 发明申请

US20130268276A1 Menu Hierarchy Skipping Dialog for Directed Dialog Speech Recognition 有权
标题翻译：菜单层次跳转对话框用于定向对话语音识别
公开(公告)号：US20130268276A1
公开(公告)日：2013-10-10
申请号：US13908411
申请日：2013-06-03
申请人： AT&T Intellectual Property II, L.P.
发明人： Hary E. Blanchard
IPC分类号： G10L21/06
CPC分类号： G10L25/51 , G06F3/167 , G09B5/04 , G10L15/22 , G10L15/26 , G10L21/06 , G10L2015/223 , G10L2015/228 , H04M3/4936
摘要： A method and a processing device for managing an interactive speech recognition system is provided. Whether a voice input relates to expected input, at least partially, of any one of a group of menus different from a current menu is determined. If the voice input relates to the expected input, at least partially, of any one of the group of menus different from the current menu, skipping to the one of the group of menus is performed. The group of menus is different from the current menu include menus at multiple hierarchical levels.
摘要翻译：提供了一种用于管理交互式语音识别系统的方法和处理装置。确定语音输入是否与至少部分地与当前菜单不同的一组菜单中的任何一个的预期输入相关。如果语音输入与期望的输入相关，至少部分地与不同于当前菜单的一组菜单中的任一个菜单相关，则跳过到该组菜单中的一个菜单。菜单组与当前菜单不同，包括多层次的菜单。

75. 发明申请

US20130253929A1 MOBILE SYSTEMS AND METHODS OF SUPPORTING NATURAL LANGUAGE HUMAN-MACHINE INTERACTIONS 有权
标题翻译：移动系统和支持自然语言人机交互的方法
公开(公告)号：US20130253929A1
公开(公告)日：2013-09-26
申请号：US13898045
申请日：2013-05-20
申请人： VoiceBox Technologies, Inc.
发明人： Chris Weider , Richard Kennewick , Mike Kennewick , Robert A. Kennewick , Philippe Di Cristo , Samuel Menaker , Lynn Elise Armstrong
IPC分类号： G10L21/06
CPC分类号： G10L15/1815 , G06F17/30864 , G10L15/22 , G10L21/06 , G10L2015/223 , G10L2015/227 , G10L2015/228 , H04M2250/74
摘要： A mobile system is provided that includes speech-based and non-speech-based interfaces for telematics applications. The mobile system identifies and uses context, prior information, domain knowledge, and user specific profile data to achieve a natural environment for users that submit requests and/or commands in multiple domains. The invention creates, stores and uses extensive personal profile information for each user, thereby improving the reliability of determining the context and presenting the expected results for a particular question or command. The invention may organize domain specific behavior and information into agents, that are distributable or updateable over a wide area network.
摘要翻译：提供了一种移动系统，其包括用于远程信息处理应用的基于语音和非基于语音的接口。移动系统识别和使用上下文，先前信息，域知识和用户特定的简档数据，以为在多个域中提交请求和/或命令的用户实现自然环境。本发明为每个用户创建，存储和使用广泛的个人简档信息，从而提高确定上下文的可靠性并呈现特定问题或命令的预期结果。本发明可以将域特定行为和信息组织成可在广域网上分发或更新的代理。

76. 发明申请

US20130197917A1 METHODS AND SYSTEMS FOR UTILIZING VOICE COMMANDS ONBOARD AN AIRCRAFT 审中-公开
标题翻译：利用飞机上的声音命令的方法和系统
公开(公告)号：US20130197917A1
公开(公告)日：2013-08-01
申请号：US13859301
申请日：2013-04-09
申请人： HONEYWELL INTERNATIONAL INC.
发明人： Xian Qin Dong , Xiao Long Qin
IPC分类号： G10L21/06
CPC分类号： G10L21/06 , G08G5/0021 , G10L15/183 , G10L15/22 , G10L21/0208 , G10L21/0216 , G10L2015/223 , G10L2015/225 , G10L2015/228
摘要： Methods and systems are provided for utilizing audio commands onboard an aircraft. A method comprises identifying a flight phase for the aircraft, resulting in an identified flight phase, receiving an audio input, resulting in received audio input, filtering the received audio input in a manner that is influenced by the identified flight phase for the aircraft, resulting in filtered audio input, and validating the filtered audio input as a first voice command of a first plurality of possible voice commands.
摘要翻译：提供了用于利用飞机上的音频命令的方法和系统。一种方法包括识别飞行器的飞行阶段，导致识别的飞行阶段，接收音频输入，导致接收到的音频输入，以受飞行器识别的飞行阶段的影响的方式过滤所接收的音频输入，导致在经过滤波的音频输入中，并且将经滤波的音频输入验证为第一多个可能语音命令的第一语音命令。

77. 发明申请

US20130197915A1 SPEECH-BASED USER INTERFACE FOR A MOBILE DEVICE 有权
标题翻译：基于语音的用户界面，用于移动设备
公开(公告)号：US20130197915A1
公开(公告)日：2013-08-01
申请号：US13628657
申请日：2012-09-27
申请人： GM Global Technology Operations LLC
发明人： Denis R. Burke , Danilo Gurovich , Daniel E. Rudman , Keith A. Fry , Shane M. McCutchen , Marco T. Carnevale , Mukesh Gupta
IPC分类号： G10L21/06
CPC分类号： G10L15/265 , G10L21/06 , H04M1/6083 , H04M2250/74 , H04W92/08
摘要： A method of providing hands-free services using a mobile device having wireless access to computer-based services includes carrying out a completed speech session via a mobile device without any physical interaction with the mobile device, wherein the speech session includes receiving a speech input from a user, and obtaining from a cloud service a service result responsive to the speech input, and providing the service result as a speech response presented to the user.
摘要翻译：使用具有对基于计算机的服务的无线接入的移动设备提供免提服务的方法包括经由移动设备执行完成的语音会话而没有与移动设备的任何物理交互，其中语音会话包括接收来自用户，并且从云服务获得响应于所述语音输入的服务结果，以及将所述服务结果提供为呈现给所述用户的语音响应。

78. 发明授权

US08498873B2 Establishing a multimodal advertising personality for a sponsor of multimodal application 有权
标题翻译：为多模式应用的赞助者建立多式联运广告个性
公开(公告)号：US08498873B2
公开(公告)日：2013-07-30
申请号：US13535588
申请日：2012-06-28
申请人： Charles W. Cross, Jr. , Hilary A. Pike
发明人： Charles W. Cross, Jr. , Hilary A. Pike
IPC分类号： G10L21/00 , G10L21/06
CPC分类号： G10L21/00 , G06Q30/02
摘要： Establishing a multimodal advertising personality for a sponsor of a multimodal application, including associating one or more vocal demeanors with a sponsor of a multimodal application and presenting a speech portion of the multimodal application for the sponsor using at least one of the vocal demeanors associated with the sponsor.
摘要翻译：为多模式应用的赞助者建立多模式广告个性，包括将一个或多个声音风格与多模态应用的赞助者联系起来，并使用至少一个与所述多模态应用相关联的声音风格向赞助者呈现多模式应用的语音部分赞助。

79. 发明申请

US20130191131A1 ELECTRONIC BOOK WITH VOICE EMULATION FEATURES 有权
公开(公告)号：US20130191131A1
公开(公告)日：2013-07-25
申请号：US13757409
申请日：2013-02-01
申请人： ADREA, LLC
发明人： John S. HENDRICKS , Michael L. Asmussen
IPC分类号： G10L21/06
CPC分类号： G10L13/02 , G06F1/1613 , G09B5/065 , G10L21/06
摘要： A method and system for providing text-to-audio conversion of an electronic book displayed on a viewer. A user selects a portion of displayed text and converts it into audio. The text-to-audio conversion may be performed via a header file and pre-recorded audio for each electronic book, via text-to-speech conversion, or other available means. The user may select manual or automatic text-to audio conversion. The automatic text-to-audio conversion may be performed by automatically turning the pages of the electronic book or by the user manually turning the pages. The user may also select to convert the entire electronic book, or portions of it, into audio. The user may also select an option to receive an audio definition of a particular word in the electronic book. The present invention allows a user to control the system by selecting options from a screen or by entering voice commands.

80. 发明授权

US08494859B2 Universal processing system and methods for production of outputs accessible by people with disabilities 有权
标题翻译：通用加工系统和生产残疾人无障碍产出的方法
公开(公告)号：US08494859B2
公开(公告)日：2013-07-23
申请号：US10686127
申请日：2003-10-15
申请人： Joe P. Said , David A. Schleppenbach
发明人： Joe P. Said , David A. Schleppenbach
IPC分类号： G10L21/06
CPC分类号： G06F17/227 , G06F17/218 , G06F17/22
摘要： DEAF-core technology converts inputs to outputs accessible to people with disabilities. Communication is improved with DEAF-core technology by using data storage and transmission format that includes both semantic information and content. User-defined input, responsible for conveying semantic information, and raw analog input, such as text, are converted into a unique XML format (“gh XML”). “gh XML” includes standard XML encoded with accessibility information that allows a user to communicate both verbal (text) and non-verbal (semantic) information as part of the input. “gh XML” is a temporary format which is further converted using XSLT (extensible Stylesheet Language Transformations) into individual versions of XML specific to each output. After the “gh XML” is converted into the desired XML format, custom rendering engines specific to the desired output convert the individual version of XML into a viable analog format for display.
摘要翻译： DEAF核心技术将输入转换为残疾人可访问的输出。通过使用包含语义信息和内容的数据存储和传输格式，通过DEAF核心技术改进通信。负责传达语义信息的用户定义输入以及原始模拟输入（如文本）将被转换成唯一的XML格式（“gh XML”）。 “gh XML”包括使用可访问性信息编码的标准XML，允许用户将语言（文本）和非语言（语义）信息作为输入的一部分进行通信。 “gh XML”是一种临时格式，它将使用XSLT（可扩展样式表语言转换）进一步转换为每个输出专用的XML的各个版本。在将“gh XML”转换为所需的XML格式之后，特定于所需输出的自定义渲染引擎会将单个版本的XML转换为可行的模拟格式进行显示。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式