会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 72. 发明授权
    • Electrolaryngeal speech reconstruction method and system thereof
    • 电喉语音重建方法及其系统
    • US08650027B2
    • 2014-02-11
    • US13603226
    • 2012-09-04
    • Mingxi WanLiang WuSupin WangZhifeng NiuCongying Wan
    • Mingxi WanLiang WuSupin WangZhifeng NiuCongying Wan
    • G10L11/00G10L17/00G10L13/00G10L21/06
    • G06K9/00221G10L21/0364G10L2021/0575
    • The invention provides an electrolaryngeal speech reconstruction method and a system thereof. Firstly, model parameters are extracted from the collected speech as a parameter library, then facial images of a speaker are acquired and then transmitted to an image analyzing and processing module to obtain the voice onset and offset times and the vowel classes, then a waveform of a voice source is synthesized by a voice source synthesis module, finally, the waveform of the above voice source is output by an electrolarynx vibration output module, wherein the voice source synthesis module firstly sets the model parameters of a glottal voice source so as to synthesize the waveform of the glottal voice source, and then a waveguide model is used to simulate sound transmission in a vocal tract and select shape parameters of the vocal tract according to the vowel classes.
    • 本发明提供一种电咽喉语音重建方法及其系统。 首先,从收集的语音中提取模型参数作为参数库,然后获取一个说话者的面部图像,然后传送给图像分析处理模块,以获得语音起始和偏移时间以及元音类,然后是波形 一个语音源由语音源合成模块合成,最后,上述语音源的波形由电声振动输出模块输出,其中语音源合成模块首先设置声门声源的模型参数,以便合成 声门声源的波形,然后使用波导模型来模拟声道中的声音传播,并根据元音类选择声道的形状参数。
    • 73. 发明申请
    • SERVER AND METHOD OF CONTROLLING THE SAME
    • 服务器及其控制方法
    • US20130339032A1
    • 2013-12-19
    • US13918538
    • 2013-06-14
    • SAMSUNG ELECTRONICS CO., LTD.
    • Seung-il YOONTae-hwan CHA
    • G10L21/06
    • G10L21/06G10L15/22G10L25/54H04N21/42203H04N21/472
    • A server which interacts with a display apparatus is provided. The server includes a storage unit configured to store conversation patterns for each service domain, a communication unit configured to receive a user's voice from the display apparatus, and a control unit configured to determine a service domain including the user's voice, generate response information corresponding to the user's voice based on a conversation pattern of the determined service domain, and to control the communication unit to transmit the response information to the display apparatus. When it is determined that a currently received user's voice is included in another service domain which is different from a service domain including a previously received user's voice, the control unit generates the response information corresponding to the currently received user's voice based on a conversation pattern of the other service domain.
    • 提供了与显示装置交互的服务器。 服务器包括:存储单元,被配置为存储每个服务域的会话模式;通信单元,被配置为从显示装置接收用户的语音;以及控制单元,被配置为确定包括用户的语音的服务域,生成对应于 基于所确定的服务域的会话模式的用户的语音,并且控制通信单元将响应信息发送到显示装置。 当确定当前接收的用户的话音被包括在不同于包括先前接收到的用户语音的服务域的另一服务域中时,控制单元基于当前接收到的用户的语音的对话模式生成与当前接收到的用户的语音相对应的响应信息 其他服务域。
    • 80. 发明授权
    • Universal processing system and methods for production of outputs accessible by people with disabilities
    • 通用加工系统和生产残疾人无障碍产出的方法
    • US08494859B2
    • 2013-07-23
    • US10686127
    • 2003-10-15
    • Joe P. SaidDavid A. Schleppenbach
    • Joe P. SaidDavid A. Schleppenbach
    • G10L21/06
    • G06F17/227G06F17/218G06F17/22
    • DEAF-core technology converts inputs to outputs accessible to people with disabilities. Communication is improved with DEAF-core technology by using data storage and transmission format that includes both semantic information and content. User-defined input, responsible for conveying semantic information, and raw analog input, such as text, are converted into a unique XML format (“gh XML”). “gh XML” includes standard XML encoded with accessibility information that allows a user to communicate both verbal (text) and non-verbal (semantic) information as part of the input. “gh XML” is a temporary format which is further converted using XSLT (extensible Stylesheet Language Transformations) into individual versions of XML specific to each output. After the “gh XML” is converted into the desired XML format, custom rendering engines specific to the desired output convert the individual version of XML into a viable analog format for display.
    • DEAF核心技术将输入转换为残疾人可访问的输出。 通过使用包含语义信息和内容的数据存储和传输格式,通过DEAF核心技术改进通信。 负责传达语义信息的用户定义输入以及原始模拟输入(如文本)将被转换成唯一的XML格式(“gh XML”)。 “gh XML”包括使用可访问性信息编码的标准XML,允许用户将语言(文本)和非语言(语义)信息作为输入的一部分进行通信。 “gh XML”是一种临时格式,它将使用XSLT(可扩展样式表语言转换)进一步转换为每个输出专用的XML的各个版本。 在将“gh XML”转换为所需的XML格式之后,特定于所需输出的自定义渲染引擎会将单个版本的XML转换为可行的模拟格式进行显示。