会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 3. 发明授权
    • Efficient lexical trending topic detection over streams of data using a modified sequitur algorithm
    • 使用修改的Sequitur算法对数据流进行有效的词汇趋势主题检测
    • US08838599B2
    • 2014-09-16
    • US12780850
    • 2010-05-14
    • Zhichen XuYun FuNeal Sample
    • Zhichen XuYun FuNeal Sample
    • G06F17/30
    • G06F17/30616
    • Embodiments are directed towards a Modified Sequitur algorithm (MSA) using pipelining and indexed arrays to identify trending topics within a plurality of documents having user generated content (UGC). The documents are parallelized and distributed across a plurality of network devices, which place at least some of the received documents into a buffer for which the MSA may then be applied to the documents within the buffer to identify n-grams or phrases within the documents' contents. The identified phrases are further analyzed to remove extraneous co-occurrences of phrases, and/or words based on a part of speech analysis. A weighting of the remaining phrases is used to identify trending topic phrases. Links to content in the plurality of UGC documents that is associated with the trending topic phrases may then be displayed to a client device.
    • 实施例针对使用流水线和索引数组来修改具有用户生成内容(UGC)的多个文档内的趋势主题的修改的序列算法(MSA)。 这些文档被并行化并且分布在多个网络设备上,这些网络设备将至少一些接收到的文档放置在缓冲器中,然后可以将MSA应用于缓冲器中的文档,以识别文档中的n个或多个短语, 内容。 进一步分析识别的短语,以消除基于词性分析的短语和/或单词的无关共存。 使用剩余短语的加权来识别趋势主题短语。 然后可以将与趋势主题短语相关联的多个UGC文档中的内容的链接显示给客户端设备。
    • 6. 发明申请
    • EFFICIENT LEXICAL TRENDING TOPIC DETECTION OVER STREAMS OF DATA USING A MODIFIED SEQUITUR ALGORITHM
    • 使用修改的序列算法在数据流上进行有效的LEXICAL TRENDING主题检测
    • US20110282874A1
    • 2011-11-17
    • US12780850
    • 2010-05-14
    • Zhichen XuYun FuNeal Sample
    • Zhichen XuYun FuNeal Sample
    • G06F17/30
    • G06F17/30616
    • Embodiments are directed towards a Modified Sequitur algorithm (MSA) using pipelining and indexed arrays to identify trending topics within a plurality of documents having user generated content (UGC). The documents are parallelized and distributed across a plurality of network devices, which place at least some of the received documents into a buffer for which the MSA may then be applied to the documents within the buffer to identify n-grams or phrases within the documents' contents. The identified phrases are further analyzed to remove extraneous co-occurrences of phrases, and/or words based on a part of speech analysis. A weighting of the remaining phrases is used to identify trending topic phrases. Links to content in the plurality of UGC documents that is associated with the trending topic phrases may then be displayed to a client device.
    • 实施例针对使用流水线和索引数组来修改具有用户生成内容(UGC)的多个文档内的趋势主题的修改的序列算法(MSA)。 这些文档被并行化并且分布在多个网络设备上,这些网络设备将至少一些接收到的文档放置在缓冲器中,然后可以将MSA应用于缓冲器中的文档,以识别文档中的n个或多个短语, 内容。 进一步分析识别的短语,以消除基于词性分析的短语和/或单词的无关共存。 使用剩余短语的加权来识别趋势主题短语。 然后可以将与趋势主题短语相关联的多个UGC文档中的内容的链接显示给客户端设备。
    • 7. 发明授权
    • Dynamic bloom filter for caching query results
    • 动态布局过滤器用于缓存查询结果
    • US07548908B2
    • 2009-06-16
    • US11475427
    • 2006-06-26
    • Yun FuZhichen XuJianchang Mao
    • Yun FuZhichen XuJianchang Mao
    • G06F7/00
    • G06F17/30864G06F17/30902Y10S707/99933
    • Methods, systems, and machine-readable media are disclosed for searching a corpus of information by utilizing a Bloom filter for caching query results. According to one aspect of the present invention, a method of caching information from a corpus of information can include populating one or more Bloom filters with a plurality of bits representative of information in the corpus of information. A search request can be received identifying requested information from the corpus of information. One or more bits in the filter(s) associated with the requested information can be checked and the requested information can be retrieved from the corpus of information based on results of said checking. Furthermore, the filter(s) can be used to determine which information to make available to a particular user in a system where certain information is associated with or access is limited to certain users or groups of users.
    • 公开了用于通过利用布隆过滤器来搜索查询结果来搜索信息语料库的方法,系统和机器可读介质。 根据本发明的一个方面,一种从信息语料库缓存信息的方法可以包括用表示信息语料库中的信息的多个比特填充一个或多个布隆过滤器。 可以从信息语料库中识别搜索请求信息。 可以检查与请求的信息相关联的过滤器中的一个或多个位,并且可以基于所述检查的结果从信息语料库检索所请求的信息。 此外,过滤器可以用于确定哪些信息可用于特定用户在某些信息相关联或访问受限于特定用户或用户组的系统中。