会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明授权
    • System and method for analyzing streams and counting stream items on multi-core processors
    • 用于分析多核处理器上的流和计数流项目的系统和方法
    • US08321579B2
    • 2012-11-27
    • US11828732
    • 2007-07-26
    • Charu Chandra AggarwalRajesh BordawekarDina ThomasPhilip Shilung Yu
    • Charu Chandra AggarwalRajesh BordawekarDina ThomasPhilip Shilung Yu
    • G06F15/16
    • G06F17/18
    • Systems and methods for parallel stream item counting are disclosed. A data stream is partitioned into portions and the portions are assigned to a plurality of processing cores. A sequential kernel is executed at each processing core to compute a local count for items in an assigned portion of the data stream for that processing core. The counts are aggregated for all the processing cores to determine a final count for the items in the data stream. A frequency-aware counting method (FCM) for data streams includes dynamically capturing relative frequency phases of items from a data stream and placing the items in a sketch structure using a plurality of hash functions where a number of hash functions is based on the frequency phase of the item. A zero-frequency table is provided to reduce errors due to absent items.
    • 公开了并行流项计数的系统和方法。 将数据流划分为多个部分,并将这些部分分配给多个处理核。 在每个处理核心处执行顺序内核以计算用于该处理核心的数据流的分配部分中的项目的本地计数。 为所有处理核心聚合计数,以确定数据流中项目的最终计数。 用于数据流的频率感知计数方法(FCM)包括从数据流动态地捕获项目的相对频率相位,并且使用多个散列函数将项目放置在草图结构中,其中多个散列函数基于频率相位 的项目。 提供零频率表以减少由于缺少项目导致的错误。
    • 3. 发明申请
    • System and method for indexing type-annotated web documents
    • 用于索引类型注释的Web文档的系统和方法
    • US20090049035A1
    • 2009-02-19
    • US11891921
    • 2007-08-14
    • Hao HeHaixun WangPhilip Shilung Yu
    • Hao HeHaixun WangPhilip Shilung Yu
    • G06F7/06G06F17/30
    • G06F16/951
    • Methods and apparatus generate an index for use in a document retrieval system where the index is organized by type and keyword. Redundancy in the index is reduced by organizing type entries in a hierarchy of internal and leaf nodes. Determining whether to generate an inverted list for a type is based on the position of the type in the hierarchy; generally inverted lists are generated only for types corresponding to leaf nodes. Redundancy is further reduced by re-using inverted lists generated for keywords for types when there is an overlap between keywords and types. Search performance using the document retrieval index is improved by adding entries corresponding to combinations of keywords and types. The intersections of inverted lists associated with the keywords and types comprising the combinations are determined and added to the index for use in search operations. Determining whether to add an entry for a keyword-type combination is made on a cost-benefit analysis dependent, at least in part, on the proximity of the keyword to type in documents containing the combination.
    • 方法和设备生成用于文档检索系统的索引,其中索引按类型和关键字组织。 通过在内部和叶节点的层次结构中组织类型条目来减少索引中的冗余。 确定是否为类型生成反向列表是基于层次结构中类型的位置; 一般反转的列表仅针对对应于叶节点的类型生成。 当关键字和类型之间存在重叠时,通过重新使用针对关键字生成的反向列表来进一步减少冗余。 通过添加与关键字和类型的组合相对应的条目来提高使用文档检索索引的搜索性能。 确定与包括组合的关键词和类型相关联的倒排列表的交集并将其添加到用于搜索操作的索引中。 确定是否添加关键字类型组合的条目是根据成本效益分析进行的,至少部分是关键字的邻近度来键入包含该组合的文档。