会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明申请
    • GENERATING SKETCHES SENSITIVE TO HIGH-OVERLAP ESTIMATION
    • 生成对高超估计敏感的草图
    • US20130159352A1
    • 2013-06-20
    • US13328901
    • 2011-12-16
    • Marshall W. Bern
    • Marshall W. Bern
    • G06F17/30
    • G06F17/30212
    • A versioning system determines an amount by which a first collection and a second collection of data objects overlap. The system divides the first collection of data objects into m possibly overlapping groups of average size s and computes one combined hash result for each group. The system then constructs a first sketch vector with n elements based on the combined hash results. A respective element of the first sketch vector is selected, using a selection function, from the combined hash results that are computed with the hash function corresponding to the element's index. Next, the system receives a second sketch vector for the second collection of data objects, and determines a sketch-vector overlap between the first and second sketch vectors. The system then computes a data-object overlap between the first and second collections of data objects based on the sketch-vector overlap.
    • 版本控制系统确定数据对象的第一集合和第二集合重叠的量。 该系统将第一个数据对象集合分成m个可能重叠的平均大小的组,并为每个组计算一个组合的哈希结果。 然后,该系统基于组合的散列结果构造具有n个元素的第一草图向量。 使用选择函数从使用与元素索引对应的散列函数计算的组合哈希结果中选择第一草图向量的相应元素。 接下来,系统接收用于数据对象的第二集合的第二草图矢量,并且确定第一和第二草图矢量之间的草图矢量重叠。 然后,系统基于草图向量重叠来计算第一和第二数据对象集合之间的数据对象重叠。